Noureddine RAMDI Dinour

Lead Developer & AI Enthusiast — Software Architecture, AI/LLM, Infrastructure Automation

Organizations

3 results for Speech-Recognition

Fun-ASR: Alibaba's multilingual speech recognition model with real-time capabilities
Fun-ASR is Alibaba Tongyi Lab’s end-to-end speech recognition model with 800M parameters, supporting 31 languages and real-time transcription in noisy environments.
github-stars speech recognition multilingual ai python Created Sat, 23 May 2026 20:41:14 +0000
Kimi-Audio: a unified hybrid-token audio foundation model with LLM core
Kimi-Audio combines continuous acoustic and discrete semantic tokens within a 7B LLM for unified audio-text understanding and generation. It achieves state-of-the-art ASR with low-latency audio synthesis.
github-stars python audio speech-recognition foundation-model Created Sat, 23 May 2026 20:41:14 +0000
LiveCaptions Translator: Real-time speech translation using Windows 11's built-in captions and LLM APIs
LiveCaptions Translator taps Windows 11’s on-device LiveCaptions for real-time speech translation via multiple LLM and traditional APIs, all in a sleek C# desktop app.
github-stars c# windows11 speech-recognition translation Created Sat, 23 May 2026 20:41:14 +0000