<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Speech-Recognition on Noureddine RAMDI</title><link>https://ramdi.fr/tags/speech-recognition/</link><description>Recent content in Speech-Recognition on Noureddine RAMDI</description><generator>Hugo</generator><language>en</language><lastBuildDate>Sat, 23 May 2026 20:41:27 +0000</lastBuildDate><atom:link href="https://ramdi.fr/tags/speech-recognition/index.xml" rel="self" type="application/rss+xml"/><item><title>Fun-ASR: Alibaba's multilingual speech recognition model with real-time capabilities</title><link>https://ramdi.fr/github-stars/fun-asr-alibaba-s-multilingual-speech-recognition-model-with-real-time-capabilities/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/fun-asr-alibaba-s-multilingual-speech-recognition-model-with-real-time-capabilities/</guid><description>Fun-ASR is Alibaba Tongyi Lab&amp;rsquo;s end-to-end speech recognition model with 800M parameters, supporting 31 languages and real-time transcription in noisy environments.</description></item><item><title>Kimi-Audio: a unified hybrid-token audio foundation model with LLM core</title><link>https://ramdi.fr/github-stars/kimi-audio-a-unified-hybrid-token-audio-foundation-model-with-llm-core/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/kimi-audio-a-unified-hybrid-token-audio-foundation-model-with-llm-core/</guid><description>Kimi-Audio combines continuous acoustic and discrete semantic tokens within a 7B LLM for unified audio-text understanding and generation. It achieves state-of-the-art ASR with low-latency audio synthesis.</description></item><item><title>LiveCaptions Translator: Real-time speech translation using Windows 11's built-in captions and LLM APIs</title><link>https://ramdi.fr/github-stars/livecaptions-translator-real-time-speech-translation-using-windows-11-s-built-in-captions-and-llm-apis/</link><pubDate>Sat, 23 May 2026 20:41:14 +0000</pubDate><guid>https://ramdi.fr/github-stars/livecaptions-translator-real-time-speech-translation-using-windows-11-s-built-in-captions-and-llm-apis/</guid><description>LiveCaptions Translator taps Windows 11&amp;rsquo;s on-device LiveCaptions for real-time speech translation via multiple LLM and traditional APIs, all in a sleek C# desktop app.</description></item></channel></rss>