Skip to content
Change the repository type filter

All

    Repositories list

    • WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
      Python
      Apache License 2.0
      2931.1k766Updated May 25, 2025May 25, 2025
    • Automatic Speech Recognition in Unity using Vosk library
      C#
      Apache License 2.0
      198450Updated May 23, 2025May 23, 2025
    • Russian speech technology links
      Apache License 2.0
      2131501Updated May 17, 2025May 17, 2025
    • vosk-tts

      Public
      Text To Speech Synthesis with Vosk
      Python
      Apache License 2.0
      27195290Updated May 16, 2025May 16, 2025
    • vosk-api

      Public
      Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
      Jupyter Notebook
      Apache License 2.0
      1.5k12k50539Updated May 1, 2025May 1, 2025
    • clapack

      Public
      CLAPACK clone for our builds
      C
      Other
      12310Updated May 1, 2025May 1, 2025
    • Real-time speech recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go
      C++
      Apache License 2.0
      742900Updated Apr 15, 2025Apr 15, 2025
    • icefall

      Public
      Python
      Apache License 2.0
      348200Updated Apr 11, 2025Apr 11, 2025
    • Website and documentation
      HTML
      222011Updated Dec 23, 2024Dec 23, 2024
    • [ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
      Jupyter Notebook
      MIT License
      138300Updated Dec 12, 2024Dec 12, 2024
    • Resources that make every language unique
      Apache License 2.0
      21300Updated Nov 24, 2024Nov 24, 2024
    • Dart
      Apache License 2.0
      6167160Updated Oct 26, 2024Oct 26, 2024
    • SDDPM

      Public
      [WACV 2024] Spiking Denoising Diffusion Probabilistic Models
      Python
      12100Updated Oct 9, 2024Oct 9, 2024
    • kaldi

      Public
      An official git mirror of Kaldi project SVN repo
      Shell
      Other
      5.4k5402Updated Aug 23, 2024Aug 23, 2024
    • openfst

      Public
      Openfst mirror with some fixes
      C++
      Other
      141020Updated Aug 23, 2024Aug 23, 2024
    • Faster Whisper ASR transcription with CTranslate2
      Python
      MIT License
      1.4k100Updated Aug 19, 2024Aug 19, 2024
    • A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
      Apache License 2.0
      231400Updated Aug 11, 2024Aug 11, 2024
    • Speech Recognition in Asterisk with Vosk Server
      C
      GNU General Public License v2.0
      41114173Updated Jun 21, 2024Jun 21, 2024
    • RHVoice

      Public
      a free and open source speech synthesizer for Russian and other languages
      C++
      GNU General Public License v2.0
      245300Updated May 28, 2024May 28, 2024
    • Python
      Apache License 2.0
      0000Updated Apr 24, 2024Apr 24, 2024
    • TTS

      Public
      🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
      Python
      Mozilla Public License 2.0
      5.3k300Updated Apr 8, 2024Apr 8, 2024
    • ffmpeg

      Public
      C
      Other
      13k000Updated Apr 1, 2024Apr 1, 2024
    • 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
      Python
      MIT License
      5.3k100Updated Mar 20, 2024Mar 20, 2024
    • aiortc

      Public
      WebRTC and ORTC implementation for Python using asyncio
      Python
      BSD 3-Clause "New" or "Revised" License
      816000Updated Dec 13, 2023Dec 13, 2023
    • Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.
      Python
      Other
      131200Updated Dec 5, 2023Dec 5, 2023
    • aioice

      Public
      asyncio-based Interactive Connectivity Establishment (RFC 5245)
      Python
      BSD 3-Clause "New" or "Revised" License
      60000Updated Nov 27, 2023Nov 27, 2023
    • Offline speech recognition for Android with Vosk library.
      Java
      Apache License 2.0
      240889704Updated Nov 24, 2023Nov 24, 2023
    • Application of MB-iSTFT-VITS components to vits2_pytorch
      Python
      MIT License
      28400Updated Oct 29, 2023Oct 29, 2023
    • Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup
      Python
      15000Updated Oct 20, 2023Oct 20, 2023
    • OpenAI Whisper Prompt Examples
      Apache License 2.0
      35200Updated Jul 17, 2023Jul 17, 2023