Skip to content

A large-scale Automatic Speech Recognition (ASR) platform tailored for the Kinyarwanda language. Built using cutting-edge ML models like Wav2Vec2 and Whisper, this system supports multiple domains including Health, Government, Education, Finance, and Agriculture

Notifications You must be signed in to change notification settings

VugaTech/KinyaVoiceAI

Repository files navigation

🗣️ KinyVoiceAI

KinyVoiceAI is a large-scale, domain-specific Automatic Speech Recognition (ASR) system designed for the Kinyarwanda language. This initiative leverages open speech data and state-of-the-art machine learning models to build reliable and accessible ASR tools for real-world applications.


🌍 Project Goal

To develop an end-to-end ASR pipeline that supports accurate transcription of Kinyarwanda audio across five high-impact domains:

  • 🏥 Health
  • 🏛️ Government
  • 📚 Education
  • 💰 Financial Services
  • 🌾 Agriculture

🧠 Core Components

This project consists of the following main components:

Module Description Repository / Folder
Backend (ASR) Data preprocessing, model training, evaluation, and API serving kinyvoice-be
Frontend (Demo UI) Web-based interface for testing and demonstrating ASR performance kinyvoice-fe (optional)

⚙️ Technologies Used

  • Python, PyTorch, Hugging Face Transformers
  • Wav2Vec2, Whisper, ESPnet
  • FastAPI, Docker, DVC
  • Next.js, TailwindCSS

🚀 Getting Started

Each module contains its own setup and usage instructions.
Start by exploring the backend repository:

git clone https://github.com/KinyaVoiceAI/KinyVoiceAI.git

For a live demo or UI interface, check the kinyvoice-fe repo.


📄 Licensing & Credits

  • Dataset: Digital Umuganda (funded by the Gates Foundation)
  • Models: Built using open-source toolkits and research frameworks
  • License: CC BY 4.0

🤝 Contribution

We welcome contributions from the community, especially from native speakers, ML researchers, and developers interested in African language technologies. See contribution guidelines in the respective repos.


📫 Contact

For questions, collaborations, or feedback: 📧 [email protected] 🌐 GitHub Organization: KinyaVoiceAI


Bringing Kinyarwanda to the forefront of voice AI.

About

A large-scale Automatic Speech Recognition (ASR) platform tailored for the Kinyarwanda language. Built using cutting-edge ML models like Wav2Vec2 and Whisper, this system supports multiple domains including Health, Government, Education, Finance, and Agriculture

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •