Hey there! đź‘‹ I'm an AI researcher with a background spanning software engineering, information security, and
machine learning. While AI started as a hobby, it's grown into my passion project now that I've found time
to
return to the machine learning industry.
I'm currently working on Uzbek language speech technologies, specifically STT/TTS models. For now, I've
decided
to take the open source route, publishing my work right here and on my Hugging Face account.
My goal? To contribute to Uzbekistan's emerging AI landscape. Because sometimes the most meaningful
innovations
start with a passion project!
I'm developing a suite of speech AI models tailored specifically for Uzbek language. Here are the current and upcoming models:
A classic Whisper medium model fine-tuned specifically for the Uzbek language. The training dataset
included diverse audio sources: publicly available podcasts, Tashkent dialect podcasts, news
content, Google FLEURS, USC, and Common Voice 17. Data quality was mixed with 50% human
transcribed and 50% pseudo-transcribed using Gemini 2.5 Pro.
Key Improvements from v1: This version fixes problematic moments from v1 and
offers better generalization.
Fully Repeatable Open Source: Due to conflicts with data partners, v1 was
removed and the 500-hour dataset was excluded. Instead, new and different datasets were
included—all of which will be open-sourced. Training scripts will also be open-sourced,
making the entire process fully repeatable.
Dialect Coverage: This model includes some popular Uzbek dialects, providing
broader language coverage and improved performance across different regional variations.
GapTTS-1v is my upcoming Text-to-Speech project for the Uzbek language. While I have a clear vision and have gathered the necessary data for training, development will begin after I complete the current STT work. I'm planning to make GapTTS-1v open source upon completion, bringing natural Uzbek speech synthesis to the AI community.
The AI industry operates on two fronts: open source and closed models. Their competition drives technological growth. Here in Uzbekistan, we have several AI leaders emerging. To create balance in our ecosystem, I've chosen to position myself in the open source space.
This isn't about opposing closed models—in fact, my work in that sector helps fund my open source initiatives. It's about ensuring both approaches thrive for the benefit of our tech landscape.
If you'd like to support the open source movement with resources, attention, time, or work, please reach out. Every contribution helps strengthen our collective knowledge and capabilities.
Connect & Collaborate Support with Donations