Advancing the state of Uzbek language AI through open-source research, specialized datasets, and state-of-the-art model architectures.
I'm an AI researcher bridging the gap between software engineering and machine learning. My focus is on democratizing speech technologies for the Uzbek language ecosystem.
Currently, I am developing open-source STT (Speech-to-Text), TTS (Text-to-Speech), and NLP models including PII detection for privacy compliance. By publishing models, datasets, and training methodologies, I aim to make high-quality AI accessible to everyone.
A fine-tuned Whisper Medium model optimized for Uzbek language nuances. Trained on a diverse 500+ hour dataset including podcasts, news, and dialect-rich audio.
BERT-based NER model for detecting Personal Identifiable Information (PII) in Uzbek text. Now with bank card detection! 96.1% F1 score on 475K+ samples.
Upcoming high-fidelity Text-to-Speech synthesis engine. Designed to generate natural-sounding Uzbek speech with proper intonation and prosody.
Open source development requires significant computational resources. Your support helps maintain independent research and keeps these tools free for the community.