๐ŸŒ Turkic Transliteration Suite

Web Interface for exploring Turkic language transliteration tools

โš ๏ธ Model file(s) missing:

  • turkic_model.model not found in /usr/local/lib/python3.10/site-packages/turkic_translit/turkic_model.model Please ensure all required models are present for full functionality.

Explore transliteration capabilities for Turkic languages between Cyrillic, Latin, and IPA representations. Navigate through the tabs below to access different features.

Corpus Downloader: Stream sentences from public corpora (OSCAR or Wikipedia) directly in the browser. Select a source and language, optionally cap the number of sentences, and decide whether to filter by FastText language ID.
Corpus Source
Language

Keep only sentences whose FastText language-ID matches the code above (uses lid.176 model).

0 1

๐Ÿ“ Transliteration Options

Get both original + transliterated corpus files

Preview

Try this example
Corpus Source Language Max Sentences (empty = all) Filter by FastText LangID Min Lang-ID Confidence Threshold โœจ Also create transliterated version Transliteration Format

Turkic Transliteration Suite - A tool for transliterating Turkic languages between different writing systems

Use the tabs above to explore different features