Turkic Transliteration Suite

Web Interface for exploring Turkic language transliteration tools

Explore IPA transliteration for Turkic languages. Navigate through the tabs below to access different features.

Corpus Downloader: Stream sentences from public corpora (OSCAR or Wikipedia) directly in the browser. Select a source and language, optionally cap the number of sentences, and decide whether to filter by FastText language ID.

Corpus Source
Language

Keep only sentences whose FastText language-ID matches the code above (uses lid.176 model).

0 1

IPA Transliteration

Get both original + IPA-transliterated corpus files

Preview

Try this example
Corpus Source Language Max Sentences (empty = all) Filter by FastText LangID Min Lang-ID Confidence Threshold Also create IPA-transliterated version

Turkic Transliteration Suite - A tool for transliterating Turkic languages between different writing systems

Use the tabs above to explore different features