Turkic Transliteration Suite

Web Interface for exploring Turkic language transliteration tools

Explore IPA transliteration for Turkic languages. Navigate through the tabs below to access different features.

Corpus Downloader: Stream sentences from public corpora (OSCAR or Wikipedia) directly in the browser. Select a source and language, optionally cap the number of sentences, and decide whether to filter by FastText language ID.

Corpus Source

Language

Max Sentences (empty = all)

Keep only sentences whose FastText language-ID matches the code above (uses lid.176 model).

Filter by FastText LangID

Min Lang-ID Confidence Threshold

0 1

IPA Transliteration

No IPA rules for 'af' — transliteration unavailable

Also create IPA-transliterated version

Original Corpus

Preview

Try this example

Corpus Source	Language	Max Sentences (empty = all)	Filter by FastText LangID	Min Lang-ID Confidence Threshold	Also create IPA-transliterated version