badge icon

This article was automatically translated from the original Turkish version.

Article

Turkish Language Family

Quote
Family and Branch to Which It Belongs
Ural-Altaic Language FamilyAltaic Branch
Language Type According to Structure
Agglutinative
Oldest Written Work
7th CenturyIlterish Kutluk Khagan Inscription
Historical Periods
Old TurkishMiddle TurkishNew Turkish
Alphabets Used (Throughout History)
Göktürk (Runic)UyghurManichaeanBrahmiArabicCyrillicLatin
Main Language Groups
OghuzKipchakKarlukSiberianOgur
Major Languages
Turkish of TürkiyeAzerbaijani TurkishTurkmen TurkishKazakh TurkishKyrgyz TurkishUzbek TurkishNew Uyghur TurkishTatar Turkish
Geographical Distribution
AnatoliaBalkansCaucasusCentral AsiaSiberiaChina (Xinjiang)
Key Features
Vowel HarmonyRegular Affix OrderPreservation of Roots

The Turkic language family (Turkic languages) is a language family consisting of languages with a common origin that have been spoken across the vast geography of Eurasia throughout history and continue to be used by numerous communities today. This family is defined as a cohesive whole characterized by structural similarities evident in both historical documents and modern linguistic data, accepted as sharing a common origin and classified through varying approaches in different periods. The Turkic languages spread over a wide area from the Central Asian steppes to northern Siberia the Middle East Anatolia the Balkan Peninsula and Eastern Europe. This expansion was shaped by both migration movements and political and cultural interactions, giving Turkic languages a remarkable geographic diversity.


The language family holds an important position not only in terms of the number of speakers but also in its historical and cultural influence. The political organizations established by Turkic communities in different periods their cultural production literary traditions and written texts are the main factors determining the historical value of Turkic languages. These languages have formed an extensive body of texts written in various scripts across different belief systems since before Islam.


The Turkic language family refers to a group of languages spoken across a large part of Eurasia that share a common historical and structural root. Languages belonging to this family are defined by criteria such as agglutinative structure vowel harmony similar phonological and morphological features and a shared basic vocabulary. The Turkic language family is regarded as a coherent linguistic unit based on comparisons between historical texts Orhun Uyghur Karakhanid Chagatai etc. and modern languages. This language family belongs to the Altai branch of the Ural-Altaic language family.


The main languages belonging to the Turkic language family grouped by major branches are as follows:

Oghuz Group

  • Turkish of Türkiye
  • Azerbaijani Turkish
  • Turkmen Turkish
  • Gagauz Turkish
  • Balkan Oghuz dialects (Rumelia region)

Kipchak Group

  • Kazakh Turkish
  • Kyrgyz Turkish
  • Nogai Turkish
  • Tatar Turkish
  • Bashkir Turkish
  • Karachay-Balkar Turkish

Karluk Group

  • Uzbek Turkish
  • New Uyghur Turkish

Siberian Group

  • Yakut (Sakha) Turkish
  • Dolgan Turkish
  • Khakas Turkish
  • Tuvan Turkish

Oghur (Bulgar) Group

  • Chuvash


Turkic languages can also be classified as follows:

  • Southwestern (Oghuz) group: Historically Old Anatolian Turkish and Ottoman Turkish; today Turkish of Türkiye Azerbaijani Turkish Turkmen Gagauz; the dialect of Iraqi Turkmens; Horasan Turkish in Iran Kashkay Turkish in Iran and Turkish dialects in Northern Cyprus and Balkan countries.
  • Northwestern (Kipchak) group: Historically Koman Kipchak of the Mamluks Kipchak of the Golden Horde and today represented by Tatar Crimean Tatar Bashkir Karay Karaçay Balkar Kumuk Nogai Kazakh Karakalpak and Kyrgyz.
  • Southeastern (Uyghur) group: Historically Old Uyghur Karakhanid and Chagatai Turkish; today represented by Uzbek New Uyghur Sary Uyghur and Salar.
  • Northeastern (Siberian) group: Altai Khakas Tuvan and Shor.
  • Yakut.
  • Bulgar group (Chuvash).
  • Halach.

Historical Development of Turkic Languages

Early Period (Orhun and Old Turkish Period)

The oldest known written products of the Turkic language family were long dated to the 8th century Orhun Inscriptions. In 2022 the discovery of the Ilterish Kutluk Khagan Inscription in Mongolia pushed this date back to the 7th century. Until this discovery the oldest inscription bearing the name “Turk” was considered to be among the Orhun Inscriptions; it is now the Ilterish Kutluk Khagan Inscription. Ilterish Kutluk Khagan was the father of Bilge Khagan and Kül Tigin.【1】 This inscription underscores that the Orhun Inscriptions constitute the most comprehensive and linguistically significant documents of early Old Turkic. The Orhun texts are a fundamental source because they demonstrate Turkic’s deep written tradition and clearly reflect the syntax phonology and morphological features of the language at that time.


The Orhun Inscriptions indicate a highly developed written language in terms of syntax phonology and morphology. The Uyghur period manuscripts that followed reveal the diversity of early written Turkic through Manichaean Buddhist and secular texts. These documents demonstrate that Turkic languages had a long-standing written tradition while also reflecting the historical continuity of shared features. Typological characteristics such as vowel harmony agglutinative structure and verb-final word order are common traits.


The Orhun Inscriptions exhibit a clear agglutinative structure. The dominant word order places the verb at the end of the sentence and this structure is recognized as one of the fundamental enduring features of Turkic. The arrangement of sentence elements verb inflections and noun phrases demonstrate that Old Turkic possessed a sophisticated syntactic system. Phonologically vowel harmony is clearly evident in the inscriptions; this system reflecting both thick-thin and round-flat harmony reveals the phonological coherence of the period. Additionally the regular distribution of vowels and consonants and the limited occurrence of initial and final sound changes indicate the phonological stability of Old Turkic.


Old Turkish text types vary from runic inscriptions to Uyghur-era manuscripts. Runic inscriptions include not only monumental texts but also tombstones seals and short inscriptions. These writings are in the Göktürk alphabet and provide detailed information about early Turkic language and culture through their political military and social content. Uyghur-era texts encompass a broader literature including Manichaean Buddhist and secular works. These texts show that the early written tradition of Turkic was further diversified through religious texts and translated literature.


Multiple alphabets were used during the Old Turkish period. The Göktürk alphabet is the earliest writing system used in the Orhun and Yenisei inscriptions. The Uyghur alphabet derived from Sogdian and was widely used especially in Manichaean and Buddhist texts. The Manichaean alphabet is another writing system adopted in Manichaean circles. It is noted that a Brahmi-derived alphabet was used in some Buddhist Uyghur texts. This multiplicity of writing systems demonstrates that Turkic developed in early periods through contact with diverse religious and cultural environments.

Medieval Turkish Period

The Medieval Turkish period constitutes the stage following Old Turkish during which Turkic languages diversified through regional written languages. This period marks a process in which Turkic developed new forms in phonology morphology and writing tradition through the emergence of the Karakhanid Harezm-Kipchak Old Oghuz and Chagatai written languages. This period enabled Turkic to adapt and diversify in different geographical political religious and cultural contexts.

Karakhanid Turkish

Karakhanid Turkish is recognized as the first written language of Medieval Turkish and is the product of the literary culture developed during the Karakhanid state. This language preserved the core morphological and phonological features inherited from Old Turkic while reflecting the influence of Islamic culture. Karakhanid texts consist of didactic and religious works written in the Arabic alphabet. In the texts of this period vowel harmony was largely preserved the agglutinative structure remained stable and the verb-final word order of Old Turkic continued.

Harezm-Kipchak Turkish

The Harezm and Kipchak region formed the second major written language environment of Medieval Turkish following the Karakhanid tradition. This written language carries the common features of the Turkish text tradition developed in Harezm and the Turkish spoken in the Kipchak steppes. Texts show transitional features between Karakhanid Turkish and Old Oghuz Turkish. Harezm-Kipchak texts are generally religious ethical and instructional and were written using the Arabic alphabet. During this period the diversity of phonological and morphological features increased particularly with consonant changes at the beginning and end of words and variations in certain affix forms in the Kipchak region. This linguistic environment later formed the historical basis of Kipchak Turkish.

Old Oghuz Turkish

Old Oghuz Turkish is the written language of the Oghuz region encompassing Anatolia Azerbaijan and surrounding areas and represents an important branch of the Medieval Turkish period. This written language retained features inherited from Old Turkish and Karakhanid periods but developed regional phonological and morphological variations. Old Oghuz Turkish found wide usage especially in Islamic and literary texts. Pre-Ottoman and early Ottoman religious mystical ethical instructional and translated works are among the examples of this written language. This linguistic period formed the foundation of later Ottoman Turkish.

Chagatai Written Language

Chagatai Turkish is a dialect that developed in Central Asia during the Timurid period and served for a long time as the literary and cultural written language of the region. Although connected to the Karakhanid and Harezm traditions Chagatai Turkish acquired the status of a standardized written language in phonological and morphological terms. Chagatai Turkish possesses a rich literary corpus producing numerous texts in poetry and prose. The Chagatai written language was entirely written in the Arabic alphabet and represented the eastern branch of classical Turkic literature.


Medieval Turkish was not only a literary language but also a widely used written language in administration law religion and culture. The Medieval Turkish period established the historical and geographical foundations that determined the later divergence of Turkic languages into modern dialects.

New Turkish Period

The New Turkish period represents the transition from Medieval Turkish to modern Turkic languages during which written languages acquired new forms based on geographical and political developments. This period is defined by the divergent evolution of the Oghuz Kipchak and Eastern Turkish traditions with linguistic diversity standardization processes and regional fragmentation as key dynamics.

Ottoman Turkish and the Oghuz Written Tradition

The most influential written language of the New Turkish period in cultural and political contexts is Ottoman Turkish which developed on the basis of Old Oghuz Turkish and continues the Oghuz written tradition that emerged in Anatolia and the Balkans. Ottoman Turkish evolved through sustained contact with Arabic and Persian and formed an extensive written corpus through administrative legal scientific and literary texts. During this period Turkish was used as a language of education bureaucracy and literature across a vast geography stretching from the Balkans to North Africa. Literature in the Rumelia and Anatolia regions flourished through court literature and folk literature producing numerous poets and writers. During this process the Ottoman written tradition acquired the status of a superordinate language incorporating both Oghuz dialectal features and the multilayered vocabulary of classical literature. This written tradition formed the foundation of modern Turkish of Türkiye.

Kipchak Written Tradition

During the New Turkish period the Kipchak written tradition continued the earlier Harezm-Kipchak phase but developed differently in various regions due to geographical fragmentation and political structures. The Kipchak region was connected to written traditions in Eastern Europe the northern Black Sea coast and Egypt and its written language was shaped through dictionaries grammars and translated texts produced in these areas. The Kipchak written language continued to use the Arabic alphabet during this period but phonological and morphological variations emerged due to settlement changes and regional contacts following Mongol rule. By the New Turkish period the Kipchak written tradition largely lost its historical continuity; however new standardized written forms emerged in Crimea Kazan and the Volga-Ural regions carrying Kipchak elements. This transformation constituted the key stage in the transition of the Kipchak branch to the modern era.

Disintegration of the Eastern Turkish Tradition and Transition to Modern Dialects

The Eastern Turkish tradition was represented for a long time in Central Asia by Chagatai Turkish which served as the region’s literary and administrative language. As a classical written language Chagatai maintained its influence from the 15th century until the early 20th century and produced a vast literary corpus in poetry and prose. However at the end of the 19th century and the beginning of the 20th century political and social transformations in the region weakened the unifying character of the Chagatai written language and new standardized written languages based on local dialects emerged.


As a result of language planning during the Soviet period Uzbek Turkish and New Uyghur Turkish developed as independent modern written languages separate from the Chagatai tradition. During this process alphabet reforms were frequently implemented: first from the Arabic to the Latin alphabet then to the Cyrillic alphabet; after the Soviet era some regions adopted the Latin alphabet again. The disintegration of the Eastern Turkish tradition contributed to increased regional diversity in the modern Turkic language family; the emergence of contemporary Central Asian Turkic languages largely resulted from this process.


In this context the New Turkish period is regarded as a stage in which the historical written tradition of Turkic languages was reshaped modern national languages emerged and linguistic diversity became more pronounced.

Formation of Modern Languages

The emergence of modern Turkic languages is the result of regional fragmentation that began during the New Turkish period and the social political and cultural transformations experienced throughout the 19th and 20th centuries. This process occurred especially in the geographical areas of Türkiye Central Asia The Balkans Cyprus Siberia (Yakut/Sakha) and Turkmen regions. During this process modern Turkic languages became standardized reflecting both local dialectal features and factors such as state formation education policies and writing system reforms.

Formation of Turkish of Türkiye

Turkish of Türkiye is the continuation of Old Oghuz Turkish and Ottoman Turkish and constitutes the modern form of the Oghuz written tradition developed in Anatolia. The broad administrative and cultural usage during the Ottoman period prepared the foundation of Turkish of Türkiye; the language reforms standardization efforts and the transition to the Latin alphabet in the 20th century gave it its definitive modern written form. Turkish of Türkiye has acquired prestige status across a wide geography from the Balkans to Cyprus and from the Middle East to various migrant communities and is one of the Turkic languages with the largest written and spoken output in the contemporary era.

Language Standardization in Central Asia after the Soviet Era

The languages of Central Asia were standardized during the Soviet period and after the Soviet era these standards were reshaped in connection with national identities. Uzbek Turkish and New Uyghur Turkish are modern written languages developed after the dissolution of the Chagatai written language based on local dialects. Alphabet changes between Arabic Latin and Cyrillic during the Soviet period directly affected how the phonological structures of these languages were transcribed into writing. After the Soviet era transition processes to the Latin alphabet were revived in countries such as Kazakhstan and Turkmenistan and in some countries (e.g. Uzbekistan) Latin letters were officially adopted as the national writing system. This standardization process accelerated the modern differentiation of Turkic languages in the region.

Modernization of Special Dialect Areas: The Balkans Cyprus and Yakut/Sakha

Balkan Turkish dialects preserve traces of historical Rumelian settlements and have shown a trend toward standardization due to renewed ties with Turkish of Türkiye in the modern era. In the Ottoman period Turkish served as an administrative and literary written language in the Balkans but in the modern era it has persisted mainly at the dialect level; however written production (newspapers journals literary texts) has continued in some areas.


Cypriot Turkish developed on the island after Turkish settlement following 1571 and has developed distinctive phonological morphological and syntactic differences due to intensive contact with Greek and English. After 1974 increased communication and educational ties with Türkiye led to a stronger orientation toward Turkish of Türkiye yet Cypriot Turkish has retained its vitality in daily usage.


Yakut/Sakha Turkish developed in the northeast of Siberia and is one of the most divergent branches of the Turkic language family in phonological and morphological terms. Yakut has a rich literary output and was standardized as a written language during the Soviet period. This language is one of the main examples demonstrating the geographic and typological diversity of the Turkic language family.

Formation of the Turkmen Written Language

The Turkmen written language reflects the most distinctive trajectory of alphabet change among Turkic languages. The historical progression is as follows:

  • Arabic Alphabet Period: Turkmen Turkish was long written using the Arabic alphabet and this script was used in religious and cultural texts.
  • Latin Alphabet Period (early 20th century): Under Soviet language policies Turkmen Turkish was transitioned to the Latin alphabet; this facilitated the transcription of its phonological structure.
  • Cyrillic Alphabet Period: In the subsequent phase of Soviet standardization the Cyrillic alphabet was adopted as the official writing system and widely used in education and publishing.
  • Modern Latin Alphabet Period: In 1993 Turkmenistan decided to transition to the Latin alphabet; in 2000 the alphabet was reorganized and officially adopted. During this process the sound values and spelling principles were adjusted to reflect the phonological features of modern Turkmen Turkish.

These four phases form the essential outline of the modernization and standardization process of Turkmen Turkish. Changes in the writing system have led to the reformation of the language in both educational and cultural production domains.

Classification of Turkic Languages

The classification of Turkic languages is a fundamental area of study based on both historical development and modern linguistic data. Researchers have developed various classification models due to the wide geographic spread of the Turkic language family historical contacts with different communities and phonological morphological and syntactic differences among dialects. Some researchers divide the language family into historical and contemporary layers based on genetic unity and determine relationships among languages through sound changes affix systems lexical layers and geographic distributions. Classification is generally structured around the Oghuz Kipchak Karluk Siberian and Oghur/Chuvash branches each of which is defined by shared features and historical divergences.

Main Groups Subgroups and Languages

The Turkic language family is divided into various main groups based on historical development geographic spread and structural features. These groups reflect both shared linguistic characteristics rooted in a common origin and divergences that emerged in different geographic regions over time.

Oghuz Group

The Oghuz Group is one of the most extensive and culturally influential branches of the Turkic languages. Its historical origin lies in Old Oghuz Turkish which later formed the basis of written and spoken languages developed in Anatolia Azerbaijan Iran and the Balkans. This group includes Turkish of Türkiye Azerbaijani Turkish Turkmen Turkish Gagauz Turkish and Balkan Oghuz dialects. Oghuz languages share common phonological and morphological features; vowel harmony is largely preserved and verb conjugation and word structure show similarities. The Oghuz group encompasses languages spoken across a wide geographic area and is distinguished by its historical spread and dialectal diversity in the Balkans Anatolia Azerbaijan and the Middle East.


  • Turkish of Türkiye: The most populous language of the Oghuz branch developed from Ottoman Turkish and became the standard written language in modern Türkiye. It also exerts prestige influence in regions such as the Balkans and Cyprus.
  • Azerbaijani Turkish: Represents the eastern segment of the Oghuz branch; historically rooted in Old Oghuz written language and shares numerous phonological and morphological features with Anatolian Turkish.
  • Turkmen Turkish: Shaped by the influence of the Teke and Yomut dialects as a written language it has passed through Arabic Latin Cyrillic and Latin alphabet phases.
  • Gagauz Turkish: Belongs to the Oghuz group and is spoken in the Balkan region.
  • Balkan Oghuz Dialects: These dialects developed as a result of Ottoman settlements and migration movements in Rumelia.

Kipchak Group

The Kipchak Group is a historical branch associated with the western parts of Central Asia the northern Black Sea region and Eastern Europe. The structure of Kipchak languages extends from the Harezm-Kipchak tradition to the modern era. Languages such as Kazakh Kyrgyz Nogai Karachay-Balkar Turkish Tatar and Bashkir are included in this group. Kipchak languages exhibit unique phonological changes (e.g. certain consonant shifts) and morphological structures.


  • Kazakh Turkish: Represents the eastern subgroup of the Kipchak branch; has become more prominent through historical Kipchak features and modern standardization processes.
  • Kyrgyz Turkish: Related to the Kipchak group in terms of phonology and vocabulary and constitutes an important sub-branch of the Turkic language family.
  • Nogai Turkish: One of the languages preserving the Kipchak steppe tradition.
  • Tatar Turkish: Exhibits Kipchak features in the context of the historical languages of the Kazan and Volga-Ural regions.
  • Bashkir Turkish: Closely related to Tatar Turkish and is a language based on Kipchak roots.
  • Karachay-Balkar Turkish: The language of Kipchak-origin communities in the North Caucasus; its phonological and morphological features are characteristic of the Kipchak group.

Karluk Group

The Karluk Group is a branch historically rooted in the Karakhanid and Harezm regions and later evolved into modern standard languages through Uzbek and New Uyghur Turkish in Central Asia. Languages in this group are connected to the Chagatai tradition and were restructured as separate written languages during the 20th century under Soviet language policies. Karluk languages retain traces from both Old Turkic and other branches of Medieval Turkish in phonological and morphological terms.


  • Uzbek Turkish: Standardized based on local dialects after the dissolution of the Chagatai written language.
  • New Uyghur Turkish: The other main representative of the Karluk branch spoken in Eastern Turkestan and also developed as a modern written language separate from the Chagatai tradition.

Siberian–Yakut Branch

The languages developed in Siberia form a distinct branch. The most prominent representative of this branch is Yakut/Sakha Turkish; Dolgan Turkish is also included in this group. Siberian Turkic languages exhibit some of the most distinctive features within the Turkic language family in phonological and morphological terms. These languages have diverged significantly from other Turkic languages due to geographic isolation and contact with local communities. This group encompasses various Siberian Turkic languages with Yakut/Sakha as the primary example.


  • Yakut (Sakha) Turkish: One of the most divergent members of the Turkic languages with its unique sound system (e.g. initial b-p-) extensive vowel inventory and distinct morphological features. It was standardized as a modern written language during the Soviet period.
  • Dolgan Turkish: A Siberian Turkic language closely related to Yakut.
  • Khakas Turkish: Another Turkic language spoken in Siberia.
  • Tuvan Turkish: One of the languages spoken in the Siberian region.

Chuvash (Oghur Branch)

The Oghur branch is a historically distinct group that has diverged significantly from other branches of the Turkic language family. Chuvash is defined as the only living representative of the historical Oghur branch. This branch is associated with historical Bulgar Turkish and exhibits structures that differ from general Turkic linguistic features in phonological and morphological terms. When compared with other Turkic languages Chuvash is noted for systematic differences in initial consonants (e.g. r/l shifts) and certain phonological and morphological elements. In this regard the Oghur branch constitutes one of the most unique structures within the language family.

Criteria Used in Classification

The wide geographic distribution of Turkic languages and their development through contact with diverse cultural environments have required a multidimensional approach in classification studies. In this regard researchers use various criteria ranging from structural measures based on phonological and morphological changes to historical comparative methods and the effects of language contact and writing system phases.

Phonological/Morphological Criteria

Phonological and morphological features are among the most fundamental criteria in the classification of Turkic languages. Particularly the following structural divergences are decisive:

  • Initial and internal consonant changes (e.g. in Yakut b-p-; in Balkan dialects t > d k > g p > b changes)
  • Preservation or loss of original long vowels (preserved in Turkmen Turkish; lost in many other languages)
  • Differences in the affix system (e.g. in Balkan Oghuz dialects the present tense suffix -yor is replaced by -y)
  • Divergences in morphological inflection forms (e.g. in Cypriot Turkish the function of -mış is restricted and indirectness is expressed with -dı)

These differences serve as fundamental criteria for determining both historical stages and the classification of modern Turkic languages into their respective branches.

Types of Vowel Harmony

Vowel harmony is one of the common typological features of Turkic languages but this rule does not operate uniformly in all languages. The following harmony types are considered in classification:

  • Thick-thin vowel harmony: Preserved in most Turkic languages but weakened in some cases in Balkan Oghuz dialects and Cypriot Turkish due to contact influences.
  • Round-flat vowel harmony: Has undergone system changes in some Siberian Turkic languages such as Yakut/Sakha and Khakas.
  • Affix vowel harmony: Regular in Turkmen and Anatolian areas but irregularities are observed in Balkan and Cypriot dialects.

These divergences enable the grouping of Turkic languages according to their historical origins and the languages with which they have had contact.

Historical Comparative Methods

In these methods:

  • Sound correspondences of words derived from a common root are examined.
  • Formal features in historical texts are compared.
  • Developmental trends of dialects over time are identified.
  • Subgroups are defined based on shared innovations and shared preserved features.

For example the formation of the Karluk branch through the Chagatai tradition or the distinctive phonological features of the Oghur branch such as r/l shifts are determined using these comparative methods.

Geographic Spread and Language Contact

Geographic factors and language contact are also decisive criteria in the classification of the Turkic language family. Particularly the following contacts have occurred:

  • Balkans: Turkish-Slavic language contact (e.g. 4–5000 words of Turkish origin in Bulgarian)
  • Cyprus: Turkish-Greek-English contact
  • Siberia: Turkish-Evenki Tungusic-Manchu Samoyedic language contact
  • Central Asia: Turkish-Persian-Arabic contact areas
  • Eastern Europe/Kipchak region: Turkish-Slavic-Germanic influences

These contacts not only explain phonological and lexical differences among dialects but also support the distinguishing criteria used in classification.

Impact of Different Writing Systems on Classification

Different writing systems constitute an important criterion in the historical periodization of the Turkic language. Alphabet changes have highlighted both the developmental stages of languages and regional divergences.

  • Runic (Göktürk) Alphabet: Used in the earliest phase of Old Turkic identified through the Orhun and Yenisei inscriptions.
  • Uyghur Alphabet: Used in Manichaean and Buddhist Uyghur texts; continues the Old Turkic tradition.
  • Arabic Alphabet: The primary writing system of the Medieval and New Turkish periods including Karakhanid Harezm-Kipchak Ottoman and Chagatai.
  • Cyrillic Alphabet: Widely applied during the Soviet period to modern Turkic languages; established standardization in Kipchak Karluk and Siberian regions.
  • Latin Alphabet: Türkiye’s 1928 reform Turkmenistan’s 1993–2000 Latinization process and transition periods in Azerbaijan and Uzbekistan have created distinct chronological breaks in the classification of modern Turkic languages.

These writing phases are among the primary tools for periodizing the historical development of Turkic languages and shape the historical dimension of classification.

Phonology

The phonological system of Turkic languages shows broad commonality in fundamental features such as vowel harmony and syllable structure but various sound changes and unique phonetic characteristics are observed in dialects spread across different geographic regions.

Basic Features of the Sound System in Turkic Languages

The fundamental features of the phonological structure of Turkic languages are defined primarily through vowel harmony consonant harmony and syllable structure. These features have shown considerable continuity throughout the historical stages of the Turkic language family and have been preserved to varying degrees in modern dialects.

Types of Vowel Harmony

Vowel harmony is one of the most distinctive typological features of Turkic languages and is treated as a fundamental phonological principle in both historical texts and contemporary dialects. Vowel harmony is examined along three main axes.


Thick-thin Vowel Harmony: This harmony refers to the organization of vowels in a word according to their back (thick) and front (thin) characteristics.

  • The system is consistent in Old Turkish texts.
  • It is largely preserved in Oghuz and Karluk languages.
  • In Balkan Oghuz dialects and Cypriot Turkish examples of weakening due to contact languages are observed.
  • In Siberian languages such as Yakut/Sakha the system has diverged but the thick-thin contrast remains at a fundamental level.


Round-flat Vowel Harmony: This harmony concerns the reflection of the roundedness feature of the first vowel in subsequent syllables.

  • It is generally preserved in Oghuz languages.
  • In Karluk languages historical and modern variations in the use of rounded vowels are observed.
  • In Yakut/Sakha Turkish due to its extensive vowel inventory and contact influences the classical rounded harmony functions differently.


Wide-narrow Vowel Harmony: This harmony is based on the regulated arrangement of vowel width (a e ı i u ü) between syllables.

  • It can be traced back to the Old Turkish period.
  • It shows considerable stability in Oghuz and Kipchak regions.
  • In Balkan and Cypriot Turkish dialects harmony is occasionally disrupted due to contact.
  • In Siberian languages the role of wide-narrow vowels in the system differs due to unique phonological structures.

Consonant Harmony

Consonant harmony is analyzed in Turkic languages based on the relationship between the consonants of suffixes and the final consonants of roots. This harmony includes processes of voicing and devoicing (consonant assimilation).

  • The initial consonants of suffixes are shaped according to the voicing or devoicing properties of the final consonant of the word.
  • In Balkan dialects processes of voicing and devoicing have become variable due to influences from contact languages.
  • In Cypriot Turkish changes such as t > d k > g and p > b are examples of consonant harmony reshaped by regional influences.
  • In Yakut/Sakha Turkish initial consonant changes (b-p-) and different distributions of internal consonants demonstrate a unique phonological structure diverging from general Turkic patterns.

Consonant harmony serves as a fundamental phonological regulator supporting the stability of the agglutinative structure based on sound harmony.

Syllable Structure

The syllable structure of Turkic languages is defined as a simple regular and open syllable system.

  • Turkic syllables generally tend toward open syllables (ending in a vowel).
  • Although closed syllables (ending in a consonant) exist consonant clusters at the beginning of syllables are limited.
  • The syllable structure has been largely stable in historical texts; this structure is clearly evident in Orhun and Uyghur texts.
  • In Turkmen Turkish original long vowels are noted as a prominent factor affecting syllable structure.
  • In Cypriot and Balkan dialects the syllable structure sometimes changes under the influence of contact languages; particularly in Cypriot Turkish under the influence of Greek the stress and syllable length differ noticeably.
  • In Yakut/Sakha Turkish the syllable rhythm is different from other Turkic languages due to long vowels and distinct vowel structures.

Regional Diversity

Balkan Dialects

The Turkish dialects of the Balkan region constitute an important dialect area within the Oghuz group reflecting the linguistic features of historical Rumelian settlements. These dialects are generally analyzed in two main groups: Eastern Rumelia and Western Rumelia. Both groups exhibit both common features and distinctive differences resulting from regional contact influences. The fundamental phonological features of Balkan dialects can be examined through vowel changes the present tense suffix and final vowel variations.


Vowel Changes in Eastern/Western Rumelia Dialects: One of the characteristic features of Balkan Turkish dialects is the vowel changes in their vowel systems. These changes are observed in both thick-thin and wide-narrow vowel relations.

  • In the Rumelia region some words exhibit vowel narrowing or fronting.
  • Vowel changes are often influenced by the phonological structures of neighboring Slavic languages.
  • In Eastern Rumelia dialects the rounding of vowels has weakened; in Western Rumelia dialects vowels show a tendency toward narrowing.
  • Population density migration and multilingual environments in the Balkan region are among the main factors contributing to variations in the vowel system.

These changes demonstrate that the Balkan region has witnessed the most prominent phonological adaptations of Oghuz Turkish due to regional contact.


Present Tense Suffix (-y / -yor): Balkan Turkish dialects show a distinctive variation in tense suffixes particularly in the present tense form.

  • The -yor suffix used in Turkish of Türkiye is frequently reduced to the -y form in Balkan dialects.
  • This suffix can also be represented through various intermediate forms such as “geliyom > geliyem > gelyem”; however the essential characteristic is the shorter and simpler -y form.
  • This phenomenon is regarded as an example of regional morphological simplification within the Oghuz group.


Final Vowel Variations (-i -ı -u -ü): In Balkan dialects final vowels in words show variability.

  • Final -ı -u -ü vowels are often transformed into the -i form in this region.
    • Example: altı > alti oldu > oldi
  • This change is observed in both Eastern and Western Rumelia areas but is more widespread in certain regions.
  • This regular change in final vowels reveals the continuity of vowel narrowing in the phonological system of Balkan Turkish dialects.


These variations demonstrate that Balkan dialects not only maintain the general line of Oghuz Turkish but have also developed unique phonological structures due to prolonged contact with Slavic and Greek languages.

Cypriot Turkish

Cypriot Turkish is an Oghuz dialect that developed after Turkish settlement on the island following 1571 and evolved through prolonged contact with Greek and later English. Cypriot Turkish exhibits distinctive features in phonology morphology and vocabulary compared to Turkish of Türkiye; however since 1974 its orientation toward Turkish of Türkiye has increased while the island dialect has retained its vitality.


t > d k > g p > b Changes: One of the most characteristic phonological features of Cypriot Turkish is the systematic softening or voicing of certain consonants in word-initial and internal positions. The following changes occur:

  • t > d

Example: tatlı > dadlı

  • k > g

Example: kavun > gavun

  • p > b

Example: dolap > dolab


Although these changes parallel those in some Anatolian dialects the continuity and prevalence in Cyprus can be explained by the combination of the linguistic features of Turkish settlers and the multilingual structure of the island.


Shortening of Long Vowels: In Cypriot Turkish original long vowels are either completely or largely shortened. Some long vowels preserved to a limited extent in Anatolian dialects are regularly reduced or lost in the Cypriot dialect. This shortening process leads to simplification in the syllable structure of Cypriot Turkish resulting in a phonological system in which vowel length no longer serves a distinctive function.


Phonological Influence of Greek and English: Cypriot Turkish has developed through prolonged contact with Greek (Cypriot Greek) and English due to the island’s historical multilingual structure.

  • Under Greek influence sentence rhythm intonation and stress have changed in some words.
  • Greek syntactic patterns stress placement and phonetic flow are clearly evident in certain expressions.
  • When Greek-origin words are used with Turkish suffixes vowel harmony is sometimes disrupted.
  • English contact especially after 1878 has influenced the pronunciation of loanwords.
  • During the phonological adaptation of foreign words in Cypriot Turkish deviations from Turkish sound rules occur (e.g. simplification of English /ɫ/ /r/ or Greek double consonants).

Yakut / Sakha Turkish

Yakut (Sakha) Turkish is the primary representative of the Siberian branch and possesses one of the most divergent phonological systems within the Turkic language family. Its geographic location historical isolation and prolonged contact with Evenki Tungusic-Manchu and Samoyedic languages have caused Yakut to diverge significantly from other Turkic languages in its sound system.

b- > p- and Other Systematic Divergences: The most distinctive feature of Yakut Turkish is the systematic change of initial consonants compared to historical Turkish. The most prominent change is the preservation of the Old Turkic b- sound as p- in Yakut. This change is not merely an isolated phonetic shift but represents a unique evolutionary path of Yakut within the Turkic language family. Yakut also exhibits the following features:

  • Yakut does not allow consonant clusters at the beginning of words,
  • It simplifies internal consonants in some cases,
  • Historically certain consonants in Turkish have acquired entirely different phonological values in Yakut. These features clearly distinguish Yakut from the Kipchak Oghuz or Karluk branches.


Unique Vowel Inventory: Yakut/Sakha Turkish has one of the most extensive and complex vowel systems among Turkic languages.

  • Long vowels are phonemic (i.e. they distinguish meaning).
  • The harmony of vowels exhibits a more complex structure than in classical Turkic languages.
  • Although the thick-thin contrast exists the system has a broader distribution than in other Turkic languages.
  • The relationship between rounded and unrounded vowels in Yakut differs from the typical Oghuz/Kipchak systems.


This vowel system reflects both the historical connection of Yakut to Turkish and the influence of Siberian contact areas.


Tonal–Prosodic Features: The prosodic structure of Yakut/Sakha Turkish occupies a unique position among Turkic languages. These features include:

  • The ability of stress and tone to serve a distinctive function in meaning,
  • The role of long vowels in determining sentence rhythm,
  • The reorganization of syllable structure based on length,
  • Observations suggesting that the tone in Sakha Turkish may have been shaped by contact with indigenous Siberian languages.


This prosodic structure demonstrates that Yakut occupies a unique position among Turkic languages not only in terms of its sound system but also in terms of speech flow rhythm and intonation.

Turkmen Turkish

Turkmen Turkish is a Turkic language representing the eastern branch of the Oghuz group and exhibits distinctive features in its historical development phonological structure and dialectal diversity. The preservation of original long vowels and the prominence of regional dialectal differences highlight Turkmen Turkish’s distinctive position within the Turkic language family.


Original Long Vowels: One of the most distinctive phonological features of Turkmen Turkish is that it is one of the few Turkic languages that preserve original long vowels inherited from Old Turkic. Long vowels in Turkmen Turkish carry phonemic value both in word roots and in certain affixation processes.

This feature is defined by:

  • The regular preservation of long vowels in Turkmen Turkish despite their historical loss or shortening in most other Turkic languages,
  • The maintenance of length oppositions capable of distinguishing meaning between words,
  • The consideration of long vowels in standardization of the written language.


Regional Dialectal Differences (Teke Yomut etc.): Turkmen Turkish is not only a written language but also a broad dialectal area consisting of numerous regional dialects. The best-known among these are the Teke Yomut Ersari Sarik Salir and Goklen dialects. Dialectal differences are observed at both phonological and morphological levels:

  • Teke dialect forms the basis of the modern Turkmen written language and many standardization efforts have been based on this dialect.
  • Yomut dialect possesses unique features in sound changes and vocabulary; some sound correspondences diverge from the Teke dialect.
  • Ersari Sarik Salir and Goklen dialects exhibit different phonetic and morphological features depending on regional settlements.
  • Differences among dialects are related to historical migration movements tribal structures and geographic fragmentation.

Morphology

The Turkic language family is defined as a group of languages exhibiting a highly regular agglutinative structure in morphology. Throughout the broad time span from historical texts (Orhun Uyghur Karakhanid etc.) to contemporary dialects the fundamental morphological principles have been largely preserved. This continuity has made morphology a central criterion in the internal classification of the Turkic language family and in historical comparative studies. The morphology of Turkic languages is based on a “root + affix” system and word formation and inflection are clear transparent and systematic. Roots generally remain unchanged while differences in meaning and function are achieved through affixes.

General Agglutinative Structure

The core of the morphology of the Turkic language family is the agglutinative structure. This structure is based on the principle that word roots remain largely unchanged in form and grammatical functions are expressed through sequential affixes. The agglutinative structure has shown continuity from Old Turkic through the Medieval and New Turkish periods and is preserved as the fundamental morphological principle in all modern Turkic languages.

Order of Affixes

In Turkic languages affixes are added to the root in a specific and fixed order. This order has remained largely unchanged from historical texts to contemporary dialects. Generally:

  • Derivational affixes precede inflectional affixes.
  • Inflectional affixes of the same type follow a specific sequence among themselves.
  • Affixes typically carry only one grammatical function and this function is clearly distinguishable.


This regular affix order ensures high formal transparency in Turkic languages and enables easy analysis of word structure.

Time–Mood–Person Affixes

Verb inflection is one of the areas where the agglutinative structure is most clearly observed in the Turkic language family. Affixes added to the verb root follow this basic sequence:

  1. Time/Mood affixes
  2. Person affixes

Time and mood affixes express the time of occurrence and the speaker’s perspective while person affixes indicate the identity and number of the subject. Each of these affixes carries a distinct function and is clearly separable morphologically.


A significant portion of the time-mood affixes used in Old Turkic has maintained functional continuity in Medieval Turkish and modern Turkic languages. Although some time affixes have been formally simplified in regional dialects (e.g. Balkan Turkish dialects) the fundamental principle of the time-mood-person sequence has not changed.

Noun Inflection System

Noun inflection in Turkic languages also reflects the regular and systematic functioning of the agglutinative structure. Nouns are inflected through:

  • Plural affixes,
  • Possessive affixes,
  • Case affixes

These affixes are added in a specific order and generally follow this sequence:

  • noun root + plural affix + possessive affix + case affix


This order is a morphological pattern traceable from Old Turkish texts. Although minor formal differences exist among the Oghuz Kipchak Karluk and Siberian branches the fundamental structure of noun inflection is common across the entire Turkic language family. Although the frequency of case affix usage varies in contact areas such as Balkan and Cypriot Turkish the essence of the inflection system has been preserved.

Vocabulary and Language Contact

The vocabulary of the Turkic language family has been shaped by intense contact with different cultural and linguistic environments throughout history. These contacts have not altered the fundamental structure of Turkic languages but have left lasting effects especially on vocabulary and to some extent on semantic fields and usage frequency.

Chinese Mongolian Persian and Arabic Influences

The primary historical contact areas of Turkic languages are Chinese Mongolian Persian and Arabic languages. Contact with these languages occurred in different periods and with varying intensity.

  • Contact with Chinese is primarily considered in the context of early Central Asian Turkic communities. This contact occurred mainly through limited lexical exchange within the framework of administrative military and diplomatic relations.
  • Contact with Mongolian emerged from the long-term coexistence of Turkic and Mongolian communities in Central Asia and the steppes. This contact particularly affected vocabulary related to military administration and social structure. However Mongolian elements did not penetrate the inflection and affix systems of Turkic languages.
  • Persian influence is regarded as one of the strongest and longest-lasting influences on the vocabulary of Turkic languages. This interaction began in the Karakhanid period and intensified during the Medieval and New Turkish periods. Persian loanwords are especially prevalent in literature state institutions culture urban life and abstract concepts.
  • Arabic influence became prominent with the adoption of Islam. Arabic significantly enriched the vocabulary of Turkic languages particularly in religious scientific and legal terminology. Arabic-origin words exhibit a broad historical continuity from Karakhanid Turkish to Ottoman Turkish and other written languages.

Syntax

The syntax of the Turkic language family exhibits a regular and typologically distinctive structure with considerable continuity from historical periods to contemporary dialects. The syntax of Turkic languages is closely related to morphology and due to the agglutinative structure the grammatical roles of elements in a sentence are clearly determined.

Basic Sentence Order

In Turkic languages the basic sentence order is Subject – Object – Predicate. This order:

  • Can be clearly traced from the Orhun Inscriptions onwards.
  • Has shown continuity through the Old Turkish Medieval Turkish and New Turkish periods.
  • Is preserved as a fundamental principle in the Oghuz Kipchak Karluk and Siberian branches.


The placement of the predicate at the end of the sentence is considered one of the distinctive syntactic features of Turkic languages.

Flexibility in Element Order

Although the basic Subject – Object – Predicate order is maintained in Turkic languages some flexibility in word order is possible. This flexibility arises from functional reasons such as:

  • Emphasis and focus,
  • Highlighting meaning,
  • Speech context.


However this flexibility does not imply free word order; due to affixes clearly marking the grammatical roles of elements no ambiguity in meaning arises.

Modifier–Head and Qualifier–Qualified Relationships

In Turkic languages:

  • The modifier precedes the head,
  • Adjectives and qualifying elements precede the nouns they modify.


This order is consistently applied in noun phrases and adjective groups. This structure has remained unchanged from Old Turkish to the present day.

Postpositions and Auxiliary Elements

In Turkic languages elements corresponding to prepositions in Western languages are generally postpositions. Postpositions follow nouns or noun phrases and function together with case affixes. This feature demonstrates that Turkic languages exhibit a head-final syntax consistent with their typological classification.

Subordinate Clauses and Verbals

In Turkic languages subordinate clauses are mostly constructed through verbals (noun-verbs adjective-verbs adverbial-verbs). The limited use of conjunctions enables the construction of long and complex sentences through affixes and verbals. This situation demonstrates that sentence structure in Turkish is heavily based on morphology.

Current Status of Turkic Languages

The Turkic language family is a linguistic community spread across a vast area of Eurasia. The number of speakers of Turkic languages is distributed across a region extending from the Balkans to Anatolia the Caucasus to Central Asia and the northeastern parts of Siberia. In this context:

  • Türkiye Azerbaijan Turkmenistan Kazakhstan Uzbekistan and Kyrgyzstan have Turkic languages as their majority languages.
  • Balkans (Bulgaria Greece North Macedonia Kosovo Romania Moldova) and Cyprus have Turkish as a minority or regional language based on historical settlements.
  • Within the Russian Federation communities such as Tatar Bashkir Yakut (Sakha) Khakas Tuvan and Dolgan speak Turkic languages in their respective regions.
  • In China’s Xinjiang (Eastern Turkestan) region New Uyghur Turkish occupies an important position as a contemporary written language within the Karluk group.

Current Language Use

Education

The use of Turkic languages in education varies significantly by region:

  • Türkiye Azerbaijan Turkmenistan Kazakhstan Kyrgyzstan and Uzbekistan use their respective Turkic languages as official languages of education.
  • In the Russian Federation some Turkic languages (Tatar Bashkir Yakut) are used in education at the regional level but their usage is often limited.
  • In the Balkans and Cyprus Turkish is transmitted through minority schools and private educational institutions.

Media and Publishing

Today the use of Turkic languages in written and visual media is particularly strong in independent Turkic republics. Television radio newspapers and digital media play an important role in sustaining the vitality of Turkic languages. In contrast media usage in the Balkans and Siberia is more limited and localized.

Cultural Institutions

The preservation and development of Turkic languages are carried out through cultural institutions academic centers and publishing activities. Language and cultural associations universities and research centers play an important role in ensuring the continuity of language among Turkic communities in the Balkans and the Russian Federation.

Contemporary Situation in the Balkans Cyprus Central Asia and Siberia

  • Balkans: Turkish primarily survives at the dialect level with limited written usage.
  • Cyprus: Cypriot Turkish maintains its vitality in daily life while Turkish of Türkiye dominates in education and official domains.
  • Central Asia: Turkic languages hold a strong position as modern national state languages.
  • Siberia: Some languages such as Yakut have the status of written languages but their usage areas are limited.

Endangered Turkic Languages

Although the entire Turkic language family exhibits a strong structure some languages are considered endangered. This situation arises from relatively small numbers of speakers and intense contact with dominant languages. In this context:

  • Dolgan Turkish is at risk due to its very limited number of speakers.
  • Khakas Turkish is confined to a narrow regional usage area.
  • Tuvan Turkish although possessing a written language and cultural production is in a vulnerable position in terms of population and usage area.

World Turkic Languages Family Day

The World Day of the Turkic Language Family is a cultural and linguistic day aimed at making visible on an international level the common origin historical continuity and cultural bonds of Turkic languages. This day has been declared by UNESCO to be celebrated on December 15. The choice of date is based on the symbolic connection with the Orhun Inscriptions one of the oldest known written legacies of the Turkic language.


The initiative for declaring this day was shaped by a joint effort of Türkiye Azerbaijan Kazakhstan Kyrgyzstan and Uzbekistan and the decision was adopted at the 43rd Session of the UNESCO General Conference. The World Day of the Turkic Language Family provides an international framework to promote cooperation for the preservation research and intergenerational transmission of the Turkic language family.

Citations

Author Information

Avatar
AuthorMeryem Şentürk ÇobanDecember 18, 2025 at 12:33 PM

Tags

Discussions

No Discussion Added Yet

Start discussion for "Turkish Language Family" article

View Discussions

Contents

  • Oghuz Group

  • Kipchak Group

  • Karluk Group

  • Siberian Group

  • Oghur (Bulgar) Group

  • Historical Development of Turkic Languages

    • Early Period (Orhun and Old Turkish Period)

    • Medieval Turkish Period

      • Karakhanid Turkish

      • Harezm-Kipchak Turkish

      • Old Oghuz Turkish

      • Chagatai Written Language

    • New Turkish Period

      • Ottoman Turkish and the Oghuz Written Tradition

      • Kipchak Written Tradition

      • Disintegration of the Eastern Turkish Tradition and Transition to Modern Dialects

    • Formation of Modern Languages

      • Formation of Turkish of Türkiye

      • Language Standardization in Central Asia after the Soviet Era

      • Modernization of Special Dialect Areas: The Balkans Cyprus and Yakut/Sakha

      • Formation of the Turkmen Written Language

  • Classification of Turkic Languages

    • Main Groups Subgroups and Languages

      • Oghuz Group

      • Kipchak Group

      • Karluk Group

      • Siberian–Yakut Branch

      • Chuvash (Oghur Branch)

    • Criteria Used in Classification

      • Phonological/Morphological Criteria

      • Types of Vowel Harmony

      • Historical Comparative Methods

      • Geographic Spread and Language Contact

      • Impact of Different Writing Systems on Classification

  • Phonology

    • Basic Features of the Sound System in Turkic Languages

      • Types of Vowel Harmony

      • Consonant Harmony

      • Syllable Structure

    • Regional Diversity

      • Balkan Dialects

      • Cypriot Turkish

      • Yakut / Sakha Turkish

      • Turkmen Turkish

  • Morphology

    • General Agglutinative Structure

      • Order of Affixes

      • Time–Mood–Person Affixes

      • Noun Inflection System

  • Vocabulary and Language Contact

    • Chinese Mongolian Persian and Arabic Influences

  • Syntax

    • Basic Sentence Order

    • Flexibility in Element Order

    • Modifier–Head and Qualifier–Qualified Relationships

    • Postpositions and Auxiliary Elements

    • Subordinate Clauses and Verbals

  • Current Status of Turkic Languages

    • Current Language Use

      • Education

      • Media and Publishing

      • Cultural Institutions

      • Contemporary Situation in the Balkans Cyprus Central Asia and Siberia

    • Endangered Turkic Languages

  • World Turkic Languages Family Day

Ask to Küre