The Tibetic languages are a cluster of Tibeto-Burman languages descended from Old Tibetan, spoken across a wide area of eastern Central Asia bordering the Indian subcontinent, including the Tibetan Plateau and the Himalayas in Baltistan, Ladakh, Nepal, Sikkim, Bhutan, Assam and Arunachal Pradesh. Classical Tibetan is a major regional literary language, particularly for its use in Buddhist literature.
Tibetic languages are spoken by some 6 million people. With the worldwide spread of Tibetan Buddhism, the Tibetan language has spread into the western world and can be found in many Buddhist publications and prayer materials; with some western students learning the language for translation of Tibetan texts. Outside Lhasa itself, Lhasa Tibetan is spoken by approximately 200,000 exile speakers who have moved from modern-day Tibet to India and other countries. Tibetan is also spoken by groups of ethnic minorities in Tibet who have lived in close proximity to Tibetans for centuries, but nevertheless retain their own languages and cultures.
Although some of the Qiang peoples of Kham are classified by China as ethnic Tibetans (see rGyalrongic languages; rGyalrong people are identified as 'Tibetan' in China), the Qiangic languages are not Tibetic, but rather form their own branch of the Tibeto-Burman language family.
Classical Tibetan was not a tonal language, but some varieties such as Central and Khams Tibetan have developed tone registers. Amdo and Ladakhi-Balti are without tone. Tibetic morphology can generally be described as agglutinative.
Nicolas Tournadre (2008) describes the language situation of Tibetan as follows:
Based on my 20 years of field work throughout the Tibetan language area and on the existing literature, I estimate that there are 220 'Tibetan dialects' derived from Old Tibetan and nowadays spread across 5 countries: China, India, Bhutan, Nepal and Pakistan [which] may be classed within 25 dialect groups, i.e. groups which do not allow mutual intelligibility. The notion of ‘dialect group’ is equivalent to the notion of language but does not entail any standardization. Thus if we set aside the notion of standardization, I believe it would be more appropriate to speak of 25 languages derived from Old Tibetan. This is not only a terminological issue but it gives an entirely different perception of the range of variation. When we refer to 25 languages, we make clear that we are dealing with a family comparable in size to the Romance family which has 19 groups of dialects.
and another dozen minor clusters or single dialects, mostly spoken by a few hundred to a few thousand people:
In addition, there is Baima, which retains an apparent Qiangic substratum, and has multiple layers of borrowing from Amdo, Khams, and Zhongu, but does not correspond to any established branch of Tibetic. The more divergent dialects such as this are spoken in the north and east near the Qiangic and Rgyalrongic languages, and some, such as Khalong, may also be due to language shift.
Marius Zemp (2018) hypothesizes that Tibetan originated as a pidgin with the West Himalayish language Zhangzhung as its superstratum, and Rgyalrongic as its substratum. Similarly, Tamangic also has a West Himalayish superstratum, but its substratum is derived from a different Sino-Tibetan branch.
According to Bradley, the languages cluster as follows (dialect information from the Tibetan Dialects Project at the University of Bern):
Some classifications group Khams and Amdo together as Eastern Tibetan (not to be confused with East Bodish, whose speakers are not ethnically Tibetan). Some, like Tournadre, break up Central Tibetan. Phrases such as 'Central Tibetan' and 'Central Bodish' may or may not be synonymous: Southern (Central) Tibetan can be found as Southern Bodish, for example; 'Central Tibetan' may mean dBus or all dialects apart from Khams; Tibeto-Kanauri languages.
Most Tibetic languages are written in one of two Indic scripts. Standard Tibetan and most other Tibetic languages are written in the Tibetan script with a historically conservative orthography (see below) that helps unify the Tibetan-language area. Some other Tibetan languages (in India and Nepal) are written in the related Devanagari script, which is also used to write Hindi, Nepali and many other languages. However, some Ladakhi and Balti speakers write with the Urdu script; this occurs almost exclusively in Pakistan. The Tibetan script fell out of use in Pakistani Baltistan hundreds of years ago upon the region's adoption of Islam. However, increased concern among Balti people for the preservation of their language and traditions, especially in the face of strong Punjabi cultural influence throughout Pakistan, has fostered renewed interest in reviving the Tibetan script and using it alongside the Arabic-Persian script. Many shops in Baltistan's capital Skardu in Pakistan's "Northern Areas" region have begun supplementing signs written in the Arabic-Persian script with signs written in the Tibetan script. Baltis see this initiative not as separatist but rather as part of an attempt to preserve the cultural aspects of their region which has shared a close history with neighbors like Kashmiris and Punjabis since the arrival of Islam in the region many centuries ago.
Old Tibetan phonology is rather accurately rendered by the script. The finals were pronounced devoiced although they are written as voiced, the prefix letters assimilated their voicing to the root letters. The graphic combinations hr and lh represent voiceless and not necessarily aspirate correspondences to r and l respectively. The letter ' was pronounced as a voiced guttural fricative before vowels but as homorganic prenasalization before consonants. Whether the gigu verso had phonetic meaning or not remains controversial.
For instance, Srongbtsan Sgampo would have been pronounced [sroŋpʦan zɡampo] (now pronounced [sɔ́ŋʦɛ̃ ɡʌ̀mpo] in Lhasa Tibetan) and 'babs would have been pronounced [mbaps] (pronounced [bapˤ][dubious ] in Lhasa Tibetan).
Already in the 9th century the process of cluster simplification, devoicing and tonogenesis had begun in the central dialects can be shown with Tibetan words transliterated in other languages, particularly Middle Chinese but also Uyghur.
The concurrence of the evidence indicated above enables us to form the following outline of the evolution of Tibetan. In the 9th century, as shown by the bilingual Tibetan–Chinese treaty of 821–822 found in front of Lhasa's Jokhang, the complex initial clusters had already been reduced, and the process of tonogenesis was likely well underway.
The next change took place in Tsang (Gtsang) dialects: The ra-tags were altered into retroflex consonants, and the ya-tags became palatals.
Later on the superscribed letters and finals d and s disappeared, except in the east and west. It was at this stage that the language spread in Lahul and Spiti, where the superscribed letters were silent, the d and g finals were hardly heard, and as, os, us were ai, oi, ui. The words introduced from Tibet into the border languages at that time differ greatly from those introduced at an earlier period.
The other changes are more recent and restricted to Ü and Tsang. In Ü, the vowel sounds a, o, u have now mostly umlauted to ä, ö, ü when followed by the coronal sounds i, d, s, l and n. The same holds for Tsang with the exception of l which merely lengthens the vowel. The medials have become aspirate tenues with a low intonation, which also marks the words having a simple initial consonant; while the former aspirates and the complex initials simplified in speech are uttered with a high tone, shrill and rapidly.
Proto-Tibetic, the hypothetical proto-language ancestral to the Tibetic languages, has been reconstructed by Tournadre (2014). Proto-Tibetic is similar to, but not identical to, written Classical Literary Tibetan. The following phonological features are characteristic of Proto-Tibetic (Tournadre 2014: 113).
Pre-Tibetic is a hypothetical pre-formation stage of Proto-Tibetic.
*ty-, *ly-, *sy- were not palatalized in Pre-Tibetic, but underwent palatalization in Proto-Tibetic (Tournadre 2014: 113-114). Posited sound changes from Pre-Tibetic to Proto-Tibetic include *ty- > *tɕ-, *sy- > *ɕ-, *tsy- > *tɕ-, and *ly- > *ʑ-. However, Tournadre (2014: 114) notes that many Bodish languages such as Basum, Tamang, and Kurtöp (East Bodish) have not undergone these changes (e.g., Bake (Basum) ti ‘what’ vs. Proto-Tibetic *tɕ(h)i and Bake tɨ ‘one’ vs. Proto-Tibetic *g(ǝ)-tɕ(h)ik; Kurtöp Hla: ‘iron’ and Bumthap lak ‘iron’ vs. Proto-Tibetic *ltɕaks).
Some Pre-Tibetic reconstructions, along with reconstructed Proto-Tibetic forms and orthographic Classical Literary Tibetan, from Tournadre (2014: 114-116) are listed below.