Pali (/ˈpɑːli/ Pāḷi; Burmese: ပါဠိ) or Magadhan[a] is a Middle Indo-Aryan liturgical language native to the Indian subcontinent. It is widely studied because it is the language of the Pāli Canon or Tipiṭaka, and is the sacred language of Theravāda Buddhism. The earliest archaeological evidence of the existence of canonical Pali comes from Pyu city-states inscriptions found in Burma dated to the mid 5th to mid 6th century CE.
The word Pali is used as a name for the language of the Theravada canon. The word seems to have its origins in commentarial traditions, wherein the Pāli (in the sense of the line of original text quoted) was distinguished from the commentary or vernacular translation that followed it in the manuscript.
As such, the name of the language has caused some debate among scholars of all ages; the spelling of the name also varies, being found with both long "ā" [ɑː] and short "a" [a], and also with either a retroflex [ɭ] or non-retroflex [l] "l" sound. Both the long ā and retroflex ḷ are seen in the ISO 15919/ALA-LC rendering, Pāḷi; however, to this day there is no single, standard spelling of the term, and all four possible spellings can be found in textbooks. R. C. Childers translates the word as "series" and states that the language "bears the epithet in consequence of the perfection of its grammatical structure".
In the 19th century, the British Orientalist Robert Caesar Childers argued that the true or geographical name of the Pali language was Magadhi Prakrit, and that because pāḷi means "line, row, series", the early Buddhists extended the meaning of the term to mean "a series of books", so pāḷibhāsā means "language of the texts". However, modern scholarship has regarded Pali as a mix of several Prakrit languages from around the 3rd century BCE, combined together and partially Sanskritized. The closest artifacts to Pali that have been found in India are Edicts of Ashoka found at Gujarat, in the west of India, leading some scholars to associate Pali with this region of western India.
There is persistent confusion as to the relation of Pāḷi to the vernacular spoken in the ancient kingdom of Magadha, which was located around modern-day Bihār. Beginning in the Theravada commentaries, Pali was identified with 'Magadhi', the language of the kingdom of Magadha, and this was taken to also be the language that the Buddha used during his life. In the modern era, it has been possible to compare Pali with inscriptions known to be in Magadhi Prakrit, as well as other texts and grammars of that language. While none of the existing sources specifically document pre-Ashokan Magadhi, the available sources suggest that Pali is not equatable with that language.
Pali, as a Middle Indo-Aryan language, is different from Sanskrit more with regard to its dialectal base than the time of its origin. A number of its morphological and lexical features show that it is not a direct continuation of Ṛgvedic Vedic Sanskrit. Instead it descends from one or more dialects that were, despite many similarities, different from Ṛgvedic.
Paiśācī is a largely unattested literary language of classical India that is mentioned in Prakrit and Sanskrit grammars of antiquity. It is found grouped with the Prakrit languages, with which it shares some linguistic similarities, but was not considered a spoken language by the early grammarians because it was understood to have been purely a literary language.
In works of Sanskrit poetics such as Daṇḍin's Kavyadarsha, it is also known by the name of Bhūtabhāṣā, an epithet which can be interpreted as 'dead language' (i.e., with no surviving speakers), or bhuta means past and bhasha means language i.e. 'a language spoken in the past'. Evidence which lends support to this interpretation is that literature in Paiśācī is fragmentary and extremely rare but may once have been common.
The 13th-century Tibetan historian Buton Rinchen Drub wrote that the early Buddhist schools were separated by choice of sacred language: the Mahāsāṃghikas used Prākrit, the Sarvāstivādins used Sanskrit, the Sthaviravādins used Paiśācī, and the Saṃmitīya used Apabhraṃśa. This observation has led some scholars to theorize connections between Pali and Paiśācī; Sten Konow concluded that it may have been an Indo-Aryan language spoken by Dravidian people in South India, and Alfred Master noted a number of similarities between surviving fragments and Pali morphology.
Many Theravada sources refer to the Pali language as "Magadhan" or the "language of Magadha". This identification first appears in the commentaries, and may have been an attempt by Buddhists to associate themselves more closely with the Maurya Empire.
However, only some of the Buddha's teachings were delivered in the historical territory of the kingdom of Magadhi. Scholars consider it likely that he taught in several closely related dialects of Middle Indo-Aryan, which had a high degree of mutual intelligibility. There is no attested dialect of Middle Indo-Aryan with all the features of Pali. Pali has some commonalities with both the western Ashokan Edicts at Girnar in Saurashtra, and the Central-Western Prakrit found in the eastern Hathigumpha inscription.
Whatever the relationship of the Buddha's speech to Pali, the canon was eventually transcribed and preserved entirely in it, while the commentarial tradition that accompanied it (according to the information provided by Buddhaghosa) was translated into Sinhala and preserved in local languages for several generations. In Sri Lanka, Pali is thought to have entered into a period of decline ending around the 4th or 5th century (as Sanskrit rose in prominence, and simultaneously, as Buddhism's adherents became a smaller portion of the subcontinent), but ultimately survived. The work of Buddhaghosa was largely responsible for its reemergence as an important scholarly language in Buddhist thought. The Visuddhimagga, and the other commentaries that Buddhaghosa compiled, codified and condensed the Sinhala commentarial tradition that had been preserved and expanded in Sri Lanka since the 3rd century BCE.
T. W. Rhys Davids in his book Buddhist India, and Wilhelm Geiger in his book Pāli Literature and Language, suggested that Pali may have originated as a lingua franca or common language of culture among people who used differing dialects in North India, used at the time of the Buddha and employed by him. Another scholar states that at that time it was "a refined and elegant vernacular of all Aryan-speaking people". Modern scholarship has not arrived at a consensus on the issue; there are a variety of conflicting theories with supporters and detractors. After the death of the Buddha, Pali may have evolved among Buddhists out of the language of the Buddha as a new artificial language. R. C. Childers, who held to the theory that Pali was Old Magadhi, wrote: "Had Gautama never preached, it is unlikely that Magadhese would have been distinguished from the many other vernaculars of Hindustan, except perhaps by an inherent grace and strength which make it a sort of Tuscan among the Prakrits."
According to K. R. Norman, it is likely that the viharas in North India had separate collections of material, preserved in the local dialect. In the early period it is likely that no degree of translation was necessary in communicating this material to other areas. Around the time of Ashoka there had been more linguistic divergence, and an attempt was made to assemble all the material. It is possible that a language quite close to the Pali of the canon emerged as a result of this process as a compromise of the various dialects in which the earliest material had been preserved, and this language functioned as a lingua franca among Eastern Buddhists in India from then on. Following this period, the language underwent a small degree of Sanskritisation (i.e., MIA bamhana > brahmana, tta > tva in some cases).
Bhikkhu Bodhi, summarizing the current state of scholarship, states that the language is "closely related to the language (or, more likely, the various regional dialects) that the Buddha himself spoke". He goes on to write:
Scholars regard this language as a hybrid showing features of several Prakrit dialects used around the third century BCE, subjected to a partial process of Sanskritization. While the language is not identical to what Buddha himself would have spoken, it belongs to the same broad language family as those he might have used and originates from the same conceptual matrix. This language thus reflects the thought-world that the Buddha inherited from the wider Indian culture into which he was born, so that its words capture the subtle nuances of that thought-world.
According to A. K. Warder, the Pali language is a Prakrit language used in a region of Western India. Warder associates Pali with the Indian realm (janapada) of Avanti, where the Sthavira nikāya was centered. Following the initial split in the Buddhist community, the Sthavira nikāya became influential in Western and South India while the Mahāsāṃghika branch became influential in Central and East India. Akira Hirakawa and Paul Groner also associate Pali with Western India and the Sthavira nikāya, citing the Saurashtran inscriptions, which are linguistically closest to the Pali language.
Pali died out as a literary language in mainland India in the fourteenth century but survived elsewhere until the eighteenth. Today Pali is studied mainly to gain access to Buddhist scriptures, and is frequently chanted in a ritual context. The secular literature of Pali historical chronicles, medical texts, and inscriptions is also of great historical importance. The great centers of Pali learning remain in the Theravada nations of Southeast Asia: Myanmar, Sri Lanka, Thailand, Laos, and Cambodia. Since the 19th century, various societies for the revival of Pali studies in India have promoted awareness of the language and its literature, including the Maha Bodhi Society founded by Anagarika Dhammapala.
In Europe, the Pali Text Society has been a major force in promoting the study of Pali by Western scholars since its founding in 1881. Based in the United Kingdom, the society publishes romanized Pali editions, along with many English translations of these sources. In 1869, the first Pali Dictionary was published using the research of Robert Caesar Childers, one of the founding members of the Pali Text Society. It was the first Pali translated text in English and was published in 1872. Childers' dictionary later received the Volney Prize in 1876.
The Pali Text Society was founded in part to compensate for the very low level of funds allocated to Indology in late 19th-century England and the rest of the UK; incongruously, the citizens of the UK were not nearly so robust in Sanskrit and Prakrit language studies as Germany, Russia, and even Denmark. Even without the inspiration of colonial holdings such as the former British occupation of Sri Lanka and Burma, institutions such as the Danish Royal Library have built up major collections of Pali manuscripts, and major traditions of Pali studies.
Nearly every word in Pāḷi has cognates in the other Middle Indo-Aryan languages, the Prakrits. The relationship to Vedic Sanskrit is less direct and more complicated; the Prakrits were descended from Old Indo-Aryan vernaculars. Historically, influence between Pali and Sanskrit has been felt in both directions. The Pali language's resemblance to Sanskrit is often exaggerated by comparing it to later Sanskrit compositions – which were written centuries after Sanskrit ceased to be a living language, and are influenced by developments in Middle Indic, including the direct borrowing of a portion of the Middle Indic lexicon; whereas, a good deal of later Pali technical terminology has been borrowed from the vocabulary of equivalent disciplines in Sanskrit, either directly or with certain phonological adaptations.
Post-canonical Pali also possesses a few loan-words from local languages where Pali was used (e.g. Sri Lankans adding Sinhala words to Pali). These usages differentiate the Pali found in the Suttapiṭaka from later compositions such as the Pali commentaries on the canon and folklore (e.g., commentaries on the Jataka tales), and comparative study (and dating) of texts on the basis of such loan-words is now a specialized field unto itself.
Pali was not exclusively used to convey the teachings of the Buddha, as can be deduced from the existence of a number of secular texts, such as books of medical science/instruction, in Pali. However, scholarly interest in the language has been focused upon religious and philosophical literature, because of the unique window it opens on one phase in the development of Buddhism.
Although Sanskrit was said[by whom?] in the Brahmanical tradition to be the unchanging language spoken by the gods, in which each word had an inherent significance, this view of language was not shared in the early Buddhist tradition,[which?] in which words were only conventional and mutable signs. This view of language naturally extended to Pali, and may have contributed to its usage (as an approximation or standardization of local Middle Indic dialects) in place of Sanskrit. However, by the time of the compilation of the Pali commentaries (4th or 5th century), Pali was regarded as the natural language, the root language of all beings.[who?]
Comparable to Ancient Egyptian, Latin or Hebrew in the mystic traditions of the West, Pali recitations were often thought to have a supernatural power (which could be attributed to their meaning, the character of the reciter, or the qualities of the language itself), and in the early strata of Buddhist literature we can already see Pali dhāraṇīs used as charms, as, for example, against the bite of snakes. Many people in Theravada cultures still believe that taking a vow in Pali has a special significance, and, as one example of the supernatural power assigned to chanting in the language, the recitation of the vows of Aṅgulimāla are believed to alleviate the pain of childbirth in Sri Lanka. In Thailand, the chanting of a portion of the Abhidhammapiṭaka is believed to be beneficial to the recently departed, and this ceremony routinely occupies as much as seven working days. There is nothing in the latter text that relates to this subject, and the origins of the custom are unclear.
Long and short vowels are only contrastive in open syllables; in closed syllables, all vowels are always short. Short and long e and o are in complementary distribution: the short variants occur only in closed syllables, the long variants occur only in open syllables. Short and long e and o are therefore not distinct phonemes.
A sound called anusvāra (Skt.; Pali: nigghahita), represented by the letter ṁ (ISO 15919) or ṃ (ALA-LC) in romanization, and by a raised dot in most traditional alphabets, originally marked the fact that the preceding vowel was nasalized. That is, aṁ, iṁ and uṁ represented [ã], [ĩ] and [ũ]. In many traditional pronunciations, however, the anusvāra is pronounced more strongly, like the velar nasal [ŋ], so that these sounds are pronounced instead [ãŋ], [ĩŋ] and [ũŋ]. However pronounced, ṁ never follows a long vowel; ā, ī and ū are converted to the corresponding short vowels when ṁ is added to a stem ending in a long vowel, e.g. kathā + ṁ becomes kathaṁ, not *kathāṁ, devī + ṁ becomes deviṁ, not *devīṁ.
Of the sounds listed above only the three consonants in parentheses, ṅ, ḷ, and ḷh, are not distinct phonemes in Pali: ṅ only occurs before velar stops, while ḷ and ḷh are allophones of single ḍ and ḍh occurring between vowels.
Pali is a highly inflected language, in which almost every word contains, besides the root conveying the basic meaning, one or more affixes (usually suffixes) which modify the meaning in some way. Nouns are inflected for gender, number, and case; verbal inflections convey information about person, number, tense and mood.
Pali nouns inflect for three grammatical genders (masculine, feminine, and neuter) and two numbers (singular and plural). The nouns also, in principle, display eight cases: nominative or paccatta case, vocative, accusative or upayoga case, instrumental or karaṇa case, dative or sampadāna case, ablative, genitive or sāmin case, and locative or bhumma case; however, in many instances, two or more of these cases are identical in form; this is especially true of the genitive and dative cases.
a-stems, whose uninflected stem ends in short a (/ə/), are either masculine or neuter. The masculine and neuter forms differ only in the nominative, vocative, and accusative cases.
i-stems and u-stems are either masculine or neuter. The masculine and neuter forms differ only in the nominative and accusative cases. The vocative has the same form as the nominative.
The literal meaning is therefore: "The dharmas have mind as their leader, mind as their chief, are made of/by mind. If [someone] either speaks or acts with a corrupted mind, from that [cause] suffering goes after him, as the wheel [of a cart follows] the foot of a draught animal."
The Indo-Aryan languages are commonly assigned to three major groups: Old, Middle and New Indo-Aryan. The classification reflects consecutive stages in a common linguistic development, but is not merely a matter of chronology: Classical Sanskrit, as a codified derivate of Vedic Sanskrit, remains mostly representative of the Old Indo-Aryan stage, even though it continued to flourish at the same time as the Middle Indo-Aryan languages. Conversely, a number of the morphophonological and lexical features of the Middle Indo-Aryan languages show that they are not direct continuations of Rigvedic Sanskrit, the main base of Classical Sanskrit. Instead they descend from other dialects similar to, but in some ways more archaic than Rigvedic.
Pali and Sanskrit are very closely related and the common characteristics of Pali and Sanskrit were always easily recognized by those in Nepal who were familiar with both. A very large proportion[clarification needed] of Pali and Sanskrit word-stems are identical in form, differing only in details of inflection.
Technical terms from Sanskrit were converted into Pali by a set of conventional phonological transformations. These transformations mimicked a subset of the phonological developments that had occurred in Proto-Pali. Because of the prevalence of these transformations, it is not always possible to tell whether a given Pali word is a part of the old Prakrit lexicon, or a transformed borrowing from Sanskrit. The existence of a Sanskrit word regularly corresponding to a Pali word is not always secure evidence of the Pali etymology, since, in some cases, artificial Sanskrit words were created by back-formation from Prakrit words.[dubious ]
The following phonological processes are not intended as an exhaustive description of the historical changes which produced Pali from its Old Indic ancestor, but rather are a summary of the most common phonological equations between Sanskrit and Pali, with no claim to completeness.
Total assimilation, where one sound becomes identical to a neighboring sound, is of two types: progressive, where the assimilated sound becomes identical to the following sound; and regressive, where it becomes identical to the preceding sound.
An epenthetic vowel is sometimes inserted between certain consonant-sequences. As with ṛ, the vowel may be a, i, or u, depending on the influence of a neighboring consonant or of the vowel in the following syllable. i is often found near i, y, or palatal consonants; u is found near u, v, or labial consonants.
There are several notable exceptions to the rules above; many of them are common Prakrit words rather than borrowings from Sanskrit.
Emperor Ashoka erected a number of pillars with his edicts in at least three regional Prakrit languages in Brahmi script, all of which are quite similar to Pali. Historically, the first written record of the Pali canon is believed to have been composed in Sri Lanka, based on a prior oral tradition. As per the Mahavamsa (the chronicle of Sri Lanka), due to a major famine in the country Buddhist monks wrote down the Pali canon during the time of King Vattagamini in 100 BCE. The transmission of written Pali has retained a universal system of alphabetic values, but has expressed those values in a stunning variety of actual scripts.
In Sri Lanka, Pali texts were recorded in Sinhala script. Other local scripts, most prominently Khmer, Burmese, and in modern times Thai (since 1893), Devanāgarī and Mon script (Mon State, Burma) have been used to record Pali.
Since the 19th century, Pali has also been written in the Roman script. An alternate scheme devised by Frans Velthuis, called the Velthuis scheme (see § Text in ASCII) allows for typing without diacritics using plain ASCII methods, but is arguably less readable than the standard IAST system, which uses diacritical marks.
There are several fonts to use for Pali transliteration. However, older ASCII fonts such as Leedsbit PaliTranslit, Times_Norman, Times_CSX+, Skt Times, Vri RomanPali CN/CB etc., are not recommendable, they are deprecated, since they are not compatible with one another, and are technically out of date. Instead, fonts based on the Unicode standard are recommended.
However, not all Unicode fonts contain the necessary characters. To properly display all the diacritic marks used for romanized Pali (or for that matter, Sanskrit), a Unicode font must contain the following character ranges:
Some Unicode fonts freely available for typesetting Romanized Pali are as follows:
Some of the latest fonts coming with Windows 7 can also be used to type transliterated Pali: Arial, Calibri, Cambria, Courier New, Microsoft Sans Serif, Segoe UI, Segoe UI Light, Segoe UI Semibold, Tahoma, and Times New Roman. And some of them have 4 styles each hence usable in professional typesetting: Arial, Calibri and Segoe UI are sans-serif fonts, Cambria and Times New Roman are serif fonts and Courier New is a monospace font.
The Velthuis scheme was originally developed in 1991 by Frans Velthuis for use with his "devnag" Devanāgarī font, designed for the TeX typesetting system. This system of representing Pali diacritical marks has been used in some websites and discussion lists. However, as the Web itself and email software slowly evolve towards the Unicode encoding standard, this system has become almost unnecessary and obsolete.
The following table compares various conventional renderings and shortcut key assignments: