Incorporation (linguistics)

Incorporation is a phenomenon by which a grammatical category, such as a verb, forms a compound with its direct object (object incorporation) or adverbial modifier, while retaining its original syntactic function. The inclusion of a noun qualifies the verb, narrowing its scope rather than making reference to a specific entity.

Incorporation is central to many polysynthetic languages such as those found in North America, Siberia and northern Australia. However, polysynthesis does not necessarily imply incorporation (Mithun 2009), and the presence of incorporation does not imply that the language is polysynthetic.

Although incorporation does not occur regularly, English uses it sometimes: breastfeed, and direct object incorporation, as in babysit. Etymologically, such verbs in English are usually back-formations: the verbs breastfeed and babysit are formed from the adjective breast-fed and the noun babysitter respectively. Incorporation and plain compounding may be fuzzy categories: consider backstabbing, name-calling, axe murder.

The following example from Oneida (Iroquoian) illustrates noun incorporation.

In this example, the verbal root hninu appears with its usual verbal morphology: a factive marker (FACT), which very roughly translates as past tense, although this is not quite accurate; an agreement marker (1.SG), which tells us that the verb agrees with 1st person singular (the speaker); and an aspect marker, punctual (PUNC), which tells us that this is a completed event. The direct object ne kanaktaʼ follows the verb. The function of the particle ne is to determine the bed: in the example, I bought this specific bed. Note that the word for bed consists of a root nakt plus a prefix and a suffix. The notion of the root is important here, but the properties of the prefix and suffix do not matter for this discussion.

In the following sentence, the bed is unspecified. Unspecified nouns can be incorporated, thus creating a general statement. In this example: I bought a bed (and not a specific bed). In a broader sense, depending on context, it can even mean that I am a bed buyer, as in: I am a trader of beds, buying beds is my profession.[1]

In this example, the root for bed nakt has incorporated into the verbal construction and appears before the verbal root. Two other incidental changes are noticed here. First, the agreement marker in the first example is k and in the second example is ke. These are two phonologically-conditioned allomorphs. In other words, the choice between using k and ke is based on the other sounds in the word (and has nothing to do with noun incorporation). Also, there is an epenthetic vowel a between the nominal and verbal roots. This vowel is inserted to break up an illegal consonant cluster (and also has nothing to do with noun incorporation).

The next example, from Panare, illustrates the cross-linguistically common phenomenon that the incorporated form of a noun may be significantly different from its unincorporated form. The first sentence contains the incorporated form u' of "head", and the second its unincorporated form ipu:

Chukchi, a Chukotko-Kamchatkan language spoken in North Eastern Siberia, provides a wealth of examples of noun incorporation. The phrase təpelarkən qoraŋə means "I'm leaving the reindeer" and has two words (the verb in the first person singular, and the noun). The same idea can be expressed with the single word təqorapelarkən, in which the noun root qora- "reindeer" is incorporated into the verb word.

Mohawk, an Iroquoian language, makes heavy use of incorporation, as in: watia'tawi'tsherí:io "it is a good shirt", where the noun root atia'tawi "upper body garment" is present inside the verb.

Cheyenne, an Algonquian language of the plains, also uses noun incorporation on a regular basis. Consider nátahpe'emaheona, meaning "I have a big house", which contains the noun morpheme maheo "house".

Chinese makes extensive use of verb-object compounds, which are compounds composed of two constituents having the syntactic relation of verb and its direct object.[2] For example, the verb shuì-jiào 睡覺 'sleep (VO)' is composed of the verb shuì 'sleep (V)' and the bound morpheme object jiào 'sleep (N)'. Aspect markers (e.g. le PERFECTIVE), classifier phrases (e.g. 三個鐘頭 sān ge zhōngtóu THREE + CL + hours), and other elements may separate the two constituents of these compounds, though different verb-object compounds vary in degree of separability.

In many cases, a phrase with an incorporated noun carries a different meaning with respect to the equivalent phrase where the noun is not incorporated into the verb. The difference seems to hang around the generality and definiteness of the statement. The incorporated phrase is usually generic and indefinite, while the non-incorporated one is more specific.

In Yucatec Maya, for example, the phrase "I chopped a tree", when the word for "tree" is incorporated, changes its meaning to "I chopped wood". In Lahu (a Tibeto-Burman language), the definite phrase "I drink the liquor" becomes the more general "I drink liquor" when "liquor" is incorporated. The Japanese phrase 目を覚ます me o samasu means "to wake up" or literally to wake (one's) eyes. But when the direct object is incorporated into the nominal form of the verb, the resulting noun 目覚まし mezamashi literally means "waking up", as in 目覚まし時計 mezamashidokei meaning "alarm clock."

This tendency is not a rule. There are languages where noun incorporation does not produce a meaning change (though it may cause a change in syntax — as explained below).

An influential definition of noun incorporation (NI) by Sapir (1911) and Mithun (1984) has stated that NI is “a construction in which a noun and a verb stem combine to yield a complex verb.”[3][4] Due to the wide variation in how noun incorporation presents itself in different languages, however, it is difficult to create an agreed upon and all-encompassing definition. As a result, most syntacticians have focused on generating definitions that apply to the languages they have studied, regardless of whether or not they are cross-linguistically attested.[5]

Noun incorporation can interact with the transitivity of the verb it applies to in two different ways. In some languages, the incorporated noun deletes one of the arguments of the verb, and this is shown explicitly: if the verb is transitive, the derived verb word with an incorporated noun (which functions as the direct object) becomes formally intransitive and is marked as such. In other languages this change does not take place, or at least it is not shown by explicit morphology. A recent study found out that across languages, morphosyntactically highly transitive verbs and patientive intransitive verbs are most likely to perform noun incorporation.[6]

Incorporation looks at whether verb arguments, its nominal complements, exist on the same syntactic level or not. Incorporation is characterized as a stem combination meaning it combines independent lexical items into a modal or auxiliary verb to ultimately form a complex verb.[7] The stem of the verb will be the determiner of the new category in which the incorporation belongs to and the noun which was incorporated drops its own categorical features and grammatical markings, if employed.[7] This is done by the movement of the incorporated noun to its new position in syntax. When participating in noun incorporation, it allows for the speaker to represent an alternative expression to further explain and shift focus to the information being presented (Mithun 1984).

Although incorporation exists in many languages, incorporation is optional and non-obligatory. Incorporation is restricted to certain noun categories; namely on the degree to which they are animate or alive or suppletive forms.

If a language participates in productive compounding it does not allow for incorporation. An example of a compounding language is German. Respectively, if a language participates in incorporation it does not allow for productive compounding.

The most common type of NI is where the incorporated noun acts as the notional subject of the clause.[8] This can be observed in Onondaga, Southern Tiwa and Koryak.

In 1985, Mithun introduced a four type system to define the functionality and progression of noun incorporation in a language.[3] This system is important as many discuss this, and it is widely applied to explain the differences in NI in languages. The four types are:

According to Mithun, languages exhibiting any of these types always display all of the lower types as well. This seems to imply a pattern of progression, as Mithun describes in her 1984 paper on the evolution of noun incorporation.[3] It is argued that it is necessary to distinguish at least two types of noun incorporation.[3]

Figure 1: English syntax tree example illustrating noun incorporation following Baker's (1989) head movement hypothesis.

A large field of inquiry is whether NI is a syntactic process (verb and noun originate in different nodes and come together through syntactic means, a lexical process (word formation rules that apply in the lexicon dictate NI), or a combined process which investigates the which aspects of noun incorporation can be productively created through general syntactic rules and which must be specified in the lexicon. Of course, this will vary by language as some languages (primarily polysynthetic ones) allow for incorporated structures in a wide variety of sentences whereas in languages such as English, this incorporation is more limited. Theories of morphology-syntax interaction and the debate between syntactic and lexical accounts of NI should strive to be restrictive enough to account for the stable properties of NI in a unified way, but also account for language-specific variations.[9] Within this section, we will focus on describing the influential syntactic and combined approaches to NI, though it is important to note that highly influential lexical accounts, such as Rosen's (1989) paper[10] do exist.

One highly influential syntactic account for NI is the head movement process proposed by Baker (1988).[11] This account states that this NI head movement is distinct from but similar to the better-established phenomenon of phrase movement and involves the movement of a head noun out of object position and into a position where it adjoins to a governing verb. An example of this movement can be seen in figure 1 where the head noun 'baby' is moved out of the object N position to become incorporated with the verb as the sister to the verb 'sit'. While this theory does not account for every language, it does provide a starting point for subsequent syntactic analyses of NI, both with and without head movement. A more recent paper by Baker (2007) addresses a number of other influential accounts including Massam’s pseudo-incorporation,[12] Van Geenhoven’s base generation,[13] and Koopman and Szabolcsi’s small phrase movement.[14][15] It was concluded that while each have their strong points, they all fail to answer some important questions, thus requiring the continued use of Baker's head movement account.[15]

Others, including Barrrie and Mattieu (2016) have argued against Baker’s  head movement hypothesis. They have investigated Onondaga and Ojibwe and proposed that phrasal movement rather than head movement can account for NI in a number of languages (including Mohawk).[16]

A polysynthetic language is a language in which multiple morphemes, including affixes are often present within a single word. Each word can therefore express the meaning of full clauses or phrase and this structure has implications on how noun incorporation is exhibited in the languages in which it's observed.

In Lakhota, a Siouan language of the plains, for example, the phrase "the man is chopping wood" can be expressed either as a transitive wičháša kiŋ čháŋ kiŋ kaksáhe ("man the wood the chopping") or as an intransitive wičháša kiŋ čhaŋkáksahe ("man the wood-chopping") in which the independent nominal čháŋ, "wood," becomes a root incorporated into the verb: "wood-chopping."

Mohawk is an Iroquoian Language in which noun incorporation occurs. NI is a very salient property of Northern Iroquoian languages including Mohawk and is seen unusually often in comparison to other languages. Noun incorporation in Mohawk involves the compounding of a noun stem with a verb stem to form a new verb stem.[17]

Only the noun stem is incorporated into the verb in NI, not the whole noun word.[17]

Mohawk grammar allows for whole propositions to be expressed by one word, which is classified as a verb. Other core elements, namely nouns (subjects, objects, etc.) can be incorporated into the verb. Well-formed verb phrases contain at the bare minimum a verb root and a pronominal prefix. The rest of the elements (and therefore noun incorporation) are optional.[18] In the example sentences below, one can see the original sentence in 1a and the same sentence with noun incorporation into the verb in 1b where instead of "bought a bed," the literal translation of the sentence is "bed-bought."

It is true in Mohawk, as it is in many languages, that the direct object of a transitive verb can incorporate, but the subject of a transitive verb cannot.[9] This can be seen in the examples below as the well-formed sentence in 2a involves the incorporation of na'tar (bread), the direct object of the transitive verb kwetar (cut). Example 2b represents a sentence that is ill-formed as it cannot possess the same meaning as 2a ('this knife cuts bread'). This is because the subject of the transitive verb, a'shar (knife), is being incorporated into the verb which is not attested in Mohawk.

Further, a unique feature of Mohawk is that fact that this language allows for noun incorporation into intransitives[9] as illustrated in example sentence 3. Hri' (shatter) is an intransitive verb which the noun stem ks (dish) is being incorporated into, producing a well-formed sentence.

Another feature of Mohawk which is not as commonly attested cross-linguistically is that Mohawk allows a demonstrative, numeral, or adjective outside the complex verb to be interpreted as a modifier of the incorporated noun.[9] Example sentence 4 illustrates this below. Here, the demonstrative thinkv (that) refers to and therefore modifies the incorporated noun ather (basket).

Figure 2: Simplified syntax tree of noun incorporation in Mohawk following Baker's head movement hypothesis

According to Mithun's (1984)[3] theory of noun incorporation classification, Mohawk is generally considered a type IV language because the Incorporated noun modifies the internal argument.[9] As a result of this classification, NI is Mohawk can follow any of the four structures listed in Mithun's paper including lexical compounding, manipulation of case roles, manipulation of discourse structure, and classificatory incorporation.

Baker, Aranovich, & Golluscio claim that the structure of NI in Mohawk is the result of noun movement in the syntax.[9] This is an extension of Baker's head movement hypothesis[15] which is described above. The differences that Mohawk displays as compared to other languages therefore depends on whether or not the person, number, and gender features are retained in the ‘trace’ of the noun, the trace being the position from where the noun moved from object position before adjoinig to the governing verb.[9] Figure 2 illustrates a simplified syntax tree of noun incorporation in Mohawk following Baker's head movement hypothesis. Here, the noun -wir- (baby) is moved from the object N position to become incorporated with the verb as the sister to the verb -núhwe'- (to like). Please note that some details were not included in this tree for illustrative purposes.

In the Oneida language (an Iroquoian language spoken in Southern Ontario and Wisconsin), one finds classifier noun incorporation, in which a generic noun acting as a direct object can be incorporated into a verb, but a more specific direct object is left in place. In a rough translation, one would say for example "I animal-bought this pig", where "animal" is the generic incorporated noun. Note that this "classifier" is not an actual classifier (i.e. a class agreement morpheme) but a common noun.

Cherokee language is a language spoken by the Cherokee people, and is an Iroquoian language. Noun incorporation in Cherokee is very limited and the cases are lexicalized.[21] All of the noun incorporation in Cherokee involves a body part word and few nouns,[22] and to make up for the lack of NI, Cherokee has a system of classificatory verbs with five distinct categories.[23]

English exhibits noun incorporation in a way that's different from the polysynthetic languages described above.

Noun Incorporation is not traditionally and commonly used in English, although over time it has been incorporated into the language for productiveness.

Productive noun incorporation cannot be pluralized in English and cannot include any additions of higher elements such as determiners, quantifiers or adjuncts.

The following examples show impossible noun incorporation cases for the sentence Will enjoys watch-collecting

Noun Incorporation in English is namely used by incorporating a type of noun with a verb to form a new verb through the process of lexical compounding. Noun compounding exists in lexical items where nouns have a recognizable concept to alter the semantics of a verb.The ability of the incorporation of nouns altering the syntactic and semantics of a verb is known as an incorporation complex. The incorporation becomes a valency-changing factor to the word which either decreases or increases verb valency.[26]

In English, it is most common for an argument or an actant to be incorporated with the predicate. It is shown that the addition of an actant results in additional connotations and metaphorical meanings. E.g., house-hunt. Although this may make the semantics more complex than it originally was, it simplifies the syntactic structure of the sentence by incorporating the actant-sender house.

There are 3 different types of Noun Incorporation compounding: lexical compounding, composition by juxtaposition and morphological compounding. English participates in Lexical Compounding.[3]

Lexical compounding in English exists to categorize an entity, quality or activity that deserves its own name. For example bus money, or mountain-climbing. Though, if mountain-climbing was not an institutional activity, the language may not recognize such activity. In this form of compounding a noun adds itself to a host verb and forms a new verb. The noun drops its syntactic features.

English tends to undergo transmutation forming denominal verbs in the language.

In the listed examples above, the incorporation of the actant does not possess a separate syntactic position in the verbs.

The following illustrates the three sources of incorporation in English and corresponding examples:

In the listed examples above, the incorporation of the actant possesses a separate syntactic position in the verbs.

Hungarian is an Uralic language in which many different types of noun incorporation occurs. Hungarian's linguistic typology is agglutinative, meaning that the languages have words that consists of more than one morphemes. Hungarian combines "bare noun + verb" to form a new complex verb, and this would correspond to Mithun's first type of NI, Lexical Compounding.[27][3] Phonologically, the V and N are separate words, but in syntactically, the N loses its syntactic status as the argument of the sentence, and VN unit becomes an intransitive predicate.[3] This is demonstrated in the examples below:

To be clear, to 'house-build' is not the same as to 'build a house.' 'house-building' is a complex activity and a unitary concept, and this is applied to other examples as well.[28] The object argument of the underlying verb may be satisfied by the bare noun, but the bare noun does not act as an argument of a sentence like it usually would.[28] In Hungarian, for examples such as one mentioned, the incorporating verb must be imperfective and the complex verb formed from it always must be intransitive.[28]

Interestingly, in Hungarian, incorporated nominals may be morphologically singular or plural - this is dependent on whether languages allow this or not in their incorporation.[29]

One restriction that Hungarian forbids is that bare object nouns cannot be incorporated with prefixed verbs.[28]

Another restriction that Hungarian does not allow is stative verbs cannot allow noun incorporation, even if the stative verbs are not prefixed verbs.[28]

However, it is important note that there are some unprefixed verbs that are perfective and allows NI in Hungarian.[28]

Korean, national language of South Korea and North Korea, is part of Koreanic language family and also has noun incoporation.

Specifically, Korean obeys the Head Movement Constraint by Baker (1988) that was discussed in the prior section.[11]

Korean has incorporated nouns in the structure of [N + VStem + AN1(i)] which is different that the normal N + V like in English.[30] AN is an affix in this case.

In Korean, the noun, the head of the preceding NP, moves to the head of the VP to form a syntactic compound.[30] Then the complex VP moves up to the right of the nominal head position where is '-i' is base-generated.[31]