| Proto-Semitic | |
|---|---|
| Reconstruction of | Semitic languages | 
| Era | ca. 4500–3500 BC | 
| Reconstructed ancestor | |
| Lower-order reconstructions | |
Proto-Semitic is the hypothetical reconstructed proto-language ancestral to the Semitic languages. There is no consensus regarding the location of the Proto-Semitic Urheimat: scholars hypothesize that it may have originated in the Levant, the Sahara, the Horn of Africa, the Arabian Peninsula, or northern Africa.[1]
The Semitic language family is considered part of the broader macro-family of Afroasiatic languages.
Dating
The earliest attestations of a Semitic language are in Akkadian, dating to around the 24th to 23rd centuries BC (see Sargon of Akkad) and the Eblaite language, but earlier evidence of Akkadian comes from personal names in Sumerian texts from the first half of the third millennium BC.[2] One of the earliest known Akkadian inscriptions was found on a bowl at Ur, addressed to the very early pre-Sargonic king Meskiagnunna of Ur (c. 2485–2450 BC) by his queen Gan-saman, who is thought to have been from Akkad.[3] The earliest text fragments of West Semitic are snake spells in Egyptian pyramid texts, dated around the mid-third millennium BC.[4][5]
Proto-Semitic itself must have been spoken before the emergence of its daughters, so some time before the earliest attestation of Akkadian, and sufficiently long so for the changes leading from it to Akkadian to have taken place, which would place it in the fourth millennium BC or earlier.[2]
Urheimat
Since all modern Semitic languages can be traced back to a common ancestor, Semiticists have placed importance on locating the Urheimat of the Proto-Semitic language.[6] The Urheimat of the Proto-Semitic language may be considered within the context of the larger Afro-Asiatic family to which it belongs.
The previously popular hypothesis of an Arabian Urheimat has been largely abandoned since the region could not have supported massive waves of emigration before the domestication of camels in the 2nd millennium BC.[6]
There is also evidence that Mesopotamia and adjoining areas of modern Syria were originally inhabited by a non-Semitic population. That is suggested by non-Semitic toponyms preserved in Akkadian and Eblaite.
Levant hypothesis
A Bayesian analysis performed in 2009 suggests an origin for all known Semitic languages in the Levant around 3750 BC, with a later single introduction from South Arabia into the Horn of Africa around 800 BC. This statistical analysis could not, however, estimate when or where the ancestor of all Semitic languages diverged from Afroasiatic.[7] It thus neither contradicts nor confirms the hypothesis that the divergence of ancestral Semitic from Afroasiatic occurred in Africa.

In another variant of the theory, the earliest wave of Semitic speakers entered the Fertile Crescent via the Levant and eventually founded the Akkadian Empire. Their relatives, the Amorites, followed them and settled Syria before 2500 BC.[8] Late Bronze Age collapse in Israel led the South Semites to move southwards where they emigrated to the highlands of Yemen after the 20th century BC until those crossed Bab el-Mandeb to the Horn of Africa between 1500 and 500 BC.[8]
Phonology
Vowels
Proto-Semitic had a simple vowel system, with three qualities *a, *i, *u, and phonemic vowel length, conventionally indicated by a macron: *ā, *ī, *ū.[9] This system is preserved in Classical Arabic.[10]
Consonants
The reconstruction of Proto-Semitic was originally based primarily on Arabic, whose phonology and morphology (particularly in Classical Arabic) is extremely conservative, and which preserves as contrastive 28 out of the evident 29 consonantal phonemes.[11] Thus, the phonemic inventory of reconstructed Proto-Semitic is very similar to that of Arabic, with only one phoneme fewer in Arabic than in reconstructed Proto-Semitic, with *s and *š merging into Arabic /s/ ⟨س⟩ and *ś becoming Arabic /ʃ/ ⟨ش⟩. As such, Proto-Semitic is generally reconstructed as having the following phonemes (as usually transcribed in Semitology):[12]
| Type | Manner | Voicing | Labial | Interdental | Alveolar | Palatal | Velar/Uvular | Pharyngeal | Glottal | |
|---|---|---|---|---|---|---|---|---|---|---|
| Central | Lateral | |||||||||
| Obstruent | Stop | voiceless | *p [p] | *t [t] | *k [k] | |||||
| emphatic | (pʼ)[lower-alpha 1] | *ṭ [tʼ] | *q/ḳ [kʼ] | |||||||
| voiced | *b [b] | *d [d] | *g [g] | |||||||
| Affricate | voiceless | *s [t͡s] | *ś [t͡ɬ] | |||||||
| emphatic | *ṯ̣/θ̣/ẓ [t͡θʼ] | *ṣ [t͡sʼ] | *ṣ́/ḏ̣ [t͡ɬʼ] | |||||||
| voiced | *z [d͡z] | |||||||||
| Fricative | voiceless | *ṯ/θ [θ] | *š [ʃ] | *ḫ/k̇ [x~χ] | *ḥ [ħ] | *h [h] | ||||
| emphatic | (xʼ~χʼ)[lower-alpha 2] | |||||||||
| voiced | *ḏ [ð] | *ǵ/*ġ [ɣ~ʁ] | *ʻ,ˤ [ʕ] | |||||||
| Resonant | Trill | *r [r] | ||||||||
| Approximant | *w/u [w] | *l [l] | *y/i [j] | |||||||
| Nasal | *m [m] | *n [n] | ||||||||
The reconstructed phonemes *s *z *ṣ *ś *ṣ́ *ṯ̣, which are shown to be phonetically affricates in the table above, may also be interpreted as fricatives (/s z sʼ ɬ ɬʼ θʼ/), as discussed below. This was the traditional reconstruction and is reflected in the choice of signs.
The Proto-Semitic consonant system is based on triads of related voiceless, voiced and "emphatic" consonants. Five such triads are reconstructed in Proto-Semitic:
- Dental stops *d *t *ṭ
- Velar stops *g *k *ḳ (normally written *g *k *q)
- Dental sibilants *z *s *ṣ
- Interdental /ð θ θʼ/ (written *ḏ *ṯ *ṯ̣)
- Lateral /l ɬ ɬʼ/ (normally written *l *ś *ṣ́)
The probable phonetic realization of most consonants is straightforward and is indicated in the table with the International Phonetic Alphabet (IPA). Two subsets of consonants, however, deserve further comment.
Emphatics
The sounds notated here as "emphatic consonants" occur in nearly all Semitic languages as well as in most other Afroasiatic languages, and they are generally reconstructed as glottalization in Proto-Semitic.[14][15][nb 1] Thus, *ṭ, for example, represents [tʼ]. See below for the fricatives/affricates.
In modern Semitic languages, emphatics are variously realized as pharyngealized (Arabic, Aramaic, Tiberian Hebrew (such as [tˤ]), glottalized (Ethiopian Semitic languages, Modern South Arabian languages, such as [tʼ]), or as tenuis consonants (Turoyo language of Tur Abdin such as [t˭]);[16] Ashkenazi Hebrew and Maltese are exceptions and emphatics merge into plain consonants in various ways under the influence of Indo-European languages (Sicilian for Maltese, various languages for Hebrew).
An emphatic labial *ṗ occurs in some Semitic languages, but it is unclear whether it was a phoneme in Proto-Semitic.
- The classical Ethiopian Semitic language Geʽez is unique among Semitic languages for contrasting all three of /p/, /f/, and /pʼ/. While /p/ and /pʼ/ occur mostly in loanwords (especially from Greek), there are many other occurrences whose origin is less clear (such as hepʼä 'strike', häppälä 'wash clothes').[17]
- According to Hetzron, Hebrew developed an emphatic labial phoneme ṗ to represent unaspirated /p/ in Iranian and Greek.[18]
Fricatives
The reconstruction of Proto-Semitic has nine fricative sounds that are reflected usually as sibilants in later languages, but whether all were already sibilants in Proto-Semitic is debated:
- Two voiced fricatives *ð, *z that eventually became, for example, /z/ for both in Hebrew and Geʽez (/ð/ in early Geʽez), but /ð/ and /z/ in Arabic respectively
- Four voiceless fricatives
- *θ (*ṯ) that became /ʃ/ in Hebrew but /θ/ in Arabic and /s/ in Geʽez (/θ/ in early Geʽez)
- *š (*s₁) that became /ʃ/ in Hebrew but /s/ in Arabic and Geʽez
- *ś (*s₂) that became /s/ (transcribed ś) in Hebrew but /ʃ/ in Arabic and /ɬ/ in Geʽez
- *s (*s₃) that became /s/ in Hebrew, Arabic and Geʽez
 
- Three emphatic fricatives (*θ̣, *ṣ, *ṣ́)
The precise sound of the Proto-Semitic fricatives, notably of *š, *ś, *s and *ṣ, remains a perplexing problem, and there are various systems of notation to describe them. The notation given here is traditional and is based on their pronunciation in Hebrew, which has traditionally been extrapolated to Proto-Semitic. The notation *s₁, *s₂, *s₃ is found primarily in the literature on Old South Arabian, but more recently, it has been used by some authors to discuss Proto-Semitic to express a noncommittal view of the pronunciation of the sounds. However, the older transcription remains predominant in most literature, often even among scholars who either disagree with the traditional interpretation or remain noncommittal.[19]
The traditional view, as expressed in the conventional transcription and still maintained by some of the authors in the field[20][21][22] is that *š was a voiceless postalveolar fricative ([ʃ]), *s was a voiceless alveolar sibilant ([s]) and *ś was a voiceless alveolar lateral fricative ([ɬ]). Accordingly, *ṣ is seen as an emphatic version of *s ([sʼ]) *z as a voiced version of it ([z]) and *ṣ́ as an emphatic version of *ś ([ɬʼ]). The reconstruction of *ś ṣ́ as lateral fricatives (or affricates) is certain although few modern languages preserve the sounds. The pronunciation of *ś ṣ́ as [ɬ ɬʼ] is still maintained in the Modern South Arabian languages (such as Mehri), and evidence of a former lateral pronunciation is evident in a number of other languages. For example, Biblical Hebrew baśam was borrowed into Ancient Greek as balsamon (hence English "balsam"), and the 8th-century Arab grammarian Sibawayh explicitly described the Arabic descendant of *ṣ́, now pronounced [dˤ] in the standard pronunciation or [ðˤ] in Bedouin-influenced dialects, as a pharyngealized voiced lateral fricative [ɮˤ].[23][24] (Compare Spanish alcalde, from Andalusian Arabic اَلْقَاضِي al-qāḍī "judge".)
The primary disagreements concern whether the sounds were actually fricatives in Proto-Semitic or whether some were affricates and whether the sound designated *š was pronounced [ʃ] (or similar) in Proto-Semitic, as the traditional view posits, or had the value of [s]. The issue of the nature of the "emphatic" consonants, discussed above, is partly related (but partly orthogonal) to the issues here as well.
With respect to the traditional view, there are two dimensions of "minimal" and "maximal" modifications made:
- In how many sounds are taken to be affricates. The "minimal affricate" position takes only the emphatic *ṣ as an affricate [t͡sʼ]. The "maximal affricate" position additionally posits that *s *z were actually affricates [t͡s d͡z] while *š was actually a simple fricative [s].[25]
- In whether to extend the affricate interpretation to the interdentals and laterals. The "minimal extension" position assumes that only the sibilants were affricates, and the other "fricatives" were in fact all fricatives, but the maximal update extends the same interpretation to the other sounds. Typically, that means that the "minimal affricate, maximal extension" position takes all and only the emphatics are taken as affricates: emphatic *ṣ θ̣ ṣ́ were [t͡sʼ t͡θʼ t͡ɬʼ]. The "maximal affricate, maximal extension" position assumes not only the "maximal affricate" position for sibilants but also that non-emphatic *θ ð ś were actually affricates.
Affricates in Proto-Semitic were proposed early on but met little acceptance until the work of Alice Faber (1981) who challenged the older approach. The Semitic languages that have survived often have fricatives for these consonants. However, Ethiopic languages and Modern Hebrew, in many reading traditions, have an affricate for *ṣ.[26]
The evidence for the various affricate interpretations of the sibilants is direct evidence from transcriptions and structural evidence. However, the evidence for the "maximal extension" positions that extend affricate interpretations to non-sibilant "fricatives" is largely structural because of both the relative rarity of the interdentals and lateral obstruents among the attested Semitic language and the even-greater rarity of such sounds among the various languages in which Semitic words were transcribed. As a result, even when the sounds were transcribed, the resulting transcriptions may be difficult to interpret clearly.
The narrowest affricate view (only *ṣ was an affricate [t͡sʼ]) is the most accepted one.[27] The affricate pronunciation is directly attested in the modern Ethiopic languages and Modern Hebrew, as mentioned above, but also in ancient transcriptions of numerous Semitic languages in various other languages:
- Transcriptions of Ge'ez from the period of the Axumite Kingdom (early centuries AD): ṣəyāmo rendered as Greek τζιαμω tziamō.[27]
- The Hebrew reading tradition of ṣ as [t͡s] clearly goes back at least to medieval times, as shown by the use of Hebrew צ (ṣ) to represent affricates in early New Persian, Old Osmanli Turkic, Middle High German etc. Similarly, Old French c /t͡s/ was used to transliterate צ: Hebrew ṣɛdɛḳ "righteousness" and ʼārɛṣ "land (of Israel)" were written cedek, arec.[27]
- There is also evidence of an affricate in Ancient Hebrew and Phoenician ṣ. Punic ṣ was often transcribed as ts or t in Latin and Greek or occasionally Greek ks; correspondingly, Egyptian names and loanwords in Hebrew and Phoenician use ṣ to represent the Egyptian palatal affricate ḏ (conventionally described as voiced [d͡ʒ] but possibly instead an unvoiced ejective [t͡ʃʼ]).[28]
- Aramaic and Syriac had an affricated realization of *ṣ until some point, as is seen in Classical Armenian loanwords: Aramaic צרר 'bundle, bunch' → Classical Armenian crar /t͡sɹaɹ/.[29]
The "maximal affricate" view, applied only to sibilants, also has transcriptional evidence. According to Kogan, the affricate interpretation of Akkadian s z ṣ is generally accepted.[30]
- Akkadian cuneiform, as adapted for writing various other languages, used the z- signs to represent affricates. Examples include /ts/ in Hittite,[29] Egyptian affricate ṯ in the Amarna letters and the Old Iranian affricates /t͡ʃ d͡ʒ/ in Elamite.[31]
- Egyptian transcriptions of early Canaanite words with *z, *s, *ṣ use affricates (ṯ for *s, ḏ for *z, *ṣ).[32]
- West Semitic loanwords in the "older stratum" of Armenian reflect *s *z as affricates /t͡sʰ/, /d͡z/.[26]
- Greek borrowing of Phoenician 𐤔 *š to represent /s/ (compare Greek Σ), and 𐤎 *s to represent /ks/ (compare Greek Ξ) is difficult to explain if *s then had the value [s] in Phoenician, but it is quite easy to explain if it actually had the value [t͡s] (even more so if *š had the value [s]).[33]
- Similarly, Phoenician uses 𐤔 *š to represent sibilant fricatives in other languages rather than 𐤎 *s until the mid-3rd century BC, which has been taken by Friedrich/Röllig 1999 (pp. 27–28)[34] as evidence of an affricate pronunciation in Phoenician until then. On the other hand, Egyptian starts using s in place of earlier ṯ to represent Canaanite s around 1000 BC. As a result, Kogan[35] assumes a much earlier loss of affricates in Phoenician, and he assumes that the foreign sibilant fricatives in question had a sound closer to [ʃ] than [s]. (A similar interpretation for at least Latin s has been proposed[36] by various linguists based on evidence of similar pronunciations of written s in a number of early medieval Romance languages; a technical term for this "intermediate" sibilant is voiceless alveolar retracted sibilant.) However, it is likely that Canaanite was already dialectally split by that time and the northern, Early Phoenician dialect that the Greeks were in contact with could have preserved the affricate pronunciation until c. 800 BC at least, unlike the more southern Canaanite dialects that the Egyptians were in contact with, so that there is no contradiction.
There is also a good deal of internal evidence in early Akkadian for affricate realizations of s z ṣ. Examples are that underlying ||*t, *d, *ṭ + *š|| were realized as ss, which is more natural if the law was phonetically ||*t, *d, *ṭ + *s|| → [tt͡s],[29] and that *s *z *ṣ shift to *š before *t, which is more naturally interpreted as deaffrication.[30]
Evidence for *š as /s/ also exists but is somewhat less clear. It has been suggested that it is cross-linguistically rare for languages with a single sibilant fricative to have [ʃ] as the sound and that [s] is more likely.[33] Similarly, the use of Phoenician 𐤔 *š, as the source of Greek Σ s, seems easiest to explain if the phoneme had the sound of [s] at the time. The occurrence of [ʃ] for *š in a number of separate modern Semitic languages (such as Neo-Aramaic, Modern South Arabian, most Biblical Hebrew reading traditions) and Old Babylonian Akkadian is then suggested to result from a push-type chain shift, and the change from [t͡s] to [s] "pushes" [s] out of the way to [ʃ] in the languages in question, and a merger of the two to [s] occurs in various other languages such as Arabic and Ethiopian Semitic.
On the other hand, it has been suggested that the initial merged s in Arabic was actually a "hissing-hushing sibilant",[37] presumably something like [ɕ] (or a "retracted sibilant"), which did not become [s] until later. That would suggest a value closer to [ɕ] (or a "retracted sibilant") or [ʃ] for Proto-Semitic *š since [t͡s] and [s] would almost certainly merge directly to [s]. Furthermore, there is various evidence to suggest that the sound [ʃ] for *š existed while *s was still [ts].[38] Examples are the Southern Old Babylonian form of Akkadian, which evidently had [ʃ] along with [t͡s] as well as Egyptian transcriptions of early Canaanite words in which *š s are rendered as š ṯ. (ṯ is an affricate [t͡ʃ] and the consensus interpretation of š is [ʃ], as in Modern Coptic.[38])
Diem (1974) suggested that the Canaanite sound change of *θ → *š would be more natural if *š was [s] than if it was [ʃ]. However, Kogan argues that, because *s was [ts] at the time, the change from *θ to *š is the most likely merger, regardless of the exact pronunciation of *š while the shift was underway.[39]
Evidence for the affricate nature of the non-sibilants is based mostly on internal considerations. Ejective fricatives are quite rare cross-linguistically, and when a language has such sounds, it nearly always has [sʼ] so if *ṣ was actually affricate [tsʼ], it would be extremely unusual if *θ̣ ṣ́ was fricative [θʼ ɬʼ] rather than affricate [t͡θʼ t͡ɬʼ]. According to Rodinson (1981) and Weninger (1998), the Greek placename Mátlia, with tl used to render Ge'ez ḍ (Proto-Semitic *ṣ́), is "clear proof" that this sound was affricated in Ge'ez and quite possibly in Proto-Semitic as well.[40]
The evidence for the most maximal interpretation, with all the interdentals and lateral obstruents being affricates, appears to be mostly structural: the system would be more symmetric if reconstructed that way.
The shift of *š to h occurred in most Semitic languages (other than Akkadian, Minaean, Qatabanian) in grammatical and pronominal morphemes, and it is unclear whether reduction of *š began in a daughter proto-language or in Proto-Semitic itself. Some thus suggest that weakened *š̠ may have been a separate phoneme in Proto-Semitic.[41]
Prosody
Proto-Semitic is reconstructed as having non-phonemic stress on the third mora counted from the end of the word,[42] i.e. on the second syllable from the end, if it has the structure CVC or CVː (where C is any consonant and V is any vowel), or on the third syllable from the end, if the second one had the structure CV.[43]
Morphophonology
Proto-Semitic allowed only syllables of the structures CVC, CVː, or CV. It did not permit word-final clusters of two or more consonants, clusters of three or more consonants, hiatus of two or more vowels, or long vowels in closed syllables.[44]
Most roots consisted of three consonants. However, it appears that historically the three-consonant roots had developed from two-consonant ones (this is suggested by evidence from internal as well as external reconstruction). To construct a given grammatical form, certain vowels were inserted between the consonants of the root.[45][46] There were certain restrictions on the structure of the root: it was impossible to have roots where the first and second consonants were identical, and roots where the first and third consonants were identical were extremely rare.[47]
Correspondence of sounds with daughter languages
See Semitic languages#Phonology for a fuller discussion of the outcomes of the Proto-Semitic sounds in the various daughter languages.
Correspondence of sounds with other Afroasiatic languages
See table at Proto-Afroasiatic language#Consonant correspondences.
Grammar
Nouns
Three cases are reconstructed: nominative (marked by *-u), genitive (marked by *-i), accusative (marked by *-a).[48][49]
There were two genders: masculine (marked by a zero morpheme) and feminine (marked by *-at/*-t and *-ah/-ā).[50][51] The feminine marker was placed after the root, but before the ending, e.g.: *ba‘l- ‘lord, master’ > *ba‘lat- ‘lady, mistress’, *bin- ‘son’ > *bint- ‘daughter’.[52] Besides, there was a small group of feminine nouns that didn't have formal markers: *’imm- ‘mother’, *laxir- ‘ewe’, *’atān- ‘she-donkey’, *‘ayn- ‘eye’, *birk- ‘knee’[53]
There were three numbers: singular, plural and dual (only in nouns).[51]
There were two ways to mark the plural:[54]
- affixation
- masculine nouns formed their nominative by means of the marker *-ū, their genitive and accusative by *-ī, i.e., by lengthening the vowel of the singular case suffix;
- feminines also formed their plural by lengthening a vowel — namely, by means of the marker *-āt;
 
- apophonically (by changing the vocalisation pattern of the word, as seen e.g. in Arabic: kātib ‘writer’ — kuttāb ‘writers’) — only in the masculine.
The dual was formed by means of the markers *-ā in the nominative and *-āy in the genitive and accusative.[55]
The endings of the noun:[56]
| Singular | Plural | Dual | |
|---|---|---|---|
| Nominative | *-u | *-ū | *-ā | 
| Genitive | *-i | *-ī | *-āy | 
| Accusative | *-a | *-ī | *-āy | 
Pronouns
Like most of its daughter languages, Proto-Semitic has one free pronoun set, and case-marked bound sets of enclitic pronouns. Genitive case and accusative case are only distinguished in the first person.[57]
| independent nominative | enclitic | |||
|---|---|---|---|---|
| nominative | genitive | accusative | ||
| 1.sg. | ʼanā̆/ʼanākū̆ | -kū̆ | -ī/-ya | -nī | 
| 2.sg.masc. | ʼantā̆ | -tā̆ | -kā̆ | |
| 2.sg.fem. | ʼantī̆ | -tī̆ | -kī̆ | |
| 3.sg.masc. | šuʼa | -a | -šū̆ | |
| 3.sg.fem. | šiʼa | -at | -šā̆/-šī̆ | |
| 1.du. | ? | -nuyā ? | -niyā ? | -nayā ? | 
| 2.du. | ʼantumā | -tumā | -kumā/-kumay | |
| 3.du. | šumā | -ā | -šumā/-šumay | |
| 1.pl. | niḥnū̆ | -nū̆ | -nī̆ | -nā̆ | 
| 2.pl.masc. | ʼantum | -tum | -kum | |
| 2.pl.fem. | ʼantin | -tin | -kin | |
| 3.pl.masc. | šum/šumū | -ū | -šum | |
| 3.pl.fem. | šin/šinnā | -ā | -šin | |
For many pronouns, the final vowel is reconstructed with long and short positional variants; this is conventionally indicated by a combined macron and breve on the vowel (e.g. ā̆).
The Semitic demonstrative pronouns are usually divided into two series: those showing a relatively close object and those showing a more distant one.[58] Nonetheless, it is very difficult to reconstruct Proto-Semitic forms on the basis of the demonstratives of the individual Semitic languages.[59]
A series of interrogative pronouns are reconstructed for Proto-Semitic: *man ‘who’, *mā ‘what’ and *’ayyu ‘of what kind’ (derived from *’ay ‘where’).[60][61][62]
Numerals
Reconstruction of the cardinal numerals from one to ten (masculine):[63][64][65]
| Languages | Reconstruction | ||||||
|---|---|---|---|---|---|---|---|
| Akkadian | Ugaritic | Arabic | Sabean | Weninger | Lipiński | Huehnergard | |
| One | ištēnum | ꞻḥd | wāḥid | ’ḥd | *’aḥad- | *ḥad-, *‘išt- | *ʔaħad- | 
| Two | šena/šina | ṯn | iṯnān | ṯny | *ṯinān | *ṯin-, *kil’- | *θin̩-/*θn̩- | 
| Three | šalāšum | ṯlṯ | ṯalāṯa | s2lṯ | *śalāṯ- | *ślaṯ- | *θalaːθ- | 
| Four | erbûm | ꞻrbʻ | ’arbaʻ | ’rbʻ | *’arbaʻ- | *rbaʻ- | *ʔarbaʕ- | 
| Five | ḫamšum | ḫmš | ḫamsa | ḫms1 | *ḫamš- | *ḫamš- | *xamis- | 
| Six | ši/eššum | ṯṯ | sitta | s1dṯ/s1ṯ- | *šidṯ- | *šidṯ- | *sidθ- | 
| Seven | sebûm | šbʻ | sabʻa | s1bʻ | *šabʻ- | *šabʻ- | *sabʕ- | 
| Eight | samānûm | ṯmn | ṯamānia | ṯmny/ṯmn | *ṯamāniy- | *ṯmān- | *θamaːniy- | 
| Nine | tišûm | tšʻ | tisʻa | ts1ʻ | *tišʻ- | *tišʻ- | *tisʕ- | 
| Ten | ešrum | ʻšr | ʻašara | ʻs2r | *ʻaśr- | *ʻaśr- | *ʕaɬr- | 
All nouns from one to ten were declined as singular nouns with the exception of the numeral ‘two’, which was declined as a dual. Feminine forms of all numbers from one to ten were produced by the suffix *-at. In addition, if the name of the object counted was of the feminine gender, the numbers from 3 to 10 were in the masculine form and vice versa.[66]
The names of the numerals from 11 to 19 were formed by combining the names of the unit digits with the word ‘ten’. Twenty’ was expressed by the dual form of ‘ten’, and the names of the ten digits from 30 to 90 were plural forms of the corresponding unit digits. Besides, Proto-Semitic also had designations for hundred (*mi’t-), thousand (*li’m-) and ten thousand (*ribb-).[67][64]
Ordinal numerals cannot be reconstructed for the protolanguage because of the great diversity in the descendant languages.[65]
Verbs
Traditionally, two conjugations are reconstructed for Proto-Semitic — a prefix conjugation and a suffix conjugation.[68] According to a hypothesis that has garnered wide support, the prefix conjugation was used with verbs that expressed actions, and the suffix conjugation was used with verbs that expressed states.[69]
The prefix conjugation is reconstructed as follows:[70][71]
| Singular | Plural | Dual | ||
|---|---|---|---|---|
| 1 pers. | *’a- | *ni- | ||
| 2 pers. | ||||
| masc. | *ta- | *ta- – -ū | *ta- – -ā | |
| fem. | *ta- – -ī | *ta- – -ā | *ta- – -ā | |
| 3 pers. | ||||
| masc. | *yi- | *yi- – -ū | *ya- – -ā | |
| fem. | *ta- | *yi- – -ā | *ta- – -ā | 
The suffix conjugation is reconstructed as follows:[72]
| Singular | Plural | Dual | ||
|---|---|---|---|---|
| 1 pers. | *-ku | *-na | *-kāya/-nāya | |
| 2 pers. | ||||
| masc. | *-ka/-ta | *-kan(u)/-tanu | *-kā/-tanā | |
| fem. | *-ki/-ti | *-kin(a)/-tina | *-kā/-tanā | |
| 3 pers. | ||||
| masc. | – | *-ū | *-ā | |
| fem. | *-at | *-ā | *-atā | 
Verb stems are divided into basic (German: Grundstamm) and derived. The basic ones consist of a three-consonant root with thematic vowels. Among the derived ones, one distinguishes stems with a geminated middle consonant (German: Doppelungsstamm), stems with a lengthened first vowel, causative stems (formed by means of the prefix *ša-), nouns with the prefix *na-/*ni-, stems with the suffix *-tV-, stems that consist of a reduplicated biconsonantal root and stems with a geminated final consonant.[73][74][75]
From the basic stems, an active participle was formed on the pattern CāCiC, the passive one on the patterns CaCīC and CaCūC.[76]
From the derived stems, the participles were formed by means of the prefix *mu-, while the vocalisation of the active ones was a-i and that of the passive ones was a-a[77] (on this pattern, for example, the Arabic name muḥammad is formed from the root ḥmd ‘to praise’.[78])
The imperative mood was formed only for the second person, and the form for the singular masculine was the pure stem:[79]
| Singular | Plural | Dual | ||
|---|---|---|---|---|
| 2 pers. | ||||
| masc. | - | *-ū | *-ā | |
| fem. | *-i | *-ā | *-ā | 
Conjunctions
Three conjunctions are reconstructed for Proto-Semitic:[80]
- *wa ’and’;
- *’aw ’or’;
- *šimmā ’if’.
Syntax
The Proto-Semitic language was a language of nominative-accusative alignment, which is preserved in most of its descendant languages.[81]
The basic word order of Proto-Semitic was VSO (verb — subject — direct object), and the modifier usually followed its head.[82][65]
Lexis

Reconstruction of the Proto-Semitic lexis provides more information about the life of Proto-Semites and helps in the search for their Urheimat.
Thus, it is possible to reconstruct religious terms (*’il ‘deity’, *ḏbḥ ‘to perform a sacrifice’, *mšḥ ‘to anoint’, *ḳdš ‘be holy’, *ḥrm ‘to forbid, excommunicate’ *ṣalm- ‘idol’), agricultural terms (*ḥaḳl- ‘field’, *ḥrṯ ‘to plough’, *zrʻ ‘to sow’, *ʻṣ́d ‘to harvest’, *dyš ‘to thresh’, *ḏrw ‘to winnow’, *gurn- ‘threshing-floor’, *ḥinṭ- ‘wheat’, *kunāṯ- ‘emmer’, *duḫn- ‘millet’), animal husbandry terms (*’immar- ‘ram’, *raḫil- ‘ewe’, *‘inz- ‘goat’, *śaw- ‘a flock of sheep’, *ṣ́a’n- ‘a herd of sheep and goats’, *gzz ‘to shear sheep’, *r‘y ‘to graze (animals)’, *šḳy ‘to guide to a watering place’, *’alp- ‘bull’, *ṯawr- ‘buffalo’, *ḫzr-/*ḫnzr- ‘pig’, *kalb- ‘dog’, *ḥimār- ‘donkey’, *’atān- ‘she-donkey’, *ḥalab- ‘milk’, *lašad- ‘cream’, *ḫim’at- ‘butter’), terms of daily life (*bayt- ‘house’, *dalt- ‘door’, *kussi’- ‘chair’, *‘arś- ‘bed’, *kry ‘to dig’, *bi’r- ‘well’, *śrp ‘to kindle, *’iš- ‘fire’, *ḳly ‘to roast’, *laḥm- ‘food’), technological terms (*ṣrp ‘to smelt’, *paḥḥam- ‘coal’, *kasp- ‘silver’, *kupr- ‘bitumen’, *kuḥl- ‘antimony’, *napṭ- ‘petrol’, *ḥabl- ‘rope’, *ḳašt- ‘bow’, *ḥaṱw- ‘arrow’). Many words are useful for the identification of the Semitic Urheimat (*ti’n- ‘fig’, *ṯūm- ‘garlic’, *baṣal- ‘onion’, *tam(a)r- ‘palm tree’, *dibš- ‘date honey’, *buṭn- ‘pistachio’, *ṯaḳid- ‘almond’, *kammūn- ‘cumin’).[83][84]
The words *ṯawr- ‘buffalo’ and *ḳarn- ‘horn’ are suspected to be borrowings from Proto-Indo-European[83] or vice versa (for *ṯawr- and certain other words).[85] Sergei Starostin adduces several dozens of Semito-Indo-European correspondences, which he considers to be borrowings into Proto-Semitic from Proto-Anatolian or a disappeared branch of Proto-Indo-European.[86]
Comparative vocabulary and reconstructed roots
See List of Proto-Semitic stems (appendix in Wiktionary).
See also
Notes
- ↑ That explains the lack of voicing distinction in the emphatic series, which would be unnecessary if the emphatics were pharyngealized.
References
- ↑ The Oxford Handbook of the History of Linguistics by Keith Allan
- 1 2 Huehnergard, John (2019). "Introduction to the Semitic languages and their history". In John Huehnergard and Na‘ama Pat-El (ed.). The Semitic Languages (Second ed.). New York: Routledge.
- ↑ Bertman, Stephen (2003). Handbook to Life in Ancient Mesopotamia. Oxford University Press. p. 94. ISBN 978-019-518364-1. Retrieved 16 May 2015.
- ↑ Steiner, Richard C. (2011). Early Northwest Semitic Serpent Spells in the Pyramid Texts. Winona Lake: Eisenbrauns.
- ↑ Huehnergard, John (2020). "The Languages of the Ancient Near East". In Daniel C. Snell (ed.). A Companion to the Ancient Near East (Second ed.). Hoboken: John Wiley & Sons. pp. 341–353.
- 1 2 Lipiński 2001, pp. 42
- ↑ Kitchen, A.; Ehret, C.; Assefa, S.; Mulligan, C. J. (29 April 2009). "Bayesian phylogenetic analysis of Semitic languages identifies an Early Bronze Age origin of Semitic in the Near East". Proceedings of the Royal Society B: Biological Sciences. 276 (1668): 2703–10. doi:10.1098/rspb.2009.0408. PMC 2839953. PMID 19403539.
- 1 2 Lipiński 2001, pp. 44
- ↑ Huehnergard (2008), p. 231.
- ↑ Kogan (2011), p. 119.
- ↑ Versteegh, Cornelis Henricus Maria "Kees" (1997). The Arabic Language. Columbia University Press. p. 13. ISBN 978-0-231-11152-2.
- ↑ Sáenz Badillos, Angel (1993) [1988]. "Hebrew in the context of the Semitic Languages". Historia de la Lengua Hebrea [A History of the Hebrew Language]. Translated by John Elwolde. Cambridge, UK: Cambridge University Press. pp. 18–19. ISBN 0-521-55634-1.
- ↑ Kogan (2011), p. 54.
- ↑ Cantineau, J. (1952). "Le consonantisme du sémitique". Semitica: 79–94.
- ↑ Kogan (2011), p. 61.
- ↑ Dolgopolsky 1999, p. 29.
- ↑ Woodard 2008, p. 219.
- ↑ Hetzron 1997, p. 147.
- ↑ For an example of an author using the traditional symbols but subscribing to the new sound values, see Hackett, Joe Ann. 2008. Phoenician and Punic. The Ancient Languages of Syria-Palestine and Arabia (ed. Roger D. Woodard). Likewise, Huehnergard, John and Christopher Woods. 2008. Akkadian and Eblaite. The Ancient Languages of Mesopotamia, Egypt, and Aksum (ed. Roger D. Woodard). p. 96: "Similarly, there was a triad of affricates, voiced /ᵈz/ (⟨z⟩) voiceless /ᵗs/ (⟨s⟩), and emphatic /ᵗsʼ/ (⟨*ṣ⟩). These became fricatives in later dialects; the voiceless member of this later, fricative set was pronounced [s] in Babylonian, but [š] in Assyrian, while the reflex of Proto-Semitic *š, which was probably simple [s] originally, continued to be pronounced as such in Assyrian, but as [š] in Babylonian." Similarly, an author remaining undecided regarding the sound values of the sibilants will also use the conventional symbols, for example, Greenberg, Joseph, The Patterning of Root Morphemes in Semitic. 1990. p. 379. On language: selected writings of Joseph H. Greenberg. Ed. Keith M. Denning and Suzanne Kemme: "There is great uncertainty regarding the phonetic values of s, ś, and š in Proto-Semitic. I simply use them here as conventional transcriptions of the three sibilants corresponding to the sounds indicated by samekh, śin, and šin respectively in Hebrew orthography."
- ↑ Lipiński, Edward. 2000. Semitic languages: outline of a comparative grammar. e.g. the tables on p.113, p.131; also p.133: "Common Semitic or Proto-Semitic has a voiceless fricative prepalatal or palato-alevolar š, i.e. [ʃ] ...", p.129 ff.
- ↑ Macdonald, M.C.A. 2008. Ancient North Arabian. In: The Ancient Languages of Syria-Palestine and Arabia (ed. Roger D. Woodard). p. 190.
- ↑ Blau, Joshua (2010). Phonology and Morphology of Biblical Hebrew. Winona Lake, Indiana: Eisenbrauns. p. 25–40.
- ↑ Ferguson, Charles (1959), "The Arabic Koine", Language, 35 (4): 630, doi:10.2307/410601, JSTOR 410601.
- ↑ Versteegh, Kees (1997), The Arabic Language, Edinburgh University Press, ISBN 90-04-17702-7
- ↑ For example, Huehnergard (2008), pp. 229–231.
- 1 2 Dolgopolsky 1999, p. 33.
- 1 2 3 Kogan (2011), p. 62.
- ↑ Kogan (2011), p. 63.
- 1 2 3 Dolgopolsky 1999, p. 32.
- 1 2 Kogan (2011), p. 66.
- ↑ Kogan (2011), p. 67.
- ↑ Kogan (2011), pp. 67–68.
- 1 2 Kogan (2011), p. 69.
- ↑ Quoted in Kogan (2011), p. 68.
- ↑ Kogan (2011), p. 68.
- ↑ Vijūnas, Aurelijus (2010), "The Proto-Indo-European Sibilant */s/", Historische Sprachforschung, Göttingen, 123: 40–55, doi:10.13109/hisp.2010.123.1.40, ISSN 0935-3518
- ↑ Kogan (2011), p. 70, quoting Martinet 1953 p. 73 and Murtonen 1966 p. 138.
- 1 2 Kogan (2011), p. 70.
- ↑ Kogan (2011), pp. 92–93.
- ↑ Kogan (2011), p. 80.
- ↑ Dolgopolsky 1999, pp. 19, 69–70
- ↑ Kogan L. (2011). "Proto-Semitic Phonetics and Phonology". The Semitic languages. Berlin — Boston: Walter de Gruyter. p. 124. ISBN 978-3-11-018613-0.
- ↑ Huehnergard J. (2008). "Afro-Asiatic". The Ancient Languages of Syria-Palestine and Arabia. New York: Cambridge University Press. p. 232. ISBN 978-0-511-39338-9.
- ↑ Huehnergard J. (2008). "Afro-Asiatic". The Ancient Languages of Syria-Palestine and Arabia. New York: Cambridge University Press. p. 231. ISBN 978-0-511-39338-9.
- ↑  Moscati S., Spitaler A., Ullendorff E., von Soden W. (1980). An Introduction to the Comparative Grammar of the Semitic Languages. Wiesbaden: Otto Harrassowitz. pp. 72–73.{{cite book}}: CS1 maint: multiple names: authors list (link)
- ↑ Weninger S. (2011). "Reconstructive Morphology". The Semitic languages. Berlin — Boston: Walter de Gruyter. pp. 152–153. ISBN 978-3-11-018613-0.
- ↑ Huehnergard J. (2008). "Afro-Asiatic". The Ancient Languages of Syria-Palestine and Arabia. New York: Cambridge University Press. p. 233. ISBN 978-0-511-39338-9.
- ↑ Weninger S. (2011). "Reconstructive Morphology". The Semitic languages. Berlin — Boston: Walter de Gruyter. p. 165. ISBN 978-3-11-018613-0.
- ↑ Huehnergard J. (2008). "Afro-Asiatic". The Ancient Languages of Syria-Palestine and Arabia. New York: Cambridge University Press. p. 235. ISBN 978-0-511-39338-9.
- ↑  Moscati S., Spitaler A., Ullendorff E., von Soden W. (1980). An Introduction to the Comparative Grammar of the Semitic Languages. Wiesbaden: Otto Harrassowitz. pp. 84–85.{{cite book}}: CS1 maint: multiple names: authors list (link)
- 1 2 Weninger S. (2011). "Reconstructive Morphology". The Semitic languages. Berlin — Boston: Walter de Gruyter. p. 166. ISBN 978-3-11-018613-0.
- ↑ Huehnergard J. (2011). Proto-Semitic Language and Culture. Vol. The American Heritage dictionary of the English Language. p. 2067.
- ↑ Huehnergard J. (2008). "Afro-Asiatic". The Ancient Languages of Syria-Palestine and Arabia. New York: Cambridge University Press. p. 234. ISBN 978-0-511-39338-9.
- ↑  Moscati S., Spitaler A., Ullendorff E., von Soden W. (1980). An Introduction to the Comparative Grammar of the Semitic Languages. Wiesbaden: Otto Harrassowitz. pp. 87–92.{{cite book}}: CS1 maint: multiple names: authors list (link)
- ↑  Moscati S., Spitaler A., Ullendorff E., von Soden W. (1980). An Introduction to the Comparative Grammar of the Semitic Languages. Wiesbaden: Otto Harrassowitz. p. 93.{{cite book}}: CS1 maint: multiple names: authors list (link)
- ↑  Moscati S., Spitaler A., Ullendorff E., von Soden W. (1980). An Introduction to the Comparative Grammar of the Semitic Languages. Wiesbaden: Otto Harrassowitz. p. 94.{{cite book}}: CS1 maint: multiple names: authors list (link)
- ↑ Huehnergard (2008), p. 237; Huehnergard's phonetic transcription is changed to traditional symbols here.
- ↑ Lipiński E. (1997). Semitic languages:Outline of a Comparative Grammar. Leuven: Peeters. p. 315. ISBN 90-6831-939-6.
- ↑  Moscati S., Spitaler A., Ullendorff E., von Soden W. (1980). An Introduction to the Comparative Grammar of the Semitic Languages. Wiesbaden: Otto Harrassowitz. p. 112.{{cite book}}: CS1 maint: multiple names: authors list (link)
- ↑  Moscati S., Spitaler A., Ullendorff E., von Soden W. (1980). An Introduction to the Comparative Grammar of the Semitic Languages. Wiesbaden: Otto Harrassowitz. pp. 114–115.{{cite book}}: CS1 maint: multiple names: authors list (link)
- ↑ Lipiński E. (1997). Semitic languages:Outline of a Comparative Grammar. Leuven: Peeters. pp. 328–329. ISBN 90-6831-939-6.
- ↑ Huehnergard J. (2008). "Afro-Asiatic". The Ancient Languages of Syria-Palestine and Arabia. New York: Cambridge University Press. p. 238. ISBN 978-0-511-39338-9.
- ↑ Weninger S. (2011). "Reconstructive Morphology". The Semitic languages. Berlin — Boston: Walter de Gruyter. p. 167. ISBN 978-3-11-018613-0.
- 1 2 Lipiński E. (1997). Semitic languages:Outline of a Comparative Grammar. Leuven: Peeters. p. 282. ISBN 90-6831-939-6.
- 1 2 3 Huehnergard J. (2008). "Afro-Asiatic". The Ancient Languages of Syria-Palestine and Arabia. New York: Cambridge University Press. p. 241. ISBN 978-0-511-39338-9.
- ↑ Huehnergard J. (2008). "Afro-Asiatic". The Ancient Languages of Syria-Palestine and Arabia. New York: Cambridge University Press. p. 240. ISBN 978-0-511-39338-9.
- ↑  Moscati S., Spitaler A., Ullendorff E., von Soden W. (1980). An Introduction to the Comparative Grammar of the Semitic Languages. Wiesbaden: Otto Harrassowitz. pp. 117–118.{{cite book}}: CS1 maint: multiple names: authors list (link)
- ↑  Moscati S., Spitaler A., Ullendorff E., von Soden W. (1980). An Introduction to the Comparative Grammar of the Semitic Languages. Wiesbaden: Otto Harrassowitz. pp. 131–132.{{cite book}}: CS1 maint: multiple names: authors list (link)
- ↑ Коган Л. Е. (2009). "Семитские языки". Языки мира: Семитские языки. Аккадский язык. Северозападносемитские языки. М.: Academia. p. 75. ISBN 978-5-87444-284-2.
- ↑ Weninger S. (2011). "Reconstructive Morphology". The Semitic languages. Berlin — Boston: Walter de Gruyter. p. 160. ISBN 978-3-11-018613-0.
- ↑ Lipiński E. (1997). Semitic languages:Outline of a Comparative Grammar. Leuven: Peeters. p. 370. ISBN 90-6831-939-6.
- ↑ Lipiński E. (1997). Semitic languages:Outline of a Comparative Grammar. Leuven: Peeters. p. 360. ISBN 90-6831-939-6.
- ↑  Moscati S., Spitaler A., Ullendorff E., von Soden W. (1980). An Introduction to the Comparative Grammar of the Semitic Languages. Wiesbaden: Otto Harrassowitz. pp. 122–130.{{cite book}}: CS1 maint: multiple names: authors list (link)
- ↑ Lipiński E. (1997). Semitic languages:Outline of a Comparative Grammar. Leuven: Peeters. pp. 378–406. ISBN 90-6831-939-6.
- ↑ Weninger S. (2011). "Reconstructive Morphology". The Semitic languages. Berlin — Boston: Walter de Gruyter. pp. 156–157. ISBN 978-3-11-018613-0.
- ↑ Lipiński E. (1997). Semitic languages:Outline of a Comparative Grammar. Leuven: Peeters. p. 419. ISBN 90-6831-939-6.
- ↑ Lipiński E. (1997). Semitic languages:Outline of a Comparative Grammar. Leuven: Peeters. pp. 420–421. ISBN 90-6831-939-6.
- ↑ Huehnergard J. (2011), Proto-Semitic Language and Culture, vol. The American Heritage dictionary of the English Language, p. 2066
- ↑ Lipiński E. (1997). Semitic languages:Outline of a Comparative Grammar. Leuven: Peeters. pp. 366–367. ISBN 90-6831-939-6.
- ↑ Weninger S. (2011). "Reconstructive Morphology". The Semitic languages. Berlin — Boston: Walter de Gruyter. p. 169. ISBN 978-3-11-018613-0.
- ↑ Коган Л. Е. (2009). "Семитские языки". Языки мира: Семитские языки. Аккадский язык. Северозападносемитские языки. М.: Academia. p. 99. ISBN 978-5-87444-284-2.
- ↑  Huehnergard, John (2006). "Proto-Semitic and Proto-Akkadian". The Akkadian language in its Semitic Context: 1. {{cite journal}}: Cite journal requires|journal=(help)
- 1 2 Huehnergard J. (2011), Proto-Semitic Language and Culture, vol. The American Heritage dictionary of the English Language, p. 2068
- ↑ Kogan L. (2011). "Proto-Semitic Lexicon". The Semitic languages. Berlin — Boston: Walter de Gruyter. pp. 179–242. ISBN 978-3-11-018613-0.
- ↑  "Древнейшие индоевропейско-семитские языковые контакты" (Проблемы индоевропейского языкознания ed.). 1964: 3–12. {{cite journal}}: Cite journal requires|journal=(help)
- ↑ а Старостин, С. (2007). Indo-European Glottochronology and Homeland (Труды по языкознанию ed.). Языки славянских культур. pp. 821–826. ISBN 978-5-9551-0186-6.
Sources
- Blench, Roger (2006). Archaeology, Language, and the African Past. Rowman Altamira. ISBN 978-0-7591-0466-2. Retrieved 30 June 2013.
- Dolgopolsky, Aron (1999). From Proto-Semitic to Hebrew. Milan: Centro Studi Camito-Semitici di Milano.
- Hetzron; Robert (1997). The Semitic languages. Cambridge University Press. p. 572. ISBN 0-415-05767-1.
- Huehnergard, John (2000). "Proto-Semitic Language and Culture + Appendix II: Semitic Roots". American Heritage Dictionary of the English Language (Fourth ed.). Boston & New York: Houghton Mifflin Company. pp. 2056–2068. ISBN 0-395-82517-2.
- Huehnergard, John. (2003) "Akkadian ḫ and West Semitic ḥ." Studia Semitica 3, ed. Leonid E. Kogan & Alexander Militarev. Moscow: Russian State University for the Humanities. pp. 102–119. ISBN 978-5-728-10690-6
- Huehnergard, John (2008). "Appendix 1. Afro-Asiatic". In Woodard, Roger (ed.). The Ancient Languages of Syria-Palestine and Arabia. Cambridge University Press. pp. 225–246. ISBN 978-0-521-68498-9.
- Kienast, Burkhart. (2001). Historische semitische Sprachwissenschaft.
- Kogan, Leonid (2011). "Proto-Semitic Phonology and Phonetics". In Weninger, Stefan (ed.). The Semitic Languages: An International Handbook. Walter de Gruyter. pp. 54–151. ISBN 978-3-11-025158-6.
- Lipiński, Edward (2001). Semitic Languages: Outline of a Comparative Grammar. Peeters Publishers. ISBN 978-90-429-0815-4. Retrieved 30 June 2013.
- Woodard, Roger (2008). The Ancient Languages of Mesopotamia, Egypt and Aksum. Cambridge University Press. p. 250. ISBN 978-0-521-68497-2.