Persian alphabet
Persian alphabet |
---|
ا ب پ ت ث ج چ ح خ د ذ ر ز ژ س ش ص ض ط ظ ع غ ف ق ک گ ل م ن و ه ی |
Perso-Arabic script |
The Persian alphabet (Persian: الفبای فارسی alefbā-ye fārsi) or Perso-Arabic alphabet is a writing system based on the Arabic script and used for the Persian language. It has four letters more than the Arabic alphabet: پ [p], چ [t͡ʃ], ژ [ʒ], and گ [ɡ].
The Persian script is an abjad and is exclusively written cursively. That is, the majority of the letters in a word connect to each other. This is also implemented on computers. Whenever the Persian alphabet is typed, the computer automatically connects the letters to each other. Words are written from right to left. Also, vowels are underrepresented in writing; see below for details.
The replacement of the Pahlavi scripts with the Persian alphabet in order to write the Persian language was done by the Tahirid dynasty in ninth century Greater Khorasan.[1][2]
Letters
Below are the 32 letters of the modern Persian alphabet. Since the script is cursive, the appearance of a letter changes depending on its position: isolated, initial (joined on the left), medial (joined on both sides), and final (joined on the right) of a word.
The names of the letter are mostly the ones used in Arabic, except for the Persian pronunciation. The only ambiguous name is he, which is used for both ﺡ and ه. For clarification, these are often called ḥe-ye jimi (literally "jim-like ḥe" after jim, the name for the letter ج that uses the same base form) and he-ye do-češm (literally "two-eyed he", after the contextual middle letterform ﻬ), respectively.
# | Name | Name in Persian script | DIN 31635 | IPA | Contextual forms | |||
---|---|---|---|---|---|---|---|---|
Final | Medial | Initial | Isolated | |||||
0 | hamza[3] | همزه | ʾ | [ʔ] | ـئ ـأ ـؤ | ـئـ | ئـ | ء أ |
1 | ʾalef | الف | ā | [ɒ] | ﺎ | آ / ا | ||
2 | be | بِ | b | [b] | ـب | ـﺒ | ﺑ | ب |
3 | pe | پِ | p | [p] | ـپ | ـﭙ | ﭘ | پ |
4 | te | تِ | t | [t] | ـت | ـﺘ | ﺗ | ت |
5 | s̱e | ثِ | s̱ | [s] | ـث | ـﺜ | ﺛ | ث |
6 | jim | جیم | j | [d͡ʒ] | ﺞ | ـﺠ | ﺟ | ج |
7 | che | چِ | č | [t͡ʃ] | ﭻ | ـﭽ | ﭼ | چ |
8 | ḥe(-ye jimi) | حِ | ḥ | [h] | ﺢ | ـﺤ | ﺣ | ح |
9 | khe | خِ | x | [x] | ﺦ | ـﺨ | ﺧ | خ |
10 | dāl | دال | d | [d] | ـد | د | ||
11 | ẕāl | ذال | ẕ | [z] | ـذ | ذ | ||
12 | re | رِ | r | [ɾ] | ـر | ر | ||
13 | ze | زِ | z | [z] | ـز | ز | ||
14 | že | ژِ | ž | [ʒ] | ـﺘ | ژ | ||
15 | sin | سین | s | [s] | ـس | ـﺴ | ﺳ | س |
16 | šin | شین | š | [ʃ] | ـش | ـﺸ | ﺷ | ش |
17 | ṣād | صاد | ṣ | [s] | ـص | ـﺼ | ﺻ | ص |
18 | z̤ād | ضاد | z̤ | [z] | ـض | ـﻀ | ﺿ | ض |
19 | ṭā, ṭoy (in Dari) | طی, طا | ṭ | [t] | ـط | ـﻄـ | ﻃ | ط |
20 | ẓā, ẓoy (in Dari) | ظی, ظا | ẓ | [z] | ـظ | ـﻈـ | ﻇ | ظ |
21 | ʿeyn | عین | ʿ | [ʔ] | ﻊ | ﻌ | ﻋ | ع |
22 | ġeyn | غین | ġ | [b] | ﻎ | ﻐ | ﻏ | غ |
23 | fe | فِ | f | [f] | ـف | ـﻔ | ﻓ | ف |
24 | qāf | قاف | q | [b] | ـق | ـﻘ | ﻗ | ق |
25 | kāf | کاف | k | [k] | ـک | ـﻜ | ﻛ | ک |
26 | gāf | گاف | g | [ɡ] | ـگ | ـﮕ | ﮔ | گ |
27 | lām | لام | l | [l] | ـل | ـﻠ | ﻟ | ل |
28 | mim | میم | ṭ | [m] | ـم | ـﻤ | ﻣ | م |
29 | nun | نون | n | [n] | ـن | ـﻨ | ﻧ | ن |
30 | vāv | واو | v / k / ow / (w / aw / ō in Dari) | [v] / [uː] / [o] / [ow] / ([w] / [aw] / [oː] in Dari) | ـو | و | ||
31 | he(-ye do-češm) | هِ | h | [h] | ﻪ | ﻬ | ﻫ | ه |
32 | ye | یِ | y / ī / á / (ay / ē in Dari) | [j] / [i] / [ɒː] / ([aj] / [eː] in Dari) | ﯽ | ـﯿ | ﯾ | ﻌ |
- Letters which do not link to a following letter
Seven letters – و, ژ, گ, ﺭ, ﺫ, ک, ﺍ – do not connect to a following letter as the rest of the letters of the alphabet do. These seven letters have the same form in isolated and initial position, and a second form in medial and final position. For example, when the letter ا "alef" is at the beginning of a word such as اینجا "injā" (here), the same form is used as in an isolated "alef". In the case of امروز "emruz" (today), the letter ﺮ "re" takes the final form and the letter و "vāv" takes the isolated form, though they are in the middle of the word, and گ also has its isolated form, though it occurs at the end of the word.
Diacritics
Persian script has adopted a subset of Arabic diacritics which consists of zabar /æ/ (fatḥah in Arabic), zir /e/ (kasrah in Arabic), and pesh /ou̯/ or /o/ (ḍammah in Arabic, pronounced zamme in Western Persian), sukūn, tanwīn nasb /æn/ and shadda (gemination). Other Arabic diacritics may be seen in Arabic loan-words.
Other characters
The following are not actual letters but different orthographical shapes for letters, and in the case of the lām alef, a ligature. As to ﺀ hamze, it has only a single graphic, since it is never tied to a preceding or following letter. However, it is sometimes 'seated' on a vāv, ye or alef, and in that case the seat behaves like an ordinary vāv, ye or alef respectively. Technically, hamze is not a letter but a diacritic.
Name | Transliteration | IPA | Final | Medial | Initial | Stand-alone |
---|---|---|---|---|---|---|
alef madde | ā | [ɒ] | ﺂ | — | — | ﺁ |
he ye | -eye or -eyeh | [eje] | ﮥ | — | — | ۀ |
lām alef | lā | [lɒ] | ﻼ | — | — | ﻻ |
Although at first glance they may seem similar, there are many differences in the way the different languages use the alphabets. For example, similar words are written differently in Persian and Arabic, as they are used differently.
Novel letters
The Persian alphabet adds four letters to the Arabic alphabet, /p/, /ɡ/, /t͡ʃ/ (ch in chair), /ʒ/ (s in measure):
Sound | Shape | Unicode name |
---|---|---|
/p/ | پ | peh |
/t͡ʃ/ (ch) | چ | tcheh |
/ʒ/ (zh) | ژ | jeh |
/ɡ/ | گ | gaf |
Differences from the Arabic writing system
Many Arabic letters represent sounds not present in Persian; they are typically only employed in loanwords and native Persian sounds replace them, such as ذ, ض, and ظ all being pronounced the same as historical ze ز z.
Vowel notation is simple but its history is complicated. Classical Arabic has a vowel length distinction; in writing, long vowels are normally written ambiguously by letters known as matres lectionis while short ones are normally omitted entirely (although certain diacritics are added to indicate them in special circumstances, notably in the Quran). Middle Persian also had vowel length, and noted ā with alif ا, ē and ī with yāʾ ﻌ, and ō and ū with wāw و. Short vowels (a, e, i, o and u) were normally not written.
The length distinction of Middle Persian no longer exists in modern Persian. The results of its collapse vary between Western Persian, Dari, and Tajiki, with eight- or six-vowel inventories. However, the alphabet retains the original spellings of most words so that فارسي Fārsī "Persian" is pronounced in the Tehrani dialect fɒrsi and شير shēr "lion" and شیر shīr "milk" is ʃir, while in Dari, these same words appear as Persian pronunciation: [fɒrsi] but ʃer "lion", ʃir "milk".
The following is a list of differences between the Arabic writing system and the Persian writing system:
- A hamze (ء) is not written above or below an alef (ا), unlike in Arabic.
- The Arabic letter tāʾ marbūṭah (ة), unless used in a direct Arabic quotation, is usually changed to a te (ت) or he ه,in accordance with its actual pronunciation. Tāʾ marbūṭa, used in feminine nouns in Arabic, is a combined form of hāʾ with the dots marking tāʾ and represents a [t] that is dropped in word-final position. Since Persian does not have this grammatical issue (or grammatical gender), tāʾ marbūṭa is not necessary and is kept only to maintain fidelity in Arabic loanwords and quotations.
- Two dots are removed in the final ye (ﻌ). Arabic differentiates the final yāʾ with the two dots and the alif maqsūra except in Egyptian, Sudanese and Maghrebi Arabic usage, which is written like a final yāʾ without two dots. Because Persian drops the two dots in the final ye, the alif maqsura cannot be differentiated from the normal final ye. For example, the name Mūsá "Moses" is written موسی. In the final letter in Mūsá, Persian does not differentiate between ye and the Arabic alif maqsūra.
- The letters pe (پ), che (چ), že (ژ), and gāf (گ) are added because Arabic, lacking the phonemes represented by these letters, has no letters for them.
- Wāw (و) is used as vâv for [v], because Arabic has no [v], and standard Iranian Persian has [w] only within the diphthong [ow].
- In the Arabic alphabet hāʾ (ﻩ) comes before wāw (و), however in the Persian alphabet, he (ﻩ) comes after vâv (و).
- It is more standard to write the nunation in this order in Persian: ـً (fatḥa tanwīn or fatḥatān) then ا (alef). In Persian, the order is reverse - ا, then ـً, i.e. Arabic ـًا becomes ـاً in Persian. e.g. عصًا ʿaṣan is written عصاً ʾasan in Persian. Writing ـاً in Arabic is also very common.
Word boundaries
Typically words are separated from each other by a space. Certain morphemes (such as the plural ending '-hâ') are written without a space. When writing on a computer, they are separated from the word using the zero-width non-joiner.
See also
- Scripts used for Persian
- Persian braille
- Nastaʿlīq, used to write Persian before the 20th century
References
- ↑ Ira M. Lapidus (2012). Islamic Societies to the Nineteenth Century: A Global History. Cambridge University Press. pp. 256–. ISBN 978-0-521-51441-5.
- ↑ Ira M. Lapidus (2002). A History of Islamic Societies. Cambridge University Press. pp. 127–. ISBN 978-0-521-77933-3.
- ↑ "??" (PDF). Persianacademy.ir. Retrieved 2015-09-05.
External links
Wikimedia Commons has media related to Persian alphabet. |
- Persian dictionary that also provide Randomization
- Virtual Persian Keyboard
- Persian Alphabet
- Persian alphabet, numerals, and pronunciation
- Persian numerals
- eiktub: web-based Perso-Arabic transliteration pad, with support for Persian characters
- Persian Character Maps
- Tests to Practice Joining and Disjoining Persian Letters and Frequently Occurring Shapes
- Alphabet Tests with Audio to learn Pronunciation
- Daoulagad - mobile Persian OCR dictionary
- Dastoor e khat - The Official document in Persian by Academy of Persian Language and Literature