|
Arabic transliteration edit
|
| This article may require cleanup to meet Wikipedia's quality standards. Please improve this article if you can. (December 2006) |
| This article contains Arabic text, written from right to left in a cursive style with some letters joined. Without proper rendering support, you may see unjoined Arabic letters written left-to-right, instead of right-to-left or other symbols instead of Arabic script. |
| Arabic alphabet | |||||
|---|---|---|---|---|---|
| ا ب ت ث ج ح | |||||
| خ د ذ ر ز س | |||||
| ش ص ض ط ظ ع | |||||
| غ ف ق ك ل | |||||
| م ن ه و ي | |||||
| History · Transliteration Diacritics · Hamza ء Numerals · Numeration |
|||||
Different approaches and methods for the romanization of Arabic exist. They vary in the way that they address the inherent problems of rendering written and spoken Arabic in the Latin alphabet; they also use different symbols for Arabic phonemes that do not exist in English or other European languages.
Contents |
Any transliteration system has to make a number of decisions which are dependent on its intended field of application. One basic problem is that written Arabic is normally unvocalized, i.e., many of the vowels are not written out, and must be supplied by a reader familiar with the language. Hence unvocalized Arabic writing does not give a reader unfamiliar with the language sufficient information for accurate pronunciation. An exact equivalent of قطر would be qṭr, which is meaningless to an untrained reader. A "full transliteration" adds information not in the text, which has to be supplied by a speaker of Arabic, qaṭar. Usually, newspapers and popular books do not use a transliteration, but a transcription: Instead of transliterating each written letter, they try to reproduce the sound of the words according to the orthography rules of the target language: Qatar.
Most issues related to the romanization of Arabic are about transliterating vs. transcribing – others, about what should be romanized:
A transcription may reflect the language as spoken, for example, by the people of Baghdad, or the official standard as spoken by a preacher in the mosque or a TV news reader. A transcription is free to add phonological (such as vowels) or morphological (such as word boundaries) information. Transcriptions will also vary depending on the writing conventions of the target language; compare English Omar Khayyam with German Omar Chajjam, both for عمر خيام (unvocalized ʿmr ḫyʾm, vocalized ʿumar ḫayyām).
A transliteration is ideally fully reversible: a machine must be able to transliterate it into Arabic and back. A transliteration can be considered as flawed for any one of the following reasons:
A fully accurate transcription may not be necessary for native Arabic speakers as they would be able to pronounce names and sentences correctly anyway, but it can be very useful for those not fully familiar with spoken Arabic and who are familiar with the Roman alphabet. An accurate transliteration serves as a valuable stepping stone for learning, pronouncing correctly, and distinguishing phonemes. It is a useful tool for anyone familiar with the sounds of Arabic but who are not fully conversant in the language.
One criticism is that a fully accurate system would require special learning that most do not have to actually pronounce names correctly, and that with a lack of a universal Romanization system they will not be pronounced correctly by non-native speakers anyway. The precision will be lost if special characters are not replicated and if someone is not familiar with Arabic pronunciation.
A table comparing romanizations using DIN 31635, ISO 233, ISO/R 233, UN, ALA-LC, and Encyclopaedia of Islam systems is available here: [11].
| Letter | Unicode | Name | SATTS | UNGEGN | ALA-LC | DIN | ISO | ISO/R | Qalam | SAS | SM | Buckwalter | IPA | BATR | ArabTeX |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ﺀ | 0621 |
hamza | E | ʼ, — | —, ’ | ʾ | ˈ, ˌ | —, ’ | ' | ʾ | ' | ' | /ʔ/ | e | ' |
| ﺍ | 0627 |
ʼalif | A | ā | ʾ | ā | aa | a, i, u; ā | aa | A | /a(ː)/ | aa or A | a | ||
| ﺏ | 0628 |
bāʼ | B | b | b | b | b | b | b | /b/ | b | b | |||
| ﺕ | 062A |
tāʼ | T | t | t | t | t | t | t | /t/ | t | t | |||
| ﺙ | 062B |
ṯāʼ | C | th | ṯ | th | ṯ | ç | v | /θ/ | c | _t | |||
| ﺝ | 062C |
ǧīm, jīm, gīm | J | j | ǧ | j | ŷ | j | j | /ʤ/ / /g/ | j | ^g | |||
| ﺡ | 062D |
ḥāʼ | H | ḩ | ḥ | ḥ | H | ḥ | ḥ | H | /ħ/ | H | .h | ||
| ﺥ | 062E |
ḫāʼ | O | kh | ḫ | ẖ | kh | j | x | x | /x/ | K | _h | ||
| ﺩ | 062F |
dāl | D | d | d | d | d | d | d | /d/ | d | d | |||
| ﺫ | 0630 |
ḏāl | Z | dh | ḏ | dh | ḏ | đ | * | /ð/ | z' | _d | |||
| ﺭ | 0631 |
rāʼ | R | r | r | r | r | r | r | /r/ | r | r | |||
| ﺯ | 0632 |
zāy | ; | z | z | z | z | z | z | /z/ | z | z | |||
| ﺱ | 0633 |
sīn | S | s | s | s | s | s | s | /s/ | s | s | |||
| ﺵ | 0634 |
šīn | : | sh | š | sh | š | š | $ | /ʃ/ | x | ^s | |||
| ﺹ | 0635 |
ṣād | X | ş | ṣ | ṣ | S | ṣ | ṣ | S | /sˁ/ | S | .s | ||
| ﺽ | 0636 |
ḍād | V | ḑ | ḍ | ḍ | D | ḍ | ḍ | D | /dˁ/ | D | .d | ||
| ﻁ | 0637 |
ṭāʼ | U | ţ | ṭ | ṭ | T | ṭ | ṭ | T | /tˁ/ | T | .t | ||
| ﻅ | 0638 |
ẓāʼ | Y | z̧ | ẓ | ẓ | Z | ẓ | đ̣ | Z | /ðˁ/ | Z | .z | ||
| ﻉ | 0639 |
ʻayn | ` | ʻ | ʿ | ` | ʿ | ř | E | /ʕ/ | E | ` | |||
| ﻍ | 063A |
ġayn | G | gh | ġ | ḡ | gh | g | ğ | g | /ɣ/ | g | .g | ||
| ﻑ | 0641 |
fāʼ | F | f | f | f | f | f | f | /f/ | f | f | |||
| ﻕ | 0642 |
qāf | Q | q | q | q | q | q | q | /q/ | q | q | |||
| ﻙ | 0643 |
kāf | K | k | k | k | k | k | k | /k/ | k | k | |||
| ﻝ | 0644 |
lām | L | l | l | l | l | l | l | /l/ | l | l | |||
| ﻡ | 0645 |
mīm | M | m | m | m | m | m | m | /m/ | m | m | |||
| ﻥ | 0646 |
nūn | N | n | n | n | n | n | n | /n/ | n | n | |||
| ﻩ | 0647 |
hāʼ | ~ | h | h | h | h | h | h | /h/ | h | h | |||
| ﻭ | 0648 |
wāw | W | w | w | w | w; ū | w; o | w | /w/, /uː/ | w or uu | w | |||
| ﻱ | 064A |
yāʼ | I | y | y | y | y; ī | y; e | y | /j/, /iː/ | y or ii | y | |||
| ﺁ | 0622 |
ʼalif madda | AEA | ā | ā, ʼā | ʾā | ʾâ | ā, ʾā | ā | 'aa | | | /ʔaː/ | eaa | 'A | |
| ﺓ | 0629 |
tāʼ marbūṭa | @ | h, t | h, t | ẗ | h, t | h, t | t; — | ŧ | p | /a/, /at/ | t' | T | |
| ﻯ | 0649 |
ʼalif maqṣūra | / | y | ā | ỳ | ae | à | à | Y | /aː/ | aaa | _A | ||
| ﻻ | FEFB |
lām ʼalif | LA | lā | lā | laʾ | lā | la | lʾ; lā | laa | /lː/ | laa | lA | ||
| ال | ʼalif lām | AL | al- | al- | ʾˈal | al- | al | al- | al-; ál- | var. | Al- | al- | |||
Online communication is sometimes restricted to an ASCII environment in which not only the Arabic letters themselves but also Roman characters with diacritics are unavailable. Even when Arabic letters and Roman characters with diacritics are available, they are often difficult to type. This problem is faced by most speakers of languages that use non-Roman alphabets, or heavily modified ones. An ad hoc solution consists of using Arabic numerals which mirror or resemble the relevant Arabic letters in shape. They appear as follows:
3 represents the Arabic letter ع .
5 or 7' represent the Arabic letter خ .
6 represents the Arabic letter ط .
6' represents the Arabic letter ظ .
7 represents the Arabic letter ح .
8 represents the Arabic letter ق .
9 represents the Arabic letter ص .
9' represents the Arabic letter ض .
2 is sometimes used to represent the أ when it is in the middle of a word
|
||||||||||||||||||||||||||||