LHSName "ArabTeX-DMG" RHSName "UNICODE" Define a U+0061 ;a Define b U+0062 ;b Define c U+0063 ;c Define d U+0064 ;d Define e U+0065 ;e Define f U+0066 ;f Define g U+0067 ;g Define h U+0068 ;h Define i U+0069 ;i Define j U+006A ;j Define k U+006B ;k Define l U+006C ;l Define m U+006D ;m Define n U+006E ;n Define o U+006F ;o Define p U+0070 ;p Define q U+0071 ;q Define r U+0072 ;r Define s U+0073 ;s Define t U+0074 ;t Define u U+0075 ;u Define v U+0076 ;v Define w U+0077 ;w Define x U+0078 ;x Define y U+0079 ;y Define z U+007A ;z Define A U+0041 ;A Define B U+0042 ;B Define E U+0045 ;E Define G U+0047 ;G Define I U+0049 ;I Define N U+004E ;N Define O U+004F ;O Define T U+0054 ;T Define U U+0055 ;U Define W U+0057 ;W Define Y U+0059 ;Y Define C U+005E ;^ << CARET Define UL U+005F ;_ << UNDERLINE Define P U+002E ;. << DOT Define V U+002C ;, << COMMA (_V_irgule) Define CO U+003A ;: << COLON Define Ain U+0060 ;` Define H U+0027 ;' << HAMZA Define Q U+0022 ;" << QUOTE Define Z U+002D ;- << HYPHEN Define BAR U+007C ;| Define LLAH U+006C U+006C U+005F U+0061 U+0068 pass(Unicode) Class [BRACKETS] = (U+003C U+003E U+00AB U+00BB U+0028 U+0029 U+005B U+005D U+2018 U+2019 U+201C U+201D); < > () « » [ ] ‘ ’ “ ” Define BEG (#|Z|[BRACKETS]) ; special ligatures (a|A) BAR l Z / _ LLAH > ;; 'a|l-ll_ah' or 'A|l-ll_ah' -> llāh (a|A) l Z / _ LLAH > A ;; 'al-ll_ah' or 'Al-ll_ah' -> Allāh U+0053 U+004C U+004D > U+1E63 U+0061 U+006C U+006C U+0101 U+0020 U+0041 U+006C U+006C U+0101 U+0068 U+0020 U+02BF U+0061 U+006C U+0061 U+0079 U+0068 U+0069 U+0020 U+0077 U+0061 U+002D U+0073 U+0061 U+006C U+006C U+0061 U+006D ; 'SLM' : .sallY al-ll_ah `alayhi wa-sallam U+0052 U+0049 U+0059 U+0041 U+004C > U+0072 U+0069 U+0079 U+0101 U+006C ;; 'RIYAL' : Saudi currency sign U+0053 U+0041 U+004C U+004C U+0041 U+0053 U+0054 U+004F U+0050 > ;; 'SALLASTOP' U+0051 U+0041 U+004C U+0041 > ;; 'QALA' U+0041 U+004B U+0042 U+0041 U+0052 > U+0041 U+006B U+0062 U+0061 U+0072 ;; 'AKBAR' U+004D U+0055 U+0048 U+0041 U+004D U+004D U+0041 U+0044 > U+004D U+0075 U+1E25 U+0061 U+006D U+006D U+0061 U+0064 ;; 'MUHAMMAD' U+0053 U+0041 U+004C U+0041 U+004D > U+0073 U+0061 U+006C U+0101 U+006D ;; 'SALAM' U+0052 U+0041 U+0053 U+004F ? U+0055 U+004C > U+0072 U+0061 U+0073 U+016B U+006C ;; 'RASUL' (also unicode 'RASOUL)' U+0041 U+004C U+0041 U+0059 U+0048 (U+0049|U+0045) > U+02BF U+0061 U+006C U+0061 U+0079 U+0068 U+0069 ;; 'ALAYHI' (also unicode 'ALAYHE)' U+0057 U+0041 U+0053 U+0041 U+004C U+004C U+0041 U+004D > U+0077 U+0061 U+002D U+0073 U+0061 U+006C U+006C U+0061 U+006D ;; 'WASALLAM' U+0053 U+0041 U+004C U+004C U+0041 > U+1E63 U+0061 U+006C U+006C U+0101 ;; 'SALLA' U+004A U+0041 U+004C U+004C U+0041 > U+006A U+0061 U+006C U+006C U+0101 ;; 'JALLA' : U+0042 U+0041 U+0053 U+004D U+0041 U+004C U+0041 > U+0062 U+0069 U+0073 U+006D U+0069 U+0020 U+0041 U+006C U+006C U+0101 U+0068 U+0020 U+0061 U+006C U+002D U+0072 U+0061 U+1E25 U+006D U+0101 U+006E U+0020 U+0061 U+006C U+002D U+0072 U+0061 U+0068 U+012B U+006D ;; 'BASMALA' U+005E U+0053 U+004C U+0059 > ;; ^SLY U+005E U+0051 U+004C U+0059 > ;; ^QLY U+005E U+004D U+0049 U+004D > ;; ^MIM U+005E U+004C U+0041 > ;; ^LA U+005E U+004A U+0049 U+004D > ;; ^JIM U+005E U+0044 U+004F U+0054 U+0053 > ;; ^DOTS U+005E U+0053 U+0049 U+004E > ;; ^SIN U+0048 U+0049 U+005A U+0042 > ;; HIZB U+0043 U+0049 U+0052 U+0043 U+005A U+0045 U+0052 U+004F > ;; CIRCZERO U+0052 U+0045 U+0043 U+0054 U+005A U+0045 U+0052 U+004F > ;; RECTZERO U+0053 U+0041 U+004A U+0044 U+0041 > ;; SAJDA U+0044 U+004F U+0054 U+0053 > ;; DOTS ; general ;persian ;; #P UL h w > U+1E2B ;; #P U+0048 > h ; capital H > h ;; ; problem of "T": It should be omitted when at end of word and when not ; the 1st member of a definite construct: al-madInaT al-`a.zImaT > al-madīna al-ʿaẓīma ; Only solution is to always transliterate to t, except when explicitly "quoted off" with |: T BAR / _ # > ; T > t ;;#U T > h N > n U+0057 > w (U|u u|UL u) > U+016B (A|a a|UL a|UL A) > U+0101 i y > U+012B y (I|i i|UL i) > U+012B ;;#U (E|a e) / _ # > H U+0113 ;;#T (E|e) e > e y ; pashto (E|e e) > U+0113 (O|o o) > U+014D C A > U+02BE U+0101 ; historical spelling C U > U+02BE U+016B ; historical spelling : madda on waw C I > U+02BE U+012B ; historical spelling : madda on ya P z > U+1E93 V z > U+017A C z > U+017E ;check V t > t U+0301 P t > U+1E6D UL t > U+1E6F C s > U+0161 UL s > U+1E6F ; sindhi V r > U+0155 P r > U+1E59; r with dot above P q > q ; ? V n > U+0144 C n > U+00F1 P n > U+1E47 CO n > n U+0324 C l > l U+0303 ; l + combining tilde CO g > g U+0324 V d > d U+0301 CO d > d U+0324 P d > U+1E0D V s > U+015B P s > U+1E63 (j|C g) > U+01E7 ; CO j > U+0237 U+0308 P h > U+1E25 V h > h (UL h|x) > U+1E2B G > g P g > U+0121 P f > f ; ? UL d > U+1E0F C c > U+010D ; CO b > b U+0324 P b > x ; ? undotted (b,t,th,n,y) ... (A|Y|UL A) / _ (N|U|W) > ; Y > U+0101 ;ā -> LOC: á : check DMG Ain > U+02BF Z H > Z H / # _ > ; H > U+02BE Z Z Z > ; Z Z > ; B > ; U+003C U+003C > U+00AB ;<< U+003E U+003E > U+00BB ;>> UL > ;_ U+003A U+003A > U+2234 ; '::' ??Arabic three dots transliterated by Therefore?? N > ; C > ;^ Q > ; pass(Unicode) Class [LOWERCASE] = ( U+0061..U+007A U+0101 U+0113 U+012B U+014D U+0161 U+016B U+1E0D U+1E25 U+1E33 U+1E63 U+1E6D U+1E93 U+1E2B U+1E0F U+1E6F U+0121 U+01E7 ) Class [UPPERCASE] = ( U+0041..U+005A U+0100 U+0112 U+012A U+014C U+0160 U+016A U+1E0C U+1E24 U+1E32 U+1E62 U+1E6C U+1E92 U+1E2A U+1E0E U+1E6E U+0120 U+01E6 ) Class [LETTER] = (U+0041..U+01FF) Class [VWL] = (U+0061 U+0065 U+0069 U+006F U+0075) ; (a e i o u) Class [SUNCONS] = (U+0064 U+1E0F U+0074 U+1E6F U+0072 U+007A U+0073 U+0161 U+1E63 U+1E0D U+1E6D U+1E93 U+006E) Class [BRACKETS] = (U+003C U+003E U+00AB U+00BB U+0028 U+0029 U+005B U+005D U+2018 U+2019 U+201C U+201D); < > () « » [ ] ‘ ’ “ ” [LOWERCASE] / U+E000 a l Z _ > [UPPERCASE] [LOWERCASE] / U+E000 _ > [UPPERCASE] ;l Z ([SUNCONS]=sc){2,2} > @sc Z @sc ;; FIXME ;l Z [SUNCONS]=sc > @sc Z @sc ;; FIXME a / [VWL] Z _ l Z > ;; bi-al- -> bi-l- ;; [VWL] / [VWL] BEG _ ^l > U+2019 ;; vowel at beg of word replaced by apostr when prec word end with vowel [LETTER] BAR > ;; pass(Unicode) U+E000 > ; A l Z > a l Z