UNPKG

cldr

Version:

Library for extracting data from CLDR (the Unicode Common Locale Data Repository)

490 lines (433 loc) 17.9 kB
<?xml version="1.0" encoding="UTF-8" ?> <!DOCTYPE supplementalData SYSTEM "../../common/dtd/ldmlSupplemental.dtd"> <!-- Copyright © 1991-2013 Unicode, Inc. CLDR data files are interpreted according to the LDML specification (http://unicode.org/reports/tr35/) For terms of use, see http://www.unicode.org/copyright.html --> <supplementalData> <version number="$Revision$"/> <transforms> <transform source="el" target="el_Latn" variant="BGN" direction="forward" draft="contributed" alias="Greek-Latin/BGN el-Latn-t-el-m0-bgn"> <tRule><![CDATA[ # ######################################################################## # BGN/PCGN 1962 System # # This system is a simplified version of the system devised by the PCGN # in 1941 and later adopted by the BGN. In 1962 the two organizations # agreed to joint adoption of certain changes in the original system, # specifically the omission of special rules for the treatment of Greek # geographic names of Albanian, Bulgarian, Italian, Macedonian, and # Turkish origin. That revision eliminated the need to consider the # origin of names and removed ambiguity from the romanization of Greek # expressions of possible non-Greek origin. This system is based on # the pronunciation of modern Greek and is not intended for use in # the romanization of classical Greek. # # The Greek Alphabet as defined by the BGN (Pages 29-31): # # ΑΒΓΔΕΖΗΘΙΚΛΜΝΞΟΠΡΣΤΥΦΧΨΩ # αβγδεζηθικλμνξοπρσςτυφχψω # # Originally prepared by Michael Everson <everson@evertype.com> ######################################################################## # # MINIMAL FILTER: Greek-Latin # :: [ΆΈΉΊΌΎΏΐΑΒΓΔΕΖΗΘΙΚΛΜΝΞΟΠΡΣΤΥΦΧΨΩΪΫάέήίΰαβγδεζηθικλμνξοπρςστυφχψωϊϋόύώἀἁἂἃἄἅἆἇἈἉἊἋἌἍἎἏἐἑἒἓἔἕἘἙἚἛἜἝἠἡἢἣἤἥἦἧἨἩἪἫἬἭἮἯἰἱἲἳἴἵἶἷἸἹἺἻἼἽἾἿὀὁὂὃὄὅὈὉὊὋὌὍὐὑὒὓὔὕὖὗὙὛὝὟὠὡὢὣὤὥὦὧὨὩὫὬὭὮὯὰάὲέὴήὶίὸόὺύὼώᾀᾁᾂᾃᾄᾅᾆᾇᾈᾉᾊᾋᾌᾍᾎᾏᾐᾑᾒᾓᾔᾕᾖᾗᾘᾙᾚᾛᾜᾝᾞᾟᾠᾡᾢᾣᾤᾥᾦᾧᾨᾩᾪᾫᾬᾭᾮᾯᾲᾳᾴᾶᾷᾺΆᾼῂῃῄῆῇῈΈῊΉῌῖῚΊῤῥῦῪΎῲῳῴῶῷῸΌῺΏῼ῾] ; :: NFD (NFC) ; # # ######################################################################## # ######################################################################## # # Define All Transformation Variables # ######################################################################## # $upperConsonants = [ΒΓΔΖΘΚΛΜΝΞΠΡΣΤΦΧΨ] ; $lowerConsonants = [βγδζθκλμνξπρσςτφχψ] ; $consonants = [$upperConsonants $lowerConsonants] ; $upperVowels = [ΑΕΗΙΟΥΩ] ; $lowerVowels = [αεηιουω] ; $vowels = [$upperVowels $lowerVowels] ; $lower = [$lowerConsonants $lowerVowels] ; # # # Use this $wordBoundary until bug 2034 is fixed in ICU: # http://bugs.icu-project.org/cgi-bin/icu-bugs/transliterate?id=2034;expression=boundary;user=guest # $wordBoundary = [^[:L:][:M:][:N:]] ; # # ######################################################################## # ######################################################################## # # Rules moved to front to avoid masking # ######################################################################## # ######################################################################## # # BGN Page 32 Rule 1: # # The apostrophe and reversed apostrophe, on or the other of which is # written in Greek in front of all initial uppercase vowel characters, # above all initial lowercase vowel characters, and above the second # character of all initial two-vowel character sequences, should not # be romanized, e.g., Ἀθῆναι → Athínai, Ἠράκλειον → Iráklion, # Οἰνόφυτα → Oinófita. These apostrophes must be distinguished from # accent marks hen they occur together, e.g. Ἄβατον → Ávaton, # Ἤλια → Ília, Οἴτη → Oíti. The reversed apostrophe is sometimes found # also with ρ and should, likewise, not be romanized: ῥέμα → réma. # # BGN Page 32 Rule 2a: # # Stress is shown in Greek by the use of the tilde or circumflex, # the acute accent, or the grave accent; all of those marks should # be represented in romanization by an acute accent, e.g., # Ἀθῆναι → Athínai, Νδία → Día, Ζεμενὸν → Zemenón. # # BGN Page 32 Rule 4: # # The character ι (ióta) is sometimes found written under, or, # in uppercase, to the right of the vowel characters α, η, and ω. # This "subscript iota" should not be romanized, e.g., # Μυρτῷον Πέλαγος or ΜΥΡΤῼΟΝ ΠΕΛΑΓΟΣ [but not ΜΥΡΤΩΙΟΝ ΠΕΛΑΓΟΣ] # → Mirtóön Pélagos. # ######################################################################## # [ἈἉᾼᾈᾉ] Α ; # GREEK CAPITAL LETTER ALPHA [ἀἁᾳᾀᾁ] α ; # GREEK SMALL LETTER ALPHA [ἊἋἌἍἎἏᾊᾋᾌᾍᾎᾏᾺΆ] Ά ; # GREEK CAPITAL LETTER ALPHA WITH TONOS [ἂἃἄἅἆἇὰάᾂᾃᾄᾅᾆᾇᾲᾴᾶᾷ] ά ; # GREEK SMALL LETTER ALPHA WITH TONOS [ἘἙ] Ε ; # GREEK CAPITAL LETTER EPSILON [ἐἑὲέ] ε ; # GREEK SMALL LETTER EPSILON [ἚἛἜἝῈΈ] Έ ; # GREEK CAPITAL LETTER EPSILON WITH TONOS [ἒἓἔἕ] έ ; # GREEK SMALL LETTER EPSILON WITH TONOS [ἨἩᾘᾙῌ] Η ; # GREEK CAPITAL LETTER ETA [ἠἡᾐᾑῃ] η ; # GREEK SMALL LETTER ETA [ἪἫἬἭἮἯᾚᾛᾜᾝᾞᾟῊΉ] Ή ; # GREEK CAPITAL LETTER ETA WITH TONOS [ἢἣἤἥἦἧὴήᾒᾓᾔᾕᾖᾗῂῄῆῇ] ή ; # GREEK SMALL LETTER ETA WITH TONOS [ἸἹ] Ι ; # GREEK CAPITAL LETTER IOTA [ἰἱ] ι ; # GREEK SMALL LETTER IOTA [ἺἻἼἽἾἿῚΊ] Ί ; # GREEK CAPITAL LETTER IOTA WITH TONOS [ἲἳἴἵἶἷὶίῖ] ί ; # GREEK SMALL LETTER IOTA WITH TONOS [ὈὉ] Ο ; # GREEK CAPITAL LETTER OMICRON [ὀὁ] ο ; # GREEK SMALL LETTER OMICRON [ὊὋὌὍῸΌ] Ό ; # GREEK CAPITAL LETTER OMICRON WITH TONOS [ὂὃὄὅὸό] ό ; # GREEK SMALL LETTER OMICRON WITH TONOS Υ ; # GREEK CAPITAL LETTER UPSILON [ὐὑ] υ ; # GREEK SMALL LETTER UPSILON [ὛὝὟῪΎ] Ύ ; # GREEK CAPITAL LETTER UPSILON WITH TONOS [ὒὓὔὕὖὗὺύῦ] ύ ; # GREEK SMALL LETTER UPSILON WITH TONOS [ὨὩᾨᾩῼ] Ω ; # GREEK CAPITAL LETTER OMEGA [ὠὡᾠᾡῳ] ω ; # GREEK SMALL LETTER OMEGA [ὬὫὬὭὮὯᾪᾫᾬᾭᾮᾯῺΏ] Ώ ; # GREEK CAPITAL LETTER OMEGA WITH TONOS [ὢὣὤὥὦὧὼώᾢᾣᾤᾥᾦᾧῲῴῶῷ] ώ ; # GREEK SMALL LETTER OMEGA WITH TONOS Ρ ; # GREEK CAPITAL LETTER RHO [ῤῥ] ρ ; # GREEK SMALL LETTER RHO # # ######################################################################## # # End of Rules 1, 2a, and 4 # ######################################################################## # ######################################################################## # # BGN Page 32 Rules 2b and 2c: # # If the stressed vowel is written as a sequence of two vowel characters # in Greek, the # second vowel character should carry the accent; # similarly, in Romanization the acute accent should be placed over the # second vowel letter, e.g., Οἰνοῦσαι → Oinoúsai, Οἴτη → Oíti, # Θεσπιαὶ → Thespiaí. # # Where a syllable containing on the combinations αυ, ευ, or ηυ # carries the stress, this is marked in Greek on the character υ. # In romanization it should be shown on the preceding vowel # letter, e.g., Πειραιεύς → Piraiévs, Αὔρα → Ávra. # Αί ; αί ; Οί ; οί ; Ού ; ού ; Αύ Άυ ; αύ άυ ; Εύ Έυ ; εύ έυ ; Ηύ Ήυ ; ηύ ήυ ; # # ######################################################################## # # End of Rules 2b and 2c # ######################################################################## # ######################################################################## # # BGN Page 32 Rule 3: # # The dieresis should be shown in romanization where it occurs in Greek, # e.g., Μαρινέϊκα → Marinéïka, Ἀχαΐα → Akhaḯa; and over the second vowel # etter in romanization of the following combinations fo Greek vowel # characters: αε, e.g., Ἀετὸς → Aëtos; αη, e.g., Ἀηδὼν → Aïdhon; οη, # e.g. Οἰνόη → Oinóï; ωο, e.g., Ἠρῶον → Iróön. # [ΪΫ] Ï ; [ϊϋ] ï ; [ΐΰ] ḯ ; Αε ; αε ; Αη ; αη ; Οη ; οη ; Ωο ; ωο ; Άε Áë ; άε áë ; Άη Áï ; άη áï ; Όη Óï ; όη óï ; Ώο Óö ; ώο óö ; # # ######################################################################## # # End of Rule 3 # ######################################################################## # ######################################################################## # # Start of Alphabetic Transformations # ######################################################################## # ΑΙ AI ; # GREEK CAPITAL LETTER ALPHA + CAPITAL IOTA Αι Ai ; # GREEK CAPITAL LETTER ALPHA + SMALL IOTA αι ai ; # GREEK SMALL LETTER ALPHA + SMALL IOTA ΑΥ AV ; # GREEK CAPITAL LETTER ALPHA + CAPITAL UPSILON Αυ Av ; # GREEK CAPITAL LETTER ALPHA + SMALL UPSILON αυ av ; # GREEK SMALL LETTER ALPHA + SMALL UPSILON Α A ; # GREEK CAPITAL LETTER ALPHA α a ; # GREEK SMALL LETTER ALPHA Ά Á ; # GREEK CAPITAL LETTER ALPHA WITH TONOS ά á ; # GREEK SMALL LETTER ALPHA WITH TONOS Β V ; # GREEK CAPITAL LETTER BETA β v ; # GREEK SMALL LETTER BETA ΓΓ NG ; # GREEK CAPITAL LETTER GAMMA + CAPITAL GAMMA Γγ Ng ; # GREEK CAPITAL LETTER GAMMA + SMALL GAMMA γγ ng ; # GREEK SMALL LETTER GAMMA + SMALL GAMMA $wordBoundary{ΓΚ G ; # GREEK CAPITAL LETTER GAMMA + CAPITAL KAPPA $wordBoundary{Γκ G ; # GREEK CAPITAL LETTER GAMMA + SMALL KAPPA $wordBoundary{γκ g ; # GREEK SMALL LETTER GAMMA + SMALL KAPPA ΓΚ NG ; # GREEK CAPITAL LETTER GAMMA + CAPITAL KAPPA Γκ Ng ; # GREEK CAPITAL LETTER GAMMA + SMALL KAPPA γκ ng ; # GREEK SMALL LETTER GAMMA + SMALL KAPPA # # ######################################################################## # # BGN Page 29 Rule 3a: # # The character γ should be romanized g before α, ο, ου, ω, and # consonants other than γ, ξ, and χ. # ######################################################################## # Γ}[ΑΟΩ [$upperConsonants - [ΓΞΧ]]] G ; # GREEK CAPITAL LETTER GAMMA Γ}[αοω [$lowerConsonants - [γξχ]]] G ; # GREEK CAPITAL LETTER GAMMA Γ}ΟΥ G ; # GREEK CAPITAL LETTER GAMMA Γ}ου G ; # GREEK CAPITAL LETTER GAMMA γ}[αοω [$lowerConsonants - [γξχ]]] g ; # GREEK SMALL LETTER GAMMA γ}ου g ; # GREEK SMALL LETTER GAMMA # # ######################################################################## # # End of Rule 3a # ######################################################################## # ######################################################################## # # BGN Page 29 Rule 3b: # # The character γ should be romanized y before αι, ε, ει, η, ι, οι, υ, # and υι. # ######################################################################## # Γ}[ΑΕΟΥ]Ι Y ; # GREEK CAPITAL LETTER GAMMA Γ}[ΕΗΙΥ] Y ; # GREEK CAPITAL LETTER GAMMA Γ}[αεου]ι Y ; # GREEK CAPITAL LETTER GAMMA Γ}[εηιυ] Y ; # GREEK CAPITAL LETTER GAMMA γ}[αεου]ι y ; # GREEK SMALL LETTER GAMMA γ}[εηιυ] y ; # GREEK SMALL LETTER GAMMA # # ######################################################################## # # End of Rule 3b # ######################################################################## # ######################################################################## # # BGN Page 29 Rule 3c: # # The character γ should be romanized n before ξ and χ. # ######################################################################## # Γ}[ΞΧ] N ; # GREEK CAPITAL LETTER GAMMA Γ}[ξχ] N ; # GREEK CAPITAL LETTER GAMMA γ}[ξχ] n ; # GREEK SMALL LETTER GAMMA # # ######################################################################## # # End of Rule 3c # ######################################################################## # Γ G ; # GREEK CAPITAL LETTER GAMMA γ g ; # GREEK SMALL LETTER GAMMA # # ######################################################################## # # BGN Page 29 Rule 4a: # # The character δ should be romanized d when between ν and ρ. # ######################################################################## # Ν{Δ}Ρ D ; # GREEK CAPITAL LETTER DELTA ν{δ}ρ d ; # GREEK SMALL LETTER GAMMA # # ######################################################################## # # End of Rule 4a # ######################################################################## # Δ} $lower Dh ; # GREEK CAPITAL LETTER PSI Δ DH ; # GREEK CAPITAL LETTER DELTA δ dh ; # GREEK SMALL LETTER DELTA ΕΙ I ; # GREEK CAPITAL LETTER EPSILON + CAPITAL IOTA Ει I ; # GREEK CAPITAL LETTER EPSILON + SMALL IOTA ει i ; # GREEK SMALL LETTER EPSILON + SMALL IOTA ΕΪ ; # GREEK CAPITAL LETTER EPSILON + CAPITAL IOTA DIAERESIS Εϊ ; # GREEK CAPITAL LETTER EPSILON + SMALL IOTA DIAERESIS εϊ ; # GREEK SMALL LETTER EPSILON + SMALL IOTA DIAERESIS ΕΥ EV ; # GREEK CAPITAL LETTER EPSILON + CAPITAL UPSILON Ευ Ev ; # GREEK CAPITAL LETTER EPSILON + SMALL UPSILON ευ ev ; # GREEK SMALL LETTER EPSILON + SMALL UPSILON Ε E ; # GREEK CAPITAL LETTER EPSILON ε e ; # GREEK SMALL LETTER EPSILON Έ É ; # GREEK CAPITAL LETTER EPSILON WITH TONOS έ é ; # GREEK SMALL LETTER EPSILON WITH TONOS Ζ Z ; # GREEK CAPITAL LETTER ZETA ζ z ; # GREEK SMALL LETTER ZETA ΗΥ IV ; # GREEK CAPITAL LETTER ALPHA + CAPITAL UPSILON Ηυ Iv ; # GREEK CAPITAL LETTER ALPHA + SMALL UPSILON ηυ iv ; # GREEK SMALL LETTER ALPHA + SMALL UPSILON Η I ; # GREEK CAPITAL LETTER ETA η i ; # GREEK SMALL LETTER ETA Ή Í ; # GREEK CAPITAL LETTER ETA WITH TONOS ή í ; # GREEK SMALL LETTER ETA WITH TONOS Θ} $lower Th ; # GREEK CAPITAL LETTER THETA Θ TH ; # GREEK CAPITAL LETTER THETA θ th ; # GREEK SMALL LETTER THETA Ι I ; # GREEK CAPITAL LETTER IOTA ι i ; # GREEK SMALL LETTER IOTA Ί Í ; # GREEK CAPITAL LETTER IOTA WITH TONOS ί í ; # GREEK SMALL LETTER IOTA WITH TONOS Κ K ; # GREEK CAPITAL LETTER KAPPA κ k ; # GREEK SMALL LETTER KAPPA Λ L ; # GREEK CAPITAL LETTER LAMDA λ l ; # GREEK SMALL LETTER LAMDA $wordBoundary{ΜΠ B ; # GREEK CAPITAL LETTER MU + CAPITAL PI $wordBoundary{Μπ B ; # GREEK CAPITAL LETTER MU + SMALL PI $wordBoundary{μπ b ; # GREEK SMALL LETTER MU + SMALL PI ΜΠ MB ; # GREEK CAPITAL LETTER MU + CAPITAL PI Μπ Mb ; # GREEK CAPITAL LETTER MU + SMALL PI μπ mb ; # GREEK SMALL LETTER MU + SMALL PI Μ M ; # GREEK CAPITAL LETTER MU μ m ; # GREEK SMALL LETTER MU $wordBoundary{ΝΤ D ; # GREEK CAPITAL LETTER NU + CAPITAL TAU $wordBoundary{Ντ D ; # GREEK CAPITAL LETTER NU + SMALL TAU $wordBoundary{ντ d ; # GREEK SMALL LETTER NU + SMALL TAU ΝΤ ND ; # GREEK CAPITAL LETTER NU + CAPITAL TAU Ντ Nd ; # GREEK CAPITAL LETTER NU + SMALL TAU ντ nd ; # GREEK SMALL LETTER NU + SMALL TAU Ν N ; # GREEK CAPITAL LETTER NU ν n ; # GREEK SMALL LETTER NU Ξ X ; # GREEK CAPITAL LETTER KSI ξ x ; # GREEK SMALL LETTER KSI ΟΙ OI ; # GREEK CAPITAL LETTER OMICRON + CAPITAL IOTA Οι Oi ; # GREEK CAPITAL LETTER OMICRON + SMALL IOTA οι oi ; # GREEK SMALL LETTER OMICRON + SMALL IOTA ΟΥ OU ; # GREEK CAPITAL LETTER OMICRON + CAPITAL UPSILON Ου Ou ; # GREEK CAPITAL LETTER OMICRON + SMALL UPSILON ου ou ; # GREEK SMALL LETTER OMICRON + SMALL UPSILON Ο O ; # GREEK CAPITAL LETTER OMICRON ο o ; # GREEK SMALL LETTER OMICRON Ό Ó ; # GREEK CAPITAL LETTER OMICRON WITH TONOS ό ó ; # GREEK SMALL LETTER OMICRON WITH TONOS Π P ; # GREEK CAPITAL LETTER PI π p ; # GREEK SMALL LETTER PI Ρ R ; # GREEK CAPITAL LETTER RHO ρ r ; # GREEK SMALL LETTER RHO Σ S ; # GREEK CAPITAL LETTER SIGMA σ s ; # GREEK SMALL LETTER SIGMA ς s ; # GREEK SMALL LETTER FINAL SIGMA Τ T ; # GREEK CAPITAL LETTER TAU τ t ; # GREEK SMALL LETTER TAU # # ######################################################################## # # End Rule 3.5 # ######################################################################## # Υ I ; # GREEK CAPITAL LETTER UPSILON υ i ; # GREEK SMALL LETTER UPSILON Ύ Í ; # GREEK CAPITAL LETTER UPSILON WITH TONOS ύ í ; # GREEK SMALL LETTER UPSILON WITH TONOS Φ F ; # GREEK CAPITAL LETTER PHI φ f ; # GREEK SMALL LETTER PHI Χ} $lower Kh ; # GREEK CAPITAL LETTER CHI Χ KH ; # GREEK CAPITAL LETTER CHI χ kh ; # GREEK SMALL LETTER CHI Ψ} $lower Ps ; # GREEK CAPITAL LETTER PSI Ψ PS ; # GREEK CAPITAL LETTER PSI ψ ps ; # GREEK SMALL LETTER PSI Ω O ; # GREEK CAPITAL LETTER OMEGA ω o ; # GREEK SMALL LETTER OMEGA Ώ Ó ; # GREEK CAPITAL LETTER OMEGA WITH TONOS ώ ó ; # GREEK SMALL LETTER OMEGA WITH TONOS # # ######################################################################## ]]></tRule> </transform> </transforms> </supplementalData>