Scientific transliteration of Cyrillic

Romanization of Macedonian Romanization of Russian Romanization

Scientific transliteration, variously called academic, linguistic, international, or scholarly transliteration, is an international system for transliteration of text from the Cyrillic script to the Latin script (romanization). This system is most often seen in linguistics publications on Slavic languages.


The scientific transliteration system is roughly as phonemic as is the orthography of the language transliterated. The deviations are with щ, where the transliteration makes clear that two phonemes are involved, and џ, where it fails to represent the (monophonemic) affricate with a single letter. The transliteration system is based on the Gaj's Latin alphabet used in Serbo-Croatian, in which each letter corresponds directly to a Cyrillic letter in Bosnian, Montenegrin and Serbian official standards, and was heavily based on the earlier Czech alphabet. It was codified in the 1898 Prussian Instructions for libraries, or Preußische Instruktionen (PI). It can also be used to romanize the early Glagolitic alphabet, which has a close correspondence to Cyrillic.

Scientific transliteration is often adapted to serve as a phonetic alphabet.[1]

Scientific transliteration was the basis for the ISO 9 transliteration standard. While linguistic transliteration tries to preserve the original language's pronunciation to a certain degree, the latest version of the ISO standard (ISO 9:1995) has abandoned this concept, which was still found in ISO/R 9:1968 and is now restricted to a one-to-one mapping of letters. It thus allows for unambiguous reverse transliteration into the original Cyrillic text and is language-independent.

The previous official Soviet romanization system, GOST 16876-71, is also based on scientific transliteration but used Latin h for Cyrillic х instead of Latin x or ssh and sth for Cyrillic Щ, and had a number of other differences. Most countries using Cyrillic script now have adopted GOST 7.79 instead, which is not the same as ISO 9 but close to it.

Representing all of the necessary diacritics on computers requires Unicode, Latin-2, Latin-4, or Latin-7 encoding.


Scientific transliteration and ISO 9
Cyrillic Church
Bulgarian Russian Belarusian Ukrainian Serbian Macedonian ISO 9
А а a a a a a a a a
Б б b b b b b b b b
В в v v v v v v v v
Г г g g g h h g g g
Ґ ґ g* g
Д д d d d d d d d d
Ѓ ѓ ǵ ǵ
Ђ ђ đ (dj) đ
Е е e e e e e e e
Ё ё ë ë ë
Є є e je ê
Ж ж ž ž ž ž ž ž ž ž
З з z z z z z z z z
Ѕ ѕ dz dz
И и i i i y i i i
I і i i* i i ì
Ї ї i ï (ji) ï
Й й j j j j j
Ј ј j j ǰ
К к k k k k k k k k
Л л l l l l l l l l
Љ љ lj (ļ) lj
М м m m m m m m m m
Н н n n n n n n n n
Њ њ nj (ń) nj
О о o o o o o o o o
П п p p p p p p p p
Р р r r r r r r r r
С с s s s s s s s s
Т т t t t t t t t t
Ќ ќ
Ћ ћ ǵ ć ć
У у u u u u u u u
ОУ оу u
Ў ў ŭ (w) ŭ
Ф ф f f f f f f f f
Х х x h x x x h h h
Ц ц c c c c c c c c
Ч ч č č č č č č č č
Џ џ dž (ģ)
Ш ш š š š š š š š š
Щ щ šč (št) št šč šč ŝ
Ъ ъ ъ (ǔ) ǎ ʺ ʺ
Ы ы y (ū) y y y
Ь ь ь (ǐ) j ʹ ʹ ʹ ʹ
Ѣ ѣ ě ě* ě* ě* ě
Э э è è è
Ю ю ju ju ju ju ju û
Я я ja ja ja ja â
 ʼ  ʼ
Ѡ ѡ o, ô
Ѧ ѧ ę
Ѩ ѩ
Ѫ ѫ ǫ ǎ
Ѭ ѭ
Ѯ ѯ ks
Ѱ ѱ ps
Ѳ ѳ th (θ) f* f* f*
Ѵ ѵ ü (i*) (i*) (i*)
Ѥ ѥ je
Ꙗ ꙗ ja
* Archaic letters.
() Letters in parentheses are older or alternate transliterations.

Ukrainian and Belarusian apostrophe are not transcribed. The early Cyrillic letter koppa (Ҁ, ҁ) was used only for transliterating Greek and its numeric value and was thus omitted. ISO 9:1995 is provided for comparison.

Unicode encoding is,

See also


  1. ^ Timberlake 2004, p 24.
  2. ^ a b The templates {{softsign}} and {{hardsign}} may be used for the proper character.