Special Romanian Unicode characters
|
There is a lot of confusion about how to write the Romanian characters that denote the sounds /S/ and /ts/. Although the officially preferred forms are, respectively, "s with comma below" and "t with comma below", many texts printed use "s with cedilla" and "t with cedilla" and in practice it appears to be a font variation.
This usage has been aggregated into all character encoding standards for Central and Eastern Europe (including ISO 8859-2), which include "s" and "t" with cedillas. In addition, most computer fonts have "s-cedilla" with a cedilla (like the Turkish equivalent) and "t-cedilla" with a comma below.
ISO 8859-16 includes "s" and "t" with comma below on the same places "s" and "t" with cedilla were in ISO 8859-2.
The Unicode standard defines the "comma-below" characters in the Latin Extened-B section (hex range 0180-024F).
Phoneme | With cedilla | With comma | |||
---|---|---|---|---|---|
Character | Unicode position (hex) | Character | Unicode position (hex) | HTML entity | |
/S/ | Ş | 015E | Ș | 0218 | Ș |
ş | 015F | ș | 0219 | ș | |
/ts/ | Ţ | 0162 | Ţ | 021A | Ț |
ţ | 0163 | ț | 021B | ț |
External links
- Unicode Latin Extended-B characters (http://www.unicode.org/charts/PDF/U0180.pdf/)