Tis-620

Thai Industrial Standard 620-2533, commonly referred to as TIS-620, is the most common character set and character encoding for the Thai language. The standard is published by the Thai Industrial Standards Institute (TISI), an organ of the Ministry of Industry under the Royal Thai Government, and is the sole official standard for encoding Thai in Thailand. The descriptive name of the standard is "Standard for Thai Character Codes for Computers" (Thai: รหัสสำหรับอักขระไทยที่ใช้กับคอมพิวเตอร์). "2533" refers to year 2533 of the Buddhist Era (1990), the year the present version of the standard was published; a previous revision, TIS 620-2529 (1986), is now obsolete.

Structure

TIS-620 is a conventionally structured Extended ASCII national character set that retains full compatibility with 7-bit ASCII and uses the 8-bit range hex A1 to FB for encoding the Thai alphabet. Note that due to the complex combining nature of Thai vowels and diacritics, TIS-620 is intended for information interchange only, and an additional display engine is required to compose characters correctly.

Variants

A nearly identical version of TIS-620 has been adopted as ISO 8859-11 in 1999, the sole difference being that ISO 8859-11 defines hex A0 as a non-breaking space, while TIS-620 leaves it undefined but reserved. (In practice, this small distinction is usually ignored.) The ISO 8859-11 set has also been registered as ISO-IR-166 by Ecma International, but this variation adds explicit escape codes for signaling the beginning and end of Thai character sequences. The TIS-620 character set ordering has been used essentially as is within Unicode (ISO 10646) as well. Unicode's Thai range is U+0E01 through U+0E7F, and TIS-620 characters can be converted to UTF-16 simply by prefixing each byte with 0E and substracting hex A0 from the value.
TIS-620
x0x1x2x3x4x5x6x7x8x9xAxBxCxDxExF
0xunused
1x
2xSP!"#$%&'()*+,-./
3x 0 123456789:;<=>?
4x@ABCDEFGHIJKLMNO
5xPQRSTUVWXYZhref="/encyclopedia/Backslash" title="Backslash">\]^_
6x`abcdefghijklmno
7xpqrstuvwxyz{|}~
8xunused
9x
Ax 
Bx
Cx
Dx
  
 ั  ิ  ี  ึ  ื  ุ  ู  ฺ         ฿
Ex
  
 ็  ่  ้  ๊  ๋  ์  ํ  ๎
Fx
  
       
You may need to set your text size to about 200% to see all the characters In the table above, 20 is the regular SPACE character. Code values 00-1F, 7F, 80-9F, A0, DB-DE and FC-FF are not assigned to characters by TIS-620. Code values D1, D4-DA, E7-EE are combining characters.

External link

 

<< PreviousWord BrowserNext >>
hector pieterson
raymond roussel
oliver lyttelton, 1st viscount chandos
train stop
president's commission on the hiv epidemic
hms kite (u87)
aids
neath rfc
johannes steen
minister of production
international college
yajuj and majuj
atari portfolio
emil stang
erik routley
transporter wagon
ernest william barnes
softman products co. v. adobe systems inc.
sundanese
damstredet
jamie stewart
james belich (historian)
braeriach
daihatsu charmant
river lee (ireland)
roadworks
sorbic acid
ld50 of thc
james d. watkins
orlando jones
russell reid
david reid (boxing)
zero growth
list of rolling stock manufacturers
cultural diplomacy
bernhard baron
robert henry cain
hartz concept
chhelu ram
staging area
william barrett
william gerhardie
jewish voice for peace
schongau