Note: The information contained herein is provided as-is. It was complied from various references given at the end of the page. The author believes all information presented here is accurate. But if you disagree, kindly drop me a mail.

This page provides a cross reference table for unambiguous/one-to-one mapping between the TIS 620-2533:1990 and ISO/IEC 10646-1:1993 character set standards. Coded values for TIS 620 are in hexadecimal while such for 10646 use Unicode BMP short form (the symbol U+ followed by a two-octet hexadecimal coded value).

The "lower" part of the TIS 620 table strictly follows codepoint assignments of graphics character in ISO 646 (US-ASCII).

TIS 620-2533 ISO/IEC 10646-1 TIS 620-2533 ISO/IEC 10646-1 TIS 620-2533 ISO/IEC 10646-1
20 U+0020 40 U+0040 60 U+0060
21 U+0021 41 U+0041 61 U+0061
22 U+0022 42 U+0042 62 U+0062
23 U+0023 43 U+0043 63 U+0063
24 U+0024 44 U+0044 64 U+0064
25 U+0025 45 U+0045 65 U+0065
26 U+0026 46 U+0046 66 U+0066
27 U+0027 47 U+0047 67 U+0067
28 U+0028 48 U+0048 68 U+0068
29 U+0029 49 U+0049 69 U+0069
2A U+002A 4A U+004A 6A U+006A
2B U+002B 4B U+004B 6B U+006B
2C U+002C 4C U+004C 6C U+006C
2D U+002D 4D U+004D 6D U+006D
2E U+002E 4E U+004E 6E U+006E
2F U+002F 4F U+004F 6F U+006F
30 U+0030 50 U+0050 70 U+0070
31 U+0031 51 U+0051 71 U+0071
32 U+0032 52 U+0052 72 U+0072
33 U+0033 53 U+0053 73 U+0073
34 U+0034 54 U+0054 74 U+0074
35 U+0035 55 U+0055 75 U+0075
36 U+0036 56 U+0056 76 U+0076
37 U+0037 57 U+0057 77 U+0077
38 U+0038 58 U+0058 78 U+0078
39 U+0039 59 U+0059 79 U+0079
3A U+003A 5A U+005A 7A U+007A
3B U+003B 5B U+005B 7B U+007B
3C U+003C 5C U+005C 7C U+007C
3D U+003D 5D U+005D 7D U+007D
3E U+003E 5E U+005E 7E U+007E
3F U+003F 5F U+005F  

The second part of TIS 620 covers the definition of Thai characters. The structure of codepoint assignments in TIS 620 and 10646 resembles each other very much.

Note: Special attention should be paid on codepoint A0. Contrary to many people's belief that TIS 620 defines codepoint A0 as no-break space (U+00A0), the standard does not assign any character to this codepoint. Codepoints A0 as well as DB-DE and FC-FF are not part of the standard. Interpretations of these unassigned codepoints are implementation specific and may vary from implementation to implementation. To ensure full data interchangeability among various applications, it is suggested that Thai software implementors follows the TIS 620 standard strictly.

TIS 620-2533 ISO/IEC 10646-1 TIS 620-2533 ISO/IEC 10646-1 TIS 620-2533 ISO/IEC 10646-1
  C0 U+0E20 E0 U+0E40
A1 U+0E01 C1 U+0E21 E1 U+0E41
A2 U+0E02 C2 U+0E22 E2 U+0E42
A3 U+0E03 C3 U+0E23 E3 U+0E43
A4 U+0E04 C4 U+0E24 E4 U+0E44
A5 U+0E05 C5 U+0E25 E5 U+0E45
A6 U+0E06 C6 U+0E26 E6 U+0E46
A7 U+0E07 C7 U+0E27 E7 U+0E47
A8 U+0E08 C8 U+0E28 E8 U+0E48
A9 U+0E09 C9 U+0E29 E9 U+0E49
AA U+0E0A CA U+0E2A EA U+0E4A
AB U+0E0B CB U+0E2B EB U+0E4B
AC U+0E0C CC U+0E2C EC U+0E4C
AD U+0E0D CD U+0E2D ED U+0E4D
AE U+0E0E CE U+0E2E EE U+0E4E
AF U+0E0F CF U+0E2F EF U+0E4F
B0 U+0E10 D0 U+0E30 F0 U+0E50
B1 U+0E11 D1 U+0E31 F1 U+0E51
B2 U+0E12 D2 U+0E32 F2 U+0E52
B3 U+0E13 D3 U+0E33 F3 U+0E53
B4 U+0E14 D4 U+0E34 F4 U+0E54
B5 U+0E15 D5 U+0E35 F5 U+0E55
B6 U+0E16 D6 U+0E36 F6 U+0E56
B7 U+0E17 D7 U+0E37 F7 U+0E57
B8 U+0E18 D8 U+0E38 F8 U+0E58
B9 U+0E19 D9 U+0E39 F9 U+0E59
BA U+0E1A DA U+0E3A FA U+0E5A
BB U+0E1B   FB U+0E5B
BC U+0E1C    
BD U+0E1D    
BE U+0E1E    
BF U+0E1F DF U+0E3F  

References

  1. TIS 620-2533 Standard for Thai Character Codes for Computers (in Thai), Thai Industrial Standards Institute
  2. Thai Information Technology Standards, On-line resources at the National Electronics and Computer Technology Center (NECTEC)
  3. ISO/IEC 10646-1, ISO/IEC JTC1/SC2
  4. Thai block in Unicode 4.1, Unicode Consortium

[An annotated reference to the Thai implementations]