Skip Headers
Oracle® Database Globalization Support Guide
10g Release 2 (10.2)

Part Number B14225-01
Go to Documentation Home
Home
Go to Book List
Book List
Go to Table of Contents
Contents
Go to Index
Index
Go to Master Index
Master Index
Go to Feedback page
Feedback

Go to previous page
Previous
Go to next page
Next
View PDF

A Locale Data

This appendix lists the languages, territories, character sets, and other locale data supported by the Oracle server. It includes these topics:

You can obtain information about character sets, languages, territories, and linguistic sorts by querying the V$NLS_VALID_VALUES dynamic performance view.


See Also:

Oracle Database Reference for more information about the data that can be returned by this view

Languages

Languages in Table A-1 provide support for locale-sensitive information such as the following:

By using Unicode databases and datatypes, you can store, process, and retrieve data for almost all contemporary languages, including many that do not appear in Table A-1.

Table A-1 Oracle Supported Languages

Language Name Language Abbreviation Default Sort
AMERICAN us binary
ARABIC ar ARABIC
ASSAMESE as binary
AZERBAIJANI az AZERBAIJANI
BANGLA bn binary
BRAZILIAN PORTUGUESE ptb WEST_EUROPEAN
BULGARIAN bg BULGARIAN
CANADIAN FRENCH frc CANADIAN FRENCH
CATALAN ca CATALAN
CROATIAN hr CROATIAN
CYRILLIC KAZAKH ckk GENERIC_M
CYRILLIC SERBIAN csr GENERIC_M
CYRILLIC UZBEK cuz GENERIC_M
CZECH cs CZECH
DANISH dk DANISH
DUTCH nl DUTCH
EGYPTIAN eg ARABIC
ENGLISH gb binary
ESTONIAN et ESTONIAN
FINNISH sf FINNISH
FRENCH f FRENCH
GERMAN DIN din GERMAN
GERMAN d GERMAN
GREEK el GREEK
GUJARATI gu binary
HEBREW iw HEBREW
HINDI hi binary
HUNGARIAN hu HUNGARIAN
ICELANDIC is ICELANDIC
INDONESIAN in INDONESIAN
ITALIAN i WEST_EUROPEAN
JAPANESE ja binary
KANNADA kn binary
KOREAN ko binary
LATIN AMERICAN SPANISH esa SPANISH
LATIN SERBIAN lsr binary
LATIN UZBEK luz GENERIC_M
LATVIAN lv LATVIAN
LITHUANIAN lt LITHUANIAN
MACEDONIAN mk binary
MALAY ms MALAY
MALAYALAM ml binary
MARATHI mr binary
MEXICAN SPANISH esm WEST_EUROPEAN
NORWEGIAN n NORWEGIAN
ORIYA or binary
POLISH pl POLISH
PORTUGUESE pt WEST_EUROPEAN
PUNJABI pa binary
ROMANIAN ro ROMANIAN
RUSSIAN ru RUSSIAN
SIMPLIFIED CHINESE zhs binary
SLOVAK sk SLOVAK
SLOVENIAN sl SLOVENIAN
SPANISH e SPANISH
SWEDISH s SWEDISH
TAMIL ta binary
TELUGU te binary
THAI th THAI_DICTIONARY
TRADITIONAL CHINESE zht binary
TURKISH tr TURKISH
UKRAINIAN uk UKRAINIAN
VIETNAMESE vn VIETNAMESE

Translated Messages

Oracle error messages have been translated into the languages which are listed in Table A-2.

Table A-2 Oracle Supported Messages

Name Abbreviation
ARABIC ar
BRAZILIAN PORTUGUESE ptb
CATALAN ca
CZECH cs
DANISH dk
DUTCH nl
FINNISH sf
FRENCH f
GERMAN d
GREEK el
HEBREW iw
HUNGARIAN hu
ITALIAN i
JAPANESE ja
KOREAN ko
NORWEGIAN n
POLISH pl
PORTUGUESE pt
ROMANIAN ro
RUSSIAN ru
SIMPLIFIED CHINESE zhs
SLOVAK sk
SPANISH e
SWEDISH s
THAI th
TRADITIONAL CHINESE zht
TURKISH tr

Territories

Table A-3 lists the territories supported by the Oracle server.

Table A-3 Oracle Supported Territories

Name Name Name
ALGERIA GREECE POLAND
AMERICA HONG KONG PORTUGAL
ARGENTINA HUNGARY PUERTO RICO
AUSTRALIA ICELAND QATAR
AUSTRIA INDIA ROMANIA
AZERBAIJAN INDONESIA RUSSIA
BAHRAIN IRAQ SAUDI ARABIA
BANGLADESH IRELAND SERBIA AND MONTENEGRO
BELGIUM ISRAEL SINGAPORE
BRAZIL ITALY SLOVAKIA
BULGARIA JAPAN SLOVENIA
CANADA JORDAN SOMALIA
CATALONIA KAZAKHSTAN SOUTH AFRICA
CHILE KOREA SPAIN
CHINA KUWAIT SUDAN
COLOMBIA LATVIA SWEDEN
COSTA RICA LEBANON SWITZERLAND
CROATIA LIBYA SYRIA
CYPRUS LITHUANIA TAIWAN
CZECH REPUBLIC LUXEMBOURG THAILAND
DENMARK MALAYSIA THE NETHERLANDS
DJIBOUTI MAURITANIA TUNISIA
ECUADOR MEXICO TURKEY
EGYPT MOROCCO UKRAINE
EL SALVADOR NEW ZEALAND UNITED ARAB EMIRATES
ESTONIA NICARAGUA UNITED KINGDOM
FINLAND NORWAY UZBEKISTAN
FRANCE OMAN VENEZUELA
FYR MACEDONIA PANAMA VIETNAM
GUATEMALA PERU YEMEN
GERMANY PHILIPPINES

Character Sets

Oracle-supported character sets are listed in the following sections according to three broad categories.

In addition, common character set subset/superset combinations are listed. Some character sets can only be used with certain data types. For example, the AL16UTF16 character set can only be used as an NCHAR character set, and not as a database character set.

Also documented in the comment section are other unique features of the character set that may be important to users or your database administrator. For example, the information includes whether the character set supports the euro currency symbol, whether user-defined characters are supported, and whether the character set is a strict superset of ASCII. (You can use the CSALTER script to migrate an existing database to a new character set, only if all of the schema data is a strict subset of the new character set.)

The following is the key for the comment column of the character set tables:

SB: single-byte encoding
MB: multibyte encoding
FIXED: fixed-width multibyte encoding
ASCII: strict superset of ASCII
EURO: euro symbol supported
UDC: user-defined characters supported

Oracle does not document individual code page layouts. For specific details about a particular character set, its character repertoire, and code point values, you can use Oracle Locale Builder. Otherwise, you should refer to the actual national, international, or vendor-specific standards.

Recommended Database Character Sets

Table A-4 lists the recommended and most commonly used ASCII-based Oracle database character sets. The list is ordered alphabetically within their respective language group.

Table A-4 Recommended ASCII Database Character Sets


Name Description Comments
Asian



JA16EUC EUC 24-bit Japanese MB, ASCII

JA16EUCTILDE The same as JA16EUC except for the way that the wave dash and the tilde are mapped to and from Unicode. MB, ASCII

JA16SJIS Shift-JIS 16-bit Japanese MB, ASCII, UDC

JA16SJISTILDE The same as JA16SJIS except for the way that the wave dash and the tilde are mapped to and from Unicode. MB, ASCII, UDC

KO16MSWIN949 MS Windows Code Page 949 Korean MB, ASCII, UDC

TH8TISASCII Thai Industrial Standard 620-2533 - ASCII 8-bit SB, ASCII, EURO

VN8MSWIN1258 MS Windows Code Page 1258 8-bit Vietnamese SB, ASCII, EURO

ZHS16GBK GBK 16-bit Simplified Chinese MB, ASCII, UDC

ZHT16HKSCS MS Windows Code Page 950 with Hong Kong Supplementary Character Set HKSCS-2001 (character set conversion to and from Unicode is based on Unicode 3.0) MB, ASCII, EURO

ZHT16MSWIN950 MS Windows Code Page 950 Traditional Chinese MB, ASCII, UDC

ZHT32EUC EUC 32-bit Traditional Chinese MB, ASCII
European



BLT8ISO8859P13 ISO 8859-13 Baltic SB, ASCII

BLT8MSWIN1257 MS Windows Code Page 1257 8-bit Baltic SB, ASCII, EURO

CL8ISO8859P5 ISO 8859-5 Latin/Cyrillic SB, ASCII

CL8MSWIN1251 MS Windows Code Page 1251 8-bit Latin/Cyrillic SB, ASCII, EURO

EE8ISO8859P2 ISO 8859-2 East European SB, ASCII

EL8ISO8859P7 ISO 8859-7 Latin/Greek SB, ASCII, EURO

EL8MSWIN1253 MS Windows Code Page 1253 8-bit Latin/Greek SB, ASCII, EURO

EE8MSWIN1250 MS Windows Code Page 1250 8-bit East European SB, ASCII, EURO

NE8ISO8859P10 ISO 8859-10 North European SB, ASCII

NEE8ISO8859P4 ISO 8859-4 North and North-East European SB, ASCII

WE8ISO8859P15 ISO 8859-15 West European SB, ASCII, EURO

WE8MSWIN1252 MS Windows Code Page 1252 8-bit West European SB, ASCII, EURO
Middle Eastern



AR8ISO8859P6 ISO 8859-6 Latin/Arabic SB, ASCII

AR8MSWIN1256 MS Windows Code Page 1256 8-Bit Latin/Arabic SB, ASCII, EURO

IW8ISO8859P8 ISO 8859-8 Latin/Hebrew SB, ASCII

IW8MSWIN1255 MS Windows Code Page 1255 8-bit Latin/Hebrew SB, ASCII, EURO

TR8MSWIN1254 MS Windows Code Page 1254 8-bit Turkish SB, ASCII, EURO

WE8ISO8859P9 ISO 8859-9 West European & Turkish SB, ASCII
Universal



AL32UTF8 Unicode 4.0 UTF-8 Universal character set MB, ASCII, EURO

Table A-5 lists the recommended and most commonly used EBCDIC-based Oracle database character sets. The list is ordered alphabetically within their respective language group.

Table A-5 Recommended EBCDIC Database Character Sets


Name Description Comments
Asian



JA16DBCS IBM EBCDIC 16-bit Japanese MB, UDC

JA16EBCDIC930 IBM DBCS Code Page 290 16-bit Japanese MB, UDC

KO16DBCS IBM EBCDIC 16-bit Korean MB, UDC

TH8TISEBCDICS Thai Industrial Standard 620-2533-EBCDIC Server 8-bit SB
European



BLT8EBCDIC1112S EBCDIC Code Page 1112 8-bit Server Baltic Multilingual SB

CE8BS2000 Siemens EBCDIC.DF.04 8-bit Central European SB

CL8BS2000 Siemens EBCDIC.EHC.LC 8-bit Cyrillic SB

CL8EBCDIC1025R EBCDIC Code Page 1025 Server 8-bit Cyrillic SB

CL8EBCDIC1158R EBCDIC Code Page 1158 Server 8-bit Cyrillic SB

D8EBCDIC1141 EBCDIC Code Page 1141 8-bit Austrian German SB, EURO

DK8DBCDIC1142 EBCDIC Code Page 1142 8-bit Danish SB, EURO

EE8BS2000 Siemens EBCDIC.DF.04 8-bit East European SB

EE8EBCDIC870S EBCDIC Code Page 870 Server 8-bit East European SB

EL8EBCDIC423R IBM EBCDIC Code Page 423 for RDBMS server-side SB

EL8EBCDIC875R EBCDIC Code Page 875 Server 8-bit Greek SB

F8EBCDIC1147 EBCDIC Code Page 1147 8-bit French SB, EURO

I8EBCDIC1144 EBCDIC Code Page 1144 8-bit Italian SB, EURO

S8EBCDCI1143 EBCDIC Code Page 1143 8-bit Swedish SB, EURO

WE8BS2000 Siemens EBCDIC.DF.04 8-bit West European SB

WE8BS2000E Siemens EBCDIC.DF.04 8-bit West European SB, EURO

WE8BS2000L5 Siemens EBCDIC.DF.L5 8-bit West European/Turkish SB

WE8EBCDIC1047E Latin 1/Open Systems 1047 SB, EBCDIC, EURO

WE8EBCDIC1140 EBCDIC Code Page 1140 8-bit West European SB, EURO

WE8EBCDIC1145 EBCDIC Code Page 1145 8-bit West European SB, EURO

WE8DBCDIC1146 EBCDIC Code Page 1146 8-bit West European SB, EURO

WE8EBCDIC1148 EBCDIC Code Page 1148 8-bit West European SB, EURO
Middle Eastern



AR8EBCDIC420S EBCDIC Code Page 420 Server 8-bit Latin/Arabic SB

IW8EBCDIC424S EBCDIC Code Page 424 Server 8-bit Latin/Hebrew SB

TR8EBCDIC1026S EBCDIC Code Page 1026 Server 8-bit Turkish SB

Other Character Sets

Table A-6 lists the other ASCII-based Oracle character sets. The list is ordered alphabetically within their language groups.

Table A-6 Other ASCII Character Sets


Name Description Comments
Asian



BN8BSCII Bangladesh National Code 8-bit BSCII SB, ASCII

IN8ISCII Multiple-Script Indian Standard 8-bit Latin/Indian Languages SB, ASCII

JA16VMS JVMS 16-bit Japanese MB, ASCII

KO16KSC5601 KSC5601 16-bit Korean MB, ASCII

KO16KSCCS KSCCS 16-bit Korean MB, ASCII

TH8MACTHAIS Mac Server 8-bit Latin/Thai SB, ASCII

VN8VN3 VN3 8-bit Vietnamese SB, ASCII

ZHS16CGB231280 CGB2312-80 16-bit Simplified Chinese MB, ASCII

ZHT16BIG5 BIG5 16-bit Traditional Chinese MB, ASCII

ZHT16CCDC HP CCDC 16-bit Traditional Chinese MB, ASCII

ZHT16DBT Taiwan Taxation 16-bit Traditional Chinese MB, ASCII

ZHT16HKSCS31 MS Windows Code Page 950 with Hong Kong Supplementary Character Set HKSCS-2001 (character set conversion to and from Unicode is based on Unicode 3.1) MB, ASCII, EURO

ZHT32SOPS SOPS 32-bit Traditional Chinese MB, ASCII

ZHT32TRIS TRIS 32-bit Traditional Chinese MB, ASCII
Middle Eastern



AR8ADOS710 Arabic MS-DOS 710 Server 8-bit Latin/Arabic SB, ASCII

AR8ADOS710T Arabic MS-DOS 710 8-bit Latin/Arabic SB

AR8ADOS720 Arabic MS-DOS 720 Server 8-bit Latin/Arabic SB, ASCII

AR8ADOS720T Arabic MS-DOS 720 8-bit Latin/Arabic SB

AR8APTEC715 APTEC 715 Server 8-bit Latin/Arabic SB, ASCII

AR8APTEC715T APTEC 715 8-bit Latin/Arabic SB

AR8ASMO708PLUS ASMO 708 Plus 8-bit Latin/Arabic SB, ASCII

AR8ASMO8X ASMO Extended 708 8-bit Latin/Arabic SB, ASCII

AR8HPARABIC8T HP 8-bit Latin/Arabic SB

AR8ISO8859P6 ISO 8859-6 Latin/Arabic SB, ASCII

AR8MUSSAD768 Mussa'd Alarabi/2 768 Server 8-bit Latin/Arabic SB, ASCII

AR8MUSSAD768T Mussa'd Alarabi/2 768 8-bit Latin/Arabic SB

AR8NAFITHA711 Nafitha Enhanced 711 Server 8-bit Latin/Arabic SB, ASCII

AR8NAFITHA711T Nafitha Enhanced 711 8-bit Latin/Arabic SB

AR8NAFITHA721 Nafitha International 721 Server 8-bit Latin/Arabic SB, ASCII

AR8NAFITHA721T Nafitha International 721 8-bit Latin/Arabic SB

AR8SAKHR706 SAKHR 706 Server 8-bit Latin/Arabic SB, ASCII

AR8SAKHR707 SAKHR 707 Server 8-bit Latin/Arabic SB, ASCII

AR8SAKHR707T SAKHR 707 8-bit Latin/Arabic SB

AR8XBASIC XBASIC 8-bit Latin/Arabic SB

AZ8ISO8859PE ISO 8859-9 Latin Azerbaijani SB, ASCII

IN8ISCII Multiple-Script Indian Standard 8-bit Latin/Indian Languages SB, ASCII

IW8MACHEBREW Mac Client 8-bit Hebrew SB

IW8PC1507 IBM-PC Code Page 1507/862 8-bit Latin/Hebrew SB, ASCII

LA8ISO6937 ISO 6937 8-bit Coded Character Set for Text Communication SB, ASCII

TR7DEC DEC VT100 7-bit Turkish SB

TR8DEC DEC 8-bit Turkish SB, ASCII

TR8PC857 IBM-PC Code Page 857 8-bit Turkish SB, ASCII
European



AR8ARABICMAC Mac Client 8-bit Latin/Arabic SB

AR8ARABICMACS Mac Server 8-bit Latin/Arabic SB, ASCII

BG8MSWIN MS Windows 8-bit Bulgarian Cyrillic SB, ASCII

BG8PC437S IBM-PC Code Page 437 8-bit (Bulgarian Modification) SB, ASCII

BLT8CP921 Latvian Standard LVS8-92(1) Windows/Unix 8-bit Baltic SB, ASCII

BLT8PC775 IBM-PC Code Page 775 8-bit Baltic SB, ASCII

CDN8PC863 IBM-PC Code Page 863 8-bit Canadian French SB, ASCII

CEL8ISO8859P14 ISO 8859-13 Celtic SB, ASCII

CL8ISOIR111 ISOIR111 Cyrillic SB

CL8KOI8R RELCOM Internet Standard 8-bit Latin/Cyrillic SB, ASCII

CL8KOI8U KOI8 Ukrainian Cyrillic SB

CL8MACCYRILLICS Mac Server 8-bit Latin/Cyrillic SB, ASCII

EE8MACCES Mac Server 8-bit Central European SB, ASCII

EE8MACCROATIANS Mac Server 8-bit Croatian SB, ASCII

EE8PC852 IBM-PC Code Page 852 8-bit East European SB, ASCII

EL8DEC DEC 8-bit Latin/Greek SB

EL8MACGREEKS Mac Server 8-bit Greek SB, ASCII

EL8PC437S IBM-PC Code Page 437 8-bit (Greek modification) SB, ASCII

EL8PC851 IBM-PC Code Page 851 8-bit Greek/Latin SB, ASCII

EL8PC869 IBM-PC Code Page 869 8-bit Greek/Latin SB, ASCII

ET8MSWIN923 MS Windows Code Page 923 8-bit Estonian SB, ASCII

HU8ABMOD Hungarian 8-bit Special AB Mod SB, ASCII

HU8CWI2 Hungarian 8-bit CWI-2 SB, ASCII

IS8PC861 IBM-PC Code Page 861 8-bit Icelandic SB, ASCII

IW7IS960 Israeli Standard 960 7-bit Latin/Hebrew SB

IW8ISO8859P8 ISO 8859-8 Latin/Hebrew SB, ASCII

LA8ISO6937 ISO 6937 8-bit Coded Character Set for Text Communication SB, ASCII

LA8PASSPORT German Government Printer 8-bit All-European Latin SB, ASCII

LT8MSWIN921 MS Windows Code Page 921 8-bit Lithuanian SB, ASCII

LT8PC772 IBM-PC Code Page 772 8-bit Lithuanian (Latin/Cyrillic) SB, ASCII

LT8PC774 IBM-PC Code Page 774 8-bit Lithuanian (Latin) SB, ASCII

LV8PC8LR Latvian Version IBM-PC Code Page 866 8-bit Latin/Cyrillic SB, ASCII

LV8PC1117 IBM-PC Code Page 1117 8-bit Latvian SB, ASCII

LV8RST104090 IBM-PC Alternative Code Page 8-bit Latvian (Latin/Cyrillic) SB, ASCII

N8PC865 IBM-PC Code Page 865 8-bit Norwegian SB, ASCII

RU8BESTA BESTA 8-bit Latin/Cyrillic SB, ASCII

RU8PC855 IBM-PC Code Page 855 8-bit Latin/Cyrillic SB, ASCII

RU8PC866 IBM-PC Code Page 866 8-bit Latin/Cyrillic SB, ASCII

SE8ISO8859P3 ISO 8859-3 South European SB, ASCII

TR8MACTURKISH Mac Client 8-bit Turkish SB

TR8MACTURKISHS Mac Server 8-bit Turkish SB, ASCII

TR8PC857 IBM-PC Code Page 857 8-bit Turkish SB, ASCII

US7ASCII ASCII 7-bit American SB, ASCII

US8PC437 IBM-PC Code Page 437 8-bit American SB, ASCII

WE8DEC DEC 8-bit West European SB, ASCII

WE8DG DG 8-bit West European SB, ASCII

WE8ISO8859P1 ISO 8859-1 West European SB, ASCII

WE8MACROMAN8S Mac Server 8-bit Extended Roman8 West European SB, ASCII

WE8NCR4970 NCR 4970 8-bit West European SB, ASCII

WE8NEXTSTEP NeXTSTEP PostScript 8-bit West European SB, ASCII

WE8PC850 IBM-PC Code Page 850 8-bit West European SB, ASCII

WE8PC858 IBM-PC Code Page 858 8-bit West European SB, ASCII, EURO

WE8PC860 IBM-PC Code Page 860 8-bit West European SB, ASCII

WE8ROMAN8 HP Roman8 8-bit West European SB, ASCII
Universal



UTF8 Unicode 3.0 UTF-8 Universal character set, CESU-8 compliant MB, ASCII, EURO

Table A-7 lists the other EBCDIC-based Oracle character sets. The list is ordered alphabetically within their language groups.

Table A-7 Other EBCDIC Character Sets


Name Description Comments
Asian



TH8TISEBCDIC Thai Industrial Standard 620-2533 - EBCDIC 8-bit SB

ZHS16DBCS IBM EBCDIC 16-bit Simplified Chinese MB, UDC

ZHT16DBCS IBM EBCDIC 16-bit Traditional Chinese MB, UDC
Middle Eastern



AR8EBCDICX EBCDIC XBASIC Server 8-bit Latin/Arabic SB

IW8EBCDIC424 EBCDIC Code Page 424 8-bit Latin/Hebrew SB

IW8EBCDIC1086 EBCDIC Code Page 1086 8-bit Hebrew SB

TR8EBCDIC1026 EBCDIC Code Page 1026 8-bit Turkish SB

WE8EBCDIC37C EBCDIC Code Page 37 8-bit Oracle/c SB
European



BLT8EBCDIC1112 EBCDIC Code Page 1112 8-bit Server Baltic Multilingual SB

CL8EBCDIC1025 EBCDIC Code Page 1025 8-bit Cyrillic SB

CL8EBCDIC1025C EBCDIC Code Page 1025 Client 8-bit Cyrillic SB

CL8EBCDIC1025S EBCDIC Code Page 1025 Server 8-bit Cyrillic SB

CL8EBCDIC1025X EBCDIC Code Page 1025 (Modified) 8-bit Cyrillic SB

CL8EBCDIC1158 EBCDIC Code Page 1158 8-bit Cyrillic SB

D8BS2000 Siemens 9750-62 EBCDIC 8-bit German SB

D8EBCDIC273 EBCDIC Code Page 273/1 8-bit Austrian German SB

DK7SIEMENS9780X Siemens 97801/97808 7-bit Danish SB

DK8BS2000 Siemens 9750-62 EBCDIC 8-bit Danish SB

DK8EBCDIC277 EBCDIC Code Page 277/1 8-bit Danish SB

E8BS2000 Siemens 9750-62 EBCDIC 8-bit Spanish SB

EE8EBCDIC870 EBCDIC Code Page 870 8-bit East European SB

EE8EBCDIC870C EBCDIC Code Page 870 Client 8-bit East European SB

EL8EBCDIC875 EBCDIC Code Page 875 8-bit Greek SB

EL8GCOS7 Bull EBCDIC GCOS7 8-bit Greek SB

F8BS2000 Siemens 9750-62 EBCDIC 8-bit French SB

F8EBCDIC297 EBCDIC Code Page 297 8-bit French SB

I8EBCDIC280 EBCDIC Code Page 280/1 8-bit Italian SB

S8BS2000 Siemens 9750-62 EBCDIC 8-bit Swedish SB

S8EBCDIC278 EBCDIC Code Page 278/1 8-bit Swedish SB

US8ICL ICL EBCDIC 8-bit American SB

US8BS2000 Siemens 9750-62 EBCDIC 8-bit American SB

WE8EBCDIC924 Latin 9 EBCDIC 924 SB, EBCDIC

WE8EBCDIC37 EBCDIC Code Page 37 8-bit West European SB

WE8EBCDIC284 EBCDIC Code Page 284 8-bit Latin American/Spanish SB

WE8EBCDIC285 EBCDIC Code Page 285 8-bit West European SB

WE8EBCDIC1047 EBCDIC Code Page 1047 8-bit West European SB

WE8EBCDIC1140C EBCDIC Code Page 1140 8-bit West European SB, EURO

WE8EBCDIC1148C EBCDIC Code Page 1148 Client 8-bit West European SB, EURO

WE8EBCDIC500C EBCDIC Code Page 500 8-bit Oracle/c SB

WE8EBCDIC500 EBCDIC Code Page 500 8-bit West European SB

WE8EBCDIC871 EBCDIC Code Page 871 8-bit Icelandic SB

WE8ICL ICL EBCDIC 8-bit West European SB

WE8GCOS7 Bull EBCDIC GCOS7 8-bit West European SB
Universal



UTFE EBCDIC form of Unicode 3.0 UTF-8 Universal character set (UTF-EBCDIC) MB, EURO

Character Sets that Support the Euro Symbol

Table A-8 lists the character sets that support the Euro symbol.

Table A-8 Character Sets that Support the Euro Symbol

Character Set Name Hexadecimal Code Value of the Euro Symbol
AL16UTF16 20AC
AL32UTF8 E282AC
AR8MSWIN1256 80
BLT8MSWIN1257 80
CL8EBCDIC1158 E1
CL8EBCDIC1158R 9F
CL8MSWIN1251 88
D8EBCDIC1141 9F
DK8EBCDIC1142 5A
EE8MSWIN1250 80
EL8EBCDIC423R FD
EL8EBCDIC875R DF
EL8ISO8859P7 A4
EL8MSWIN1253 80
F8EBCDIC1147 9F
I8EBCDIC1144 9F
IW8MSWIN1255 80
KO16KSC5601 A2E6
KO16KSCCS D9E6
KO16MSWIN949 A2E6
S8EBCDIC1143 5A
TH8TISASCII 80
TR8MSWIN1254 80
UTF8 E282AC
UTFE CA4653
VN8MSWIN1258 80
WE8BS2000E 9F
WE8EBCDIC1047E 9F
WE8EBCDIC1140 9F
WE8EBCDIC1140C 9F
WE8EBCDIC1145 9F
WE8EBCDIC1146 9F
WE8EBCDIC1148 9F
WE8EBCDIC1148C 9F
WE8EBCDIC924 9F
WE8ISO8859P15 A4
WE8MACROMAN8 DB
WE8MACROMAN8S DB
WE8MSWIN1252 80
WE8PC858 DF
ZHS32GB18030 A2E3
ZHT16HKSCS A3E1
ZHT16HKSCS31 A3E1
ZHT16MSWIN950 A3E1

Client-Only Character Sets

Table A-9 lists the Oracle character sets that are supported as client-only character sets. The list is ordered alphabetically within their respective language groups.

Table A-9 Client-Only Character Sets


Name Description Comments
Asian



JA16EUCYEN EUC 24-bit Japanese with '\' mapped to the Japanese yen character MB

JA16MACSJIS Mac client Shift-JIS 16-bit Japanese MB

JA16SJISYEN Shift-JIS 16-bit Japanese with '\' mapped to the Japanese yen character MB, UDC

TH8MACTHAI Mac Client 8-bit Latin/Thai SB

ZHS32GB18030 GB18030-2000 MB, ASCII, EURO

ZHS16MACCGB231280 Mac client CGB2312-80 16-bit Simplified Chinese MB
European



CH7DEC DEC VT100 7-bit Swiss (German/French) SB

CL8MACCYRILLIC Mac Client 8-bit Latin/Cyrillic SB

D7SIEMENS9780X Siemens 97801/97808 7-bit German SB

D7DEC DEC VT100 7-bit German SB

EEC8EUROASCI EEC Targon 35 ASCI West European/Greek SB

EEC8EUROPA3 EEC EUROPA3 8-bit West European/Greek SB

EE8MACCROATIAN Mac Client 8-bit Croatian SB

EE8MACCE Mac Client 8-bit Central European SB

EL8PC737 IBM-PC Code Page 737 8-bit Greek/Latin SB

EL8MACGREEK Mac Client 8-bit Greek SB

E7DEC DEC VT100 7-bit Spanish SB

E7SIEMENS9780X Siemens 97801/97808 7-bit Spanish SB

F7DEC DEC VT100 7-bit French SB

F7SIEMENS9780X Siemens 97801/97808 7-bit French SB

I7DEC DEC VT100 7-bit Italian SB

I7SIEMENS9780X Siemens 97801/97808 7-bit Italian SB

IS8MACICELANDICS Mac Server 8-bit Icelandic SB

IS8MACICELANDIC Mac Client 8-bit Icelandic SB

NL7DEC DEC VT100 7-bit Dutch SB

NDK7DEC DEC VT100 7-bit Norwegian/Danish SB

N7SIEMENS9780X Siemens 97801/97808 7-bit Norwegian SB

SF7DEC DEC VT100 7-bit Finnish SB

S7SIEMENS9780X Siemens 97801/97808 7-bit Swedish SB

S7DEC DEC VT100 7-bit Swedish SB

SF7ASCII ASCII 7-bit Finnish SB

TR7DEC DEC VT100 7-bit Turkish SB

WE8ISOICLUK ICL special version ISO8859-1 SB

WE8MACROMAN8 Mac Client 8-bit Extended Roman8 West European SB

WE8HP HP LaserJet 8-bit West European SB

YUG7ASCII ASCII 7-bit Yugoslavian SB
Middle Eastern



AR8ARABICMAC Mac Client 8-bit Latin/Arabic SB

AR8ARABICMACT Mac 8-bit Latin/Arabic SB

AR8MUSSAD768 Mussa'd Alarabi/2 768 Server 8-bit Latin/Arabic SB, ASCII

IW7IS960 Israeli Standard 960 7-bit Latin/Hebrew SB

IW8MACHEBREW Mac Client 8-bit Hebrew SB

TR8MACTURKISH Mac Client 8-bit Turkish SB

Universal Character Sets

Table A-10 lists the Oracle character sets that provide universal language support. They attempt to support all languages of the world, including, but not limited to, Asian, European, and Middle Eastern languages.

Table A-10 Universal Character Sets

Name Description Comments
AL16UTF16 Unicode 4.0 UTF-16 Universal character set MB, EURO, FIXED
AL32UTF8 Unicode 4.0 UTF-8 Universal character set MB, ASCII, EURO
UTF8 Unicode 3.0 UTF-8 Universal character set, CESU-8 compliant MB, ASCII, EURO
UTFE EBCDIC form of Unicode 3.0 UTF-8 Universal character set (UTF-EBCDIC) MB, EURO


Note:

CESU-8 defines an encoding scheme for Unicode that is identical to UTF-8 except for its representation of supplementary characters. In CESU-8, supplementary characters are represented as six-byte sequences that result from the transformation of each UTF-16 surrogate code unit into an eight-bit form that is similar to the UTF-8 transformation, but without first converting the input surrogate pairs to a scalar value. See Unicode Technical Report #26.

Character Set Conversion Support

The following character set encodings are supported for conversion only. They cannot be used as the database or national character set:

AL16UTF16LE
ISO2022-CN
ISO2022-JP
ISO2022-KR
HZ-GB-2312

You can use these character sets as the source_char_set or dest_char_set in the CONVERT function.

See Oracle Database SQL Reference for more information about the CONVERT function and "The CONVERT Function".

Subsets and Supersets

Table A-11 lists common subset/superset relationships.

Table A-11 Subset-Superset Pairs

Subset Superset
AR8ADOS710 AR8ADOS710T
AR8ADOS720 AR8ADOS720T
AR8ADOS720T AR8ADOS720
AR8APTEC715 AR8APTEC715T
AR8ARABICMACT AR8ARABICMAC
AR8ISO8859P6 AR8ASMO708PLUS
AR8ISO8859P6 AR8ASMO8X
AR8MUSSAD768 AR8MUSSAD768T
AR8MUSSAD768T AR8MUSSAD768
AR8NAFITHA711 AR8NAFITHA711T
AR8NAFITHA721 AR8NAFITHA721T
AR8SAKHR707 AR8SAKHR707T
AR8SAKHR707T AR8SAKHR707
BLT8CP921 BLT8ISO8859P13
BLT8CP921 LT8MSWIN921
D7DEC D7SIEMENS9780X
D7SIEMENS9780X D7DEC
DK7SIEMENS9780X N7SIEMENS9780X
I7DEC I7SIEMENS9780X
I7SIEMENS9780X IW8EBCDIC424
IW8EBCDIC424 IW8EBCDIC1086
KO16KSC5601 KO16MSWIN949
LT8MSWIN921 BLT8ISO8859P13
LT8MSWIN921 BLT8CP921
N7SIEMENS9780X DK7SIEMENS9780X
US7ASCII See Table A-12, "US7ASCII Supersets".
UTF8 AL32UTF8
WE8DEC TR8DEC
WE8DEC WE8NCR4970
WE8ISO8859P1 WE8MSWIN1252
WE8ISO8859P9 TR8MSWIN1254
WE8NCR4970 TR8DEC
WE8NCR4970 WE8DEC
WE8PC850 WE8PC858

US7ASCII is a special case because so many other character sets are supersets of it. Table A-12 lists supersets for US7ASCII.

Table A-12 US7ASCII Supersets

Supersets Supersets Supersets
AL32UTF8 EE8ISO8859P2 RU8BESTA
AR8ADOS710 EE8MACCES RU8PC855
AR8ADOS710T EE8MACCROATIANS RU8PC866
AR8ADOS720 EE8MSWIN1250 SE8ISO8859P3
AR8ADOS720T EE8PC852 TH8MACTHAIS
AR8APTEC715 EL8DEC TH8TISASCII
AR8APTEC715T EL8ISO8859P7 TR8DEC
AR8ARABICMACS EL8MACGREEKS TR8MACTURKISHS
AR8ASMO708PLUS EL8MSWIN1253 TR8MSWIN1254
AR8ASMO8X EL8PC437S TR8PC857
AR8HPARABIC8T EL8PC851 US8PC437
AR8ISO8859P6 EL8PC869 UTF8
AR8MSWIN1256 ET8MSWIN923 VN8MSWIN1258
AR8MUSSAD768 HU8ABMOD VN8VN3
AR8MUSSAD768T HU8CWI2 WE8DEC
AR8NAFITHA711 IN8ISCII WE8DG
AR8NAFITHA711T IS8PC861 WE8ISO8859P1
AR8NAFITHA721 IW8ISO8859P8 WE8ISO8859P15
AR8NAFITHA721T IW8MACHEBREWS WE8ISO8859P9
AR8SAKHR706 IW8MSWIN1255 WE8MACROMAN8S
AR8SAKHR707 IW8PC1507 WE8MSWIN1252
AR8SAKHR707T JA16EUC WE8NCR4970
AZ8ISO8859PE JA16SJIS WE8NEXTSTEP
BG8MSWIN JA16VMS WE8PC850
BG8PC437S KO16KSC5601 WE8PC858
BLT8CP921 KO16KSCCS WE8PC860
BLT8ISO8859P13 KO16MSWIN949 WE8ROMAN8
BLT8MSWIN1257 LA8ISO6937 ZHS16CGB231280
BLT8PC775 LA8PASSPORT ZHS16GBK
BN8BSCII LT8MSWIN921 ZHT16BIG5
CDN8PC863 LT8PC772 ZHT16CCDC
CEL8ISO8859P14 LT8PC774 ZHT16DBT
CL8ISO8859P5 LV8PC1117 ZHT16HKSCS
CL8KOI8R LV8PC8LR ZHT16MSWIN950
CL8KOI8U LV8RST104090 ZHT32EUC
CL8ISOIR111 N8PC865 ZHT32SOPS
CL8MACCYRILLICS NE8ISO8859P10 ZHT32TRIS
CL8MSWIN1251 NEE8ISO8859P4 ZHS32GB18030

Language and Character Set Detection Support

Table A-13 displays the languages and character sets that are supported by the language and character set detection in the Character Set Scanner utilities (CSSCAN and LCSSCAN) and the Globalization Development Kit (GDK).

Each language has several character sets that can be detected.

When the binary values for a language match two or more encodings that have a subset/superset relationship, the subset character set is returned. For example, if the language is German and all characters are 7-bit, then US7ASCII is returned instead of WE8MSWIN1252, WE8ISO8859P15, or WE8ISO8859P1.

When the character set is determined to be UTF-8, the Oracle character set UTF8 is returned by default unless 4-byte characters (supplementary characters) are detected within the text. If 4-byte characters are detected, then the character set is reported as AL32UTF8.

Table A-13 Languages and Character Sets Supported by CSSCAN, LCSSCAN, and GDK

Language Character Sets
Arabic AL16UTF16, AL32UTF8, AR8ISO8859P6, AR8MSWIN1256, UTF8
Bulgarian AL16UTF16, AL32UTF8, CL8ISO8859P5, CL8MSWIN1251, UTF8
Catalan AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
Croatian AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8
Czech AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8
Danish AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
Dutch AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
English AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
Estonian AL16UTF16, AL32UTF8, NEE8IOS8859P4, UTF8
Finnish AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
French AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
German AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
Greek AL16UTF16, AL32UTF8, EL8ISO8859P7, EL8MSWIN1253, UTF8
Hebrew AL16UTF16, AL32UTF8, IW8ISO8859P8, IW8MSWIN1255, UTF8
Hungarian AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8
Italian AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
Japanese AL16UTF16, AL32UTF8, ISO2022-JP, JA16EUC, JA16SJIS, UTF8
Korean AL16UTF16, AL32UTF8, ISO2022-KR, KO16KSC5601, KO16MSWIN949, UTF8
Malay AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
Norwegian AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
Polish AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8
Portuguese AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
Romanian AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8
Russian AL16UTF16, AL32UTF8, CL8ISO8859P5, CL8KOI8R, CL8MSWIN1251, UTF8
Simplified Chinese AL16UTF16, AL32UTF8, HZ-GB-2312, UTF8, ZHS16GBK, ZHS16CGB231280
Slovak AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8
Spanish AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
Swedish AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
Thai AL16UTF16, AL32UTF8, TH8TISASCII, UTF8
Traditional Chinese AL16UTF16, AL32UTF8, UTF8, ZHT16MSWIN950
Turkish AL16UTF16, AL32UTF8, TR8MSWIN1254, UTF8, WE8ISO8859P9

Linguistic Sorts

Oracle offers two kinds of linguistic sorts, monolingual and multilingual. In addition, monolingual sorts can be extended to handle special cases. These special cases (represented with a prefix X) typically mean that the characters are sorted differently from their ASCII values. For example, ch and ll are treated as a single character in XSPANISH.

All of the linguistic sorts can be also be performed as case-insensitive or accent-insensitive by appending _CI or _AI to the linguistic sort name.

Table A-14 lists the monolingual linguistic sorts supported by the Oracle server.


See Also:

Table A-1, "Oracle Supported Languages" for a list of the default sort for each language

Table A-14 Monolingual Linguistic Sorts

Basic Name Extended Name Special Cases
ARABIC - -
ARABIC_MATCH - -
ARABIC_ABJ_SORT - -
ARABIC_ABJ_MATCH - -
ASCII7 - -
AZERBAIJANI XAZERBAIJANI i, I, lowercase i without dot, uppercase I with dot
BENGALI - -
BIG5 - -
BINARY - -
BULGARIAN - -
CATALAN XCATALAN æ, AE, ß
CROATIAN XCROATIAN D, L, N, d, l, n, ß
CZECH XCZECH ch, CH, Ch, ß
CZECH_PUNCTUATION XCZECH_PUNCTUATION ch, CH, Ch, ß
DANISH XDANISH A, ß, Å, å
DUTCH XDUTCH ij, IJ
EBCDIC - -
EEC_EURO - -
EEC_EUROPA3 - -
ESTONIAN - -
FINNISH - -
FRENCH XFRENCH -
GERMAN XGERMAN ß
GERMAN_DIN XGERMAN_DIN ß, ä, ö, ü, Ä, Ö, Ü
GBK - -
GREEK - -
HEBREW - -
HKSCS - -
HUNGARIAN XHUNGARIAN cs, gy, ny, sz, ty, zs, ß, CS, Cs, GY, Gy, NY, Ny, SZ, Sz, TY, Ty, ZS, Zs
ICELANDIC - -
INDONESIAN - -
ITALIAN - -
LATIN - -
LATVIAN - -
LITHUANIAN - -
MALAY - -
NORWEGIAN - -
POLISH - -
PUNCTUATION XPUNCTUATION -
ROMANIAN - -
RUSSIAN - -
SLOVAK XSLOVAK dz, DZ, Dz, ß (caron)
SLOVENIAN XSLOVENIAN ß
SPANISH XSPANISH ch, ll, CH, Ch, LL, Ll
SWEDISH - -
SWISS XSWISS ß
TURKISH XTURKISH æ, AE, ß
UKRAINIAN - -
UNICODE_BINARY - -
VIETNAMESE - -
WEST_EUROPEAN XWEST_EUROPEAN ß

Table A-15 lists the multilingual linguistic sorts available in Oracle. All of them include GENERIC_M (an ISO standard for sorting Latin-based characters) as a base. Multilingual linguistic sorts are used for a specific primary language together with Latin-based characters. For example, KOREAN_M sorts Korean and Latin-based characters, but it does not collate Chinese, Thai, or Japanese characters.

Table A-15 Multilingual LInguistic Sorts

Sort Name Description
CANADIAN_M Canadian French sort supports reverse secondary, special expanding characters
DANISH_M Danish sort supports sorting uppercase characters before lowercase characters
FRENCH_M French sort supports reverse sort for secondary
GENERIC_M Generic sorting order which is based on ISO14651 and Unicode canonical equivalence rules but excluding compatible equivalence rules
JAPANESE_M Japanese sort supports SJIS character set order and EUC characters which are not included in SJIS
KOREAN_M Korean sort: Hangul characters are based on Unicode binary order. Hanja characters based on pronunciation order. All Hangul characters are before Hanja characters
SPANISH_M Traditional Spanish sort supports special contracting characters
THAI_M Thai sort supports swap characters for some vowels and consonants
SCHINESE_RADICAL_M Simplified Chinese sort based on radical as primary order and number of strokes order as secondary order
SCHINESE_STROKE_M Simplified Chinese sort uses number of strokes as primary order and radical as secondary order
SCHINESE_PINYIN_M Simplified Chinese PinYin sorting order
TCHINESE_RADICAL_M Traditional Chinese sort based on radical as primary order and number of strokes order as secondary order
TCHINESE_STROKE_M Traditional Chinese sort uses number of strokes as primary order and radical as secondary order. It supports supplementary characters.

Calendar Systems

By default, most territory definitions use the Gregorian calendar system. Table A-14 lists the other calendar systems supported by the Oracle server.

Table A-16 Supported Calendar Systems

Name Default Date Format Character Set Used For Default Date Format
Japanese Imperial EEYYMMDD JA16EUC
ROC Official EEyymmdd ZHT32EUC
Thai Buddha dd month EE yyyy TH8TISASCII
Persian DD Month YYYY AR8ASMO8X
Arabic Hijrah DD Month YYYY AR8ISO8859P6
English Hijrah DD Month YYYY AR8ISO8859P6

Figure A-1 shows how March 27, 1998 appears in Japanese Imperial.

Figure A-1 Japanese Imperial Example

Description of nlspg005.gif follows
Description of the illustration nlspg005.gif

Time Zone Names

Table A-17 shows the time zone names in the default time zone file that is supplied with the Oracle Database. The default time zone file is $ORACLE_HOME/oracore/zoneinfo/timezlrg.dat. Oracle also supplies a smaller time zone file, $ORACLE_HOME/oracore/zoneinfo/timezone/dat. See Chapter 4, "Datetime Datatypes and Time Zone Support" for more information regarding time zone files.

Table A-17 Time Zone Names

Time Zone Name Is It in the Smaller Time Zone File? Time Zone Name Is It in the Smaller Time Zone File?
Africa/Algiers No Australia/Perth Yes
Africa/Cairo Yes Australia/Queensland Yes
Africa/Casablanca No Australia/South Yes
Africa/Ceuta No Australia/Sydney Yes
Africa/Djibouti No Australia/Tasmania Yes
Africa/Freetown No Australia/Victoria Yes
Africa/Johannesburg No Australia/West Yes
Africa/Khartoum No Australia/Yancowinna Yes
Africa/Mogadishu No Brazil/Acre Yes
Africa/Nairobi No Brazil/DeNoronha Yes
Africa/Nouakchott No Brazil/East Yes
Africa/Tripoli Yes Brazil/West Yes
Africa/Tunis No CET Yes
Africa/Windhoek No CST Yes
America/Adak Yes CST6CDT Yes
America/Anchorage Yes Canada/Atlantic Yes
America/Anguilla No Canada/Central Yes
America/Araguaina No Canada/East-Saskatchewan Yes
America/Aruba No Canada/Eastern Yes
America/Asuncion No Canada/Mountain Yes
America/Atka Yes Canada/Newfoundland Yes
America/Belem No Canada/Pacific Yes
America/Boa_Vista No Canada/Saskatchewan Yes
America/Bogota No Canada/Yukon Yes
America/Boise No Chile/Continental Yes
America/Buenos_Aires No Chile/EasterIsland Yes
America/Cambridge_Bay No Cuba Yes
America/Cancun No EET Yes
America/Caracas No EST Yes
America/Cayenne No EST5EDT Yes
America/Cayman No Egypt Yes
America/Chicago Yes Eire Yes
America/Chihuahua No Etc/GMT Yes
America/Costa_Rica No Etc/GMT+0 Yes
America/Cuiaba No Etc/GMT+1 Yes
America/Curacao No Etc/GMT+10 Yes
America/Dawson No Etc/GMT+11 Yes
America/Dawson_Creek No Etc/GMT+12 Yes
America/Denver Yes Etc/GMT+2 Yes
America/Detroit Yes Etc/GMT+3 Yes
America/Edmonton Yes Etc/GMT+4 Yes
America/El_Salvador No Etc/GMT+5 Yes
America/Ensenada Yes Etc/GMT+6 Yes
America/Fort_Wayne Yes Etc/GMT+7 Yes
America/Fortaleza No Etc/GMT+8 Yes
America/Godthab No Etc/GMT+9 Yes
America/Goose_Bay No Etc/GMT-0 Yes
America/Grand_Turk No Etc/GMT-1 Yes
America/Guadeloupe No Etc/GMT-10 Yes
America/Guatemala No Etc/GMT-11 Yes
America/Guayaquil No - -
America/Halifax Yes Etc/GMT-12 Yes
America/Havana Yes Etc/GMT-13 Yes
America/Indiana/Indianapolis Yes Etc/GMT-2 Yes
America/Indiana/Knox No Etc/GMT-3 Yes
America/Indiana/Marengo No Etc/GMT-4 Yes
America/Indiana/Vevay No Etc/GMT-5 Yes
America/Indianapolis Yes Etc/GMT-6 Yes
America/Inuvik No Etc/GMT-7 Yes
America/Iqaluit No Etc/GMT-8 Yes
America/Jamaica Yes Etc/GMT-9 Yes
America/Juneau No Etc/GMT0 Yes
America/Knox_IN No Etc/Greenwich Yes
America/La_Paz No Europe/Amsterdam No
America/Lima No Europe/Athens No
America/Los_Angeles Yes Europe/Belfast No
America/Louisville No Europe/Belgrade No
America/Maceio No Europe/Berlin No
America/Managua No Europe/Bratislava No
America/Manaus Yes Europe/Brussels No
America/Martinique No Europe/Bucharest No
America/Mazatlan Yes Europe/Budapest No
America/Mexico_City Yes Europe/Copenhagen No
America/Miquelon No Europe/Dublin Yes
America/Montevideo No Europe/Gibraltar No
America/Montreal Yes Europe/Helsinki No
America/Montserrat No Europe/Istanbul Yes
America/New_York Yes Europe/Kaliningrad No
America/Nome No Europe/Kiev No
America/Noronha Yes Europe/Lisbon Yes
America/Panama No Europe/Ljubljana No
America/Phoenix Yes Europe/London Yes
America/Porto_Acre No Europe/Luxembourg No
America/Porto_Velho No Europe/Madrid No
America/Puerto_Rico No Europe/Minsk No
America/Rankin_Inlet No Europe/Monaco No
America/Regina Yes Europe/Moscow Yes
America/Rio_Branco Yes - -
America/Santiago Yes Europe/Oslo No
America/Sao_Paulo Yes Europe/Paris No
America/Scoresbysund No Europe/Prague No
America/Shiprock Yes Europe/Riga No
America/St_Johns Yes Europe/Rome No
America/St_Thomas No Europe/Samara No
America/Swift_Current No Europe/San_Marino No
America/Tegucigalpa No Europe/Sarajevo No
America/Thule No Europe/Simferopol No
America/Thunder_Bay No Europe/Skopje No
America/Tijuana Yes Europe/Sofia No
America/Tortola No Europe/Stockholm No
America/Vancouver Yes Europe/Tallinn No
America/Virgin No Europe/Tirane No
America/Whitehorse Yes Europe/Vatican No
America/Winnipeg Yes Europe/Vienna No
America/Yellowknife No Europe/Vilnius No
Arctic/Longyearbyen No Europe/Warsaw Yes
Asia/Aden No Europe/Zagreb No
Asia/Almaty No Europe/Zurich No
Asia/Amman No GB Yes
Asia/Anadyr No GB-Eire Yes
Asia/Aqtau No GMT Yes
Asia/Aqtobe No GMT+0 Yes
Asia/Baghdad No GMT-0 Yes
Asia/Bahrain No GMT0 Yes
Asia/Baku No Greenwich Yes
Asia/Bangkok No HST Yes
Asia/Beirut No Hongkong Yes
Asia/Bishkek No Iceland Yes
Asia/Calcutta Yes Indian/Chagos No
Asia/Chongqing No - -
Asia/Chungking No Indian/Christmas No
Asia/Dacca No Indian/Cocos No
Asia/Damascus No Indian/Mayotte No
Asia/Dhaka No - -
Asia/Dubai No Indian/Reunion No
Asia/Gaza No Iran Yes
Asia/Harbin No Israel Yes
Asia/Hong_Kong Yes Jamaica Yes
Asia/Irkutsk No Japan Yes
Asia/Istanbul Yes Kwajalein Yes
Asia/Jakarta No Libya Yes
Asia/Jayapura No MET Yes
Asia/Jerusalem Yes MST Yes
Asia/Kabul No MST7MDT Yes
Asia/Kamchatka No Mexico/BajaNorte Yes
Asia/Karachi No Mexico/BajaSur Yes
Asia/Kashgar No Mexico/General Yes
Asia/Krasnoyarsk No NZ Yes
Asia/Kuala_Lumpur No NZ-CHAT Yes
Asia/Kuching No Navajo Yes
Asia/Kuwait No PRC Yes
Asia/Macao No PST Yes
Asia/Macau No - -
Asia/Magadan No PST8PDT Yes
Asia/Manila No Pacific/Auckland Yes
Asia/Muscat No Pacific/Chatham Yes
Asia/Nicosia No Pacific/Easter Yes
Asia/Novosibirsk No Pacific/Fakaofo No
Asia/Omsk No Pacific/Fiji No
Asia/Qatar No Pacific/Gambier No
Asia/Rangoon No Pacific/Guam No
Asia/Riyadh Yes Pacific/Honolulu Yes
Asia/Saigon No Pacific/Johnston No
Asia/Seoul Yes Pacific/Kiritimati No
Asia/Shanghai Yes Pacific/Kwajalein Yes
Asia/Singapore Yes Pacific/Marquesas No
Asia/Taipei Yes Pacific/Midway No
Asia/Tashkent No Pacific/Niue No
Asia/Tbilisi No Pacific/Norfolk No
Asia/Tehran Yes Pacific/Noumea No
Asia/Tel_Aviv Yes Pacific/Pago_Pago Yes
Asia/Tokyo Yes Pacific/Pitcairn No
Asia/Ujung_Pandang No Pacific/Rarotonga No
Asia/Urumqi No Pacific/Saipan No
Asia/Vladivostok No Pacific/Samoa Yes
Asia/Yakutsk No Pacific/Tahiti No
Asia/Yekaterinburg No Pacific/Tongatapu No
Asia/Yerevan No Pacific/Wake No
Atlantic/Azores No Pacific/Wallis No
Atlantic/Bermuda No Poland Yes
Atlantic/Canary No Portugal Yes
Atlantic/Faeroe No ROC Yes
Atlantic/Madeira No ROK Yes
Atlantic/Reykjavik Yes Singapore Yes
Atlantic/St_Helena No Turkey Yes
Atlantic/Stanley No US/Alaska Yes
Australia/ACT Yes US/Aleutian Yes
Australia/Adelaide Yes US/Arizona Yes
Australia/Brisbane Yes US/Central Yes
Australia/Broken_Hill Yes US/East-Indiana Yes
Australia/Canberra Yes US/Eastern Yes
Australia/Darwin Yes US/Hawaii Yes
Australia/Hobart Yes US/Indiana-Starke No
Australia/LHI Yes US/Michigan Yes
Australia/Lindeman Yes US/Mountain Yes
Australia/Lord_Howe Yes US/Pacific Yes
Australia/Melbourne Yes US/Pacific-New Yes
Australia/NSW Yes US/Samoa Yes
Australia/North Yes UTC No
- - W-SU Yes
- - WET Yes

Obsolete Locale Data

This section contains information about obsolete linguistic sorts, character sets, languages, and territories. The obsolete linguistic sort, language, and territory definitions are still available. However, they are supported for backward compatibility only; they may be desupported in a future release. You can obtain a listing of the obsolete character sets, languages, territories, and linguistic sorts for the current database release by querying the V$NLS_VALID_VALUES view.

Obsolete Linguistic Sorts

Table A-18 contains linguistic sorts that have been desupported in Oracle Database 10g.

Table A-18 Obsolete Linguistic Sorts in Oracle Database 10g

Obsolete Sort Name Replacement Sort
THAI_TELEPHONE THAI_M
THAI_DICTIONARY THAI_M
CANADIAN FRENCH CANADIAN_M
JAPANESE JAPANESE_M

Obsolete Territories

Table A-19 contains territories that have been desupported in Oracle Database 10g.

Table A-19 Obsolete Territories

Obsolete Territory Name Replacement Territory
CIS RUSSIA
MACEDONIA FYR MACEDONIA
YUGOSLAVIA SERBIA AND MONTENEGRO
CZECHOSLOVAKIA CZECH REPUBLIC or SLOVAKIA

Obsolete Languages

Table A-20 contains languages that have been desupported in Oracle Database 10g.

Table A-20 Obsolete Languages

Obsolete Language Name Replacement Language
BENGALI BANGLA

New Names for Obsolete Character Sets

Table A-21 lists the obsolete character sets. If you reference any of these character sets in your code, then replace them with their new name.

Table A-21 New Names for Obsolete Character Sets

Old Name New Name
AL24UTFSS UTF8, AL32UTF8
AR8MSAWIN AR8MSWIN1256
CL8EBCDIC875S CL8EBCDIC875R
CL8MSWINDOW31 CL8MSWIN1251
EL8EBCDIC875S EL8EBCDIC875R
JVMS JA16VMS
JEUC JA16EUC
SJIS JA16SJIS
JDBCS JA16DBCS
KSC5601 KO16KSC5601
KDBCS KO16DBCS
CGB2312-80 ZHS16CGB231280
CNS 11643-86 ZHT32EUC
JA16EUCFIXED None. Replaced by new national character set. UTF8 and AL16UTF16.
ZHS32EUCFIXED None. Replaced by new national character set. UTF8 and AL16UTF16.
ZHS16GBKFIXED None. Replaced by new national character set. UTF8 and AL16UTF16.
JA16DBCSFIXED None. Replaced by new national character set. UTF8 and AL16UTF16.
KO16DBCSFIXED None. Replaced by new national character set. UTF8 and AL16UTF16.
ZHS16DBCSFIXED None. Replaced by new national character set. UTF8 and AL16UTF16.
ZHS16CGB231280FIXED None. Replaced by new national character set. UTF8 and AL16UTF16.
ZHT16DBCSFIXED None. Replaced by new national character set. UTF8 and AL16UTF16.
KO16KSC5601FIXED None. Replaced by new national character set. UTF8 and AL16UTF16.
JA16SJISFIXED None. Replaced by new national character set. UTF8 and AL16UTF16.
ZHT16BIG5FIXED None. Replaced by new national character set. UTF8 and AL16UTF16.
ZHT32TRISFIXED None. Replaced by new national character set. UTF8 and AL16UTF16.

AL24UTFFSS Character Set Desupported

The Unicode Character Set AL24UTFFSS was desupported in Oracle9i. AL24UTFFSS was introduced in version 7 as the Unicode character set supporting UTF-8 encoding scheme based on the Unicode standard 1.1, which is now obsolete. In Oracle Database 10g, Oracle offers the Unicode database character set AL32UTF8, which is based on Unicode 4.0, and UTF8, which is based on Unicode 3.0.

The migration path for an existing AL24UTFFSS database is to upgrade to UTF8 prior to upgrading to Oracle9i. As with all migrations to a new database character set, Oracle recommends that you use the Character Set Scanner for data analysis before attempting to migrate your existing database character set to UTF8.

Updates to the Oracle Language and Territory Definition Files

Changes have been made to the content in some of the language and territory definition files in Oracle Database 10g. These updates are necessary to correct the legacy definitions which no longer meet the local conventions in some of the Oracle supported languages and territories. These changes include modifications to the currency symbols, month names, and group separators. One example is the local currency symbol for Brazil. This has been updated from Cr$ to R$ in Oracle Database 10g.

Please refer to the "Oracle Language and Territory definition changes" table documented in the $ORACLE_HOME/nls/data/old/data_changes.html file for a detailed list of the changes.

Oracle Database 10g customers should review their existing application code to make sure that the correct cultural conventions that are defined in Oracle Database 10g are being used. For customers who may not be able to make the necessary code changes to support their applications, Oracle offers Oracle9i Database locale definition files with Oracle Database 10g.

To revert back to the Oracle9i Database language and territory behavior, perform the following:

  1. Shutdown the database.

  2. Run the script cr9idata.pl from the $ORACLE_HOME/nls/data/old directory.

  3. Set the ORA_NLS10 environment variable to the newly created $ORACLE_HOME/nls/data/9idata directory.

  4. Restart the database.

Steps 2 and 3 will need to be repeated for all 10g database clients that need to revert back to the Oracle9i Database definition files.

Oracle strongly recommends that customers use the Oracle Database 10g locale definition files; Oracle9i Database locale definition files will be desupported in a future release.