Turkish National Language Support

This section describes the plug-in converters for Turkish national language support and the alphabet each converter supports.

Introduction

To convert a Turkish SMS message using the National Language Encoding, Symbian provides three Turkish plug-in converters:

  • Default GSM7 Turkish Single Converter. The highlighted boxes in Figure 1 show that it supports the default GSM 7-bit alphabet, the Turkish single shift alphabet and the set of lossy conversions supported by the standard converter. This converter is identified by the KCharacterSetIdentifierTurkishSingleSms7Bit UID, which is defined in the charconv.h header file.

    Figure 1

    Figure: Alphabet of the Default GSM7 Turkish Single Converter

  • Turkish Locking GSM7 Extension Converter. The highlighted boxes in Figure 2 show that it supports the Turkish locking shift alphabet, default GSM extension alphabet and the set of lossy conversions supported by the standard converter. This converter is identified by the KCharacterSetIdentifierTurkishLockingSms7Bit UID, which is defined in the charconv.h header file.

    Figure 2

    Figure: Alphabet of the Turkish Locking GSM7 Extension Converter

  • Turkish Locking Single Converter. The highlighted boxes in Figure 3 show that it supports the Turkish locking shift alphabet, Turkish single shift alphabet and the set of lossy conversions supported by the standard converter. This converter is identified by the KCharacterSetIdentifierTurkishLockingAndSingleSms7Bit UID, which is defined in the charconv.h header file.

    Figure 3

    Figure: Alphabet of the Turkish Locking Single Converter

Note: For all of the converters, any undefined Unicode is converted to a question mark (?)–GSM code 0x37. Any code outside GSM 0x00 ~0x7F is converted to the Unicode replacement character 0xFFFD.

Alphabet

To better understand the different character set of each converter, the Standard SMS Converter is set as the baseline. The following comparison tables help identify the different character set supported by each Turkish converter as a comparison to the standard SMS converter.

Table 1A Default GSM7 Turkish Single Converter vs. Standard SMS Converter – Unicode to GSM Code

Table 1B Default GSM7 Turkish Single Converter vs. Standard SMS Converter – GSM Code to Unicode

Table 2A Turkish Locking GSM7 Extension Converter vs. Standard SMS Converter – Unicode to GSM Code

Table 2B Turkish Locking GSM7 Extension Converter vs. Standard SMS Converter – GSM Code to Unicode

Table 3A Turkish Locking Single Converter vs. Standard SMS Converter – Unicode to GSM Code

Table 3B Turkish Locking Single Converter vs. Standard SMS Converter – GSM Code to Unicode

Default GSM7 Turkish Single Converter vs. Standard SMS Converter – Unicode to GSM Code

Table 1A

Default GSM7 Turkish Single Converter

 

Standard SMS Converter

Original Character

Unicode

GSM

Converted Character

Character

Unicode

GSM

Converted Character

         

ç LATIN SMALL LETTER C WITH CEDILLA

0x00E7

0x09

Ç LATIN CAPITAL LETTER C WITH CEDILLA

G LATIN CAPITAL LETTER G WITH BREVE

0x011E

0x1B47

G LATIN CAPITAL LETTER G WITH BREVE

 

       

G LATIN CAPITAL LETTER G WITH CARON

0x01E6

0x1B47

G LATIN CAPITAL LETTER G WITH BREVE

         

I LATIN CAPITAL LETTER I WITH DOT ABOVE

0x0130

0x1B49

I LATIN CAPITAL LETTER I WITH DOT ABOVE

 

       

S LATIN CAPITAL LETTER S WITH CEDILLA *

0x015E

0x1B53

S LATIN CAPITAL LETTER S WITH CEDILLA *

 

       

ç LATIN SMALL LETTER C WITH CEDILLA

0x00E7

0x1B63

ç LATIN SMALL LETTER C WITH CEDILLA

 

       

g LATIN SMALL LETTER G WITH BREVE

0x011F

0x1B67

g LATIN SMALL LETTER G WITH BREVE

 

       

g LATIN SMALL LETTER G WITH CARON

0x01E7

0x1B67

g LATIN SMALL LETTER G WITH BREVE

         

i LATIN SMALL LETTER DOTLESS

0x0131

0x1B69

i LATIN SMALL LETTER DOTLESS

 

       

s LATIN SMALL LETTER S WITH CEDILLA *

0x015F

0x1B73

s LATIN SMALL LETTER S WITH CEDILLA *

 

       

Default GSM7 Turkish Single Converter vs. Standard SMS Converter – GSM Code to Unicode

Table 1B

Default GSM7 Turkish Single Converter

Standard SMS Converter

Character

GSM

Unicode

Converted Character

Character

GSM

Unicode

Converted Character

Ç LATIN CAPITAL LETTER C WITH CEDILLA

0x09

0x00C7

Ç LATIN CAPITAL LETTER C WITH CEDILLA

 

Ç LATIN CAPITAL LETTER C WITH CEDILLA

0x09

0x00E7

ç LATIN SMALL LETTER C WITH CEDILLA

FORM FEED

0x1B0A

0x000C

FORM FEED

 

(PAGE BREAK)

0x1B0A

0x000C

(PAGE BREAK)

G LATIN CAPITAL LETTER G WITH BREVE

0x1B47

0x011E

G LATIN CAPITAL LETTER G WITH BREVE

   

0x1B47

0x0047

G LATIN CAPITAL LETTER G

I LATIN CAPITAL LETTER I WITH DOT ABOVE

0x1B49

0x0130

I LATIN CAPITAL LETTER I WITH DOT ABOVE

   

0x1B49

0x0049

I LATIN CAPITAL LETTER I

S LATIN CAPITAL LETTER S WITH CEDILLA *

0x1B53

0x015E

S LATIN CAPITAL LETTER S WITH CEDILLA *

   

0x1B53

0x0053

S LATIN CAPITAL LETTER S

g LATIN SMALL LETTER G WITH BREVE

0x1B67

0x011F

g LATIN SMALL LETTER G WITH BREVE

   

0x1B67

0x0067

g LATIN SMALL LETTER G

i LATIN SMALL LETTER DOTLESS

0x1B69

0x0131

i LATIN SMALL LETTER DOTLESS

   

0x1B69

0x0069

i LATIN SMALL LETTER I

s LATIN SMALL LETTER S WITH CEDILLA *

0x1B73

0x015F

s LATIN SMALL LETTER S WITH CEDILLA *

   

0x1B73

0x0073

s LATIN SMALL LETTER S

Turkish Locking GSM7 Extension Converter vs. Standard SMS Converter – Unicode to GSM Code

Table 2A

Turkish Locking GSM7 Extension Converter

 

Standard SMS Converter

Character

Unicode

GSM

Converted Character

 

Character

Unicode

GSM

Converted Character

€ EURO SIGN

0x20AC

0x04

€ EURO SIGN

 

è LATIN SMALL LETTER E WITH GRAVE

0x00E8

0x04

è LATIN SMALL LETTER E WITH GRAVE

i LATIN SMALL LETTER DOTLESS

0x0131

0x07

i LATIN SMALL LETTER DOTLESS

 

ì LATIN SMALL LETTER I WITH GRAVE

0x00EC

0x07

ì LATIN SMALL LETTER I WITH GRAVE

G LATIN CAPITAL LETTER G WITH BREVE

0x011E

0x0B

G LATIN CAPITAL LETTER G WITH BREVE

 

ç LATIN SMALL LETTER C WITH CEDILLA

0x00E7

0x09

Ç LATIN CAPITAL LETTER C WITH CEDILLA

G LATIN CAPITAL LETTER G WITH CARON

0x01E6

0x0B

G LATIN CAPITAL LETTER G WITH BREVE

 

Ø LATIN CAPITAL LETTER O WITH STROKE

0x00D8

0x0B

Ø LATIN CAPITAL LETTER O WITH STROKE

g LATIN SMALL LETTER G WITH BREVE

0x011F

0x0C

g LATIN SMALL LETTER G WITH BREVE

 

ø LATIN SMALL LETTER O WITH STROKE

0x00F8

0x0C

ø LATIN SMALL LETTER O WITH STROKE

g LATIN SMALL LETTER G WITH CARON

0x01E7

0x0C

g LATIN SMALL LETTER G WITH BREVE

 

(FORM FEED)

0x000C

0x1B0A

(PAGE BREAK)

S LATIN CAPITAL LETTER S WITH CEDILLA *

0x015E

0x1C

S LATIN CAPITAL LETTER S WITH CEDILLA *

 

€ EURO SIGN

0x20AC

0x1B65

€ EURO SIGN

S LATIN CAPITAL LETTER S WITH CEDILLA *

0x015F

0x1D

S LATIN CAPITAL LETTER S WITH CEDILLA *

 

Æ LATIN CAPITAL LETTER AE

0x00C6

0x1C

Æ LATIN CAPITAL LETTER AE

I LATIN CAPITAL LETTER I WITH DOT ABOVE

0x0130

0x40

I LATIN CAPITAL LETTER I WITH DOT ABOVE

 

æ LATIN SMALL LETTER AE

0x00E6

0x1D

æ LATIN SMALL LETTER AE

ç LATIN SMALL LETTER C WITH CEDILLA

0x00E7

0x60

ç LATIN SMALL LETTER C WITH CEDILLA

 

¡ INVERTED EXCLAMATION MARK

0x00A1

0x40

¡ INVERTED EXCLAMATION MARK

         

Í LATIN CAPITAL LETTER I WITH ACUTE

0x00CD

0x49

I LATIN CAPITAL LETTER I

         

¿ INVERTED QUESTION MARK

0x00BF

0x60

¿ INVERTED QUESTION MARK

Turkish Locking GSM7 Extension Converter vs. Standard SMS Converter – GSM Code to Unicode

Table 2B

Turkish Locking GSM7 Extension Converter

 

Standard SMS Converter

Character

GSM

Unicode

Converted Character

Character

GSM

Unicode

Converted Character

Ç LATIN CAPITAL LETTER C WITH CEDILLA

0x09

0x00C7

Ç LATIN CAPITAL LETTER C WITH CEDILLA

 

Ç LATIN CAPITAL LETTER C WITH CEDILLA

0x09

0x00E7

ç LATIN SMALL LETTER C WITH CEDILLA

G LATIN CAPITAL LETTER G WITH BREVE

0x0B

0x011E

G LATIN CAPITAL LETTER G WITH BREVE

 

Ø LATIN CAPITAL LETTER O WITH STROKE

0x0B

0x00D8

Ø LATIN CAPITAL LETTER O WITH STROKE

g LATIN SMALL LETTER G WITH BREVE

0x0C

0x011F

g LATIN SMALL LETTER G WITH BREVE

 

ø LATIN SMALL LETTER O WITH STROKE

0x0C

0x00F8

ø LATIN SMALL LETTER O WITH STROKE

 

0x1B07

0x0131

i LATIN SMALL LETTER DOTLESS

   

0x1B07

0x00EC

ì LATIN SMALL LETTER I WITH GRAVE

 

0x1B0A

0x000C

(FORM FEED)

 

(PAGE BREAK)

0x1B0A

0x000C

(PAGE BREAK)

 

0x1B0B

0x011E

G LATIN CAPITAL LETTER G WITH BREVE

   

0x1B0B

0x00D8

Ø LATIN CAPITAL LETTER O WITH STROKE

 

0x1B0C

0x011F

g LATIN SMALL LETTER G WITH CARON

   

0x1B0C

0x00F8

ø LATIN SMALL LETTER O WITH STROKE

 

0x1B1C

0x015E

S LATIN CAPITAL LETTER S WITH CEDILLA *

   

0x1B1C

0x00C6

Æ LATIN CAPITAL LETTER AE (ash) *

 

0x1B1D

0x015F

s LATIN SMALL LETTER S WITH CEDILLA *

   

0x1B1D

0x00E6

æ LATIN SMALL LETTER AE (ash) *

 

0x1B60

0x00E7

ç LATIN SMALL LETTER C WITH CEDILLA

   

0x1B60

0x00BF

¿ INVERTED QUESTION MARK

S LATIN CAPITAL LETTER S WITH CEDILLA *

0x1C

0x015E

S LATIN CAPITAL LETTER S WITH CEDILLA *

 

Æ LATIN CAPITAL LETTER AE

0x1C

0x00C6

Æ LATIN CAPITAL LETTER AE

s LATIN SMALL LETTER S WITH CEDILLA *

0x1D

0x015F

s LATIN SMALL LETTER S WITH CEDILLA *

 

æ LATIN SMALL LETTER AE

0x1D

0x00E6

æ LATIN SMALL LETTER AE

I LATIN CAPITAL LETTER I WITH DOT ABOVE

0x40

0x0130

I LATIN CAPITAL LETTER I WITH DOT ABOVE

 

¡ INVERTED EXCLAMATION MARK

0x40

0x00A1

¡ INVERTED EXCLAMATION MARK

Turkish Locking Single Converter vs. Standard SMS Converter – Unicode to GSM Code

Table 3A

Turkish Locking Single Converter

Standard SMS Converter

Character

Unicode

GSM

Converted Character

Character

Unicode

GSM

Converted Character

€ EURO SIGN

0x20AC

0x04

€ EURO SIGN

 

è LATIN SMALL LETTER E WITH GRAVE

0x00E8

0x04

è LATIN SMALL LETTER E WITH GRAVE

i LATIN SMALL LETTER DOTLESS

0x0131

0x07

i LATIN SMALL LETTER DOTLESS

 

ì LATIN SMALL LETTER I WITH GRAVE

0x00EC

0x07

ì LATIN SMALL LETTER I WITH GRAVE

G LATIN CAPITAL LETTER G WITH BREVE

0x011E

0x0B

G LATIN CAPITAL LETTER G WITH BREVE

 

ç LATIN SMALL LETTER C WITH CEDILLA

0x00E7

0x09

Ç LATIN CAPITAL LETTER C WITH CEDILLA

G LATIN CAPITAL LETTER G WITH CARON

0x01E6

0x0B

G LATIN CAPITAL LETTER G WITH BREVE

 

Ø LATIN CAPITAL LETTER O WITH STROKE

0x00D8

0x0B

Ø LATIN CAPITAL LETTER O WITH STROKE

g LATIN SMALL LETTER G WITH BREVE

0x011F

0x0C

g LATIN SMALL LETTER G WITH BREVE

 

ø LATIN SMALL LETTER O WITH STROKE

0x00F8

0x0C

ø LATIN SMALL LETTER O WITH STROKE

g LATIN SMALL LETTER G WITH CARON

0x01E7

0x0C

g LATIN SMALL LETTER G WITH BREVE

 

(FORM FEED)

0x000C

0x1B0A

(PAGE BREAK)

S LATIN CAPITAL LETTER S WITH CEDILLA *

0x015E

0x1C

S LATIN CAPITAL LETTER S WITH CEDILLA *

 

€ EURO SIGN

0x20AC

0x1B65

€ EURO SIGN

s LATIN SMALL LETTER S WITH CEDILLA *

0x015F

0x1D

s LATIN SMALL LETTER S WITH CEDILLA *

 

Æ LATIN CAPITAL LETTER AE

0x00C6

0x1C

Æ LATIN CAPITAL LETTER AE

I LATIN CAPITAL LETTER I WITH DOT ABOVE

0x0130

0x40

I LATIN CAPITAL LETTER I WITH DOT ABOVE

 

æ LATIN SMALL LETTER AE

0x00E6

0x1D

æ LATIN SMALL LETTER AE

ç LATIN SMALL LETTER C WITH CEDILLA

0x00E7

0x60

ç LATIN SMALL LETTER C WITH CEDILLA

 

¡ INVERTED EXCLAMATION MARK

0x00A1

0x40

¡ INVERTED EXCLAMATION MARK

         

¿ INVERTED QUESTION MARK

0x00BF

0x60

¿ INVERTED QUESTION MARK

Turkish Locking Single Converter vs. Standard SMS Converter – GSM Code to Unicode

Table 3B

Turkish Locking Single Converter

Standard SMS Converter

Character

GSM

Unicode

Converted Character

Character

GSM

Unicode

Converted Character

€ EURO SIGN

0x04

0x20AC

€ EURO SIGN

 

è LATIN SMALL LETTER E WITH GRAVE

0x04

0x00E8

è LATIN SMALL LETTER E WITH GRAVE

i LATIN SMALL LETTER DOTLESS

0x07

0x0131

i LATIN SMALL LETTER DOTLESS

 

ì LATIN SMALL LETTER I WITH GRAVE

0x07

0x00EC

ì LATIN SMALL LETTER I WITH GRAVE

Ç LATIN CAPITAL LETTER C WITH CEDILLA

0x09

0x00C7

Ç LATIN CAPITAL LETTER C WITH CEDILLA

 

Ç LATIN CAPITAL LETTER C WITH CEDILLA

0x09

0x00E7

ç LATIN SMALL LETTER C WITH CEDILLA

G LATIN CAPITAL LETTER G WITH BREVE

0x0B

0x011E

G LATIN CAPITAL LETTER G WITH BREVE

 

Ø LATIN CAPITAL LETTER O WITH STROKE

0x0B

0x00D8

Ø LATIN CAPITAL LETTER O WITH STROKE

g LATIN SMALL LETTER G WITH BREVE

0x0C

0x011F

g LATIN SMALL LETTER G WITH BREVE

 

ø LATIN SMALL LETTER O WITH STROKE

0x0C

0x00F8

ø LATIN SMALL LETTER O WITH STROKE

 

0x1B04

0x20AC

€ EURO SIGN

   

0x1B04

0x00E8

è LATIN SMALL LETTER E WITH GRAVE

 

0x1B07

0x0131

i LATIN SMALL LETTER DOTLESS

   

0x1B07

0x00EC

ì LATIN SMALL LETTER I WITH GRAVE

 

0x1B0B

0x011E

G LATIN CAPITAL LETTER G WITH BREVE

   

0x1B0B

0x00D8

Ø LATIN CAPITAL LETTER O WITH STROKE

 

0x1B0C

0x011F

g LATIN SMALL LETTER G WITH CARON

   

0x1B0C

0x00F8

ø LATIN SMALL LETTER O WITH STROKE

 

0x1B1C

0x015E

S LATIN CAPITAL LETTER S WITH CEDILLA *

   

0x1B1C

0x00C6

Æ LATIN CAPITAL LETTER AE (ash) *

 

0x1B1D

0x015F

ç LATIN SMALL LETTER C WITH CEDILLA *

   

0x1B1D

0x00E6

æ LATIN SMALL LETTER AE (ash) *

 

0x1B47

0x011E

G LATIN CAPITAL LETTER G WITH BREVE

   

0x1B47

0x0047

G LATIN CAPITAL LETTER G

 

0x1B49

0x0130

I LATIN CAPITAL LETTER I WITH DOT ABOVE

   

0x1B49

0x0049

I LATIN CAPITAL LETTER I

 

0x1B53

0x015E

S LATIN CAPITAL LETTER S WITH CEDILLA *

   

0x1B53

0x0053

S LATIN CAPITAL LETTER S

 

0x1B60

0x00E7

ç LATIN SMALL LETTER C WITH CEDILLA

   

0x1B60

0x00BF

¿ INVERTED QUESTION MARK

 

0x1B67

0x011F

g LATIN SMALL LETTER G WITH BREVE

   

0x1B67

0x0067

g LATIN SMALL LETTER G

 

0x1B69

0x0131

i LATIN SMALL LETTER DOTLESS

   

0x1B69

0x0069

i LATIN SMALL LETTER I

S LATIN CAPITAL LETTER S WITH CEDILLA *

0x1C

0x015E

S LATIN CAPITAL LETTER S WITH CEDILLA *

 

Æ LATIN CAPITAL LETTER AE

0x1C

0x00C6

Æ LATIN CAPITAL LETTER AE

s LATIN SMALL LETTER S WITH CEDILLA *

0x1D

0x015F

s LATIN SMALL LETTER S WITH CEDILLA *

 

æ LATIN SMALL LETTER AE

0x1D

0x00E6

æ LATIN SMALL LETTER AE

ç LATIN SMALL LETTER C WITH CEDILLA

0x60

0x00E7

ç LATIN SMALL LETTER C WITH CEDILLA

 

¿ INVERTED QUESTION MARK

0x60

0x00BF

¿ INVERTED QUESTION MARK

See also

SMS Encodings and Converters Overview