Share via


MicrosoftTokenizerLanguage Enum

Definition

Lists the languages supported by the Microsoft language tokenizer.

public enum MicrosoftTokenizerLanguage
type MicrosoftTokenizerLanguage = 
Public Enum MicrosoftTokenizerLanguage
Inheritance
MicrosoftTokenizerLanguage

Fields

Name Value Description
Bangla 0

Selects the Microsoft tokenizer for Bangla.

Bulgarian 1

Selects the Microsoft tokenizer for Bulgarian.

Catalan 2

Selects the Microsoft tokenizer for Catalan.

ChineseSimplified 3

Selects the Microsoft tokenizer for Chinese (Simplified).

ChineseTraditional 4

Selects the Microsoft tokenizer for Chinese (Traditional).

Croatian 5

Selects the Microsoft tokenizer for Croatian.

Czech 6

Selects the Microsoft tokenizer for Czech.

Danish 7

Selects the Microsoft tokenizer for Danish.

Dutch 8

Selects the Microsoft tokenizer for Dutch.

English 9

Selects the Microsoft tokenizer for English.

French 10

Selects the Microsoft tokenizer for French.

German 11

Selects the Microsoft tokenizer for German.

Greek 12

Selects the Microsoft tokenizer for Greek.

Gujarati 13

Selects the Microsoft tokenizer for Gujarati.

Hindi 14

Selects the Microsoft tokenizer for Hindi.

Icelandic 15

Selects the Microsoft tokenizer for Icelandic.

Indonesian 16

Selects the Microsoft tokenizer for Indonesian.

Italian 17

Selects the Microsoft tokenizer for Italian.

Japanese 18

Selects the Microsoft tokenizer for Japanese.

Kannada 19

Selects the Microsoft tokenizer for Kannada.

Korean 20

Selects the Microsoft tokenizer for Korean.

Malay 21

Selects the Microsoft tokenizer for Malay.

Malayalam 22

Selects the Microsoft tokenizer for Malayalam.

Marathi 23

Selects the Microsoft tokenizer for Marathi.

NorwegianBokmaal 24

Selects the Microsoft tokenizer for Norwegian (Bokmål).

Polish 25

Selects the Microsoft tokenizer for Polish.

Portuguese 26

Selects the Microsoft tokenizer for Portuguese.

PortugueseBrazilian 27

Selects the Microsoft tokenizer for Portuguese (Brazil).

Punjabi 28

Selects the Microsoft tokenizer for Punjabi.

Romanian 29

Selects the Microsoft tokenizer for Romanian.

Russian 30

Selects the Microsoft tokenizer for Russian.

SerbianCyrillic 31

Selects the Microsoft tokenizer for Serbian (Cyrillic).

SerbianLatin 32

Selects the Microsoft tokenizer for Serbian (Latin).

Slovenian 33

Selects the Microsoft tokenizer for Slovenian.

Spanish 34

Selects the Microsoft tokenizer for Spanish.

Swedish 35

Selects the Microsoft tokenizer for Swedish.

Tamil 36

Selects the Microsoft tokenizer for Tamil.

Telugu 37

Selects the Microsoft tokenizer for Telugu.

Thai 38

Selects the Microsoft tokenizer for Thai.

Ukrainian 39

Selects the Microsoft tokenizer for Ukrainian.

Urdu 40

Selects the Microsoft tokenizer for Urdu.

Vietnamese 41

Selects the Microsoft tokenizer for Vietnamese.

Applies to