WhiteSpace Class
Definition
Important
Some information relates to prerelease product that may be substantially modified before it’s released. Microsoft makes no warranties, express or implied, with respect to the information provided here.
The pre-tokenizer which split the text at the word boundary. The word is a set of alphabet, numeric, and underscore characters.
public sealed class WhiteSpace : Microsoft.ML.Tokenizers.PreTokenizer
type WhiteSpace = class
inherit PreTokenizer
Public NotInheritable Class WhiteSpace
Inherits PreTokenizer
- Inheritance
Constructors
WhiteSpace() |
Fields
Instance |
Gets a singleton instance of the WhiteSpace pre-tokenizer.. |
Methods
PreTokenize(String) |
Splits the given string in multiple substrings at the word boundary, keeping track of the offsets of said substrings from the original string. |