RobertaPreTokenizer Class
Definition
Important
Some information relates to prerelease product that may be substantially modified before it’s released. Microsoft makes no warranties, express or implied, with respect to the information provided here.
The pre-tokenizer for Roberta English tokenizer.
public sealed class RobertaPreTokenizer : Microsoft.ML.Tokenizers.PreTokenizer
type RobertaPreTokenizer = class
inherit PreTokenizer
Public NotInheritable Class RobertaPreTokenizer
Inherits PreTokenizer
- Inheritance
Constructors
RobertaPreTokenizer() |
Fields
Instance |
Gets a singleton instance of the Roberta pre-tokenizer.. |
Methods
PreTokenize(String) |
Splits the given string in multiple substrings at the word boundary, keeping track of the offsets of said substrings from the original string. |