Share via


RobertaPreTokenizer Class

Definition

The pre-tokenizer for Roberta English tokenizer.

public sealed class RobertaPreTokenizer : Microsoft.ML.Tokenizers.PreTokenizer
type RobertaPreTokenizer = class
    inherit PreTokenizer
Public NotInheritable Class RobertaPreTokenizer
Inherits PreTokenizer
Inheritance
RobertaPreTokenizer

Constructors

RobertaPreTokenizer()

Fields

Instance

Gets a singleton instance of the Roberta pre-tokenizer..

Methods

PreTokenize(String)

Splits the given string in multiple substrings at the word boundary, keeping track of the offsets of said substrings from the original string.

Applies to