Can build content for custom classifier be specified using generic urls?

Abby Newcomb 0 Reputation points
2025-01-06T18:48:35.04+00:00

I’m a bit confused by this APIdocumentation regarding how to train a custom document classifier in v4.0. The ClassifierDocumentTypeDetails specifies the build content files, and has 2 possible formats (azureBlobSource and azureBlobFileListSource) which both require the files to be stored in an Azure Blob container.

There are two other ContentSourceKinds listed, url and base64, but no information on how to specify the build content in that format.

Are those ContentSourceKinds, url and base64, available? If so, what is the spec for the format of the build content?

Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,846 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Vikram Singh 100 Reputation points Microsoft Employee
    2025-01-07T05:11:41.14+00:00

    Hello @Abby Newcomb ,

    Welcome to the Microsoft Q&A Forum! Thank you for your question.

    Yes, the mentioned ContentSourceKinds available. To clarify, the ClassifierDocumentTypeDetails allows for two content source formats: urlSource and base64Source. You need to specify one of these formats for your document classification

    • urlSource: This requires a URI pointing to the document.

    • base64Source: This requires the document to be provided in Base64 encoding.

    If you have any further questions, feel free to ask!

    1 person found this answer helpful.

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.