Share via


Document Parser Definition Schema Overview

This content is outdated and is no longer being maintained. It is provided as a courtesy for individuals who are still using these technologies. This page may contain URLs that were valid when originally published, but now link to sites or pages that no longer exist.

To register a custom document parser with Windows SharePoint Services, you must add a node to the document parser definition file that identifies your parser and the file type or types it can parse. You can specify these file types by file extension or file type program ID.

For more information about associating file types with document parsers, see the topic that includes the parser association section.

Windows SharePoint Services stores the document parser definition file, DOCPARSE.XML, at the following location:

Web Server Extensions\12\CONFIG\DOCPARSE.XML

The document parser definition schema is as follows:

<docParsers>
  <docParser/>
</docParsers>

Following is a list of the elements in the document parser definition schema.

  • docParsers   Required. Represents the root element of the document parser definition schema.

  • docParser   Required. Each docParser element represents a document parser and its associated file type. This element contains the following attributes:

    • Name   Required String**. The file type associated with the parser. For** docParser elements within the ByExtension element, set the Nameattribute to the file extension. For docParser elements within the ByProdId element, set the Name attribute to the program ID of the file type. To associate a parser with multiple file types, add a docParser element for each file type.

    • ProgId   Required String**.** The program ID of the parser that represents the "friendly name" of the parser. This enables you to upgrade a parser without having to edit its document parser definition entry in the DOCPARSE.XML file. However, this prevents you from installing different versions of the same parser side by side.

Document Parser Definition Example

Following is an example of a document parser definition file.

<docParsers>
  <docParser name="abc" ProgId="AdventureWorks.AWDocumentParser.ABCParser"/>
  <DocParser name="AWApplication.Document" ProgId="AdventureWorks.AWDocumentParser.ABCParser"/>
</docParsers>

See Also

Concepts

Custom Document Parsers

Mapping Document Properties to Columns

Document Parsing and Content Types

Document Parser Definition Schema Overview