DocumentPage Class
- java.
lang. Object - com.
azure. ai. formrecognizer. documentanalysis. models. DocumentPage
- com.
public final class DocumentPage
Content and layout elements extracted from a page from the input.
Constructor Summary
Constructor | Description |
---|---|
DocumentPage() |
Creates a Document |
Method Summary
Modifier and Type | Method and Description |
---|---|
Float |
getAngle()
Get the general orientation of the content in clockwise direction, measured in degrees between (-180, 180]. |
List<Document |
getBarcodes()
Get the extracted barcodes from the page. |
List<Document |
getFormulas()
Get the extracted formulas from the page. |
Float |
getHeight()
Get the height of the image/PDF in pixels/inches, respectively. |
List<Document |
getLines()
Get the extracted lines from the page, potentially containing both textual and visual elements. |
int |
getPageNumber()
Get the 1-based page number in the input document. |
List<Document |
getSelectionMarks()
Get the extracted selection marks from the page. |
List<Document |
getSpans()
Get the location of the page in the reading order concatenated content. |
Document |
getUnit()
Get the unit used by the width, height, and bounding |
Float |
getWidth()
Get the width of the image/PDF in pixels/inches, respectively. |
List<Document |
getWords()
Get the extracted words from the page. |
Methods inherited from java.lang.Object
Constructor Details
DocumentPage
public DocumentPage()
Creates a DocumentPage object.
Method Details
getAngle
public Float getAngle()
Get the general orientation of the content in clockwise direction, measured in degrees between (-180, 180].
Returns:
getBarcodes
public List
Get the extracted barcodes from the page.
Returns:
getFormulas
public List
Get the extracted formulas from the page.
Returns:
getHeight
public Float getHeight()
Get the height of the image/PDF in pixels/inches, respectively.
Returns:
getLines
public List
Get the extracted lines from the page, potentially containing both textual and visual elements.
Returns:
getPageNumber
public int getPageNumber()
Get the 1-based page number in the input document.
Returns:
getSelectionMarks
public List
Get the extracted selection marks from the page.
Returns:
getSpans
public List
Get the location of the page in the reading order concatenated content.
Returns:
getUnit
public DocumentPageLengthUnit getUnit()
Get the unit used by the width, height, and boundingBox properties. For images, the unit is "pixel". For PDF, the unit is "inch".
Returns:
getWidth
public Float getWidth()
Get the width of the image/PDF in pixels/inches, respectively.
Returns:
getWords
public List
Get the extracted words from the page.
Returns: