Hi Anchit Gupta,
Greetings & Welcome to Microsoft Q&A forum! Thanks for posting your query!
To address your requirement:
To extract figures with labels from a PDF, follow these steps:
- Convert the PDF to Images: Use a library like pdf2image to convert each page of the PDF into images for processing.
- Use Azure Document Intelligence: Leverage the prebuilt-layout model to detect and extract text regions, questions, options, and layout elements. This will help identify figures and their associated text labels.
- Extract Graphical Labels: For graphical or embedded text labels within figures, utilize the Azure Computer Vision Read API to perform OCR on the figure regions.
- Optional Customization: If the figures or labels follow a unique pattern, consider training a custom model using Azure Custom Vision or Document Intelligence Custom Model for better accuracy.
Hope this helps. Do let us know if you have any further queries.
If this answers your query, do click Accept Answer
and Yes
for was this answer helpful.