PDFlib
PDFlib

Special

Other topics

get_attachments

Extract text and images from attachments.

identify_ocr

Classify the pages in a document according to text or image content.

region_of_interest

Restrict text extraction to a particular area on the page.

multiple_documents

Process multiple documents in a loop.