PDFlib

Special

get_attachmentsExtract text and images from attachments.
emptycheckCheck whether a specified area on the page is empt.
identify_ocrClassify the pages in a document according to text or image content.
region_of_interestRestrict text extraction to a particular area on the page.
multiple_documentsProcess multiple documents in a loop.