PDFlib
PDFlib

Text Extraction

Process the text contents of PDF documents

extractor

Simple text extractor

concordance

Create a list of all unique words in the document.

back_of_the_book_index

Create a sorted list of all words in the document along with the page numbers where the words occur.