TET PDF IFilter Overview

PDFlib TET PDF IFilter 5 - Enterprise PDF Search for Windows

TET PDF IFilter extracts text and metadata from PDF documents and makes it available to search and retrieval software on Windows. This allows PDF documents to be searched on the local desktop, a corporate server or the Web. TET PDF IFilter is based on the patented PDFlib Text and Image Extraction Toolkit (TET), an established developer product for reliably extracting text from PDF documents.

TET PDF IFilter is a robust implementation of Microsoft’s IFilter indexing interface. It works with all search and retrieval products which support the IFilter interface, e.g. SharePoint and SQL Server. Such products use format-specific filter programs - called IFilters - for particular file formats, e.g. HTML. TET PDF IFilter is such a program, aimed at PDF documents. The user interface for searching documents may be the Windows Explorer, a Web or database frontend, a query script or a custom application. As an alternative to interactive searches, queries can also be submitted programmatically without any user interface.

Unique Advantages

TET PDF IFilter offers the following advantages:

  • Supports Western text, Chinese, Japanese, and Korean (CJK) text and right-to-left languages such as Arabic and Hebrew
  • Text from bookmarks, annotations (comments) and form fields
  • Indexes protected documents and extracts text even from PDFs where Acrobat fails
  • Configurable metadata indexing for document properties
  • Automatic script and language detection for improved search

Based on patented TET Technology

PDFlib TET, the basis of TET PDF IFilter, was first released in 2002, and is used by customers worldwide in server and desktop environments. As an alternative to extracting PDF page contents and metadata as raw text, TET can supply the document contents in XML format. TET is also available as a free plugin for Adobe Acrobat. This plugin allows interactive test and evaluation of TET’s superior text and image extraction.

Enterprise PDF Search

TET PDF IFilter is available in thread-safe 32- and 64-bit versions. You can implement enterprise PDF search solutions with TET PDF IFilter and all products which support the IFilter interface including the following:

  • Microsoft SharePoint Server
  • Microsoft Search Server
  • Microsoft SQL Server
  • Microsoft Exchange Server

Desktop PDF Search

TET PDF IFilter can also be used to implement desktop PDF search with Windows Search which is integrated in Windows.

TET PDF IFilter is free for non-commercial use on desktop operating systems, which provides a convenient basis for test and evaluation.