Text Miner failed to index PDF files

Text Miner failed to index PDF files

book

Article ID: KB0079298

calendar_today

Updated On:

Products Versions
Spotfire Statistica 13.3.0 and higher

Description

Statistica Text Miner failed to index PDF files. Below error message appears when user tries to index one or more PDF documents: 
"Failed to index document: xxxxxx.pdf".

Issue/Introduction

Text Miner failed to index PDF files

Environment

Windows

Resolution

This error is due to the missing Text Miner sub-component that is used to convert PDF documents to text (command-line tool called "Xpdf"). This component was removed from the distribution 13.3.0 and higher versions due to TIBCO GPL policy. 

Workaround

User can follow below instructions to manually add the Xpdf tool to Statistica to resolve. 
1. Download "Xpdf command line tools" for Windows: https://www.xpdfreader.com/download.html

User-added image


2. Unzip the downloaded file and find the tool pdftotext.exe

User-added image

3. Copy pdftotext.exe and place it into "Support\xpdf" subfolder of the Statistica installation folder.

User-added image