You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tika.apache.org by Apache Wiki <wi...@apache.org> on 2018/12/11 17:24:56 UTC

[Tika Wiki] Update of "VirtualMachine" by TimothyAllison

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification.

The "VirtualMachine" page has been changed by TimothyAllison:
https://wiki.apache.org/tika/VirtualMachine?action=diff&rev1=25&rev2=26

  
  4. cat xpdf-tools-linux-4.00/doc/sample-xpdfrc tmp_xpdfrc >> /usr/local/etc/xpdfrc
  
+ ''NOTE:'' We found that ''pdftotext'' was not correctly reading the ''xpdfrc'' file in this location.  We found no differences in extracted text when we removed the ''xpdfrc'' file and when we had it there.  We did find a difference, especially in CJK PDFs, when we specified the ''xpdfrc'' file from the commandline with the ''-cfg'' option.
+ 
  
  == Other data ==
  See ApacheTikaHtmlEncodingStudy for a description of gathering data for TIKA-2038.