You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tika.apache.org by Apache Wiki <wi...@apache.org> on 2016/11/10 19:34:18 UTC

[Tika Wiki] Update of "Troubleshooting Tika" by TimothyAllison

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification.

The "Troubleshooting Tika" page has been changed by TimothyAllison:
https://wiki.apache.org/tika/Troubleshooting%20Tika?action=diff&rev1=11&rev2=12

  
  If that shows the same problem, it's a PDFBox bug. Please [[http://pdfbox.apache.org/support.html|file an Apache PDFBox bug report]] and attach at least one failing file to the bug. When that gets fixed, Tika will pick up the new release and will get the fix
  
- If PDFBox !ExtractText works fine, it's likely a Tika bug. Please [[http://tika.apache.org/contribute.html|report an Apache Tika bug]], attach at least one failing file, and mention that PDFBox !ExtractText doesn't have the issue.
+ If PDFBox !ExtractText works fine, it may* be a Tika bug. Please [[http://tika.apache.org/contribute.html|report an Apache Tika bug]], attach at least one failing file, and mention that PDFBox !ExtractText doesn't have the issue.  
  
+ *PDFBox's ExtractText does not pull text from Annotations or Acroforms, so it is possible that a problem not encountered by PDFBox's ExtractText reveals a bug in Annotations or Acroforms; might be a bug in Tika, too.  When in doubt, ask.
+