You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tika.apache.org by Apache Wiki <wi...@apache.org> on 2016/11/10 19:34:18 UTC
[Tika Wiki] Update of "Troubleshooting Tika" by TimothyAllison
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification.
The "Troubleshooting Tika" page has been changed by TimothyAllison:
https://wiki.apache.org/tika/Troubleshooting%20Tika?action=diff&rev1=11&rev2=12
If that shows the same problem, it's a PDFBox bug. Please [[http://pdfbox.apache.org/support.html|file an Apache PDFBox bug report]] and attach at least one failing file to the bug. When that gets fixed, Tika will pick up the new release and will get the fix
- If PDFBox !ExtractText works fine, it's likely a Tika bug. Please [[http://tika.apache.org/contribute.html|report an Apache Tika bug]], attach at least one failing file, and mention that PDFBox !ExtractText doesn't have the issue.
+ If PDFBox !ExtractText works fine, it may* be a Tika bug. Please [[http://tika.apache.org/contribute.html|report an Apache Tika bug]], attach at least one failing file, and mention that PDFBox !ExtractText doesn't have the issue.
+ *PDFBox's ExtractText does not pull text from Annotations or Acroforms, so it is possible that a problem not encountered by PDFBox's ExtractText reveals a bug in Annotations or Acroforms; might be a bug in Tika, too. When in doubt, ask.
+