You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@poi.apache.org by bu...@apache.org on 2013/05/08 15:27:31 UTC
[Bug 54937] New: Strange author table structures in word documents
failing the text extraction entirely.
https://issues.apache.org/bugzilla/show_bug.cgi?id=54937
Bug ID: 54937
Summary: Strange author table structures in word documents
failing the text extraction entirely.
Product: POI
Version: unspecified
Hardware: PC
Status: NEW
Severity: normal
Priority: P2
Component: HWPF
Assignee: dev@poi.apache.org
Reporter: shu.yang@icims.com
Classification: Unclassified
Here's the stack trace of the exception.
Caused by: java.lang.UnsupportedOperationException: Non-extended character
Pascal strings are not supported right now. Please, contact POI developers for
update.
at org.apache.poi.hwpf.model.SttbUtils.read(SttbUtils.java:66)
at org.apache.poi.hwpf.model.SttbUtils.readSttbSavedBy(SttbUtils.java:116)
at org.apache.poi.hwpf.model.SavedByTable.<init>(SavedByTable.java:53)
at org.apache.poi.hwpf.HWPFDocument.<init>(HWPFDocument.java:360)
at org.apache.poi.hwpf.extractor.WordExtractor.<init>(WordExtractor.java:80)
This happens in Tika 1.3
Thanks!
--
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org