You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (Jira)" <ji...@apache.org> on 2020/08/19 13:30:00 UTC
[jira] [Commented] (TIKA-3174) tika解析ofd文档时,除了正文内容外,还出现了多余的数字。
[ https://issues.apache.org/jira/browse/TIKA-3174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17180550#comment-17180550 ]
Tim Allison commented on TIKA-3174:
-----------------------------------
Can you attach an example file?
> tika解析ofd文档时,除了正文内容外,还出现了多余的数字。
> -------------------------------
>
> Key: TIKA-3174
> URL: https://issues.apache.org/jira/browse/TIKA-3174
> Project: Tika
> Issue Type: Bug
> Reporter: 天空
> Priority: Major
>
> ofd文档中正文内容:各地各单位要高度重视、精心谋划,确保活动取得实效。
> 但通过tika解析ofd文档的内容如下:
> 0 0 262.46683 371.12244 16 23 16 -4- 2618 3430 2618 2443 1411 16311 20750 5340 18435 16380 573 13044 5625 16961 2120 952 11940 1555 9073 2270 2572 5581 4564 7038 574 9073 2270 13577 各地各单位要高度重视、精心谋划,确保活动取得实效。
>
> 故障:除了正文内容外,多出来了很多数字。
--
This message was sent by Atlassian Jira
(v8.3.4#803005)