You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (Jira)" <ji...@apache.org> on 2020/08/19 13:30:00 UTC

[jira] [Commented] (TIKA-3174) tika解析ofd文档时,除了正文内容外,还出现了多余的数字。

    [ https://issues.apache.org/jira/browse/TIKA-3174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17180550#comment-17180550 ] 

Tim Allison commented on TIKA-3174:
-----------------------------------

Can you attach an example file?

> tika解析ofd文档时,除了正文内容外,还出现了多余的数字。
> -------------------------------
>
>                 Key: TIKA-3174
>                 URL: https://issues.apache.org/jira/browse/TIKA-3174
>             Project: Tika
>          Issue Type: Bug
>            Reporter: 天空
>            Priority: Major
>
> ofd文档中正文内容:各地各单位要高度重视、精心谋划,确保活动取得实效。
> 但通过tika解析ofd文档的内容如下:
> 0 0 262.46683 371.12244 16 23 16 -4- 2618 3430 2618 2443 1411 16311 20750 5340 18435 16380 573 13044 5625 16961 2120 952 11940 1555 9073 2270 2572 5581 4564 7038 574 9073 2270 13577 各地各单位要高度重视、精心谋划,确保活动取得实效。
>  
> 故障:除了正文内容外,多出来了很多数字。



--
This message was sent by Atlassian Jira
(v8.3.4#803005)