You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "阿里木 (Jira)" <ji...@apache.org> on 2020/06/23 07:36:00 UTC

[jira] [Created] (TIKA-3123) request to parse Chinese, but return Russian

阿里木 created TIKA-3123:
-------------------------

             Summary: request to parse Chinese, but return Russian
                 Key: TIKA-3123
                 URL: https://issues.apache.org/jira/browse/TIKA-3123
             Project: Tika
          Issue Type: Bug
    Affects Versions: 1.24.1
            Reporter: 阿里木


Try to parse html text containing Chinese:
{code:java}
 <span>被</span>{code}
tika-server 返回的解析结果是:
{code:java}
<p>швл</p>
{code}
 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)