You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Ken Krugler (JIRA)" <ji...@apache.org> on 2013/02/28 17:39:13 UTC

[jira] [Closed] (TIKA-1091) Class LanguageIdentifier wrongly detecting the english language sentance

     [ https://issues.apache.org/jira/browse/TIKA-1091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ken Krugler closed TIKA-1091.
-----------------------------

    Resolution: Duplicate
      Assignee: Ken Krugler

The poor quality of results (especially with short text snippets) is a known issue in Tika, as per TIKA-369.
                
> Class LanguageIdentifier wrongly detecting the english language sentance
> ------------------------------------------------------------------------
>
>                 Key: TIKA-1091
>                 URL: https://issues.apache.org/jira/browse/TIKA-1091
>             Project: Tika
>          Issue Type: Bug
>            Reporter: Sudhakar Vankamamidi
>            Assignee: Ken Krugler
>
> Class org.apache.tika.language.LanguageIdentifier wrongly detecting the english language sentance "Text content as String". It is provided as part of sample documentation.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira