You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@stanbol.apache.org by "Rupert Westenthaler (JIRA)" <ji...@apache.org> on 2012/08/22 16:20:42 UTC
[jira] [Created] (STANBOL-719) Change from Langid to the Langdetect
engine as default for Language detection
Rupert Westenthaler created STANBOL-719:
-------------------------------------------
Summary: Change from Langid to the Langdetect engine as default for Language detection
Key: STANBOL-719
URL: https://issues.apache.org/jira/browse/STANBOL-719
Project: Stanbol
Issue Type: Sub-task
Reporter: Rupert Westenthaler
Assignee: Rupert Westenthaler
After looking at the documentation and supported languages of both I think that we should switch from the LangId Engine (based on Apache Tika Language detection) to the Langdetect Engine (based on http://code.google.com/p/language-detection/).
Normal users should not notice any difference as both engines create the same Annotations. However the later supports considerable more languages.
This change will come along with a lot of changes in the integration tests as those check on a lot of places for the LangId Engine. Those need to be changed to the Langdetect Engine.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Resolved] (STANBOL-719) Change from Langid to the
Langdetect engine as default for Language detection
Posted by "Rupert Westenthaler (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/STANBOL-719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Rupert Westenthaler resolved STANBOL-719.
-----------------------------------------
Resolution: Fixed
applied with revision 1376385
> Change from Langid to the Langdetect engine as default for Language detection
> -----------------------------------------------------------------------------
>
> Key: STANBOL-719
> URL: https://issues.apache.org/jira/browse/STANBOL-719
> Project: Stanbol
> Issue Type: Sub-task
> Reporter: Rupert Westenthaler
> Assignee: Rupert Westenthaler
>
> After looking at the documentation and supported languages of both I think that we should switch from the LangId Engine (based on Apache Tika Language detection) to the Langdetect Engine (based on http://code.google.com/p/language-detection/).
> Normal users should not notice any difference as both engines create the same Annotations. However the later supports considerable more languages.
> This change will come along with a lot of changes in the integration tests as those check on a lot of places for the LangId Engine. Those need to be changed to the Langdetect Engine.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira