You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@stanbol.apache.org by "harish suvarna (JIRA)" <ji...@apache.org> on 2012/08/21 02:21:38 UTC
[jira] [Created] (STANBOL-716) Contenthub UI does not display
Chinese characters
harish suvarna created STANBOL-716:
--------------------------------------
Summary: Contenthub UI does not display Chinese characters
Key: STANBOL-716
URL: https://issues.apache.org/jira/browse/STANBOL-716
Project: Stanbol
Issue Type: Wish
Components: Content Hub
Affects Versions: commons.web.base-0.10.0-incubating
Environment: Firefox 14.01 Mac 10.6
Reporter: harish suvarna
Put some Chinese content into Contenthub text box and submit text for analysis and it displays some links but all characters in the resulting form are question marks. Some font display issue. To test this you need an engine chain tika, langdetect (not lang id), keywordExtraction with full dbpeia solr index. keywordExtarction engine currently does not process Chinese text but will process if the method (isprocessableLanguage()) returns true for all languages. Also langdetect returns the language id as zh_cn. dbpedia slor dump uses only zn. All in all it needs some enahnced engines for Chinese. I am filing this just that I dont forget.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira