You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@stanbol.apache.org by "harish suvarna (JIRA)" <ji...@apache.org> on 2012/08/21 02:21:38 UTC

[jira] [Created] (STANBOL-716) Contenthub UI does not display Chinese characters

harish suvarna created STANBOL-716:
--------------------------------------

             Summary: Contenthub UI does not display Chinese characters 
                 Key: STANBOL-716
                 URL: https://issues.apache.org/jira/browse/STANBOL-716
             Project: Stanbol
          Issue Type: Wish
          Components: Content Hub
    Affects Versions: commons.web.base-0.10.0-incubating
         Environment: Firefox 14.01 Mac 10.6
            Reporter: harish suvarna


Put some Chinese content into Contenthub text box and submit text for analysis and it displays some links but all characters in the resulting form are question marks. Some font display issue. To test this you need an engine chain tika, langdetect (not lang id), keywordExtraction with full dbpeia solr index. keywordExtarction engine currently does not process Chinese text but will process if the method (isprocessableLanguage()) returns true for all languages. Also langdetect returns the language id as zh_cn. dbpedia slor dump uses only zn. All in all it needs some enahnced engines for Chinese. I am filing this just that I dont forget.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira