You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hivemall.apache.org by "yangjd (JIRA)" <ji...@apache.org> on 2017/06/30 03:26:00 UTC

[jira] [Resolved] (HIVEMALL-122) Added tokenize_cn UDF based upon SmartChineseAnalyzer

     [ https://issues.apache.org/jira/browse/HIVEMALL-122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

yangjd resolved HIVEMALL-122.
-----------------------------
    Resolution: Fixed

> Added tokenize_cn UDF based upon SmartChineseAnalyzer
> -----------------------------------------------------
>
>                 Key: HIVEMALL-122
>                 URL: https://issues.apache.org/jira/browse/HIVEMALL-122
>             Project: Hivemall
>          Issue Type: New Feature
>            Reporter: yangjd
>
> Support word segmentation for Simplified Chinese text based upon [org.apache.lucene.analysis.cn.smart.SmartChineseAnalyzer|http://lucene.apache.org/core/5_3_1/analyzers-smartcn/org/apache/lucene/analysis/cn/smart/SmartChineseAnalyzer.html]



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)