You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Robert Muir (JIRA)" <ji...@apache.org> on 2009/11/01 19:47:59 UTC

[jira] Commented: (LUCENE-2023) Improve performance of SmartChineseAnalyzer

    [ https://issues.apache.org/jira/browse/LUCENE-2023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12772348#action_12772348 ] 

Robert Muir commented on LUCENE-2023:
-------------------------------------

Question, the smartcn internals are pkg private (and marked experimental to boot),
I'd like to keep this clean and theres some unused stuff that could now be deprecated or removed.

should this be 3.0 or 3.1? should i deprecate or clean house (since its experimental and pkg private)?

Thanks!

> Improve performance of SmartChineseAnalyzer
> -------------------------------------------
>
>                 Key: LUCENE-2023
>                 URL: https://issues.apache.org/jira/browse/LUCENE-2023
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: contrib/analyzers
>            Reporter: Robert Muir
>            Assignee: Robert Muir
>            Priority: Minor
>             Fix For: 3.0
>
>         Attachments: LUCENE-2023.patch, LUCENE-2023.patch, LUCENE-2023.patch, LUCENE-2023.patch, LUCENE-2023.patch
>
>
> I've noticed SmartChineseAnalyzer is a bit slow, compared to say CJKAnalyzer on chinese text.
> This patch improves the internal hhmm implementation. 
> Time to index my chinese corpus is 75% of the previous time.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org