You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/05/04 00:15:18 UTC

[jira] Created: (NUTCH-261) Multi Language Support

Multi Language Support
----------------------

         Key: NUTCH-261
         URL: http://issues.apache.org/jira/browse/NUTCH-261
     Project: Nutch
        Type: New Feature

  Components: indexer, searcher  
    Versions: 0.7, 0.8-dev, 0.6, 0.7.1, 0.7.2    
    Reporter: Jerome Charron
 Assigned to: Jerome Charron 
     Fix For: 0.8-dev


Add multi-lingual support in Nutch, as described in http://wiki.apache.org/nutch/MultiLingualSupport
The document analysis part is actually implemented, and two analysis plugins (fr and de) are provided for testing (not deployed by default).
The query analysis part is missing for a complete multi-lingual support.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


[jira] Updated: (NUTCH-261) Multi Language Support

Posted by "Sami Siren (JIRA)" <ji...@apache.org>.
     [ http://issues.apache.org/jira/browse/NUTCH-261?page=all ]

Sami Siren updated NUTCH-261:
-----------------------------

    Fix Version/s: 0.9
                       (was: 0.8)

> Multi Language Support
> ----------------------
>
>                 Key: NUTCH-261
>                 URL: http://issues.apache.org/jira/browse/NUTCH-261
>             Project: Nutch
>          Issue Type: New Feature
>          Components: indexer, searcher
>    Affects Versions: 0.7, 0.8, 0.6, 0.7.1, 0.7.2
>            Reporter: Jerome Charron
>         Assigned To: Jerome Charron
>             Fix For: 0.9
>
>         Attachments: query-lang.patch
>
>
> Add multi-lingual support in Nutch, as described in http://wiki.apache.org/nutch/MultiLingualSupport
> The document analysis part is actually implemented, and two analysis plugins (fr and de) are provided for testing (not deployed by default).
> The query analysis part is missing for a complete multi-lingual support.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] Updated: (NUTCH-261) Multi Language Support

Posted by "Jerome Charron (JIRA)" <ji...@apache.org>.
     [ http://issues.apache.org/jira/browse/NUTCH-261?page=all ]

Jerome Charron updated NUTCH-261:
---------------------------------

    Attachment: query-lang.patch

Here is a patch that provides a language dependent analysis of the queries.

If you have activated some language analysis plugins (such as analysis-fr or analysis-de) during indexing, if
these plugins are activated during searching phase, the analyzer corresponding to the browser's language
will be applied: For instance, if you search for the french term "moteurs" it will returns documents containing
"moteur" or "moteurs".

Please notice that if no analyzer plugin is activated, nutch behaviors must be unchanged (backward functional).
There is some well known issues about the summaries (I have planned to solve this very soon).

Thanks for reviewing this patch.
Thanks for your feedback.

Regards

Jérôme

> Multi Language Support
> ----------------------
>
>          Key: NUTCH-261
>          URL: http://issues.apache.org/jira/browse/NUTCH-261
>      Project: Nutch
>         Type: New Feature

>   Components: indexer, searcher
>     Versions: 0.7, 0.8-dev, 0.6, 0.7.1, 0.7.2
>     Reporter: Jerome Charron
>     Assignee: Jerome Charron
>      Fix For: 0.8-dev
>  Attachments: query-lang.patch
>
> Add multi-lingual support in Nutch, as described in http://wiki.apache.org/nutch/MultiLingualSupport
> The document analysis part is actually implemented, and two analysis plugins (fr and de) are provided for testing (not deployed by default).
> The query analysis part is missing for a complete multi-lingual support.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


[jira] Commented: (NUTCH-261) Multi Language Support

Posted by "Jerome Charron (JIRA)" <ji...@apache.org>.
    [ http://issues.apache.org/jira/browse/NUTCH-261?page=comments#action_12378166 ] 

Jerome Charron commented on NUTCH-261:
--------------------------------------

Since there is no functional regression if no analyzer is activated (it is the default case), if there is no objection, I would like to commit this patch.

> Multi Language Support
> ----------------------
>
>          Key: NUTCH-261
>          URL: http://issues.apache.org/jira/browse/NUTCH-261
>      Project: Nutch
>         Type: New Feature

>   Components: indexer, searcher
>     Versions: 0.7, 0.8-dev, 0.6, 0.7.1, 0.7.2
>     Reporter: Jerome Charron
>     Assignee: Jerome Charron
>      Fix For: 0.8-dev
>  Attachments: query-lang.patch
>
> Add multi-lingual support in Nutch, as described in http://wiki.apache.org/nutch/MultiLingualSupport
> The document analysis part is actually implemented, and two analysis plugins (fr and de) are provided for testing (not deployed by default).
> The query analysis part is missing for a complete multi-lingual support.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira