You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/05/04 00:15:18 UTC
[jira] Created: (NUTCH-261) Multi Language Support
Multi Language Support
----------------------
Key: NUTCH-261
URL: http://issues.apache.org/jira/browse/NUTCH-261
Project: Nutch
Type: New Feature
Components: indexer, searcher
Versions: 0.7, 0.8-dev, 0.6, 0.7.1, 0.7.2
Reporter: Jerome Charron
Assigned to: Jerome Charron
Fix For: 0.8-dev
Add multi-lingual support in Nutch, as described in http://wiki.apache.org/nutch/MultiLingualSupport
The document analysis part is actually implemented, and two analysis plugins (fr and de) are provided for testing (not deployed by default).
The query analysis part is missing for a complete multi-lingual support.
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
http://www.atlassian.com/software/jira
[jira] Updated: (NUTCH-261) Multi Language Support
Posted by "Sami Siren (JIRA)" <ji...@apache.org>.
[ http://issues.apache.org/jira/browse/NUTCH-261?page=all ]
Sami Siren updated NUTCH-261:
-----------------------------
Fix Version/s: 0.9
(was: 0.8)
> Multi Language Support
> ----------------------
>
> Key: NUTCH-261
> URL: http://issues.apache.org/jira/browse/NUTCH-261
> Project: Nutch
> Issue Type: New Feature
> Components: indexer, searcher
> Affects Versions: 0.7, 0.8, 0.6, 0.7.1, 0.7.2
> Reporter: Jerome Charron
> Assigned To: Jerome Charron
> Fix For: 0.9
>
> Attachments: query-lang.patch
>
>
> Add multi-lingual support in Nutch, as described in http://wiki.apache.org/nutch/MultiLingualSupport
> The document analysis part is actually implemented, and two analysis plugins (fr and de) are provided for testing (not deployed by default).
> The query analysis part is missing for a complete multi-lingual support.
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] Updated: (NUTCH-261) Multi Language Support
Posted by "Jerome Charron (JIRA)" <ji...@apache.org>.
[ http://issues.apache.org/jira/browse/NUTCH-261?page=all ]
Jerome Charron updated NUTCH-261:
---------------------------------
Attachment: query-lang.patch
Here is a patch that provides a language dependent analysis of the queries.
If you have activated some language analysis plugins (such as analysis-fr or analysis-de) during indexing, if
these plugins are activated during searching phase, the analyzer corresponding to the browser's language
will be applied: For instance, if you search for the french term "moteurs" it will returns documents containing
"moteur" or "moteurs".
Please notice that if no analyzer plugin is activated, nutch behaviors must be unchanged (backward functional).
There is some well known issues about the summaries (I have planned to solve this very soon).
Thanks for reviewing this patch.
Thanks for your feedback.
Regards
Jérôme
> Multi Language Support
> ----------------------
>
> Key: NUTCH-261
> URL: http://issues.apache.org/jira/browse/NUTCH-261
> Project: Nutch
> Type: New Feature
> Components: indexer, searcher
> Versions: 0.7, 0.8-dev, 0.6, 0.7.1, 0.7.2
> Reporter: Jerome Charron
> Assignee: Jerome Charron
> Fix For: 0.8-dev
> Attachments: query-lang.patch
>
> Add multi-lingual support in Nutch, as described in http://wiki.apache.org/nutch/MultiLingualSupport
> The document analysis part is actually implemented, and two analysis plugins (fr and de) are provided for testing (not deployed by default).
> The query analysis part is missing for a complete multi-lingual support.
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
http://www.atlassian.com/software/jira
[jira] Commented: (NUTCH-261) Multi Language Support
Posted by "Jerome Charron (JIRA)" <ji...@apache.org>.
[ http://issues.apache.org/jira/browse/NUTCH-261?page=comments#action_12378166 ]
Jerome Charron commented on NUTCH-261:
--------------------------------------
Since there is no functional regression if no analyzer is activated (it is the default case), if there is no objection, I would like to commit this patch.
> Multi Language Support
> ----------------------
>
> Key: NUTCH-261
> URL: http://issues.apache.org/jira/browse/NUTCH-261
> Project: Nutch
> Type: New Feature
> Components: indexer, searcher
> Versions: 0.7, 0.8-dev, 0.6, 0.7.1, 0.7.2
> Reporter: Jerome Charron
> Assignee: Jerome Charron
> Fix For: 0.8-dev
> Attachments: query-lang.patch
>
> Add multi-lingual support in Nutch, as described in http://wiki.apache.org/nutch/MultiLingualSupport
> The document analysis part is actually implemented, and two analysis plugins (fr and de) are provided for testing (not deployed by default).
> The query analysis part is missing for a complete multi-lingual support.
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
http://www.atlassian.com/software/jira