You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by Apache Wiki <wi...@apache.org> on 2005/05/21 00:46:21 UTC
[Nutch Wiki] Update of "LanguageIdentifierPlugin" by JeromeCharron
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The following page has been changed by JeromeCharron:
http://wiki.apache.org/nutch/LanguageIdentifierPlugin
The comment on the change is:
A draft version. More to come
New page:
* plugin name: languageidentifier
* plugin version: none
* provider: SamiSiren, JeromeCharron
* plugin home url: LanguageIdentifierPlugin
* plugin download url: Included with nutch source distribution
* license: Same as Nutch
* short description: Analyzer plugin that identifies the language of documents.
* long description:
* configureable parameters: lang.ngram.min.length, lang.ngram.max.length, lang.analyze.max.length
* meta data added to index: lang
* required jars:
* plugin extension points:
* plugin extension point interface:
* plugin extension point xml snippet: