You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by wku_kunal <wk...@yahoo.com> on 2009/04/14 17:17:52 UTC

Re: Language Identifier plugin

Hello Neera,

Even I was looking for solution for the same problem. I did not find it yet.
Please let me know if you find the solution.

Thanks,
Kunal


Neera wrote:
> 
> Hi,
> 
> I am trying to use LanguageIdentifier plugin for detecting language for
> crawled results and found the following link :
> http://wiki.apache.org/nutch/LanguageIdentifier
> 
> This page mentions some open issues on the lab test benchmark. Since these
> numbers were reported by analyzing results
> from the previous version nutch-0.7, I am curious if these issues have
> been
> fixed in the newer versions (nutch-0.9) ?
> Is there a newer link/thread for the LanguageIdentifier plugin.
> 
> Also this plugin API assumes that the given contents are in UTF-8 format.
> Are the contents of nutch dump file in UTF-8 fomat?
> 
> Thanks and Regards,
> Neera
> 
> 

-- 
View this message in context: http://www.nabble.com/Language-Identifier-plugin-tp22318564p23041507.html
Sent from the Nutch - User mailing list archive at Nabble.com.