You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "saravana kumar.r" <02...@gmail.com> on 2007/08/13 09:59:36 UTC

nutch plugin-analyser language identifier

hai all,

i am working on nutch plugin to plug an language analyzer for tamil
(analysis-ta) and i have done the steps for writing plugin nutch and from
the forum i got info that i have to create the ngram profile for tamil
language .......
where i have to execute NGramProfile class to create some NGrmProfiles (ngp
files)

a) place where the NGrameProfile class file is located   or
b) inside the nutch directory or
c) place where NGramProfile.java is located

where i have to do this ???\


very urgent

with regards
saran

Re: nutch plugin-analyser language identifier

Posted by Jérôme Charron <je...@shopreflex.com>.
The best way is to use the following command:
nutch plugin language-identifier
org.apache.nutch.analysis.lang.NGramProfile-create ta sample_ta.txt
UTF8

Best regards

Jérôme


On 8/18/07, k.g.kumare san <ku...@gmail.com> wrote:
>
> dear sagar,
>
>
>                 thank you for your kind information and it really
> helped us.I need to
>
> know   how to create a new ngp file for tamil.Please send me the
> details as soon
>
> as possible.
>
> i try to set the classpath for ngp file for tamil.But i have the
> following errors
>
> as  and i run this command in
> ...../nutch0.9      the nutch directory for creating new ngp file
>
> java -classpath build/language-identifier/language-
>
> identifier.jar:build/language-identifier/classes/org/apache/nutch/analysis/lang/NGramProfile
> -create ta
> sample.ta.utf8.txt UTF8 and i  got the error as follows
>
> Exception in thread "main" java.lang.NoClassDefFoundError: loaded class
> NGramProfile was in fact named org.apache.nutch.analysis.lang.NGramProfile
>   at java.lang.VMClassLoader.defineClass(libgcj.so.7)
>   at java.lang.ClassLoader.defineClass(libgcj.so.7)
>   at java.security.SecureClassLoader.defineClass(libgcj.so.7)
>   at java.net.URLClassLoader.findClass (libgcj.so.7)
>   at java.lang.ClassLoader.loadClass(libgcj.so.7)
>   at java.lang.ClassLoader.loadClass(libgcj.so.7)
>   at java.lang.Class.forName(libgcj.so.7)
>   at gnu.java.lang.MainThread.run(libgcj.so.7)
>
>
> and i am not able to figure out that can u please help me .....
>



-- 
Jérôme Charron
Directeur Technique @ WebPulse
Tel: +33673716743 - jerome.charron@shopreflex.com
http://blog.shopreflex.com/
Tous les goûts sont dans la nature, les vôtres sont sur
http://www.shopreflex.com

Re: nutch plugin-analyser language identifier

Posted by "k.g.kumare san" <ku...@gmail.com>.
dear sagar,


                thank you for your kind information and it really
helped us.I need to

know   how to create a new ngp file for tamil.Please send me the
details as soon

as possible.

 i try to set the classpath for ngp file for tamil.But i have the
following errors

as  and i run this command in
...../nutch0.9      the nutch directory for creating new ngp file

java -classpath build/language-identifier/language-
identifier.jar:build/language-identifier/classes/org/apache/nutch/analysis/lang/NGramProfile
-create ta
sample.ta.utf8.txt UTF8 and i  got the error as follows

Exception in thread "main" java.lang.NoClassDefFoundError: loaded class
NGramProfile was in fact named org.apache.nutch.analysis.lang.NGramProfile
  at java.lang.VMClassLoader.defineClass(libgcj.so.7)
  at java.lang.ClassLoader.defineClass(libgcj.so.7)
  at java.security.SecureClassLoader.defineClass(libgcj.so.7)
  at java.net.URLClassLoader.findClass (libgcj.so.7)
  at java.lang.ClassLoader.loadClass(libgcj.so.7)
  at java.lang.ClassLoader.loadClass(libgcj.so.7)
  at java.lang.Class.forName(libgcj.so.7)
  at gnu.java.lang.MainThread.run(libgcj.so.7)


and i am not able to figure out that can u please help me .....

Re: nutch plugin-analyser language identifier

Posted by Sagar Naik <sa...@visvo.com>.
Hey Saravana,

The new NGP file shoud be kept in the place where NGramProfile.java is 
placed i.e where all other ngp files are present.
The ant script will then copy all the .class , ngp files, 
langmappings.properties file in the language-identifier.jar


- Sagar Naik

saravana kumar.r wrote:
> hai all,
>
> i am working on nutch plugin to plug an language analyzer for tamil
> (analysis-ta) and i have done the steps for writing plugin nutch and from
> the forum i got info that i have to create the ngram profile for tamil
> language .......
> where i have to execute NGramProfile class to create some NGrmProfiles (ngp
> files)
>
> a) place where the NGrameProfile class file is located   or
> b) inside the nutch directory or
> c) place where NGramProfile.java is located
>
> where i have to do this ???\
>
>
> very urgent
>
> with regards
> saran
>
>   


-- 
Always vizz it us @ visvo.com


-- 
This message has been scanned for viruses and
dangerous content and is believed to be clean.