You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Suneel Marthi (JIRA)" <ji...@apache.org> on 2013/12/01 13:12:38 UTC

[jira] [Commented] (MAHOUT-1367) WikipediaXmlSplitter --> Exception in thread "main" java.lang.NullPointerException

    [ https://issues.apache.org/jira/browse/MAHOUT-1367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13836010#comment-13836010 ] 

Suneel Marthi commented on MAHOUT-1367:
---------------------------------------

The original issue that's been reported could have been caused by a wrong path for the input file and the present implementation doesn't check if the input file exists before trying to read from the file which throws the reported stack trace.  Committing a patch to validate that the input file path exists before trying to read from the file.

> WikipediaXmlSplitter --> Exception in thread "main" java.lang.NullPointerException
> ----------------------------------------------------------------------------------
>
>                 Key: MAHOUT-1367
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1367
>             Project: Mahout
>          Issue Type: Bug
>    Affects Versions: 0.8
>         Environment: ubuntu
>            Reporter: ollagnier
>            Assignee: Suneel Marthi
>            Priority: Minor
>             Fix For: 0.9
>
>         Attachments: MAHOUT-1367.patch
>
>
> Hi !
> When I run this command : $MAHOUT_HOME/bin/mahout org.apache.mahout.text.wikipedia.WikipediaXmlSplitter -d frwiki-latest-pages-articles.xml.bz2 -o wikipedia-xml-chunks -c 100
> I have this error :
> MAHOUT-JOB: /home/ollagnier/Documents/tools/mahout-distribution-0.8/bin/trunk/examples/target/mahout-examples-0.9-SNAPSHOT-job.jar
> 13/11/29 14:51:08 WARN driver.MahoutDriver: No org.apache.mahout.text.wikipedia.WikipediaXmlSplitter.props found on classpath, will use command-line arguments only
> 13/11/29 14:51:08 WARN util.NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
> Exception in thread "main" java.lang.NullPointerException
> 	at org.apache.hadoop.io.compress.bzip2.Bzip2Factory.isNativeBzip2Loaded(Bzip2Factory.java:54)
> 	at org.apache.hadoop.io.compress.bzip2.Bzip2Factory.getBzip2Decompressor(Bzip2Factory.java:131)
> 	at org.apache.hadoop.io.compress.BZip2Codec.createDecompressor(BZip2Codec.java:250)
> 	at org.apache.hadoop.io.compress.BZip2Codec.createInputStream(BZip2Codec.java:156)
> 	at org.apache.mahout.text.wikipedia.WikipediaXmlSplitter.main(WikipediaXmlSplitter.java:190)
> 	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> 	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
> 	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> 	at java.lang.reflect.Method.invoke(Method.java:606)
> 	at org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:72)
> 	at org.apache.hadoop.util.ProgramDriver.run(ProgramDriver.java:144)
> 	at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:152)
> 	at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:195)
> 	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> 	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
> 	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> 	at java.lang.reflect.Method.invoke(Method.java:606)
> 	at org.apache.hadoop.util.RunJar.main(RunJar.java:212)
> And I have a little trouble seeing where it comes from
> Thank you for your help



--
This message was sent by Atlassian JIRA
(v6.1#6144)