You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by keren nutch <ke...@yahoo.ca> on 2006/03/24 17:29:33 UTC

Problem with nutch-0.7.1.tar.gz

Hi,
 
 I extracted "tar -xf nutch-0.7.1.tar.gz" and got the info
 "tar: A lone zero block at 132784"
 
 When I recompiled it usisng 'ant' and got an error info:
 srcdir "/home/scott/downloads/nutch-0.7.1/src/plugin/nutch-extensionpoints/src/java" does not exist!
 
 Please give me help. Thanks,
 Keren
 
				
---------------------------------
Make free worldwide PC-to-PC calls. Try the new Yahoo! Canada Messenger with Voice

Re: Problem with nutch-0.7.1.tar.gz

Posted by kauu <ba...@gmail.com>.
hi all
  i got a problem now,
 after change the nutch/conf/nutch-site.xml
(<property>
 <name>plugin.includes</name>
 <value>nutch-extensionpoints|protocol-http|urlfilter-regex|parse-(msword|pdf|text|html)|index-basic|query-(basic|site|url|jobs)</value>
 <description>Regular expression naming plugin directory names to
 include.  Any plugin not matching this expression is excluded.
 In any case you need at least include the nutch-extensionpoints plugin. By
 default Nutch includes crawling just HTML and plain text via HTTP,
 and basic indexing and search plugins.
 </description>
</property>
) ,i thought i could get the .doc and the pdf files on the web,but what
surprise me is that ,the pdf files could be fetched,but can't be parsed(this
a reported erro when i use the commad "bin/nutch dir -dir crawldir"). so
when query some pdf files in the "localhost:8080" there is no pdf at all.
  so anyone can help me???any reply will be appreciated!!!
--
www.babatu.com

Re: Problem with nutch-0.7.1.tar.gz

Posted by kauu <ba...@gmail.com>.
hi all:
  i get the answer for the problem of the messy result of the search result
of the Chinese query.
  i used the TOMCAT ,so ,i add a line in the  ~\Tomcat 5.0\conf\server.xmlfile
what i've changed is looks like this:
   <Connector
port="8080"               maxThreads="150" minSpareThreads="25"
maxSpareThreads="75"
               enableLookups="false" redirectPort="8443" acceptCount="100"
               debug="0" connectionTimeout="20000"
               useBodyEncodingForURI="true"
               disableUploadTimeout="true" />


the line  :: useBodyEncodingForURI="true"
is what i added .

after reboot the TOMCAT,  i can get the right result.
wish someone like me in the moning can happy to see this.




On 3/25/06, kauu <ba...@gmail.com> wrote:
>
> i did it exactly what u say,and it works well,thanks so much!
>
>
> On 3/25/06, kauu <ba...@gmail.com> wrote:
> >
> > thanks i'll try it
> >
> >
> > On 3/25/06, TDLN <diamond108@gmail.com > wrote:
> > >
> > > Just create the directory
> > > '/home/scott/downloads/nutch-0.7.1
> > > /src/plugin/nutch-extensionpoints/src/java'
> > > and run ant again.
> > >
> > > Rgrds,  Thomas
> > >
> > >
> > >
> > > On 3/24/06, keren nutch < kerennutch@yahoo.ca> wrote:
> > > >
> > > > Hi,
> > > >
> > > > I extracted "tar -xf nutch-0.7.1.tar.gz" and got the info
> > > > "tar: A lone zero block at 132784"
> > > >
> > > > When I recompiled it usisng 'ant' and got an error info:
> > > > srcdir "/home/scott/downloads/nutch-0.7.1
> > > /src/plugin/nutch-extensionpoints/src/java"
> > > > does not exist!
> > > >
> > > > Please give me help. Thanks,
> > > > Keren
> > > >
> > > >
> > > > ---------------------------------
> > > > Make free worldwide PC-to-PC calls. Try the new Yahoo! Canada
> > > Messenger
> > > > with Voice
> > > >
> > >
> > >
> >
> >
> > --
> > www.babatu.com
> >
>
>
>
> --
> www.babatu.com
>



--
www.babatu.com

Re: Problem with nutch-0.7.1.tar.gz

Posted by kauu <ba...@gmail.com>.
i did it exactly what u say,and it works well,thanks so much!

On 3/25/06, kauu <ba...@gmail.com> wrote:
>
> thanks i'll try it
>
>
> On 3/25/06, TDLN <di...@gmail.com> wrote:
> >
> > Just create the directory
> > '/home/scott/downloads/nutch-0.7.1
> > /src/plugin/nutch-extensionpoints/src/java'
> > and run ant again.
> >
> > Rgrds,  Thomas
> >
> >
> >
> > On 3/24/06, keren nutch < kerennutch@yahoo.ca> wrote:
> > >
> > > Hi,
> > >
> > > I extracted "tar -xf nutch-0.7.1.tar.gz" and got the info
> > > "tar: A lone zero block at 132784"
> > >
> > > When I recompiled it usisng 'ant' and got an error info:
> > > srcdir "/home/scott/downloads/nutch-0.7.1
> > /src/plugin/nutch-extensionpoints/src/java"
> > > does not exist!
> > >
> > > Please give me help. Thanks,
> > > Keren
> > >
> > >
> > > ---------------------------------
> > > Make free worldwide PC-to-PC calls. Try the new Yahoo! Canada
> > Messenger
> > > with Voice
> > >
> >
> >
>
>
> --
> www.babatu.com
>



--
www.babatu.com

Re: Problem with nutch-0.7.1.tar.gz

Posted by kauu <ba...@gmail.com>.
thanks i'll try it

On 3/25/06, TDLN <di...@gmail.com> wrote:
>
> Just create the directory
> '/home/scott/downloads/nutch-0.7.1
> /src/plugin/nutch-extensionpoints/src/java'
> and run ant again.
>
> Rgrds,  Thomas
>
>
>
> On 3/24/06, keren nutch <ke...@yahoo.ca> wrote:
> >
> > Hi,
> >
> > I extracted "tar -xf nutch-0.7.1.tar.gz" and got the info
> > "tar: A lone zero block at 132784"
> >
> > When I recompiled it usisng 'ant' and got an error info:
> > srcdir "/home/scott/downloads/nutch-0.7.1
> /src/plugin/nutch-extensionpoints/src/java"
> > does not exist!
> >
> > Please give me help. Thanks,
> > Keren
> >
> >
> > ---------------------------------
> > Make free worldwide PC-to-PC calls. Try the new Yahoo! Canada Messenger
> > with Voice
> >
>
>


--
www.babatu.com

Re: Problem with nutch-0.7.1.tar.gz

Posted by TDLN <di...@gmail.com>.
Just create the directory
'/home/scott/downloads/nutch-0.7.1/src/plugin/nutch-extensionpoints/src/java'
and run ant again.

Rgrds,  Thomas



On 3/24/06, keren nutch <ke...@yahoo.ca> wrote:
>
> Hi,
>
> I extracted "tar -xf nutch-0.7.1.tar.gz" and got the info
> "tar: A lone zero block at 132784"
>
> When I recompiled it usisng 'ant' and got an error info:
> srcdir "/home/scott/downloads/nutch-0.7.1/src/plugin/nutch-extensionpoints/src/java"
> does not exist!
>
> Please give me help. Thanks,
> Keren
>
>
> ---------------------------------
> Make free worldwide PC-to-PC calls. Try the new Yahoo! Canada Messenger
> with Voice
>