You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by lujinhong <lu...@yahoo.com> on 2015/02/24 16:05:52 UTC
questions about the webui packages
Hi, all.
I found some codes in package “org.apache.nutch.webui” in the nutch source.
What are these codes for? I can config/run nutch in a webui mode or what? Is there any tutorial?
thanks.
lujinhong
Re: questions about the webui packages
Posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov>.
Yep, Seb, that’s right.
I have a student (Sujeh Shah) at USC working on
Nutch REST 1.x API, with the goal of eventually
making D3 visualizations of crawl graphs and
seeing what’s going on in a crawl while it’s
happening! :)
We are working on Wiki pages and have some patches
coming on that that builds on the 2.x work.
Cheers,
Chris
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov
WWW: http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
-----Original Message-----
From: Sebastian Nagel <wa...@googlemail.com>
Reply-To: "dev@nutch.apache.org" <de...@nutch.apache.org>
Date: Tuesday, February 24, 2015 at 11:25 AM
To: "dev@nutch.apache.org" <de...@nutch.apache.org>
Subject: Re: questions about the webui packages
>Hi,
>
>yes, there is a Nutch server providing a REST Api
>and a web app client to run Nutch (as result of our
>participation in GSoc 2014 by Fjodor Vershinin).
>There are some limitations:
>- only 2.x for now (please, follow NUTCH-1040 for a 1.x port)
>- not complete (e.g., cannot configure a crawl)
>
>For more details, see
>- https://wiki.apache.org/nutch/NutchRESTAPI
>- https://wiki.apache.org/nutch/bin/nutch%20nutchserver
>- https://wiki.apache.org/nutch/bin/nutch%20webapp
>-
>http://lucene.472066.n3.nabble.com/Running-Crawls-via-REST-API-td4159019.h
>tml
>- https://issues.apache.org/jira/browse/NUTCH-880
>- https://issues.apache.org/jira/browse/NUTCH-1040
>
>Best,
>Sebastian
>
>On 02/24/2015 04:05 PM, lujinhong wrote:
>> Hi, all.
>>
>> I found some codes in package “org.apache.nutch.webui” in the
>>nutch source.
>>
>> What are these codes for? I can config/run nutch in a webui mode or
>>what? Is there any tutorial?
>>
>> thanks.
>>
>> lujinhong
>>
>
Re: questions about the webui packages
Posted by Sebastian Nagel <wa...@googlemail.com>.
Hi,
yes, there is a Nutch server providing a REST Api
and a web app client to run Nutch (as result of our
participation in GSoc 2014 by Fjodor Vershinin).
There are some limitations:
- only 2.x for now (please, follow NUTCH-1040 for a 1.x port)
- not complete (e.g., cannot configure a crawl)
For more details, see
- https://wiki.apache.org/nutch/NutchRESTAPI
- https://wiki.apache.org/nutch/bin/nutch%20nutchserver
- https://wiki.apache.org/nutch/bin/nutch%20webapp
- http://lucene.472066.n3.nabble.com/Running-Crawls-via-REST-API-td4159019.html
- https://issues.apache.org/jira/browse/NUTCH-880
- https://issues.apache.org/jira/browse/NUTCH-1040
Best,
Sebastian
On 02/24/2015 04:05 PM, lujinhong wrote:
> Hi, all.
>
> I found some codes in package “org.apache.nutch.webui” in the nutch source.
>
> What are these codes for? I can config/run nutch in a webui mode or what? Is there any tutorial?
>
> thanks.
>
> lujinhong
>