You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Ivan Vershinin <iv...@vershinin.net> on 2013/05/19 21:30:58 UTC

Re: GSOC 2013 project: Apache-Wicket based Nutch webapp

Hello!
I am student from Estonia (Tartu University). I want to participate in GSoC
2013, and selected your project because i have experience in Java and
Wicket.
Can you give me some advice, where i can start my investigations?
Best regards,
Ivan Vershinin

Re: GSOC 2013 project: Apache-Wicket based Nutch webapp

Posted by "Mattmann, Chris A (398J)" <ch...@jpl.nasa.gov>.
Hi Tejas,

I was actually not thinking that this was a project for the Nutch
Admin GUI, but for the actual search web app no longer present.

But the Admin GUI would be icing too!

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++






-----Original Message-----
From: Tejas Patil <te...@gmail.com>
Reply-To: "dev@nutch.apache.org" <de...@nutch.apache.org>
Date: Sunday, May 19, 2013 2:31 PM
To: "dev@nutch.apache.org" <de...@nutch.apache.org>
Subject: Re: GSOC 2013 project: Apache-Wicket based Nutch webapp

>@dev,
>
>
>I just realised that the images over the wiki page[0] are missing. Those
>were displayed as an external image from 101tec.com
><http://101tec.com/wp-content/themes/101tec/images/instanceNew.jpg> which
> is down. Is there any other place where those images might still be
>present ?
>
>
>[0] : http://wiki.apache.org/nutch/NutchAdministrationUserInterface
>
>
>
>On Sun, May 19, 2013 at 2:23 PM, Tejas Patil
><te...@gmail.com> wrote:
>
>This will help for getting an idea about what is needed:
>http://wiki.apache.org/nutch/NutchAdministrationUserInterface
>
>
>Rest API in nutch: (the jira comments and the patch will help you here)
>https://issues.apache.org/jira/browse/NUTCH-880
>
>
>
>
>Also, its worth to invest some time to get to know nutch.
>This is an old paper by Doug Cutting on Nutch:
>http://www.master.netseven.it/files/262-Nutch.pdf
>
>
>
>Here is a video of a presentation by Julien @ Lucene Eurocon last year:
>http://vimeopro.com/user11514798/apache-lucene-eurocon-2012/video/55566234
>
>
>
>After that, roll up your sleeves, get the source code and start off
>crawling. These are the relevant tutorials:
>http://wiki.apache.org/nutch/NutchTutorial
>
>http://wiki.apache.org/nutch/Nutch2Tutorial
>
>
>
>Also, you will find some config and feature-centric documentation over
>the wiki pages. Here is the wiki main page:
>http://wiki.apache.org/nutch/
>
>
>
>I think that your work would be a great contribution to Nutch. Looking
>forward to see this feature in next release cycle.
>
>
>
>Thanks,
>Tejas Patil
>
>
>
>On Sun, May 19, 2013 at 12:30 PM, Ivan Vershinin
><iv...@vershinin.net> wrote:
>
>Hello!
>
>I am student from Estonia (Tartu University). I want to participate in
>GSoC 2013, and selected your project because i have experience in Java
>and Wicket.
>
>Can you give me some advice, where i can start my investigations?
>
>Best regards,
>
>Ivan Vershinin
>
>
>
>
>
>
>
>
>
>
>
>
>


Re: GSOC 2013 project: Apache-Wicket based Nutch webapp

Posted by Tejas Patil <te...@gmail.com>.
@dev,

I just realised that the images over the wiki page[0] are missing. Those
were displayed as an external image from
101tec.com<http://101tec.com/wp-content/themes/101tec/images/instanceNew.jpg>
which
is down. Is there any other place where those images might still be present
?

[0] : http://wiki.apache.org/nutch/NutchAdministrationUserInterface


On Sun, May 19, 2013 at 2:23 PM, Tejas Patil <te...@gmail.com>wrote:

> This will help for getting an idea about what is needed:
> http://wiki.apache.org/nutch/NutchAdministrationUserInterface
>
> Rest API in nutch: (the jira comments and the patch will help you here)
> https://issues.apache.org/jira/browse/NUTCH-880
>
> Also, its worth to invest some time to get to know nutch.
> This is an old paper by Doug Cutting on Nutch:
> http://www.master.netseven.it/files/262-Nutch.pdf
>
> Here is a video of a presentation by Julien @ Lucene Eurocon last year:
> http://vimeopro.com/user11514798/apache-lucene-eurocon-2012/video/55566234
>
> After that, roll up your sleeves, get the source code and start off
> crawling. These are the relevant tutorials:
> http://wiki.apache.org/nutch/NutchTutorial
> http://wiki.apache.org/nutch/Nutch2Tutorial
>
> Also, you will find some config and feature-centric documentation over the
> wiki pages. Here is the wiki main page:
> http://wiki.apache.org/nutch/
>
> I think that your work would be a great contribution to Nutch. Looking
> forward to see this feature in next release cycle.
>
> Thanks,
> Tejas Patil
>
>
> On Sun, May 19, 2013 at 12:30 PM, Ivan Vershinin <iv...@vershinin.net>wrote:
>
>> Hello!
>> I am student from Estonia (Tartu University). I want to participate in
>> GSoC 2013, and selected your project because i have experience in Java and
>> Wicket.
>> Can you give me some advice, where i can start my investigations?
>> Best regards,
>> Ivan Vershinin
>>
>
>

Re: GSOC 2013 project: Apache-Wicket based Nutch webapp

Posted by Tejas Patil <te...@gmail.com>.
This will help for getting an idea about what is needed:
http://wiki.apache.org/nutch/NutchAdministrationUserInterface

Rest API in nutch: (the jira comments and the patch will help you here)
https://issues.apache.org/jira/browse/NUTCH-880

Also, its worth to invest some time to get to know nutch.
This is an old paper by Doug Cutting on Nutch:
http://www.master.netseven.it/files/262-Nutch.pdf

Here is a video of a presentation by Julien @ Lucene Eurocon last year:
http://vimeopro.com/user11514798/apache-lucene-eurocon-2012/video/55566234

After that, roll up your sleeves, get the source code and start off
crawling. These are the relevant tutorials:
http://wiki.apache.org/nutch/NutchTutorial
http://wiki.apache.org/nutch/Nutch2Tutorial

Also, you will find some config and feature-centric documentation over the
wiki pages. Here is the wiki main page:
http://wiki.apache.org/nutch/

I think that your work would be a great contribution to Nutch. Looking
forward to see this feature in next release cycle.

Thanks,
Tejas Patil


On Sun, May 19, 2013 at 12:30 PM, Ivan Vershinin <iv...@vershinin.net> wrote:

> Hello!
> I am student from Estonia (Tartu University). I want to participate in
> GSoC 2013, and selected your project because i have experience in Java and
> Wicket.
> Can you give me some advice, where i can start my investigations?
> Best regards,
> Ivan Vershinin
>

Re: GSOC 2013 project: Apache-Wicket based Nutch webapp

Posted by "Mattmann, Chris A (398J)" <ch...@jpl.nasa.gov>.
Thanks Ivan.

I commented on JIRA too - unfortunately the deadline has passed
for student submission to GSoC.

But you are free to work on the project regardless..just not through
GSoC.

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++






-----Original Message-----
From: Ivan Vershinin <iv...@vershinin.net>
Reply-To: "dev@nutch.apache.org" <de...@nutch.apache.org>
Date: Sunday, May 19, 2013 12:30 PM
To: "dev@nutch.apache.org" <de...@nutch.apache.org>
Subject: Re: GSOC 2013 project: Apache-Wicket based Nutch webapp

>Hello!
>
>I am student from Estonia (Tartu University). I want to participate in
>GSoC 2013, and selected your project because i have experience in Java
>and Wicket.
>
>Can you give me some advice, where i can start my investigations?
>
>Best regards,
>
>Ivan Vershinin
>