You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Andrzej Bialecki <ab...@getopt.org> on 2010/07/02 12:18:59 UTC

[Nutchbase] WebPage class is a generated code?

Hi,

(This question is mostly to Dogacan & Enis, but I encourage anyone 
familiar with the code to join the threads with [Nutchbase] - the sooner 
the better ;) ).

I'm looking at src/gora/webpage.avsc and WebPage.java & friends... 
presumably the java code was autogenerated from avsc using Gora? If so, 
we should put this autogeneration step in our build.xml. Or am I missing 
something?

-- 
Best regards,
Andrzej Bialecki     <><
  ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com


Re: [Nutchbase] WebPage class is a generated code?

Posted by Andrzej Bialecki <ab...@getopt.org>.
On 2010-07-02 12:24, Julien Nioche wrote:
>>
>> (This question is mostly to Dogacan&  Enis, but I encourage anyone familiar
>> with the code to join the threads with [Nutchbase] - the sooner the better
>> ;) ).
>>
>> I'm looking at src/gora/webpage.avsc and WebPage.java&  friends...
>> presumably the java code was autogenerated from avsc using Gora? If so, we
>> should put this autogeneration step in our build.xml. Or am I missing
>> something?
>>
>>
> correct. if we keep the generated java classes in svn then we probably want
> to make this task optional i.e. it would not be done as part of the build
> tasks OR we can add it to the build but remove it from svn (or better add to
> svn ignore or whatever-it-is-called).

There was a similar situation with NutchAnalysis.jj ... I think that 
task used to detect if the generated classes were missing and it would 
regenerate them. We should do something like that too, and have a 
specific clean target to remove the generated artifacts (the target 
wouldn't be called during regular 'clean').


-- 
Best regards,
Andrzej Bialecki     <><
  ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com


Re: [Nutchbase] WebPage class is a generated code?

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Thanks, Doğacan!

Cheers,
Chris



On 7/3/10 12:49 AM, "Doğacan Güney" <do...@gmail.com> wrote:

Hey,

On Fri, Jul 2, 2010 at 17:26, Mattmann, Chris A (388J) <ch...@jpl.nasa.gov> wrote:
Hey Guys,

Since they are generated, +1 to:


 *   adding a filepattern to svn:ignore to ignore them
 *   updating build.xml to autogenerate


I created NUTCH-842 to track this problem.

Cheers,
Chris




On 7/2/10 3:24 AM, "Julien Nioche" <lists.digitalpebble@gmail.com <ht...@gmail.com> > wrote:



(This question is mostly to Dogacan & Enis, but I encourage anyone familiar with the code to join the threads with [Nutchbase] - the sooner the better ;) ).

I'm looking at src/gora/webpage.avsc and WebPage.java & friends... presumably the java code was autogenerated from avsc using Gora? If so, we should put this autogeneration step in our build.xml. Or am I missing something?


correct. if we keep the generated java classes in svn then we probably want to make this task optional i.e. it would not be done as part of the build tasks OR we can add it to the build but remove it from svn (or better add to svn ignore or whatever-it-is-called).

J.


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: Chris.Mattmann@jpl.nasa.gov <ht...@jpl.nasa.gov>
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++





++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: Chris.Mattmann@jpl.nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: [Nutchbase] WebPage class is a generated code?

Posted by Doğacan Güney <do...@gmail.com>.
Hey,

On Fri, Jul 2, 2010 at 17:26, Mattmann, Chris A (388J) <
chris.a.mattmann@jpl.nasa.gov> wrote:

>  Hey Guys,
>
> Since they are generated, +1 to:
>
>
>    - adding a filepattern to svn:ignore to ignore them
>    - updating build.xml to autogenerate
>
>
>
I created NUTCH-842 to track this problem.


>  Cheers,
> Chris
>
>
>
>
> On 7/2/10 3:24 AM, "Julien Nioche" <li...@gmail.com> wrote:
>
>
>
> (This question is mostly to Dogacan & Enis, but I encourage anyone familiar
> with the code to join the threads with [Nutchbase] - the sooner the better
> ;) ).
>
> I'm looking at src/gora/webpage.avsc and WebPage.java & friends...
> presumably the java code was autogenerated from avsc using Gora? If so, we
> should put this autogeneration step in our build.xml. Or am I missing
> something?
>
>
> correct. if we keep the generated java classes in svn then we probably want
> to make this task optional i.e. it would not be done as part of the build
> tasks OR we can add it to the build but remove it from svn (or better add to
> svn ignore or whatever-it-is-called).
>
> J.
>
>
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: *Chris.Mattmann@jpl.nasa.gov
> *WWW:   *http://sunset.usc.edu/~mattmann/
> *++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>


-- 
Doğacan Güney

Re: [Nutchbase] WebPage class is a generated code?

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hey Guys,

Since they are generated, +1 to:


 *   adding a filepattern to svn:ignore to ignore them
 *   updating build.xml to autogenerate

Cheers,
Chris



On 7/2/10 3:24 AM, "Julien Nioche" <li...@gmail.com> wrote:



(This question is mostly to Dogacan & Enis, but I encourage anyone familiar with the code to join the threads with [Nutchbase] - the sooner the better ;) ).

I'm looking at src/gora/webpage.avsc and WebPage.java & friends... presumably the java code was autogenerated from avsc using Gora? If so, we should put this autogeneration step in our build.xml. Or am I missing something?


correct. if we keep the generated java classes in svn then we probably want to make this task optional i.e. it would not be done as part of the build tasks OR we can add it to the build but remove it from svn (or better add to svn ignore or whatever-it-is-called).

J.


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: Chris.Mattmann@jpl.nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: [Nutchbase] WebPage class is a generated code?

Posted by Julien Nioche <li...@gmail.com>.
>
> (This question is mostly to Dogacan & Enis, but I encourage anyone familiar
> with the code to join the threads with [Nutchbase] - the sooner the better
> ;) ).
>
> I'm looking at src/gora/webpage.avsc and WebPage.java & friends...
> presumably the java code was autogenerated from avsc using Gora? If so, we
> should put this autogeneration step in our build.xml. Or am I missing
> something?
>
>
correct. if we keep the generated java classes in svn then we probably want
to make this task optional i.e. it would not be done as part of the build
tasks OR we can add it to the build but remove it from svn (or better add to
svn ignore or whatever-it-is-called).

J.
-- 
DigitalPebble Ltd

Open Source Solutions for Text Engineering
http://www.digitalpebble.com