You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Matthias Paul <ma...@gmail.com> on 2012/05/18 15:08:30 UTC

Re: [VOTE] Apache Nutch 1.5 release rc #1

When will Nutch 1.5 be released?

Matthias

On Wed, Apr 18, 2012 at 1:46 PM, Bharat Goyal <bh...@shiksha.com> wrote:
> +1
>
>
> On Monday 16 April 2012 12:34 PM, Markus Jelsma wrote:
>>
>>  +1
>>
>>  On Mon, 16 Apr 2012 05:43:22 +0000, "Mattmann, Chris A (388J)"
>>  <ch...@jpl.nasa.gov>  wrote:
>>>
>>> Hi Folks,
>>>
>>> A candidate for the Nutch 1.5 release is available at:
>>>
>>>   http://people.apache.org/~mattmann/apache-nutch-1.5/rc1/
>>>
>>> The release candidate is a zip and tar.gz archive of the sources in:
>>>
>>>   http://svn.apache.org/repos/asf/nutch/tags/release-1.5/
>>>
>>> And a binary build suitable for deployment.
>>>
>>> A staged Maven repository is available here:
>>>
>>>
>>> https://repository.apache.org/content/repositories/orgapachenutch-054/
>>>
>>> Please vote on releasing this package as Apache Nutch 1.5.
>>> The vote is open for the next 72 hours and passes if a majority of at
>>> least three +1 Nutch PMC votes are cast.
>>>
>>>   [ ] +1 Release this package as Apache Nutch 1.5
>>>   [ ] -1 Do not release this package because...
>>>
>>> Thanks!
>>>
>>> Cheers,
>>> Chris
>>>
>>> P.S. Here's my +1.
>>>
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Chris Mattmann, Ph.D.
>>> Senior Computer Scientist
>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>> Office: 171-266B, Mailstop: 171-246
>>> Email: chris.a.mattmann@nasa.gov
>>> WWW:   http://sunset.usc.edu/~mattmann/
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Adjunct Assistant Professor, Computer Science Department
>>> University of Southern California, Los Angeles, CA 90089 USA
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
> DISCLAIMER
> This email is intended only for the person or the entity to whom it is
> addressed and may contain information which is confidential and privileged.
> Any review, retransmission, dissemination or any other use of the said
> information by person or entities other than intended recipient is
> unauthorized and prohibited. If you are not the intended recipient, please
> delete this email and contact the sender.

RE: [VOTE] Apache Nutch 1.5 release rc #1

Posted by Markus Jelsma <ma...@openindex.io>.
As soon as the release manager finds some spare time to manage the release process.
Please be patient or build from trunk which is the next 1.5.
 
 
-----Original message-----
> From:Matthias Paul <ma...@gmail.com>
> Sent: Fri 18-May-2012 15:09
> To: user@nutch.apache.org
> Subject: Re: [VOTE] Apache Nutch 1.5 release rc #1
> 
> When will Nutch 1.5 be released?
> 
> Matthias
> 
> On Wed, Apr 18, 2012 at 1:46 PM, Bharat Goyal <bh...@shiksha.com> wrote:
> > +1
> >
> >
> > On Monday 16 April 2012 12:34 PM, Markus Jelsma wrote:
> >>
> >>  +1
> >>
> >>  On Mon, 16 Apr 2012 05:43:22 +0000, "Mattmann, Chris A (388J)"
> >>  <ch...@jpl.nasa.gov>  wrote:
> >>>
> >>> Hi Folks,
> >>>
> >>> A candidate for the Nutch 1.5 release is available at:
> >>>
> >>>   http://people.apache.org/~mattmann/apache-nutch-1.5/rc1/
> >>>
> >>> The release candidate is a zip and tar.gz archive of the sources in:
> >>>
> >>>   http://svn.apache.org/repos/asf/nutch/tags/release-1.5/
> >>>
> >>> And a binary build suitable for deployment.
> >>>
> >>> A staged Maven repository is available here:
> >>>
> >>>
> >>> https://repository.apache.org/content/repositories/orgapachenutch-054/
> >>>
> >>> Please vote on releasing this package as Apache Nutch 1.5.
> >>> The vote is open for the next 72 hours and passes if a majority of at
> >>> least three +1 Nutch PMC votes are cast.
> >>>
> >>>   [ ] +1 Release this package as Apache Nutch 1.5
> >>>   [ ] -1 Do not release this package because...
> >>>
> >>> Thanks!
> >>>
> >>> Cheers,
> >>> Chris
> >>>
> >>> P.S. Here's my +1.
> >>>
> >>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>> Chris Mattmann, Ph.D.
> >>> Senior Computer Scientist
> >>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >>> Office: 171-266B, Mailstop: 171-246
> >>> Email: chris.a.mattmann@nasa.gov
> >>> WWW:   http://sunset.usc.edu/~mattmann/
> >>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>> Adjunct Assistant Professor, Computer Science Department
> >>> University of Southern California, Los Angeles, CA 90089 USA
> >>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >
> >
> >
> > DISCLAIMER
> > This email is intended only for the person or the entity to whom it is
> > addressed and may contain information which is confidential and privileged.
> > Any review, retransmission, dissemination or any other use of the said
> > information by person or entities other than intended recipient is
> > unauthorized and prohibited. If you are not the intended recipient, please
> > delete this email and contact the sender.
> 

Re: [VOTE] Apache Nutch 1.5 release rc #1

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hey Lewis,

On May 19, 2012, at 7:15 AM, Lewis John Mcgibbney wrote:

> Hi Chris,
> 
> On Fri, May 18, 2012 at 7:19 PM, Mattmann, Chris A (388J) <
>> 
>> Sorry I've been on hiatus enjoying a trip with my family :)
> 
> Hope you had a nice time.

Thanks!

> 
>> 
>> I was hoping to respin rc #2 before I left, but I didn't find the
>> spare cycles. Lewis, basically if you look through the rc #1 thread there are
>> about 3-4 comments from Julien, you, and I think from Sami.
> 
> OK so for reference, the original thread is here [0].
> 
> Summary of tasks
> 
> 1) fix pom.xml as the versions of the deps for hadoop, tika and
> possibly others are not
> correct in the pom.xml found in the src archive and on the mvn
> repository. Are we generating the pom.xml with an Ant task? ant
> deploy?
> 2) concerning Julien's comments w.r.t delivering the content of
> runtime/local in the
> binary archive instead of having the sources + runtime/deploy as
> well... are we near to a decision on this one? Chris, you said you are
> happy to incorporate the suggestion but this will take place @ release
> stage not before... is this an accurate description? Also Julien's
> commit to build.xml should help us out here.
> 3) add missing license headers to the following files
> src/java/org/apache/nutch/indexer/IndexingFiltersChecker.java
> src/plugin/creativecommons/src/web/web.xml
> src/plugin/protocol-httpclient/src/test/conf/httpclient-auth-test.xml
> src/plugin/protocol-httpclient/src/test/conf/nutch-site-test.xml
> 4) update NOTICE file, it stated a date of 2009

GREAT summary thanks Lewis!

> 
> So my question now... regarding making the changes which can be done
> locally e.g. 3 & 4, is it OK for me to commit to trunk? I don't know
> what the current state of play is with this and don't want to mess up
> the RC

Yep, no worries. Let's commit the above to the branch that will 
help me out a lot. Then I will reroll another tag 1.5-rc2 and we
should be good. Then we can forward merge the branch
into the trunk post release. Sound good?

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: [VOTE] Apache Nutch 1.5 release rc #1

Posted by Lewis John Mcgibbney <le...@gmail.com>.
Hi Again,

I've moved this over to dev@

New branch with resolved discrepancies identified within release-1.5
RC1 and subsequent conversations. Branch can be seen here
http://svn.apache.org/repos/asf/nutch/branches/branch-1.5/

I've also made these changes to trunk. N.B. No commits have been made
in the 4 weeks since the release-1.5 RC so nothing else needs to be
committed over to the new branch or forthcoming tag.

I think this now paves the way for us to roll the RC taking into
consideration Julien's new target.

Thanks and enjoy the rest of the weekend.

best
Lewis

On Sat, May 19, 2012 at 7:33 PM, Julien Nioche
<li...@gmail.com> wrote:
>>
>> 1) fix pom.xml as the versions of the deps for hadoop, tika and
>> possibly others are not
>> correct in the pom.xml found in the src archive and on the mvn
>> repository. Are we generating the pom.xml with an Ant task? ant
>> deploy?
>>
>
> can't remember the name of the task right now but should be easy to find
> out by looking at the build.xml. You'll need to make sure that the maven
> tasks jars are in the lib dirr. Don't think they are there by default
>
>
>
>> 2) concerning Julien's comments w.r.t delivering the content of
>> runtime/local in the
>> binary archive instead of having the sources + runtime/deploy as
>> well... are we near to a decision on this one? Chris, you said you are
>> happy to incorporate the suggestion but this will take place @ release
>> stage not before... is this an accurate description? Also Julien's
>> commit to build.xml should help us out here.
>>
>
> might as well do it in the RC to check that my changes work fine
>
>
>
>> 3) add missing license headers to the following files
>> src/java/org/apache/nutch/indexer/IndexingFiltersChecker.java
>> src/plugin/creativecommons/src/web/web.xml
>> src/plugin/protocol-httpclient/src/test/conf/httpclient-auth-test.xml
>> src/plugin/protocol-httpclient/src/test/conf/nutch-site-test.xml
>>
>
> Hasn't this been fixed already?
>
>
>> 4) update NOTICE file, it stated a date of 2009
>>
>> So my question now... regarding making the changes which can be done
>> locally e.g. 3 & 4, is it OK for me to commit to trunk? I don't know
>> what the current state of play is with this and don't want to mess up
>> the RC
>>
>
> you'll probably have to redo the 1.5 branch from trunk to reflect the
> latest changes
>
> Thanks Lewis
>
> J.
>
>
> --
> *
> *Open Source Solutions for Text Engineering
>
> http://digitalpebble.blogspot.com/
> http://www.digitalpebble.com
> http://twitter.com/digitalpebble



-- 
Lewis

Re: [VOTE] Apache Nutch 1.5 release rc #1

Posted by Julien Nioche <li...@gmail.com>.
>
> 1) fix pom.xml as the versions of the deps for hadoop, tika and
> possibly others are not
> correct in the pom.xml found in the src archive and on the mvn
> repository. Are we generating the pom.xml with an Ant task? ant
> deploy?
>

can't remember the name of the task right now but should be easy to find
out by looking at the build.xml. You'll need to make sure that the maven
tasks jars are in the lib dirr. Don't think they are there by default



> 2) concerning Julien's comments w.r.t delivering the content of
> runtime/local in the
> binary archive instead of having the sources + runtime/deploy as
> well... are we near to a decision on this one? Chris, you said you are
> happy to incorporate the suggestion but this will take place @ release
> stage not before... is this an accurate description? Also Julien's
> commit to build.xml should help us out here.
>

might as well do it in the RC to check that my changes work fine



> 3) add missing license headers to the following files
> src/java/org/apache/nutch/indexer/IndexingFiltersChecker.java
> src/plugin/creativecommons/src/web/web.xml
> src/plugin/protocol-httpclient/src/test/conf/httpclient-auth-test.xml
> src/plugin/protocol-httpclient/src/test/conf/nutch-site-test.xml
>

Hasn't this been fixed already?


> 4) update NOTICE file, it stated a date of 2009
>
> So my question now... regarding making the changes which can be done
> locally e.g. 3 & 4, is it OK for me to commit to trunk? I don't know
> what the current state of play is with this and don't want to mess up
> the RC
>

you'll probably have to redo the 1.5 branch from trunk to reflect the
latest changes

Thanks Lewis

J.


-- 
*
*Open Source Solutions for Text Engineering

http://digitalpebble.blogspot.com/
http://www.digitalpebble.com
http://twitter.com/digitalpebble

Re: [VOTE] Apache Nutch 1.5 release rc #1

Posted by Lewis John Mcgibbney <le...@gmail.com>.
Hi Chris,

On Fri, May 18, 2012 at 7:19 PM, Mattmann, Chris A (388J) <
>
> Sorry I've been on hiatus enjoying a trip with my family :)

Hope you had a nice time.

>
> I was hoping to respin rc #2 before I left, but I didn't find the
> spare cycles. Lewis, basically if you look through the rc #1 thread there are
> about 3-4 comments from Julien, you, and I think from Sami.

OK so for reference, the original thread is here [0].

Summary of tasks

1) fix pom.xml as the versions of the deps for hadoop, tika and
possibly others are not
correct in the pom.xml found in the src archive and on the mvn
repository. Are we generating the pom.xml with an Ant task? ant
deploy?
2) concerning Julien's comments w.r.t delivering the content of
runtime/local in the
binary archive instead of having the sources + runtime/deploy as
well... are we near to a decision on this one? Chris, you said you are
happy to incorporate the suggestion but this will take place @ release
stage not before... is this an accurate description? Also Julien's
commit to build.xml should help us out here.
3) add missing license headers to the following files
src/java/org/apache/nutch/indexer/IndexingFiltersChecker.java
src/plugin/creativecommons/src/web/web.xml
src/plugin/protocol-httpclient/src/test/conf/httpclient-auth-test.xml
src/plugin/protocol-httpclient/src/test/conf/nutch-site-test.xml
4) update NOTICE file, it stated a date of 2009

So my question now... regarding making the changes which can be done
locally e.g. 3 & 4, is it OK for me to commit to trunk? I don't know
what the current state of play is with this and don't want to mess up
the RC

Thanks have a great weekend

Lewis

[0] http://www.mail-archive.com/dev%40nutch.apache.org/msg07087.html

Re: [VOTE] Apache Nutch 1.5 release rc #1

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hey Guys,

Sorry I've been on hiatus enjoying a trip with my family :)

I was hoping to respin rc #2 before I left, but I didn't find the
spare cycles. Lewis, basically if you look through the rc #1 thread there are
about 3-4 comments from Julien, you, and I think from Sami.

I have them written down somewhere and prioritized and I
think I even replied to the thread with the ones I will work
on so let me see if I can nudge this along. Your help is always
welcome too!

Cheers,
Chris

On May 18, 2012, at 3:52 AM, Lewis John Mcgibbney wrote:

> When the community is satisfied that we have a good release candidate
> and when the VOTE'ing suits the required conditions.
> 
> Ultimately the timing for a release is down to the release manager but
> I think it is fair to say that we are on our way to getting 1.5
> released soon as the (trunk) codebase is in a good condition (as it
> has been for some time now :))
> 
> Thanks
> 
> Lewis
> 
> On Fri, May 18, 2012 at 2:08 PM, Matthias Paul <ma...@gmail.com> wrote:
>> When will Nutch 1.5 be released?
>> 
>> Matthias
>> 
>> On Wed, Apr 18, 2012 at 1:46 PM, Bharat Goyal <bh...@shiksha.com> wrote:
>>> +1
>>> 
>>> 
>>> On Monday 16 April 2012 12:34 PM, Markus Jelsma wrote:
>>>> 
>>>>  +1
>>>> 
>>>>  On Mon, 16 Apr 2012 05:43:22 +0000, "Mattmann, Chris A (388J)"
>>>>  <ch...@jpl.nasa.gov>  wrote:
>>>>> 
>>>>> Hi Folks,
>>>>> 
>>>>> A candidate for the Nutch 1.5 release is available at:
>>>>> 
>>>>>   http://people.apache.org/~mattmann/apache-nutch-1.5/rc1/
>>>>> 
>>>>> The release candidate is a zip and tar.gz archive of the sources in:
>>>>> 
>>>>>   http://svn.apache.org/repos/asf/nutch/tags/release-1.5/
>>>>> 
>>>>> And a binary build suitable for deployment.
>>>>> 
>>>>> A staged Maven repository is available here:
>>>>> 
>>>>> 
>>>>> https://repository.apache.org/content/repositories/orgapachenutch-054/
>>>>> 
>>>>> Please vote on releasing this package as Apache Nutch 1.5.
>>>>> The vote is open for the next 72 hours and passes if a majority of at
>>>>> least three +1 Nutch PMC votes are cast.
>>>>> 
>>>>>   [ ] +1 Release this package as Apache Nutch 1.5
>>>>>   [ ] -1 Do not release this package because...
>>>>> 
>>>>> Thanks!
>>>>> 
>>>>> Cheers,
>>>>> Chris
>>>>> 
>>>>> P.S. Here's my +1.
>>>>> 
>>>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>>> Chris Mattmann, Ph.D.
>>>>> Senior Computer Scientist
>>>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>>>> Office: 171-266B, Mailstop: 171-246
>>>>> Email: chris.a.mattmann@nasa.gov
>>>>> WWW:   http://sunset.usc.edu/~mattmann/
>>>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>>> Adjunct Assistant Professor, Computer Science Department
>>>>> University of Southern California, Los Angeles, CA 90089 USA
>>>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> 
>>> 
>>> 
>>> DISCLAIMER
>>> This email is intended only for the person or the entity to whom it is
>>> addressed and may contain information which is confidential and privileged.
>>> Any review, retransmission, dissemination or any other use of the said
>>> information by person or entities other than intended recipient is
>>> unauthorized and prohibited. If you are not the intended recipient, please
>>> delete this email and contact the sender.
> 
> 
> 
> -- 
> Lewis


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: [VOTE] Apache Nutch 1.5 release rc #1

Posted by Lewis John Mcgibbney <le...@gmail.com>.
When the community is satisfied that we have a good release candidate
and when the VOTE'ing suits the required conditions.

Ultimately the timing for a release is down to the release manager but
I think it is fair to say that we are on our way to getting 1.5
released soon as the (trunk) codebase is in a good condition (as it
has been for some time now :))

Thanks

Lewis

On Fri, May 18, 2012 at 2:08 PM, Matthias Paul <ma...@gmail.com> wrote:
> When will Nutch 1.5 be released?
>
> Matthias
>
> On Wed, Apr 18, 2012 at 1:46 PM, Bharat Goyal <bh...@shiksha.com> wrote:
>> +1
>>
>>
>> On Monday 16 April 2012 12:34 PM, Markus Jelsma wrote:
>>>
>>>  +1
>>>
>>>  On Mon, 16 Apr 2012 05:43:22 +0000, "Mattmann, Chris A (388J)"
>>>  <ch...@jpl.nasa.gov>  wrote:
>>>>
>>>> Hi Folks,
>>>>
>>>> A candidate for the Nutch 1.5 release is available at:
>>>>
>>>>   http://people.apache.org/~mattmann/apache-nutch-1.5/rc1/
>>>>
>>>> The release candidate is a zip and tar.gz archive of the sources in:
>>>>
>>>>   http://svn.apache.org/repos/asf/nutch/tags/release-1.5/
>>>>
>>>> And a binary build suitable for deployment.
>>>>
>>>> A staged Maven repository is available here:
>>>>
>>>>
>>>> https://repository.apache.org/content/repositories/orgapachenutch-054/
>>>>
>>>> Please vote on releasing this package as Apache Nutch 1.5.
>>>> The vote is open for the next 72 hours and passes if a majority of at
>>>> least three +1 Nutch PMC votes are cast.
>>>>
>>>>   [ ] +1 Release this package as Apache Nutch 1.5
>>>>   [ ] -1 Do not release this package because...
>>>>
>>>> Thanks!
>>>>
>>>> Cheers,
>>>> Chris
>>>>
>>>> P.S. Here's my +1.
>>>>
>>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>> Chris Mattmann, Ph.D.
>>>> Senior Computer Scientist
>>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>>> Office: 171-266B, Mailstop: 171-246
>>>> Email: chris.a.mattmann@nasa.gov
>>>> WWW:   http://sunset.usc.edu/~mattmann/
>>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>> Adjunct Assistant Professor, Computer Science Department
>>>> University of Southern California, Los Angeles, CA 90089 USA
>>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>
>>
>> DISCLAIMER
>> This email is intended only for the person or the entity to whom it is
>> addressed and may contain information which is confidential and privileged.
>> Any review, retransmission, dissemination or any other use of the said
>> information by person or entities other than intended recipient is
>> unauthorized and prohibited. If you are not the intended recipient, please
>> delete this email and contact the sender.



-- 
Lewis