You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2010/09/21 07:10:24 UTC

[VOTE] Apache Nutch 1.2 Release Candidate #4

Hi Folks,

I have posted a 4th release candidate for the Apache Nutch 1.2 release. The
source code is at:

http://people.apache.org/~mattmann/apache-nutch-1.2/rc4/

One highlight of this release candidate is including Markus Jelsma's patch
for NUTCH-901 that makes the index-more plugin and its mimeType extraction
configurable, allowing for back compat with the current behavior of type
indexed as a multi-valued field (splitting on the type/subtype), but also
allowing the entire mime type to be indexed as a single value, for use in
e.g., Solr via the SolrIndexer. For more detailed information, see the
included CHANGES.txt file for details on release contents and latest
changes. 

The release was made using the Nutch release process, documented on the Wiki
here:

http://bit.ly/d5ugid

The release was made from the Nutch 1.2 branch (r955767) at:

http://svn.apache.org/repos/asf/nutch/branches/branch-1.2/

Sami Siren previously indicated to integrate RAT into the build, but I
haven't had a chance to do it yet. If someone else has time, or wants to,
please go ahead and I'd be happy to roll another RC.

Please vote on releasing these packages as Apache Nutch 1.2. The vote is
open for the next 72 hours.

Only votes from Nutch PMC are binding, but folks are welcome to check the
release candidate and voice their approval or disapproval. The vote passes
if at least three binding +1 votes are cast.

[ ] +1 Release the packages as Apache Nutch 1.2.

[ ] -1 Do not release the packages because...

Thanks!

Cheers,
Chris

P.S. Here is my +1.

P.P.S. Nutch PMC members unite! Please check this release out and VOTE on
it. I'd love to push this one out the door soon...and stop the RC waiting
game and get onto 2.0! ;)

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: Chris.Mattmann@jpl.nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++





Re: [VOTE] Apache Nutch 1.2 Release Candidate #4

Posted by Andrzej Bialecki <ab...@getopt.org>.
On 2010-09-24 20:40, Mattmann, Chris A (388J) wrote:
> Thanks Andrzej, appreciate it. I know you’ve been really vigilant with
> the other RCs I’ve thrown up about testing and I appreciate it. Other
> Nutch PMC’ers: just need one more VOTE. Help, please? :)

+1, all unit tests pass, and a test crawl + indexing to Solr went just fine.


-- 
Best regards,
Andrzej Bialecki     <><
  ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com


Re: [VOTE] Apache Nutch 1.2 Release Candidate #4

Posted by Dennis Kubes <ku...@apache.org>.
  +1  good to go

On 09/24/2010 01:40 PM, Mattmann, Chris A (388J) wrote:
> Thanks Andrzej, appreciate it. I know you've been really vigilant with 
> the other RCs I've thrown up about testing and I appreciate it. Other 
> Nutch PMC'ers: just need one more VOTE. Help, please? :)
>
> Cheers,
> Chris
>
>
> On 9/24/10 11:38 AM, "Andrzej Bialecki" <ab...@getopt.org> wrote:
>
>     On 2010-09-24 04:38, Mattmann, Chris A (388J) wrote:
>     > Hi Nutch PMC:
>     >
>     > /nudge
>     >
>     > Anyone get a chance to review this yet? I have some free cycles
>     tomorrow
>     > and would really think it's cool if I could finally push out the
>     1.2 RC.
>
>     I had little time this week, but I'm testing it now... I should be
>     done
>     tomorrow.
>
>
>     --
>     Best regards,
>     Andrzej Bialecki <><
>       ___. ___ ___ ___ _ _   __________________________________
>     [__ || __|__/|__||\/|  Information Retrieval, Semantic Web
>     ___|||__||  \|  ||  |  Embedded Unix, System Integration
>     http://www.sigram.com  Contact: info at sigram dot com
>
>
>
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: _Chris.Mattmann@jpl.nasa.gov
> _WWW: _http://sunset.usc.edu/~mattmann/ 
> <http://sunset.usc.edu/%7Emattmann/>
> _++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>

Re: [VOTE] Apache Nutch 1.2 Release Candidate #4

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Thanks Andrzej, appreciate it. I know you've been really vigilant with the other RCs I've thrown up about testing and I appreciate it. Other Nutch PMC'ers: just need one more VOTE. Help, please? :)

Cheers,
Chris


On 9/24/10 11:38 AM, "Andrzej Bialecki" <ab...@getopt.org> wrote:

On 2010-09-24 04:38, Mattmann, Chris A (388J) wrote:
> Hi Nutch PMC:
>
> /nudge
>
> Anyone get a chance to review this yet? I have some free cycles tomorrow
> and would really think it's cool if I could finally push out the 1.2 RC.

I had little time this week, but I'm testing it now... I should be done
tomorrow.


--
Best regards,
Andrzej Bialecki     <><
  ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com




++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: Chris.Mattmann@jpl.nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: [VOTE] Apache Nutch 1.2 Release Candidate #4

Posted by Andrzej Bialecki <ab...@getopt.org>.
On 2010-09-24 04:38, Mattmann, Chris A (388J) wrote:
> Hi Nutch PMC:
>
> /nudge
>
> Anyone get a chance to review this yet? I have some free cycles tomorrow
> and would really think it’s cool if I could finally push out the 1.2 RC.

I had little time this week, but I'm testing it now... I should be done 
tomorrow.


-- 
Best regards,
Andrzej Bialecki     <><
  ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com


Re: [VOTE] Apache Nutch 1.2 Release Candidate #4

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hi Nutch PMC:

/nudge

Anyone get a chance to review this yet? I have some free cycles tomorrow and would really think it's cool if I could finally push out the 1.2 RC.

Cheers,
Chris



On 9/20/10 10:10 PM, "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> wrote:

Hi Folks,

I have posted a 4th release candidate for the Apache Nutch 1.2 release. The
source code is at:

http://people.apache.org/~mattmann/apache-nutch-1.2/rc4/

One highlight of this release candidate is including Markus Jelsma's patch
for NUTCH-901 that makes the index-more plugin and its mimeType extraction
configurable, allowing for back compat with the current behavior of type
indexed as a multi-valued field (splitting on the type/subtype), but also
allowing the entire mime type to be indexed as a single value, for use in
e.g., Solr via the SolrIndexer. For more detailed information, see the
included CHANGES.txt file for details on release contents and latest
changes.

The release was made using the Nutch release process, documented on the Wiki
here:

http://bit.ly/d5ugid

The release was made from the Nutch 1.2 branch (r955767) at:

http://svn.apache.org/repos/asf/nutch/branches/branch-1.2/

Sami Siren previously indicated to integrate RAT into the build, but I
haven't had a chance to do it yet. If someone else has time, or wants to,
please go ahead and I'd be happy to roll another RC.

Please vote on releasing these packages as Apache Nutch 1.2. The vote is
open for the next 72 hours.

Only votes from Nutch PMC are binding, but folks are welcome to check the
release candidate and voice their approval or disapproval. The vote passes
if at least three binding +1 votes are cast.

[ ] +1 Release the packages as Apache Nutch 1.2.

[ ] -1 Do not release the packages because...

Thanks!

Cheers,
Chris

P.S. Here is my +1.

P.P.S. Nutch PMC members unite! Please check this release out and VOTE on
it. I'd love to push this one out the door soon...and stop the RC waiting
game and get onto 2.0! ;)

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: Chris.Mattmann@jpl.nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++







++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: Chris.Mattmann@jpl.nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: [VOTE] Apache Nutch 1.2 Release Candidate #4

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hi Nutch PMC:

/nudge

Anyone get a chance to review this yet? I have some free cycles tomorrow and would really think it's cool if I could finally push out the 1.2 RC.

Cheers,
Chris



On 9/20/10 10:10 PM, "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> wrote:

Hi Folks,

I have posted a 4th release candidate for the Apache Nutch 1.2 release. The
source code is at:

http://people.apache.org/~mattmann/apache-nutch-1.2/rc4/

One highlight of this release candidate is including Markus Jelsma's patch
for NUTCH-901 that makes the index-more plugin and its mimeType extraction
configurable, allowing for back compat with the current behavior of type
indexed as a multi-valued field (splitting on the type/subtype), but also
allowing the entire mime type to be indexed as a single value, for use in
e.g., Solr via the SolrIndexer. For more detailed information, see the
included CHANGES.txt file for details on release contents and latest
changes.

The release was made using the Nutch release process, documented on the Wiki
here:

http://bit.ly/d5ugid

The release was made from the Nutch 1.2 branch (r955767) at:

http://svn.apache.org/repos/asf/nutch/branches/branch-1.2/

Sami Siren previously indicated to integrate RAT into the build, but I
haven't had a chance to do it yet. If someone else has time, or wants to,
please go ahead and I'd be happy to roll another RC.

Please vote on releasing these packages as Apache Nutch 1.2. The vote is
open for the next 72 hours.

Only votes from Nutch PMC are binding, but folks are welcome to check the
release candidate and voice their approval or disapproval. The vote passes
if at least three binding +1 votes are cast.

[ ] +1 Release the packages as Apache Nutch 1.2.

[ ] -1 Do not release the packages because...

Thanks!

Cheers,
Chris

P.S. Here is my +1.

P.P.S. Nutch PMC members unite! Please check this release out and VOTE on
it. I'd love to push this one out the door soon...and stop the RC waiting
game and get onto 2.0! ;)

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: Chris.Mattmann@jpl.nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++







++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: Chris.Mattmann@jpl.nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++