You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2012/03/07 22:35:27 UTC

[VOTE] Apache Tika 1.1 release rc #1

Hi Folks,

A candidate for the Tika 1.1 release is available at:

  http://people.apache.org/~mattmann/apache-tika-1.1/rc1/

The release candidate is a zip archive of the sources in:

   http://svn.apache.org/repos/asf/tika/tags/1.1/

The SHA1 checksum of the archive is d3185bb22fa3c7318488838989aff0cc9ee025df.

Please vote on releasing this package as Apache Tika 1.1.
The vote is open for at least the next 72 hours and passes if a majority of at
least three +1 Tika PMC votes are cast.

   [ ] +1 Release this package as Apache Tika 1.1
   [ ] -1 Do not release this package because...

Thanks!

Cheers,
Chris

P.S. Here's my +1.

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: [VOTE] Apache Tika 1.1 release rc #1

Posted by Dave Meikle <lo...@gmail.com>.
Hi,

On 7 March 2012 21:35, Mattmann, Chris A (388J) <
chris.a.mattmann@jpl.nasa.gov> wrote:

>
>   [X ] +1 Release this package as Apache Tika 1.1
>   [ ] -1 Do not release this package because...
>
>
Cheers,
Dave

Re: [VOTE] Apache Tika 1.1 release rc #1

Posted by Zabrane Mickael <za...@gmail.com>.
Hi guys,

Congrats for the v1.1 rc1.

Compile fine for me (OSX Lion 10.7.3 + OSX Snow Leopard 10.8.6). All test passed.

+1

Regards,
Zabrane

On Mar 7, 2012, at 10:35 PM, Mattmann, Chris A (388J) wrote:

> Hi Folks,
> 
> A candidate for the Tika 1.1 release is available at:
> 
>  http://people.apache.org/~mattmann/apache-tika-1.1/rc1/
> 
> The release candidate is a zip archive of the sources in:
> 
>   http://svn.apache.org/repos/asf/tika/tags/1.1/
> 
> The SHA1 checksum of the archive is d3185bb22fa3c7318488838989aff0cc9ee025df.
> 
> Please vote on releasing this package as Apache Tika 1.1.
> The vote is open for at least the next 72 hours and passes if a majority of at
> least three +1 Tika PMC votes are cast.
> 
>   [ ] +1 Release this package as Apache Tika 1.1
>   [ ] -1 Do not release this package because...
> 
> Thanks!
> 
> Cheers,
> Chris
> 
> P.S. Here's my +1.
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 



Re: [VOTE] Apache Tika 1.1 release rc #1

Posted by Oleg Tikhonov <ol...@apache.org>.
Here is my +1, this time tested only on Windows 7 x86-64 PE.

BR,
Oleg

On Thu, Mar 8, 2012 at 5:11 PM, Alex Ott <al...@gmail.com> wrote:

> +1
>
> unpacked sources, compiled, tests passed. compiled tika-app works
> correctly.
>
> separately downloaded tika-app-1.1.jar also works correctly for me
>
> The small problem is that md5sum file for tika-app-1.1.jar isn't
> correctly formatted - file name is missing, so md5sum -c can't check
> it
>
> P.S. System, Debian Linux testing, JVM version 1.6.0_26
>
> On Wed, Mar 7, 2012 at 10:35 PM, Mattmann, Chris A (388J)
> <ch...@jpl.nasa.gov> wrote:
> > Hi Folks,
> >
> > A candidate for the Tika 1.1 release is available at:
> >
> >  http://people.apache.org/~mattmann/apache-tika-1.1/rc1/
> >
> > The release candidate is a zip archive of the sources in:
> >
> >   http://svn.apache.org/repos/asf/tika/tags/1.1/
> >
> > The SHA1 checksum of the archive is
> d3185bb22fa3c7318488838989aff0cc9ee025df.
> >
> > Please vote on releasing this package as Apache Tika 1.1.
> > The vote is open for at least the next 72 hours and passes if a majority
> of at
> > least three +1 Tika PMC votes are cast.
> >
> >   [ ] +1 Release this package as Apache Tika 1.1
> >   [ ] -1 Do not release this package because...
> >
> > Thanks!
> >
> > Cheers,
> > Chris
> >
> > P.S. Here's my +1.
> >
> > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> > Chris Mattmann, Ph.D.
> > Senior Computer Scientist
> > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> > Office: 171-266B, Mailstop: 171-246
> > Email: chris.a.mattmann@nasa.gov
> > WWW:   http://sunset.usc.edu/~mattmann/
> > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> > Adjunct Assistant Professor, Computer Science Department
> > University of Southern California, Los Angeles, CA 90089 USA
> > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >
>
>
>
> --
> With best wishes,                    Alex Ott
> http://alexott.net/
> Tiwtter: alexott_en (English), alexott (Russian)
> Skype: alex.ott
>

Re: [VOTE] Apache Tika 1.1 release rc #1

Posted by Alex Ott <al...@gmail.com>.
+1

unpacked sources, compiled, tests passed. compiled tika-app works correctly.

separately downloaded tika-app-1.1.jar also works correctly for me

The small problem is that md5sum file for tika-app-1.1.jar isn't
correctly formatted - file name is missing, so md5sum -c can't check
it

P.S. System, Debian Linux testing, JVM version 1.6.0_26

On Wed, Mar 7, 2012 at 10:35 PM, Mattmann, Chris A (388J)
<ch...@jpl.nasa.gov> wrote:
> Hi Folks,
>
> A candidate for the Tika 1.1 release is available at:
>
>  http://people.apache.org/~mattmann/apache-tika-1.1/rc1/
>
> The release candidate is a zip archive of the sources in:
>
>   http://svn.apache.org/repos/asf/tika/tags/1.1/
>
> The SHA1 checksum of the archive is d3185bb22fa3c7318488838989aff0cc9ee025df.
>
> Please vote on releasing this package as Apache Tika 1.1.
> The vote is open for at least the next 72 hours and passes if a majority of at
> least three +1 Tika PMC votes are cast.
>
>   [ ] +1 Release this package as Apache Tika 1.1
>   [ ] -1 Do not release this package because...
>
> Thanks!
>
> Cheers,
> Chris
>
> P.S. Here's my +1.
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>



-- 
With best wishes,                    Alex Ott
http://alexott.net/
Tiwtter: alexott_en (English), alexott (Russian)
Skype: alex.ott

Re: [VOTE] Apache Tika 1.1 release rc #1

Posted by Markus Jelsma <ma...@openindex.io>.
+1



On Wednesday 07 March 2012 22:35:27 Mattmann, Chris A (388J) wrote:
> Hi Folks,
> 
> A candidate for the Tika 1.1 release is available at:
> 
>   http://people.apache.org/~mattmann/apache-tika-1.1/rc1/
> 
> The release candidate is a zip archive of the sources in:
> 
>    http://svn.apache.org/repos/asf/tika/tags/1.1/
> 
> The SHA1 checksum of the archive is
> d3185bb22fa3c7318488838989aff0cc9ee025df.
> 
> Please vote on releasing this package as Apache Tika 1.1.
> The vote is open for at least the next 72 hours and passes if a majority of
> at least three +1 Tika PMC votes are cast.
> 
>    [ ] +1 Release this package as Apache Tika 1.1
>    [ ] -1 Do not release this package because...
> 
> Thanks!
> 
> Cheers,
> Chris
> 
> P.S. Here's my +1.
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++

-- 
Markus Jelsma - CTO - Openindex

Re: [VOTE] Apache Tika 1.1 release rc #1

Posted by Ken Krugler <kk...@transpac.com>.
Hi Chris,

Built/tested/installed fine on Mac OS X 10.7.3

Switched Bixo to use Tika 1.1, and Bixo built/passed all tests.

+1

-- Ken

On Mar 7, 2012, at 1:35pm, Mattmann, Chris A (388J) wrote:

> Hi Folks,
> 
> A candidate for the Tika 1.1 release is available at:
> 
>  http://people.apache.org/~mattmann/apache-tika-1.1/rc1/
> 
> The release candidate is a zip archive of the sources in:
> 
>   http://svn.apache.org/repos/asf/tika/tags/1.1/
> 
> The SHA1 checksum of the archive is d3185bb22fa3c7318488838989aff0cc9ee025df.
> 
> Please vote on releasing this package as Apache Tika 1.1.
> The vote is open for at least the next 72 hours and passes if a majority of at
> least three +1 Tika PMC votes are cast.
> 
>   [ ] +1 Release this package as Apache Tika 1.1
>   [ ] -1 Do not release this package because...
> 
> Thanks!
> 
> Cheers,
> Chris
> 
> P.S. Here's my +1.
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 

--------------------------
Ken Krugler
http://www.scaleunlimited.com
custom big data solutions & training
Hadoop, Cascading, Mahout & Solr





Re: [VOTE] Apache Tika 1.1 release rc #1

Posted by Zabrane Mickael <za...@gmail.com>.
Hi guys,

Congrats for the v1.1 rc1.

Compile fine for me (OSX Lion 10.7.3 + OSX Snow Leopard 10.8.6). All test passed.

+1

Regards,
Zabrane

On Mar 7, 2012, at 10:35 PM, Mattmann, Chris A (388J) wrote:

> Hi Folks,
> 
> A candidate for the Tika 1.1 release is available at:
> 
>  http://people.apache.org/~mattmann/apache-tika-1.1/rc1/
> 
> The release candidate is a zip archive of the sources in:
> 
>   http://svn.apache.org/repos/asf/tika/tags/1.1/
> 
> The SHA1 checksum of the archive is d3185bb22fa3c7318488838989aff0cc9ee025df.
> 
> Please vote on releasing this package as Apache Tika 1.1.
> The vote is open for at least the next 72 hours and passes if a majority of at
> least three +1 Tika PMC votes are cast.
> 
>   [ ] +1 Release this package as Apache Tika 1.1
>   [ ] -1 Do not release this package because...
> 
> Thanks!
> 
> Cheers,
> Chris
> 
> P.S. Here's my +1.
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 



Re: [VOTE] Apache Tika 1.1 release rc #1

Posted by Alex Ott <al...@gmail.com>.
+1

unpacked sources, compiled, tests passed. compiled tika-app works correctly.

separately downloaded tika-app-1.1.jar also works correctly for me

The small problem is that md5sum file for tika-app-1.1.jar isn't
correctly formatted - file name is missing, so md5sum -c can't check
it

P.S. System, Debian Linux testing, JVM version 1.6.0_26

On Wed, Mar 7, 2012 at 10:35 PM, Mattmann, Chris A (388J)
<ch...@jpl.nasa.gov> wrote:
> Hi Folks,
>
> A candidate for the Tika 1.1 release is available at:
>
>  http://people.apache.org/~mattmann/apache-tika-1.1/rc1/
>
> The release candidate is a zip archive of the sources in:
>
>   http://svn.apache.org/repos/asf/tika/tags/1.1/
>
> The SHA1 checksum of the archive is d3185bb22fa3c7318488838989aff0cc9ee025df.
>
> Please vote on releasing this package as Apache Tika 1.1.
> The vote is open for at least the next 72 hours and passes if a majority of at
> least three +1 Tika PMC votes are cast.
>
>   [ ] +1 Release this package as Apache Tika 1.1
>   [ ] -1 Do not release this package because...
>
> Thanks!
>
> Cheers,
> Chris
>
> P.S. Here's my +1.
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>



-- 
With best wishes,                    Alex Ott
http://alexott.net/
Tiwtter: alexott_en (English), alexott (Russian)
Skype: alex.ott

Re: [VOTE] Apache Tika 1.1 release rc #1

Posted by Michael McCandless <lu...@mikemccandless.com>.
+1 to release.

I used tika-app-1.1.jar to successfully extract all text from the
Lucene in Action 2nd ed manuscript (PDF and MS Word).

CHANGES looks good too.

Mike McCandless

http://blog.mikemccandless.com

On Wed, Mar 7, 2012 at 4:35 PM, Mattmann, Chris A (388J)
<ch...@jpl.nasa.gov> wrote:
> Hi Folks,
>
> A candidate for the Tika 1.1 release is available at:
>
>  http://people.apache.org/~mattmann/apache-tika-1.1/rc1/
>
> The release candidate is a zip archive of the sources in:
>
>   http://svn.apache.org/repos/asf/tika/tags/1.1/
>
> The SHA1 checksum of the archive is d3185bb22fa3c7318488838989aff0cc9ee025df.
>
> Please vote on releasing this package as Apache Tika 1.1.
> The vote is open for at least the next 72 hours and passes if a majority of at
> least three +1 Tika PMC votes are cast.
>
>   [ ] +1 Release this package as Apache Tika 1.1
>   [ ] -1 Do not release this package because...
>
> Thanks!
>
> Cheers,
> Chris
>
> P.S. Here's my +1.
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>

Re: [VOTE] Apache Tika 1.1 release rc #1

Posted by Jukka Zitting <ju...@gmail.com>.
Hi,

On Wed, Mar 7, 2012 at 10:35 PM, Mattmann, Chris A (388J)
<ch...@jpl.nasa.gov> wrote:
> Please vote on releasing this package as Apache Tika 1.1.

    [x] +1 Release this package as Apache Tika 1.1

Thanks!

BR,

Jukka Zitting

Re: [VOTE] Apache Tika 1.1 release rc #1

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hey Ken,

Sorry about that! Forgot to include the link to the staged Maven2 repo, here:

https://repository.apache.org/content/repositories/orgapachetika-066/

There ya go.

Cheers,
Chris

On Mar 7, 2012, at 4:36 PM, Ken Krugler wrote:

> Hi Chris,
> 
> On Mar 7, 2012, at 1:35pm, Mattmann, Chris A (388J) wrote:
> 
>> Hi Folks,
>> 
>> A candidate for the Tika 1.1 release is available at:
>> 
>> http://people.apache.org/~mattmann/apache-tika-1.1/rc1/
> 
> I'm curious why you've got just the tika-app-1.1.jar (plus release sources), and not any of the other artifacts?
> 
> I was hoping to grab the jars, do a manual mvn install onto my Mac, and then try them out with some web crawling code.
> 
> I can of course build from source, but it seems like that adds another potential delta between the artifacts that get released and what I'm testing.
> 
> Thanks,
> 
> -- Ken
> 
> 
>> 
>> The release candidate is a zip archive of the sources in:
>> 
>>  http://svn.apache.org/repos/asf/tika/tags/1.1/
>> 
>> The SHA1 checksum of the archive is d3185bb22fa3c7318488838989aff0cc9ee025df.
>> 
>> Please vote on releasing this package as Apache Tika 1.1.
>> The vote is open for at least the next 72 hours and passes if a majority of at
>> least three +1 Tika PMC votes are cast.
>> 
>>  [ ] +1 Release this package as Apache Tika 1.1
>>  [ ] -1 Do not release this package because...
>> 
>> Thanks!
>> 
>> Cheers,
>> Chris
>> 
>> P.S. Here's my +1.
>> 
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Senior Computer Scientist
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 171-266B, Mailstop: 171-246
>> Email: chris.a.mattmann@nasa.gov
>> WWW:   http://sunset.usc.edu/~mattmann/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Adjunct Assistant Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> 
> 
> --------------------------
> Ken Krugler
> http://www.scaleunlimited.com
> custom big data solutions & training
> Hadoop, Cascading, Mahout & Solr
> 
> 
> 
> 


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: [VOTE] Apache Tika 1.1 release rc #1

Posted by Ken Krugler <kk...@transpac.com>.
Hi Chris,

On Mar 7, 2012, at 1:35pm, Mattmann, Chris A (388J) wrote:

> Hi Folks,
> 
> A candidate for the Tika 1.1 release is available at:
> 
>  http://people.apache.org/~mattmann/apache-tika-1.1/rc1/

I'm curious why you've got just the tika-app-1.1.jar (plus release sources), and not any of the other artifacts?

I was hoping to grab the jars, do a manual mvn install onto my Mac, and then try them out with some web crawling code.

I can of course build from source, but it seems like that adds another potential delta between the artifacts that get released and what I'm testing.

Thanks,

-- Ken


> 
> The release candidate is a zip archive of the sources in:
> 
>   http://svn.apache.org/repos/asf/tika/tags/1.1/
> 
> The SHA1 checksum of the archive is d3185bb22fa3c7318488838989aff0cc9ee025df.
> 
> Please vote on releasing this package as Apache Tika 1.1.
> The vote is open for at least the next 72 hours and passes if a majority of at
> least three +1 Tika PMC votes are cast.
> 
>   [ ] +1 Release this package as Apache Tika 1.1
>   [ ] -1 Do not release this package because...
> 
> Thanks!
> 
> Cheers,
> Chris
> 
> P.S. Here's my +1.
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 

--------------------------
Ken Krugler
http://www.scaleunlimited.com
custom big data solutions & training
Hadoop, Cascading, Mahout & Solr