You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by Tim Allison <ta...@apache.org> on 2017/05/22 19:25:07 UTC
[VOTE] Release Apache Tika 1.15 Candidate #1
A candidate for the Tika 1.15 release is available at:
https://dist.apache.org/repos/dist/dev/tika/
The release candidate is a zip archive of the sources in:
https://github.com/apache/tika/tree/1.15-rc1
The SHA1 checksum of the archive is
e82697a6804373367fbba98d47426ab74e036eb1.
In addition, a staged maven repository is available here:
https://repository.apache.org/content/repositories/orgapachetika-1022
Please vote on releasing this package as Apache Tika 1.15.
The vote is open for the next 72 hours and passes if a majority of at
least three +1 Tika PMC votes are cast.
[ ] +1 Release this package as Apache Tika 1.15
[ ] -1 Do not release this package because...
***This is my first time as release manager. Please kick the tires thoroughly.***
This is my +1.
Cheers,
Tim
RE: [VOTE] Release Apache Tika 1.16 Candidate #1
Posted by "Allison, Timothy B." <ta...@mitre.org>.
Doh. Thank you!
From: JB Data [mailto:jbdata31@gmail.com]
Sent: Saturday, July 8, 2017 2:34 AM
To: user@tika.apache.org; Tim Allison <ta...@apache.org>
Cc: dev@tika.apache.org
Subject: Re: [VOTE] Release Apache Tika 1.16 Candidate #1
Warn: link https://github.com/apache/tika/tree/1.16-rc1<https://github.com/apache/tika/tree/1.15-rc1> "hrefs" to the 1.15-rc1.
@JBΔ<http://jbigdata.fr>
2017-07-08 4:40 GMT+02:00 Tim Allison <ta...@apache.org>>:
A candidate for the Tika 1.16 release is available at:
https://dist.apache.org/repos/dist/dev/tika/
The release candidate is a zip archive of the sources in:
https://github.com/apache/tika/tree/1.16-rc1<https://github.com/apache/tika/tree/1.15-rc1>
The SHA1 checksum of the archive is
e6884af0209ace42bf0b9b59d72c3c5a0052055e
In addition, a staged maven repository is available here:
https://repository.apache.org/content/repositories/orgapachetika-1025
Please vote on releasing this package as Apache Tika 1.16.
The vote is open for the next 72 hours and passes if a majority of at
least three +1 Tika PMC votes are cast.
[ ] +1 Release this package as Apache Tika 1.16
[ ] -1 Do not release this package because...
This is my +1.
Cheers,
Tim
RE: [VOTE] Release Apache Tika 1.16 Candidate #1
Posted by "Allison, Timothy B." <ta...@mitre.org>.
Doh. Thank you!
From: JB Data [mailto:jbdata31@gmail.com]
Sent: Saturday, July 8, 2017 2:34 AM
To: user@tika.apache.org; Tim Allison <ta...@apache.org>
Cc: dev@tika.apache.org
Subject: Re: [VOTE] Release Apache Tika 1.16 Candidate #1
Warn: link https://github.com/apache/tika/tree/1.16-rc1<https://github.com/apache/tika/tree/1.15-rc1> "hrefs" to the 1.15-rc1.
@JBΔ<http://jbigdata.fr>
2017-07-08 4:40 GMT+02:00 Tim Allison <ta...@apache.org>>:
A candidate for the Tika 1.16 release is available at:
https://dist.apache.org/repos/dist/dev/tika/
The release candidate is a zip archive of the sources in:
https://github.com/apache/tika/tree/1.16-rc1<https://github.com/apache/tika/tree/1.15-rc1>
The SHA1 checksum of the archive is
e6884af0209ace42bf0b9b59d72c3c5a0052055e
In addition, a staged maven repository is available here:
https://repository.apache.org/content/repositories/orgapachetika-1025
Please vote on releasing this package as Apache Tika 1.16.
The vote is open for the next 72 hours and passes if a majority of at
least three +1 Tika PMC votes are cast.
[ ] +1 Release this package as Apache Tika 1.16
[ ] -1 Do not release this package because...
This is my +1.
Cheers,
Tim
Re: [VOTE] Release Apache Tika 1.16 Candidate #1
Posted by JB Data <jb...@gmail.com>.
Warn: link https://github.com/apache/tika/tree/1.16-rc1
<https://github.com/apache/tika/tree/1.15-rc1> "hrefs" to the 1.15-rc1.
*@**JB*Δ <http://jbigdata.fr>
2017-07-08 4:40 GMT+02:00 Tim Allison <ta...@apache.org>:
>
>
>
> A candidate for the Tika 1.16 release is available at:
> https://dist.apache.org/repos/dist/dev/tika/
>
> The release candidate is a zip archive of the sources in:
> https://github.com/apache/tika/tree/1.16-rc1
> <https://github.com/apache/tika/tree/1.15-rc1>
>
> The SHA1 checksum of the archive is
> e6884af0209ace42bf0b9b59d72c3c5a0052055e
>
> In addition, a staged maven repository is available here:
> *https://repository.apache.org/content/repositories/orgapachetika-1025
> <https://repository.apache.org/content/repositories/orgapachetika-1025>*
>
>
>
> Please vote on releasing this package as Apache Tika 1.16.
> The vote is open for the next 72 hours and passes if a majority of at
> least three +1 Tika PMC votes are cast.
>
> [ ] +1 Release this package as Apache Tika 1.16
> [ ] -1 Do not release this package because...
>
>
> This is my +1.
>
> Cheers,
>
> Tim
>
>
>
[RESULT][VOTE] Release Apache Tika 1.16 Candidate #1
Posted by Tim Allison <ta...@apache.org>.
All, This VOTE has passed with the following tallies:
+1 PMCTim AllisonChris MattmannDave MeikleLuís Filipe Nassif
Oleg Tikhonov
+1 CommunityJB Data
I'll push the dists out, update the site and send the ANNOUNCE email later today. Thank you, all!
Cheers,
Tim
From: Tim Allison <ta...@apache.org>
To: "dev@tika.apache.org" <de...@tika.apache.org>; "user@tika.apache.org" <us...@tika.apache.org>
Sent: Friday, July 7, 2017 10:40 PM
Subject: [VOTE] Release Apache Tika 1.16 Candidate #1
A candidate for the Tika 1.16 release is available at:
https://dist.apache.org/repos/dist/dev/tika/
The release candidate is a zip archive of the sources in:
https://github.com/apache/tika/tree/1.16-rc1
The SHA1 checksum of the archive is
e6884af0209ace42bf0b9b59d72c3c5a0052055e
In addition, a staged maven repository is available here:
https://repository.apache.org/content/repositories/orgapachetika-1025
Please vote on releasing this package as Apache Tika 1.16.
The vote is open for the next 72 hours and passes if a majority of at
least three +1 Tika PMC votes are cast.
[ ] +1 Release this package as Apache Tika 1.16
[ ] -1 Do not release this package because...
This is my +1.
Cheers,
Tim
Re: [VOTE] Release Apache Tika 1.16 Candidate #1
Posted by Luís Filipe Nassif <lf...@gmail.com>.
I don't think it is needed. Built on Win7, jdk1.8.0_131. Tests passed with
and without tesseract 3.05.
+1 from me.
Regards,
Luis
2017-07-10 14:10 GMT-03:00 Allison, Timothy B. <ta...@mitre.org>:
> Is this worth a re-spin?
>
> -----Original Message-----
> From: Allison, Timothy B. [mailto:tallison@mitre.org]
> Sent: Monday, July 10, 2017 10:26 AM
> To: lfcnassif@gmail.com
> Cc: dev@tika.apache.org
> Subject: RE: [VOTE] Release Apache Tika 1.16 Candidate #1
>
> Y. I need to fix that unit test. Thank you!
>
> https://issues.apache.org/jira/browse/TIKA-2426
>
> From: Luís Filipe Nassif [mailto:lfcnassif@gmail.com]
> Sent: Monday, July 10, 2017 9:29 AM
> To: user@tika.apache.org
> Cc: dev@tika.apache.org; Tim Allison <ta...@apache.org>
> Subject: Re: [VOTE] Release Apache Tika 1.16 Candidate #1
>
> OK, that is a Locale issue, working around...
>
> 2017-07-10 10:24 GMT-03:00 Luís Filipe Nassif <lfcnassif@gmail.com<mailto:
> lfcnassif@gmail.com>>:
> I got the following failure on Window7, jdk1.8.0_131, in OOXMLParserTest.testXLSBVarious:1537.
> Any ideas?
>
> Failed tests:
> OOXMLParserTest.testXLSBVarious:1537->TikaTest.assertContains:102
> <td>13.1211231321</td> not found in:
> <html xmlns="http://www.w3.org/1999/xhtml">
> <head>
> <meta name="date" content="2017-03-10T14:58:49Z" /> <meta
> name="extended-properties:AppVersion" content="16.0300" /> <meta
> name="dc:creator" content="Allison, Timothy B." /> <meta
> name="extended-properties:Company" content="" /> <meta
> name="dcterms:created" content="2017-03-09T12:24:26Z" /> <meta
> name="Last-Modified" content="2017-03-10T14:58:49Z" /> <meta
> name="dcterms:modified" content="2017-03-10T14:58:49Z" /> <meta
> name="Last-Save-Date" content="2017-03-10T14:58:49Z" /> <meta
> name="protected" content="false" /> <meta name="meta:save-date"
> content="2017-03-10T14:58:49Z" /> <meta name="Application-Name"
> content="Microsoft Excel" /> <meta name="modified"
> content="2017-03-10T14:58:49Z" /> <meta name="Content-Type"
> content="application/vnd.ms-excel.sheet.binary.macroenabled.12" /> <meta
> name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
> <meta name="X-Parsed-By" content="org.apache.tika.parser.microsoft.ooxml.OOXMLParser"
> /> <meta name="creator" content="Allison, Timothy B." /> <meta
> name="meta:author" content="Allison, Timothy B." /> <meta
> name="meta:creation-date" content="2017-03-09T12:24:26Z" /> <meta
> name="extended-properties:Application" content="Microsoft Excel" /> <meta
> name="meta:last-author" content="Allison, Timothy B." /> <meta
> name="Creation-Date" content="2017-03-09T12:24:26Z" /> <meta
> name="Last-Author" content="Allison, Timothy B." /> <meta
> name="X-TIKA:origResourceName" content="C:\Users\tallison\Desktop\working\xlsb\"
> /> <meta name="Application-Version" content="16.0300" /> <meta
> name="Author" content="Allison, Timothy B." /> <meta name="publisher"
> content="" /> <meta name="dc:publisher" content="" /> <title></title>
> </head> <body><div><h1>mySheet1</h1> <table><tbody><tr> <td>String</td>
> <td>This is a string</td></tr> <tr> <td>integer</td> <td>13</td></tr> <tr>
> <td>float</td> <td>13,1211231321</td></tr>
> <tr> <td>currency</td> <td>$ 0,003,,03.00</td></tr>
> <tr> <td>percent</td> <td>20%</td></tr>
> <tr> <td>float 2</td> <td>13,12</td></tr> <tr> <td>long int</td>
> <td>123456789012345</td></tr> <tr> <td>longer int</td>
> <td>1,23456789012345E+15</td> <td><br /> Allison, Timothy B.: Allison,
> Timothy B.:
> test comment2
> </td></tr>
> <tr> <td>fraction</td> <td>1/4</td></tr> <tr> <td>date</td>
> <td>3/9/17</td></tr> <tr> <td>comment</td> <td>contents<br /> Allison,
> Timothy B.: Allison, Timothy B.:
> test comment
> </td></tr>
> <tr> <td>hyperlink</td> <td>tika_link</td></tr> <tr> <td>formula</td>
> <td>4</td> <td>2</td></tr> <tr> <td>formulaErr</td> <td>ERROR</td></tr>
> <tr> <td>formulaFloat</td> <td>0,5</td> <td>March</td> <td>April</td></tr>
> <tr> <td>customFormat1</td> <td> 46/1963</td> <td>merchant1</td>
> <td>1</td> <td>3</td></tr>
> <tr> <td>customFormat2</td> <td> 3/128</td> <td>merchant2</td> <td>2</td>
> <td>4</td></tr> <tr> <td>text test</td></tr> <tr> <td><br /> Allison,
> Timothy B.: Allison, Timothy B.:
> comment1
> </td></tr>
> <tr> <td><br />
> Allison, Timothy B.: Allison, Timothy B.:
> comment2
> </td></tr>
> <tr> <td><br />
> Allison, Timothy B.: Allison, Timothy B.:
> comment3
> </td></tr>
> <tr> <td>the</td> <td><br />
> Allison, Timothy B.: Allison, Timothy B.:
> comment4 (end of row)
> </td></tr>
> <tr> <td>the</td> <td><br />
> Allison, Timothy B.: Allison, Timothy B.:
> comment5 between cells
> </td> <td>quick</td></tr>
> <tr> <td>comment6<br />
> Allison, Timothy B.: Allison, Timothy B.:
> comment6 actually in cell
> </td></tr>
> <tr> <td><br />
> Allison, Timothy B.: Allison, Timothy B.:
> comment7 end of file
> </td></tr>
> <tr> <td><br />
> Allison, Timothy B.: Allison, Timothy B.:
> comment8 end of file</td></tr>
> </tbody></table>
> <p>OddLeftHeader OddCenterHeader OddRightHeader</p> <p>EvenLeftHeader
> EvenCenterHeader EvenRightHeader </p> <p>FirstPageLeftHeader
> FirstPageCenterHeader FirstPageRightHeader</p> <p>OddLeftFooter
> OddCenterFooter OddRightFooter</p> <p>EvenLeftFooter EvenCenterFooter
> EvenRightFooter</p> <p>FirstPageLeftFooter FirstPageCenterFooter
> FirstPageRightFooter</p> <p>test textbox </p> <a href="
> http://lucene.apache.org/">http://lucene.apache.org/
> </a><p>myChartTitle</p>
> <p />
> merchant1 March April 1 3 merchant2 March April 2 4 <p /> <p /> <p /> <p
> /> <p>test WordArt</p> <p>myChartTitle</p> <p />
> merchant1 March April 1 3 merchant2 March April 2 4 <p /> <p /> <p /> <p
> /> <p>myChartTitle</p> <p />
> merchant1 March April 1 3 merchant2 March April 2 4 <p /> <p /> <p /> <p
> /> <a href="http://tika.apache.org/">http://tika.apache.org/</a></div>
> <div class="package-entry" /><div class="package-entry" /><div
> class="package-entry" /></body></html>
>
> 2017-07-10 10:17 GMT-03:00 JB Data <jbdata31@gmail.com<mailto:jbd
> ata31@gmail.com>>:
> +1.
> No regression in my 1.15 env<http://jbigdata.fr/jbigdata/ged-02.html>.
> Test docx chart extraction (TIKA-2254): OK.
>
> @JBΔ<http://jbigdata.fr>
>
> 2017-07-08 22:29 GMT+02:00 Chris Mattmann <mattmann@apache.org<mailto:ma
> ttmann@apache.org>>:
> +1 from me SIGS and CHECKSUMS look good.
>
> Thanks Tim!
>
> Cheers,
> Chris
>
> LMC-053601:apache-tika-1.16-rc1 mattmann$ for type in "" \-app \-eval
> \-server; do $HOME/bin/stage_apache_rc tika$type 1.16
> https://dist.apache.org/repos/dist/dev/tika/; done
> % Total % Received % Xferd Average Speed Time Time Time
> Current
> Dload Upload Total Spent Left
> Speed
> 100 53.5M 100 53.5M 0 0 3992k 0 0:00:13 0:00:13 --:--:--
> 5122k
> % Total % Received % Xferd Average Speed Time Time Time
> Current
> Dload Upload Total Spent Left
> Speed
> 100 836 100 836 0 0 1092 0 --:--:-- --:--:-- --:--:--
> 1092
> % Total % Received % Xferd Average Speed Time Time Time
> Current
> Dload Upload Total Spent Left
> Speed
> 100 34 100 34 0 0 96 0 --:--:-- --:--:-- --:--:--
> 96
> % Total % Received % Xferd Average Speed Time Time Time
> Current
> Dload Upload Total Spent Left
> Speed
> 100 41.6M 100 41.6M 0 0 6578k 0 0:00:06 0:00:06 --:--:--
> 8297k
> % Total % Received % Xferd Average Speed Time Time Time
> Current
> Dload Upload Total Spent Left
> Speed
> 100 836 100 836 0 0 1012 0 --:--:-- --:--:-- --:--:--
> 1012
> % Total % Received % Xferd Average Speed Time Time Time
> Current
> Dload Upload Total Spent Left
> Speed
> 100 34 100 34 0 0 46 0 --:--:-- --:--:-- --:--:--
> 46
> % Total % Received % Xferd Average Speed Time Time Time
> Current
> Dload Upload Total Spent Left
> Speed
> 100 56.4M 100 56.4M 0 0 3950k 0 0:00:14 0:00:14 --:--:--
> 4742k
> % Total % Received % Xferd Average Speed Time Time Time
> Current
> Dload Upload Total Spent Left
> Speed
> 100 836 100 836 0 0 1470 0 --:--:-- --:--:-- --:--:--
> 1469
> % Total % Received % Xferd Average Speed Time Time Time
> Current
> Dload Upload Total Spent Left
> Speed
> 100 34 100 34 0 0 65 0 --:--:-- --:--:-- --:--:--
> 65
> LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/stage_apache_rc tika
> 1.16-src https://dist.apache.org/repos/dist/dev/tika/
> % Total % Received % Xferd Average Speed Time Time Time
> Current
> Dload Upload Total Spent Left
> Speed
> 100 84.2M 100 84.2M 0 0 6563k 0 0:00:13 0:00:13 --:--:--
> 5261k
> % Total % Received % Xferd Average Speed Time Time Time
> Current
> Dload Upload Total Spent Left
> Speed
> 100 836 100 836 0 0 2129 0 --:--:-- --:--:-- --:--:--
> 2127
> % Total % Received % Xferd Average Speed Time Time Time
> Current
> Dload Upload Total Spent Left
> Speed
> 100 34 100 34 0 0 47 0 --:--:-- --:--:-- --:--:--
> 47
> LMC-053601:apache-tika-1.16-rc1 mattmann$ ls
> tika-1.16-src.zip tika-app-1.16.jar
> tika-eval-1.16.jar tika-server-1.16.jar
> tika-1.16-src.zip.asc tika-app-1.16.jar.asc
> tika-eval-1.16.jar.asc tika-server-1.16.jar.asc
> tika-1.16-src.zip.md5 tika-app-1.16.jar.md5
> tika-eval-1.16.jar.md5 tika-server-1.16.jar.md5
> LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/verify_gpg_sigs
> Verifying Signature for file tika-1.16-src.zip.asc
> gpg: assuming signed data in `tika-1.16-src.zip'
> gpg: Signature made Fri Jul 7 19:27:42 2017 PDT using RSA key ID EF0CF38A
> gpg: Good signature from "Tim Allison (ASF signing key) <
> tallison@apache.org<ma...@apache.org>>"
> gpg: WARNING: This key is not certified with a trusted signature!
> gpg: There is no indication that the signature belongs to the
> owner.
> Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C
> F38A Verifying Signature for file tika-app-1.16.jar.asc
> gpg: assuming signed data in `tika-app-1.16.jar'
> gpg: Signature made Fri Jul 7 19:13:16 2017 PDT using RSA key ID EF0CF38A
> gpg: Good signature from "Tim Allison (ASF signing key) <
> tallison@apache.org<ma...@apache.org>>"
> gpg: WARNING: This key is not certified with a trusted signature!
> gpg: There is no indication that the signature belongs to the
> owner.
> Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C
> F38A Verifying Signature for file tika-eval-1.16.jar.asc
> gpg: assuming signed data in `tika-eval-1.16.jar'
> gpg: Signature made Fri Jul 7 19:20:17 2017 PDT using RSA key ID EF0CF38A
> gpg: Good signature from "Tim Allison (ASF signing key) <
> tallison@apache.org<ma...@apache.org>>"
> gpg: WARNING: This key is not certified with a trusted signature!
> gpg: There is no indication that the signature belongs to the
> owner.
> Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C
> F38A Verifying Signature for file tika-server-1.16.jar.asc
> gpg: assuming signed data in `tika-server-1.16.jar'
> gpg: Signature made Fri Jul 7 19:17:53 2017 PDT using RSA key ID EF0CF38A
> gpg: Good signature from "Tim Allison (ASF signing key) <
> tallison@apache.org<ma...@apache.org>>"
> gpg: WARNING: This key is not certified with a trusted signature!
> gpg: There is no indication that the signature belongs to the
> owner.
> Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C F38A
> LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/verify_md5_checksums
> md5sum: stat '*.tar.gz': No such file or directory
> md5sum: stat '*.bz2': No such file or directory
> md5sum: stat '*.tgz': No such file or directory
> tika-1.16-src.zip: OK
> LMC-053601:apache-tika-1.16-rc1 mattmann$
>
>
>
>
> On 7/7/17, 7:40 PM, "Tim Allison" <tallison@apache.org<mailto:ta
> llison@apache.org>> wrote:
>
>
>
>
> A candidate for the Tika 1.16 release is available at:
> https://dist.apache.org/repos/dist/dev/tika/
>
> The release candidate is a zip archive of the sources in:
> https://github.com/apache/tika/tree/1.16-rc1
>
> The SHA1 checksum of the archive is
> e6884af0209ace42bf0b9b59d72c3c5a0052055e
>
> In addition, a staged maven repository is available here:
> https://repository.apache.org/content/repositories/orgapachetika-1025
>
>
>
> Please vote on releasing this package as Apache Tika 1.16.
> The vote is open for the next 72 hours and passes if a majority of at
> least three +1 Tika PMC votes are cast.
>
> [ ] +1 Release this package as Apache Tika 1.16
> [ ] -1 Do not release this package because...
>
>
> This is my +1.
>
> Cheers,
>
> Tim
>
>
>
>
>
>
>
>
RE: [VOTE] Release Apache Tika 1.16 Candidate #1
Posted by "Allison, Timothy B." <ta...@mitre.org>.
Is this worth a re-spin?
-----Original Message-----
From: Allison, Timothy B. [mailto:tallison@mitre.org]
Sent: Monday, July 10, 2017 10:26 AM
To: lfcnassif@gmail.com
Cc: dev@tika.apache.org
Subject: RE: [VOTE] Release Apache Tika 1.16 Candidate #1
Y. I need to fix that unit test. Thank you!
https://issues.apache.org/jira/browse/TIKA-2426
From: Luís Filipe Nassif [mailto:lfcnassif@gmail.com]
Sent: Monday, July 10, 2017 9:29 AM
To: user@tika.apache.org
Cc: dev@tika.apache.org; Tim Allison <ta...@apache.org>
Subject: Re: [VOTE] Release Apache Tika 1.16 Candidate #1
OK, that is a Locale issue, working around...
2017-07-10 10:24 GMT-03:00 Luís Filipe Nassif <lf...@gmail.com>>:
I got the following failure on Window7, jdk1.8.0_131, in OOXMLParserTest.testXLSBVarious:1537. Any ideas?
Failed tests:
OOXMLParserTest.testXLSBVarious:1537->TikaTest.assertContains:102 <td>13.1211231321</td> not found in:
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="date" content="2017-03-10T14:58:49Z" /> <meta name="extended-properties:AppVersion" content="16.0300" /> <meta name="dc:creator" content="Allison, Timothy B." /> <meta name="extended-properties:Company" content="" /> <meta name="dcterms:created" content="2017-03-09T12:24:26Z" /> <meta name="Last-Modified" content="2017-03-10T14:58:49Z" /> <meta name="dcterms:modified" content="2017-03-10T14:58:49Z" /> <meta name="Last-Save-Date" content="2017-03-10T14:58:49Z" /> <meta name="protected" content="false" /> <meta name="meta:save-date" content="2017-03-10T14:58:49Z" /> <meta name="Application-Name" content="Microsoft Excel" /> <meta name="modified" content="2017-03-10T14:58:49Z" /> <meta name="Content-Type" content="application/vnd.ms-excel.sheet.binary.macroenabled.12" /> <meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" /> <meta name="X-Parsed-By" content="org.apache.tika.parser.microsoft.ooxml.OOXMLParser" /> <meta name="creator" content="Allison, Timothy B." /> <meta name="meta:author" content="Allison, Timothy B." /> <meta name="meta:creation-date" content="2017-03-09T12:24:26Z" /> <meta name="extended-properties:Application" content="Microsoft Excel" /> <meta name="meta:last-author" content="Allison, Timothy B." /> <meta name="Creation-Date" content="2017-03-09T12:24:26Z" /> <meta name="Last-Author" content="Allison, Timothy B." /> <meta name="X-TIKA:origResourceName" content="C:\Users\tallison\Desktop\working\xlsb\" /> <meta name="Application-Version" content="16.0300" /> <meta name="Author" content="Allison, Timothy B." /> <meta name="publisher" content="" /> <meta name="dc:publisher" content="" /> <title></title> </head> <body><div><h1>mySheet1</h1> <table><tbody><tr> <td>String</td> <td>This is a string</td></tr> <tr> <td>integer</td> <td>13</td></tr> <tr> <td>float</td> <td>13,1211231321</td></tr>
<tr> <td>currency</td> <td>$ 0,003,,03.00</td></tr>
<tr> <td>percent</td> <td>20%</td></tr>
<tr> <td>float 2</td> <td>13,12</td></tr> <tr> <td>long int</td> <td>123456789012345</td></tr> <tr> <td>longer int</td> <td>1,23456789012345E+15</td> <td><br /> Allison, Timothy B.: Allison, Timothy B.:
test comment2
</td></tr>
<tr> <td>fraction</td> <td>1/4</td></tr> <tr> <td>date</td> <td>3/9/17</td></tr> <tr> <td>comment</td> <td>contents<br /> Allison, Timothy B.: Allison, Timothy B.:
test comment
</td></tr>
<tr> <td>hyperlink</td> <td>tika_link</td></tr> <tr> <td>formula</td> <td>4</td> <td>2</td></tr> <tr> <td>formulaErr</td> <td>ERROR</td></tr> <tr> <td>formulaFloat</td> <td>0,5</td> <td>March</td> <td>April</td></tr>
<tr> <td>customFormat1</td> <td> 46/1963</td> <td>merchant1</td> <td>1</td> <td>3</td></tr>
<tr> <td>customFormat2</td> <td> 3/128</td> <td>merchant2</td> <td>2</td> <td>4</td></tr> <tr> <td>text test</td></tr> <tr> <td><br /> Allison, Timothy B.: Allison, Timothy B.:
comment1
</td></tr>
<tr> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment2
</td></tr>
<tr> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment3
</td></tr>
<tr> <td>the</td> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment4 (end of row)
</td></tr>
<tr> <td>the</td> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment5 between cells
</td> <td>quick</td></tr>
<tr> <td>comment6<br />
Allison, Timothy B.: Allison, Timothy B.:
comment6 actually in cell
</td></tr>
<tr> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment7 end of file
</td></tr>
<tr> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment8 end of file</td></tr>
</tbody></table>
<p>OddLeftHeader OddCenterHeader OddRightHeader</p> <p>EvenLeftHeader EvenCenterHeader EvenRightHeader </p> <p>FirstPageLeftHeader FirstPageCenterHeader FirstPageRightHeader</p> <p>OddLeftFooter OddCenterFooter OddRightFooter</p> <p>EvenLeftFooter EvenCenterFooter EvenRightFooter</p> <p>FirstPageLeftFooter FirstPageCenterFooter FirstPageRightFooter</p> <p>test textbox </p> <a href="http://lucene.apache.org/">http://lucene.apache.org/</a><p>myChartTitle</p>
<p />
merchant1 March April 1 3 merchant2 March April 2 4 <p /> <p /> <p /> <p /> <p>test WordArt</p> <p>myChartTitle</p> <p />
merchant1 March April 1 3 merchant2 March April 2 4 <p /> <p /> <p /> <p /> <p>myChartTitle</p> <p />
merchant1 March April 1 3 merchant2 March April 2 4 <p /> <p /> <p /> <p /> <a href="http://tika.apache.org/">http://tika.apache.org/</a></div>
<div class="package-entry" /><div class="package-entry" /><div class="package-entry" /></body></html>
2017-07-10 10:17 GMT-03:00 JB Data <jb...@gmail.com>>:
+1.
No regression in my 1.15 env<http://jbigdata.fr/jbigdata/ged-02.html>.
Test docx chart extraction (TIKA-2254): OK.
@JBΔ<http://jbigdata.fr>
2017-07-08 22:29 GMT+02:00 Chris Mattmann <ma...@apache.org>>:
+1 from me SIGS and CHECKSUMS look good.
Thanks Tim!
Cheers,
Chris
LMC-053601:apache-tika-1.16-rc1 mattmann$ for type in "" \-app \-eval \-server; do $HOME/bin/stage_apache_rc tika$type 1.16 https://dist.apache.org/repos/dist/dev/tika/; done
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 53.5M 100 53.5M 0 0 3992k 0 0:00:13 0:00:13 --:--:-- 5122k
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 836 100 836 0 0 1092 0 --:--:-- --:--:-- --:--:-- 1092
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 34 100 34 0 0 96 0 --:--:-- --:--:-- --:--:-- 96
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 41.6M 100 41.6M 0 0 6578k 0 0:00:06 0:00:06 --:--:-- 8297k
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 836 100 836 0 0 1012 0 --:--:-- --:--:-- --:--:-- 1012
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 34 100 34 0 0 46 0 --:--:-- --:--:-- --:--:-- 46
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 56.4M 100 56.4M 0 0 3950k 0 0:00:14 0:00:14 --:--:-- 4742k
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 836 100 836 0 0 1470 0 --:--:-- --:--:-- --:--:-- 1469
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 34 100 34 0 0 65 0 --:--:-- --:--:-- --:--:-- 65
LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/stage_apache_rc tika 1.16-src https://dist.apache.org/repos/dist/dev/tika/
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 84.2M 100 84.2M 0 0 6563k 0 0:00:13 0:00:13 --:--:-- 5261k
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 836 100 836 0 0 2129 0 --:--:-- --:--:-- --:--:-- 2127
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 34 100 34 0 0 47 0 --:--:-- --:--:-- --:--:-- 47
LMC-053601:apache-tika-1.16-rc1 mattmann$ ls
tika-1.16-src.zip tika-app-1.16.jar tika-eval-1.16.jar tika-server-1.16.jar
tika-1.16-src.zip.asc tika-app-1.16.jar.asc tika-eval-1.16.jar.asc tika-server-1.16.jar.asc
tika-1.16-src.zip.md5 tika-app-1.16.jar.md5 tika-eval-1.16.jar.md5 tika-server-1.16.jar.md5
LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/verify_gpg_sigs Verifying Signature for file tika-1.16-src.zip.asc
gpg: assuming signed data in `tika-1.16-src.zip'
gpg: Signature made Fri Jul 7 19:27:42 2017 PDT using RSA key ID EF0CF38A
gpg: Good signature from "Tim Allison (ASF signing key) <ta...@apache.org>>"
gpg: WARNING: This key is not certified with a trusted signature!
gpg: There is no indication that the signature belongs to the owner.
Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C F38A Verifying Signature for file tika-app-1.16.jar.asc
gpg: assuming signed data in `tika-app-1.16.jar'
gpg: Signature made Fri Jul 7 19:13:16 2017 PDT using RSA key ID EF0CF38A
gpg: Good signature from "Tim Allison (ASF signing key) <ta...@apache.org>>"
gpg: WARNING: This key is not certified with a trusted signature!
gpg: There is no indication that the signature belongs to the owner.
Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C F38A Verifying Signature for file tika-eval-1.16.jar.asc
gpg: assuming signed data in `tika-eval-1.16.jar'
gpg: Signature made Fri Jul 7 19:20:17 2017 PDT using RSA key ID EF0CF38A
gpg: Good signature from "Tim Allison (ASF signing key) <ta...@apache.org>>"
gpg: WARNING: This key is not certified with a trusted signature!
gpg: There is no indication that the signature belongs to the owner.
Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C F38A Verifying Signature for file tika-server-1.16.jar.asc
gpg: assuming signed data in `tika-server-1.16.jar'
gpg: Signature made Fri Jul 7 19:17:53 2017 PDT using RSA key ID EF0CF38A
gpg: Good signature from "Tim Allison (ASF signing key) <ta...@apache.org>>"
gpg: WARNING: This key is not certified with a trusted signature!
gpg: There is no indication that the signature belongs to the owner.
Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C F38A
LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/verify_md5_checksums
md5sum: stat '*.tar.gz': No such file or directory
md5sum: stat '*.bz2': No such file or directory
md5sum: stat '*.tgz': No such file or directory
tika-1.16-src.zip: OK
LMC-053601:apache-tika-1.16-rc1 mattmann$
On 7/7/17, 7:40 PM, "Tim Allison" <ta...@apache.org>> wrote:
A candidate for the Tika 1.16 release is available at:
https://dist.apache.org/repos/dist/dev/tika/
The release candidate is a zip archive of the sources in:
https://github.com/apache/tika/tree/1.16-rc1
The SHA1 checksum of the archive is
e6884af0209ace42bf0b9b59d72c3c5a0052055e
In addition, a staged maven repository is available here:
https://repository.apache.org/content/repositories/orgapachetika-1025
Please vote on releasing this package as Apache Tika 1.16.
The vote is open for the next 72 hours and passes if a majority of at
least three +1 Tika PMC votes are cast.
[ ] +1 Release this package as Apache Tika 1.16
[ ] -1 Do not release this package because...
This is my +1.
Cheers,
Tim
RE: [VOTE] Release Apache Tika 1.16 Candidate #1
Posted by "Allison, Timothy B." <ta...@mitre.org>.
Y. I need to fix that unit test. Thank you!
https://issues.apache.org/jira/browse/TIKA-2426
From: Luís Filipe Nassif [mailto:lfcnassif@gmail.com]
Sent: Monday, July 10, 2017 9:29 AM
To: user@tika.apache.org
Cc: dev@tika.apache.org; Tim Allison <ta...@apache.org>
Subject: Re: [VOTE] Release Apache Tika 1.16 Candidate #1
OK, that is a Locale issue, working around...
2017-07-10 10:24 GMT-03:00 Luís Filipe Nassif <lf...@gmail.com>>:
I got the following failure on Window7, jdk1.8.0_131, in OOXMLParserTest.testXLSBVarious:1537. Any ideas?
Failed tests:
OOXMLParserTest.testXLSBVarious:1537->TikaTest.assertContains:102 <td>13.1211231321</td> not found in:
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="date" content="2017-03-10T14:58:49Z" />
<meta name="extended-properties:AppVersion" content="16.0300" />
<meta name="dc:creator" content="Allison, Timothy B." />
<meta name="extended-properties:Company" content="" />
<meta name="dcterms:created" content="2017-03-09T12:24:26Z" />
<meta name="Last-Modified" content="2017-03-10T14:58:49Z" />
<meta name="dcterms:modified" content="2017-03-10T14:58:49Z" />
<meta name="Last-Save-Date" content="2017-03-10T14:58:49Z" />
<meta name="protected" content="false" />
<meta name="meta:save-date" content="2017-03-10T14:58:49Z" />
<meta name="Application-Name" content="Microsoft Excel" />
<meta name="modified" content="2017-03-10T14:58:49Z" />
<meta name="Content-Type" content="application/vnd.ms-excel.sheet.binary.macroenabled.12" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.microsoft.ooxml.OOXMLParser" />
<meta name="creator" content="Allison, Timothy B." />
<meta name="meta:author" content="Allison, Timothy B." />
<meta name="meta:creation-date" content="2017-03-09T12:24:26Z" />
<meta name="extended-properties:Application" content="Microsoft Excel" />
<meta name="meta:last-author" content="Allison, Timothy B." />
<meta name="Creation-Date" content="2017-03-09T12:24:26Z" />
<meta name="Last-Author" content="Allison, Timothy B." />
<meta name="X-TIKA:origResourceName" content="C:\Users\tallison\Desktop\working\xlsb\" />
<meta name="Application-Version" content="16.0300" />
<meta name="Author" content="Allison, Timothy B." />
<meta name="publisher" content="" />
<meta name="dc:publisher" content="" />
<title></title>
</head>
<body><div><h1>mySheet1</h1>
<table><tbody><tr> <td>String</td> <td>This is a string</td></tr>
<tr> <td>integer</td> <td>13</td></tr>
<tr> <td>float</td> <td>13,1211231321</td></tr>
<tr> <td>currency</td> <td>$ 0,003,,03.00</td></tr>
<tr> <td>percent</td> <td>20%</td></tr>
<tr> <td>float 2</td> <td>13,12</td></tr>
<tr> <td>long int</td> <td>123456789012345</td></tr>
<tr> <td>longer int</td> <td>1,23456789012345E+15</td> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
test comment2
</td></tr>
<tr> <td>fraction</td> <td>1/4</td></tr>
<tr> <td>date</td> <td>3/9/17</td></tr>
<tr> <td>comment</td> <td>contents<br />
Allison, Timothy B.: Allison, Timothy B.:
test comment
</td></tr>
<tr> <td>hyperlink</td> <td>tika_link</td></tr>
<tr> <td>formula</td> <td>4</td> <td>2</td></tr>
<tr> <td>formulaErr</td> <td>ERROR</td></tr>
<tr> <td>formulaFloat</td> <td>0,5</td> <td>March</td> <td>April</td></tr>
<tr> <td>customFormat1</td> <td> 46/1963</td> <td>merchant1</td> <td>1</td> <td>3</td></tr>
<tr> <td>customFormat2</td> <td> 3/128</td> <td>merchant2</td> <td>2</td> <td>4</td></tr>
<tr> <td>text test</td></tr>
<tr> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment1
</td></tr>
<tr> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment2
</td></tr>
<tr> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment3
</td></tr>
<tr> <td>the</td> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment4 (end of row)
</td></tr>
<tr> <td>the</td> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment5 between cells
</td> <td>quick</td></tr>
<tr> <td>comment6<br />
Allison, Timothy B.: Allison, Timothy B.:
comment6 actually in cell
</td></tr>
<tr> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment7 end of file
</td></tr>
<tr> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment8 end of file</td></tr>
</tbody></table>
<p>OddLeftHeader OddCenterHeader OddRightHeader</p>
<p>EvenLeftHeader EvenCenterHeader EvenRightHeader
</p>
<p>FirstPageLeftHeader FirstPageCenterHeader FirstPageRightHeader</p>
<p>OddLeftFooter OddCenterFooter OddRightFooter</p>
<p>EvenLeftFooter EvenCenterFooter EvenRightFooter</p>
<p>FirstPageLeftFooter FirstPageCenterFooter FirstPageRightFooter</p>
<p>test textbox
</p>
<a href="http://lucene.apache.org/">http://lucene.apache.org/</a><p>myChartTitle</p>
<p />
merchant1 March April 1 3 merchant2 March April 2 4 <p />
<p />
<p />
<p />
<p>test WordArt</p>
<p>myChartTitle</p>
<p />
merchant1 March April 1 3 merchant2 March April 2 4 <p />
<p />
<p />
<p />
<p>myChartTitle</p>
<p />
merchant1 March April 1 3 merchant2 March April 2 4 <p />
<p />
<p />
<p />
<a href="http://tika.apache.org/">http://tika.apache.org/</a></div>
<div class="package-entry" /><div class="package-entry" /><div class="package-entry" /></body></html>
2017-07-10 10:17 GMT-03:00 JB Data <jb...@gmail.com>>:
+1.
No regression in my 1.15 env<http://jbigdata.fr/jbigdata/ged-02.html>.
Test docx chart extraction (TIKA-2254): OK.
@JBΔ<http://jbigdata.fr>
2017-07-08 22:29 GMT+02:00 Chris Mattmann <ma...@apache.org>>:
+1 from me SIGS and CHECKSUMS look good.
Thanks Tim!
Cheers,
Chris
LMC-053601:apache-tika-1.16-rc1 mattmann$ for type in "" \-app \-eval \-server; do $HOME/bin/stage_apache_rc tika$type 1.16 https://dist.apache.org/repos/dist/dev/tika/; done
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 53.5M 100 53.5M 0 0 3992k 0 0:00:13 0:00:13 --:--:-- 5122k
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 836 100 836 0 0 1092 0 --:--:-- --:--:-- --:--:-- 1092
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 34 100 34 0 0 96 0 --:--:-- --:--:-- --:--:-- 96
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 41.6M 100 41.6M 0 0 6578k 0 0:00:06 0:00:06 --:--:-- 8297k
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 836 100 836 0 0 1012 0 --:--:-- --:--:-- --:--:-- 1012
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 34 100 34 0 0 46 0 --:--:-- --:--:-- --:--:-- 46
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 56.4M 100 56.4M 0 0 3950k 0 0:00:14 0:00:14 --:--:-- 4742k
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 836 100 836 0 0 1470 0 --:--:-- --:--:-- --:--:-- 1469
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 34 100 34 0 0 65 0 --:--:-- --:--:-- --:--:-- 65
LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/stage_apache_rc tika 1.16-src https://dist.apache.org/repos/dist/dev/tika/
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 84.2M 100 84.2M 0 0 6563k 0 0:00:13 0:00:13 --:--:-- 5261k
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 836 100 836 0 0 2129 0 --:--:-- --:--:-- --:--:-- 2127
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 34 100 34 0 0 47 0 --:--:-- --:--:-- --:--:-- 47
LMC-053601:apache-tika-1.16-rc1 mattmann$ ls
tika-1.16-src.zip tika-app-1.16.jar tika-eval-1.16.jar tika-server-1.16.jar
tika-1.16-src.zip.asc tika-app-1.16.jar.asc tika-eval-1.16.jar.asc tika-server-1.16.jar.asc
tika-1.16-src.zip.md5 tika-app-1.16.jar.md5 tika-eval-1.16.jar.md5 tika-server-1.16.jar.md5
LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/verify_gpg_sigs
Verifying Signature for file tika-1.16-src.zip.asc
gpg: assuming signed data in `tika-1.16-src.zip'
gpg: Signature made Fri Jul 7 19:27:42 2017 PDT using RSA key ID EF0CF38A
gpg: Good signature from "Tim Allison (ASF signing key) <ta...@apache.org>>"
gpg: WARNING: This key is not certified with a trusted signature!
gpg: There is no indication that the signature belongs to the owner.
Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C F38A
Verifying Signature for file tika-app-1.16.jar.asc
gpg: assuming signed data in `tika-app-1.16.jar'
gpg: Signature made Fri Jul 7 19:13:16 2017 PDT using RSA key ID EF0CF38A
gpg: Good signature from "Tim Allison (ASF signing key) <ta...@apache.org>>"
gpg: WARNING: This key is not certified with a trusted signature!
gpg: There is no indication that the signature belongs to the owner.
Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C F38A
Verifying Signature for file tika-eval-1.16.jar.asc
gpg: assuming signed data in `tika-eval-1.16.jar'
gpg: Signature made Fri Jul 7 19:20:17 2017 PDT using RSA key ID EF0CF38A
gpg: Good signature from "Tim Allison (ASF signing key) <ta...@apache.org>>"
gpg: WARNING: This key is not certified with a trusted signature!
gpg: There is no indication that the signature belongs to the owner.
Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C F38A
Verifying Signature for file tika-server-1.16.jar.asc
gpg: assuming signed data in `tika-server-1.16.jar'
gpg: Signature made Fri Jul 7 19:17:53 2017 PDT using RSA key ID EF0CF38A
gpg: Good signature from "Tim Allison (ASF signing key) <ta...@apache.org>>"
gpg: WARNING: This key is not certified with a trusted signature!
gpg: There is no indication that the signature belongs to the owner.
Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C F38A
LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/verify_md5_checksums
md5sum: stat '*.tar.gz': No such file or directory
md5sum: stat '*.bz2': No such file or directory
md5sum: stat '*.tgz': No such file or directory
tika-1.16-src.zip: OK
LMC-053601:apache-tika-1.16-rc1 mattmann$
On 7/7/17, 7:40 PM, "Tim Allison" <ta...@apache.org>> wrote:
A candidate for the Tika 1.16 release is available at:
https://dist.apache.org/repos/dist/dev/tika/
The release candidate is a zip archive of the sources in:
https://github.com/apache/tika/tree/1.16-rc1
The SHA1 checksum of the archive is
e6884af0209ace42bf0b9b59d72c3c5a0052055e
In addition, a staged maven repository is available here:
https://repository.apache.org/content/repositories/orgapachetika-1025
Please vote on releasing this package as Apache Tika 1.16.
The vote is open for the next 72 hours and passes if a majority of at
least three +1 Tika PMC votes are cast.
[ ] +1 Release this package as Apache Tika 1.16
[ ] -1 Do not release this package because...
This is my +1.
Cheers,
Tim
Re: [VOTE] Release Apache Tika 1.16 Candidate #1
Posted by Luís Filipe Nassif <lf...@gmail.com>.
OK, that is a Locale issue, working around...
2017-07-10 10:24 GMT-03:00 Luís Filipe Nassif <lf...@gmail.com>:
> I got the following failure on Window7, jdk1.8.0_131, in OOXMLParserTest.testXLSBVarious:1537.
> Any ideas?
>
> Failed tests:
> OOXMLParserTest.testXLSBVarious:1537->TikaTest.assertContains:102
> <td>13.1211231321</td> not found in:
> <html xmlns="http://www.w3.org/1999/xhtml">
> <head>
> <meta name="date" content="2017-03-10T14:58:49Z" />
> <meta name="extended-properties:AppVersion" content="16.0300" />
> <meta name="dc:creator" content="Allison, Timothy B." />
> <meta name="extended-properties:Company" content="" />
> <meta name="dcterms:created" content="2017-03-09T12:24:26Z" />
> <meta name="Last-Modified" content="2017-03-10T14:58:49Z" />
> <meta name="dcterms:modified" content="2017-03-10T14:58:49Z" />
> <meta name="Last-Save-Date" content="2017-03-10T14:58:49Z" />
> <meta name="protected" content="false" />
> <meta name="meta:save-date" content="2017-03-10T14:58:49Z" />
> <meta name="Application-Name" content="Microsoft Excel" />
> <meta name="modified" content="2017-03-10T14:58:49Z" />
> <meta name="Content-Type" content="application/vnd.ms-excel.sheet.binary.macroenabled.12"
> />
> <meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
> <meta name="X-Parsed-By" content="org.apache.tika.parser.microsoft.ooxml.OOXMLParser"
> />
> <meta name="creator" content="Allison, Timothy B." />
> <meta name="meta:author" content="Allison, Timothy B." />
> <meta name="meta:creation-date" content="2017-03-09T12:24:26Z" />
> <meta name="extended-properties:Application" content="Microsoft Excel" />
> <meta name="meta:last-author" content="Allison, Timothy B." />
> <meta name="Creation-Date" content="2017-03-09T12:24:26Z" />
> <meta name="Last-Author" content="Allison, Timothy B." />
> <meta name="X-TIKA:origResourceName" content="C:\Users\tallison\Desktop\working\xlsb\"
> />
> <meta name="Application-Version" content="16.0300" />
> <meta name="Author" content="Allison, Timothy B." />
> <meta name="publisher" content="" />
> <meta name="dc:publisher" content="" />
> <title></title>
> </head>
> <body><div><h1>mySheet1</h1>
> <table><tbody><tr> <td>String</td> <td>This is a string</td></tr>
> <tr> <td>integer</td> <td>13</td></tr>
> <tr> <td>float</td> <td>13,1211231321</td></tr>
> <tr> <td>currency</td> <td>$ 0,003,,03.00</td></tr>
> <tr> <td>percent</td> <td>20%</td></tr>
> <tr> <td>float 2</td> <td>13,12</td></tr>
> <tr> <td>long int</td> <td>123456789012345</td></tr>
> <tr> <td>longer int</td> <td>1,23456789012345E+15</td> <td><br />
> Allison, Timothy B.: Allison, Timothy B.:
> test comment2
> </td></tr>
> <tr> <td>fraction</td> <td>1/4</td></tr>
> <tr> <td>date</td> <td>3/9/17</td></tr>
> <tr> <td>comment</td> <td>contents<br />
> Allison, Timothy B.: Allison, Timothy B.:
> test comment
> </td></tr>
> <tr> <td>hyperlink</td> <td>tika_link</td></tr>
> <tr> <td>formula</td> <td>4</td> <td>2</td></tr>
> <tr> <td>formulaErr</td> <td>ERROR</td></tr>
> <tr> <td>formulaFloat</td> <td>0,5</td> <td>March</td> <td>April</td></tr>
> <tr> <td>customFormat1</td> <td> 46/1963</td> <td>merchant1</td>
> <td>1</td> <td>3</td></tr>
> <tr> <td>customFormat2</td> <td> 3/128</td> <td>merchant2</td> <td>2</td>
> <td>4</td></tr>
> <tr> <td>text test</td></tr>
> <tr> <td><br />
> Allison, Timothy B.: Allison, Timothy B.:
> comment1
> </td></tr>
> <tr> <td><br />
> Allison, Timothy B.: Allison, Timothy B.:
> comment2
> </td></tr>
> <tr> <td><br />
> Allison, Timothy B.: Allison, Timothy B.:
> comment3
> </td></tr>
> <tr> <td>the</td> <td><br />
> Allison, Timothy B.: Allison, Timothy B.:
> comment4 (end of row)
> </td></tr>
> <tr> <td>the</td> <td><br />
> Allison, Timothy B.: Allison, Timothy B.:
> comment5 between cells
> </td> <td>quick</td></tr>
> <tr> <td>comment6<br />
> Allison, Timothy B.: Allison, Timothy B.:
> comment6 actually in cell
> </td></tr>
> <tr> <td><br />
> Allison, Timothy B.: Allison, Timothy B.:
> comment7 end of file
> </td></tr>
> <tr> <td><br />
> Allison, Timothy B.: Allison, Timothy B.:
> comment8 end of file</td></tr>
> </tbody></table>
> <p>OddLeftHeader OddCenterHeader OddRightHeader</p>
> <p>EvenLeftHeader EvenCenterHeader EvenRightHeader
> </p>
> <p>FirstPageLeftHeader FirstPageCenterHeader FirstPageRightHeader</p>
> <p>OddLeftFooter OddCenterFooter OddRightFooter</p>
> <p>EvenLeftFooter EvenCenterFooter EvenRightFooter</p>
> <p>FirstPageLeftFooter FirstPageCenterFooter FirstPageRightFooter</p>
> <p>test textbox
> </p>
> <a href="http://lucene.apache.org/">http://lucene.apache.org/
> </a><p>myChartTitle</p>
> <p />
> merchant1 March April 1 3 merchant2 March April 2 4 <p />
> <p />
> <p />
> <p />
> <p>test WordArt</p>
> <p>myChartTitle</p>
> <p />
> merchant1 March April 1 3 merchant2 March April 2 4 <p />
> <p />
> <p />
> <p />
> <p>myChartTitle</p>
> <p />
> merchant1 March April 1 3 merchant2 March April 2 4 <p />
> <p />
> <p />
> <p />
> <a href="http://tika.apache.org/">http://tika.apache.org/</a></div>
> <div class="package-entry" /><div class="package-entry" /><div
> class="package-entry" /></body></html>
>
> 2017-07-10 10:17 GMT-03:00 JB Data <jb...@gmail.com>:
>
>> +1.
>> No regression in my 1.15 env <http://jbigdata.fr/jbigdata/ged-02.html>.
>> Test docx chart extraction (TIKA-2254): OK.
>>
>> @*JB*Δ <http://jbigdata.fr>
>>
>>
>> 2017-07-08 22:29 GMT+02:00 Chris Mattmann <ma...@apache.org>:
>>
>>> +1 from me SIGS and CHECKSUMS look good.
>>>
>>> Thanks Tim!
>>>
>>> Cheers,
>>> Chris
>>>
>>> LMC-053601:apache-tika-1.16-rc1 mattmann$ for type in "" \-app \-eval
>>> \-server; do $HOME/bin/stage_apache_rc tika$type 1.16
>>> https://dist.apache.org/repos/dist/dev/tika/; done
>>> % Total % Received % Xferd Average Speed Time Time Time
>>> Current
>>> Dload Upload Total Spent Left
>>> Speed
>>> 100 53.5M 100 53.5M 0 0 3992k 0 0:00:13 0:00:13 --:--:--
>>> 5122k
>>> % Total % Received % Xferd Average Speed Time Time Time
>>> Current
>>> Dload Upload Total Spent Left
>>> Speed
>>> 100 836 100 836 0 0 1092 0 --:--:-- --:--:--
>>> --:--:-- 1092
>>> % Total % Received % Xferd Average Speed Time Time Time
>>> Current
>>> Dload Upload Total Spent Left
>>> Speed
>>> 100 34 100 34 0 0 96 0 --:--:-- --:--:--
>>> --:--:-- 96
>>> % Total % Received % Xferd Average Speed Time Time Time
>>> Current
>>> Dload Upload Total Spent Left
>>> Speed
>>> 100 41.6M 100 41.6M 0 0 6578k 0 0:00:06 0:00:06 --:--:--
>>> 8297k
>>> % Total % Received % Xferd Average Speed Time Time Time
>>> Current
>>> Dload Upload Total Spent Left
>>> Speed
>>> 100 836 100 836 0 0 1012 0 --:--:-- --:--:--
>>> --:--:-- 1012
>>> % Total % Received % Xferd Average Speed Time Time Time
>>> Current
>>> Dload Upload Total Spent Left
>>> Speed
>>> 100 34 100 34 0 0 46 0 --:--:-- --:--:--
>>> --:--:-- 46
>>> % Total % Received % Xferd Average Speed Time Time Time
>>> Current
>>> Dload Upload Total Spent Left
>>> Speed
>>> 100 56.4M 100 56.4M 0 0 3950k 0 0:00:14 0:00:14 --:--:--
>>> 4742k
>>> % Total % Received % Xferd Average Speed Time Time Time
>>> Current
>>> Dload Upload Total Spent Left
>>> Speed
>>> 100 836 100 836 0 0 1470 0 --:--:-- --:--:--
>>> --:--:-- 1469
>>> % Total % Received % Xferd Average Speed Time Time Time
>>> Current
>>> Dload Upload Total Spent Left
>>> Speed
>>> 100 34 100 34 0 0 65 0 --:--:-- --:--:--
>>> --:--:-- 65
>>> LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/stage_apache_rc
>>> tika 1.16-src https://dist.apache.org/repos/dist/dev/tika/
>>> % Total % Received % Xferd Average Speed Time Time Time
>>> Current
>>> Dload Upload Total Spent Left
>>> Speed
>>> 100 84.2M 100 84.2M 0 0 6563k 0 0:00:13 0:00:13 --:--:--
>>> 5261k
>>> % Total % Received % Xferd Average Speed Time Time Time
>>> Current
>>> Dload Upload Total Spent Left
>>> Speed
>>> 100 836 100 836 0 0 2129 0 --:--:-- --:--:--
>>> --:--:-- 2127
>>> % Total % Received % Xferd Average Speed Time Time Time
>>> Current
>>> Dload Upload Total Spent Left
>>> Speed
>>> 100 34 100 34 0 0 47 0 --:--:-- --:--:--
>>> --:--:-- 47
>>> LMC-053601:apache-tika-1.16-rc1 mattmann$ ls
>>> tika-1.16-src.zip tika-app-1.16.jar
>>> tika-eval-1.16.jar tika-server-1.16.jar
>>> tika-1.16-src.zip.asc tika-app-1.16.jar.asc
>>> tika-eval-1.16.jar.asc tika-server-1.16.jar.asc
>>> tika-1.16-src.zip.md5 tika-app-1.16.jar.md5
>>> tika-eval-1.16.jar.md5 tika-server-1.16.jar.md5
>>> LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/verify_gpg_sigs
>>> Verifying Signature for file tika-1.16-src.zip.asc
>>> gpg: assuming signed data in `tika-1.16-src.zip'
>>> gpg: Signature made Fri Jul 7 19:27:42 2017 PDT using RSA key ID
>>> EF0CF38A
>>> gpg: Good signature from "Tim Allison (ASF signing key) <
>>> tallison@apache.org>"
>>> gpg: WARNING: This key is not certified with a trusted signature!
>>> gpg: There is no indication that the signature belongs to the
>>> owner.
>>> Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C
>>> F38A
>>> Verifying Signature for file tika-app-1.16.jar.asc
>>> gpg: assuming signed data in `tika-app-1.16.jar'
>>> gpg: Signature made Fri Jul 7 19:13:16 2017 PDT using RSA key ID
>>> EF0CF38A
>>> gpg: Good signature from "Tim Allison (ASF signing key) <
>>> tallison@apache.org>"
>>> gpg: WARNING: This key is not certified with a trusted signature!
>>> gpg: There is no indication that the signature belongs to the
>>> owner.
>>> Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C
>>> F38A
>>> Verifying Signature for file tika-eval-1.16.jar.asc
>>> gpg: assuming signed data in `tika-eval-1.16.jar'
>>> gpg: Signature made Fri Jul 7 19:20:17 2017 PDT using RSA key ID
>>> EF0CF38A
>>> gpg: Good signature from "Tim Allison (ASF signing key) <
>>> tallison@apache.org>"
>>> gpg: WARNING: This key is not certified with a trusted signature!
>>> gpg: There is no indication that the signature belongs to the
>>> owner.
>>> Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C
>>> F38A
>>> Verifying Signature for file tika-server-1.16.jar.asc
>>> gpg: assuming signed data in `tika-server-1.16.jar'
>>> gpg: Signature made Fri Jul 7 19:17:53 2017 PDT using RSA key ID
>>> EF0CF38A
>>> gpg: Good signature from "Tim Allison (ASF signing key) <
>>> tallison@apache.org>"
>>> gpg: WARNING: This key is not certified with a trusted signature!
>>> gpg: There is no indication that the signature belongs to the
>>> owner.
>>> Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C
>>> F38A
>>> LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/verify_md5_checksums
>>> md5sum: stat '*.tar.gz': No such file or directory
>>> md5sum: stat '*.bz2': No such file or directory
>>> md5sum: stat '*.tgz': No such file or directory
>>> tika-1.16-src.zip: OK
>>> LMC-053601:apache-tika-1.16-rc1 mattmann$
>>>
>>>
>>>
>>>
>>> On 7/7/17, 7:40 PM, "Tim Allison" <ta...@apache.org> wrote:
>>>
>>>
>>>
>>>
>>> A candidate for the Tika 1.16 release is available at:
>>> https://dist.apache.org/repos/dist/dev/tika/
>>>
>>> The release candidate is a zip archive of the sources in:
>>> https://github.com/apache/tika/tree/1.16-rc1
>>>
>>> The SHA1 checksum of the archive is
>>> e6884af0209ace42bf0b9b59d72c3c5a0052055e
>>>
>>> In addition, a staged maven repository is available here:
>>> https://repository.apache.org/content/repositories/orgapache
>>> tika-1025
>>>
>>>
>>>
>>> Please vote on releasing this package as Apache Tika 1.16.
>>> The vote is open for the next 72 hours and passes if a majority of at
>>> least three +1 Tika PMC votes are cast.
>>>
>>> [ ] +1 Release this package as Apache Tika 1.16
>>> [ ] -1 Do not release this package because...
>>>
>>>
>>> This is my +1.
>>>
>>> Cheers,
>>>
>>> Tim
>>>
>>>
>>>
>>>
>>>
>>>
>>
>
Re: [VOTE] Release Apache Tika 1.16 Candidate #1
Posted by Luís Filipe Nassif <lf...@gmail.com>.
OK, that is a Locale issue, working around...
2017-07-10 10:24 GMT-03:00 Luís Filipe Nassif <lf...@gmail.com>:
> I got the following failure on Window7, jdk1.8.0_131, in OOXMLParserTest.testXLSBVarious:1537.
> Any ideas?
>
> Failed tests:
> OOXMLParserTest.testXLSBVarious:1537->TikaTest.assertContains:102
> <td>13.1211231321</td> not found in:
> <html xmlns="http://www.w3.org/1999/xhtml">
> <head>
> <meta name="date" content="2017-03-10T14:58:49Z" />
> <meta name="extended-properties:AppVersion" content="16.0300" />
> <meta name="dc:creator" content="Allison, Timothy B." />
> <meta name="extended-properties:Company" content="" />
> <meta name="dcterms:created" content="2017-03-09T12:24:26Z" />
> <meta name="Last-Modified" content="2017-03-10T14:58:49Z" />
> <meta name="dcterms:modified" content="2017-03-10T14:58:49Z" />
> <meta name="Last-Save-Date" content="2017-03-10T14:58:49Z" />
> <meta name="protected" content="false" />
> <meta name="meta:save-date" content="2017-03-10T14:58:49Z" />
> <meta name="Application-Name" content="Microsoft Excel" />
> <meta name="modified" content="2017-03-10T14:58:49Z" />
> <meta name="Content-Type" content="application/vnd.ms-excel.sheet.binary.macroenabled.12"
> />
> <meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
> <meta name="X-Parsed-By" content="org.apache.tika.parser.microsoft.ooxml.OOXMLParser"
> />
> <meta name="creator" content="Allison, Timothy B." />
> <meta name="meta:author" content="Allison, Timothy B." />
> <meta name="meta:creation-date" content="2017-03-09T12:24:26Z" />
> <meta name="extended-properties:Application" content="Microsoft Excel" />
> <meta name="meta:last-author" content="Allison, Timothy B." />
> <meta name="Creation-Date" content="2017-03-09T12:24:26Z" />
> <meta name="Last-Author" content="Allison, Timothy B." />
> <meta name="X-TIKA:origResourceName" content="C:\Users\tallison\Desktop\working\xlsb\"
> />
> <meta name="Application-Version" content="16.0300" />
> <meta name="Author" content="Allison, Timothy B." />
> <meta name="publisher" content="" />
> <meta name="dc:publisher" content="" />
> <title></title>
> </head>
> <body><div><h1>mySheet1</h1>
> <table><tbody><tr> <td>String</td> <td>This is a string</td></tr>
> <tr> <td>integer</td> <td>13</td></tr>
> <tr> <td>float</td> <td>13,1211231321</td></tr>
> <tr> <td>currency</td> <td>$ 0,003,,03.00</td></tr>
> <tr> <td>percent</td> <td>20%</td></tr>
> <tr> <td>float 2</td> <td>13,12</td></tr>
> <tr> <td>long int</td> <td>123456789012345</td></tr>
> <tr> <td>longer int</td> <td>1,23456789012345E+15</td> <td><br />
> Allison, Timothy B.: Allison, Timothy B.:
> test comment2
> </td></tr>
> <tr> <td>fraction</td> <td>1/4</td></tr>
> <tr> <td>date</td> <td>3/9/17</td></tr>
> <tr> <td>comment</td> <td>contents<br />
> Allison, Timothy B.: Allison, Timothy B.:
> test comment
> </td></tr>
> <tr> <td>hyperlink</td> <td>tika_link</td></tr>
> <tr> <td>formula</td> <td>4</td> <td>2</td></tr>
> <tr> <td>formulaErr</td> <td>ERROR</td></tr>
> <tr> <td>formulaFloat</td> <td>0,5</td> <td>March</td> <td>April</td></tr>
> <tr> <td>customFormat1</td> <td> 46/1963</td> <td>merchant1</td>
> <td>1</td> <td>3</td></tr>
> <tr> <td>customFormat2</td> <td> 3/128</td> <td>merchant2</td> <td>2</td>
> <td>4</td></tr>
> <tr> <td>text test</td></tr>
> <tr> <td><br />
> Allison, Timothy B.: Allison, Timothy B.:
> comment1
> </td></tr>
> <tr> <td><br />
> Allison, Timothy B.: Allison, Timothy B.:
> comment2
> </td></tr>
> <tr> <td><br />
> Allison, Timothy B.: Allison, Timothy B.:
> comment3
> </td></tr>
> <tr> <td>the</td> <td><br />
> Allison, Timothy B.: Allison, Timothy B.:
> comment4 (end of row)
> </td></tr>
> <tr> <td>the</td> <td><br />
> Allison, Timothy B.: Allison, Timothy B.:
> comment5 between cells
> </td> <td>quick</td></tr>
> <tr> <td>comment6<br />
> Allison, Timothy B.: Allison, Timothy B.:
> comment6 actually in cell
> </td></tr>
> <tr> <td><br />
> Allison, Timothy B.: Allison, Timothy B.:
> comment7 end of file
> </td></tr>
> <tr> <td><br />
> Allison, Timothy B.: Allison, Timothy B.:
> comment8 end of file</td></tr>
> </tbody></table>
> <p>OddLeftHeader OddCenterHeader OddRightHeader</p>
> <p>EvenLeftHeader EvenCenterHeader EvenRightHeader
> </p>
> <p>FirstPageLeftHeader FirstPageCenterHeader FirstPageRightHeader</p>
> <p>OddLeftFooter OddCenterFooter OddRightFooter</p>
> <p>EvenLeftFooter EvenCenterFooter EvenRightFooter</p>
> <p>FirstPageLeftFooter FirstPageCenterFooter FirstPageRightFooter</p>
> <p>test textbox
> </p>
> <a href="http://lucene.apache.org/">http://lucene.apache.org/
> </a><p>myChartTitle</p>
> <p />
> merchant1 March April 1 3 merchant2 March April 2 4 <p />
> <p />
> <p />
> <p />
> <p>test WordArt</p>
> <p>myChartTitle</p>
> <p />
> merchant1 March April 1 3 merchant2 March April 2 4 <p />
> <p />
> <p />
> <p />
> <p>myChartTitle</p>
> <p />
> merchant1 March April 1 3 merchant2 March April 2 4 <p />
> <p />
> <p />
> <p />
> <a href="http://tika.apache.org/">http://tika.apache.org/</a></div>
> <div class="package-entry" /><div class="package-entry" /><div
> class="package-entry" /></body></html>
>
> 2017-07-10 10:17 GMT-03:00 JB Data <jb...@gmail.com>:
>
>> +1.
>> No regression in my 1.15 env <http://jbigdata.fr/jbigdata/ged-02.html>.
>> Test docx chart extraction (TIKA-2254): OK.
>>
>> @*JB*Δ <http://jbigdata.fr>
>>
>>
>> 2017-07-08 22:29 GMT+02:00 Chris Mattmann <ma...@apache.org>:
>>
>>> +1 from me SIGS and CHECKSUMS look good.
>>>
>>> Thanks Tim!
>>>
>>> Cheers,
>>> Chris
>>>
>>> LMC-053601:apache-tika-1.16-rc1 mattmann$ for type in "" \-app \-eval
>>> \-server; do $HOME/bin/stage_apache_rc tika$type 1.16
>>> https://dist.apache.org/repos/dist/dev/tika/; done
>>> % Total % Received % Xferd Average Speed Time Time Time
>>> Current
>>> Dload Upload Total Spent Left
>>> Speed
>>> 100 53.5M 100 53.5M 0 0 3992k 0 0:00:13 0:00:13 --:--:--
>>> 5122k
>>> % Total % Received % Xferd Average Speed Time Time Time
>>> Current
>>> Dload Upload Total Spent Left
>>> Speed
>>> 100 836 100 836 0 0 1092 0 --:--:-- --:--:--
>>> --:--:-- 1092
>>> % Total % Received % Xferd Average Speed Time Time Time
>>> Current
>>> Dload Upload Total Spent Left
>>> Speed
>>> 100 34 100 34 0 0 96 0 --:--:-- --:--:--
>>> --:--:-- 96
>>> % Total % Received % Xferd Average Speed Time Time Time
>>> Current
>>> Dload Upload Total Spent Left
>>> Speed
>>> 100 41.6M 100 41.6M 0 0 6578k 0 0:00:06 0:00:06 --:--:--
>>> 8297k
>>> % Total % Received % Xferd Average Speed Time Time Time
>>> Current
>>> Dload Upload Total Spent Left
>>> Speed
>>> 100 836 100 836 0 0 1012 0 --:--:-- --:--:--
>>> --:--:-- 1012
>>> % Total % Received % Xferd Average Speed Time Time Time
>>> Current
>>> Dload Upload Total Spent Left
>>> Speed
>>> 100 34 100 34 0 0 46 0 --:--:-- --:--:--
>>> --:--:-- 46
>>> % Total % Received % Xferd Average Speed Time Time Time
>>> Current
>>> Dload Upload Total Spent Left
>>> Speed
>>> 100 56.4M 100 56.4M 0 0 3950k 0 0:00:14 0:00:14 --:--:--
>>> 4742k
>>> % Total % Received % Xferd Average Speed Time Time Time
>>> Current
>>> Dload Upload Total Spent Left
>>> Speed
>>> 100 836 100 836 0 0 1470 0 --:--:-- --:--:--
>>> --:--:-- 1469
>>> % Total % Received % Xferd Average Speed Time Time Time
>>> Current
>>> Dload Upload Total Spent Left
>>> Speed
>>> 100 34 100 34 0 0 65 0 --:--:-- --:--:--
>>> --:--:-- 65
>>> LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/stage_apache_rc
>>> tika 1.16-src https://dist.apache.org/repos/dist/dev/tika/
>>> % Total % Received % Xferd Average Speed Time Time Time
>>> Current
>>> Dload Upload Total Spent Left
>>> Speed
>>> 100 84.2M 100 84.2M 0 0 6563k 0 0:00:13 0:00:13 --:--:--
>>> 5261k
>>> % Total % Received % Xferd Average Speed Time Time Time
>>> Current
>>> Dload Upload Total Spent Left
>>> Speed
>>> 100 836 100 836 0 0 2129 0 --:--:-- --:--:--
>>> --:--:-- 2127
>>> % Total % Received % Xferd Average Speed Time Time Time
>>> Current
>>> Dload Upload Total Spent Left
>>> Speed
>>> 100 34 100 34 0 0 47 0 --:--:-- --:--:--
>>> --:--:-- 47
>>> LMC-053601:apache-tika-1.16-rc1 mattmann$ ls
>>> tika-1.16-src.zip tika-app-1.16.jar
>>> tika-eval-1.16.jar tika-server-1.16.jar
>>> tika-1.16-src.zip.asc tika-app-1.16.jar.asc
>>> tika-eval-1.16.jar.asc tika-server-1.16.jar.asc
>>> tika-1.16-src.zip.md5 tika-app-1.16.jar.md5
>>> tika-eval-1.16.jar.md5 tika-server-1.16.jar.md5
>>> LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/verify_gpg_sigs
>>> Verifying Signature for file tika-1.16-src.zip.asc
>>> gpg: assuming signed data in `tika-1.16-src.zip'
>>> gpg: Signature made Fri Jul 7 19:27:42 2017 PDT using RSA key ID
>>> EF0CF38A
>>> gpg: Good signature from "Tim Allison (ASF signing key) <
>>> tallison@apache.org>"
>>> gpg: WARNING: This key is not certified with a trusted signature!
>>> gpg: There is no indication that the signature belongs to the
>>> owner.
>>> Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C
>>> F38A
>>> Verifying Signature for file tika-app-1.16.jar.asc
>>> gpg: assuming signed data in `tika-app-1.16.jar'
>>> gpg: Signature made Fri Jul 7 19:13:16 2017 PDT using RSA key ID
>>> EF0CF38A
>>> gpg: Good signature from "Tim Allison (ASF signing key) <
>>> tallison@apache.org>"
>>> gpg: WARNING: This key is not certified with a trusted signature!
>>> gpg: There is no indication that the signature belongs to the
>>> owner.
>>> Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C
>>> F38A
>>> Verifying Signature for file tika-eval-1.16.jar.asc
>>> gpg: assuming signed data in `tika-eval-1.16.jar'
>>> gpg: Signature made Fri Jul 7 19:20:17 2017 PDT using RSA key ID
>>> EF0CF38A
>>> gpg: Good signature from "Tim Allison (ASF signing key) <
>>> tallison@apache.org>"
>>> gpg: WARNING: This key is not certified with a trusted signature!
>>> gpg: There is no indication that the signature belongs to the
>>> owner.
>>> Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C
>>> F38A
>>> Verifying Signature for file tika-server-1.16.jar.asc
>>> gpg: assuming signed data in `tika-server-1.16.jar'
>>> gpg: Signature made Fri Jul 7 19:17:53 2017 PDT using RSA key ID
>>> EF0CF38A
>>> gpg: Good signature from "Tim Allison (ASF signing key) <
>>> tallison@apache.org>"
>>> gpg: WARNING: This key is not certified with a trusted signature!
>>> gpg: There is no indication that the signature belongs to the
>>> owner.
>>> Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C
>>> F38A
>>> LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/verify_md5_checksums
>>> md5sum: stat '*.tar.gz': No such file or directory
>>> md5sum: stat '*.bz2': No such file or directory
>>> md5sum: stat '*.tgz': No such file or directory
>>> tika-1.16-src.zip: OK
>>> LMC-053601:apache-tika-1.16-rc1 mattmann$
>>>
>>>
>>>
>>>
>>> On 7/7/17, 7:40 PM, "Tim Allison" <ta...@apache.org> wrote:
>>>
>>>
>>>
>>>
>>> A candidate for the Tika 1.16 release is available at:
>>> https://dist.apache.org/repos/dist/dev/tika/
>>>
>>> The release candidate is a zip archive of the sources in:
>>> https://github.com/apache/tika/tree/1.16-rc1
>>>
>>> The SHA1 checksum of the archive is
>>> e6884af0209ace42bf0b9b59d72c3c5a0052055e
>>>
>>> In addition, a staged maven repository is available here:
>>> https://repository.apache.org/content/repositories/orgapache
>>> tika-1025
>>>
>>>
>>>
>>> Please vote on releasing this package as Apache Tika 1.16.
>>> The vote is open for the next 72 hours and passes if a majority of at
>>> least three +1 Tika PMC votes are cast.
>>>
>>> [ ] +1 Release this package as Apache Tika 1.16
>>> [ ] -1 Do not release this package because...
>>>
>>>
>>> This is my +1.
>>>
>>> Cheers,
>>>
>>> Tim
>>>
>>>
>>>
>>>
>>>
>>>
>>
>
Re: [VOTE] Release Apache Tika 1.16 Candidate #1
Posted by Luís Filipe Nassif <lf...@gmail.com>.
I got the following failure on Window7, jdk1.8.0_131, in
OOXMLParserTest.testXLSBVarious:1537. Any ideas?
Failed tests:
OOXMLParserTest.testXLSBVarious:1537->TikaTest.assertContains:102
<td>13.1211231321</td> not found in:
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="date" content="2017-03-10T14:58:49Z" />
<meta name="extended-properties:AppVersion" content="16.0300" />
<meta name="dc:creator" content="Allison, Timothy B." />
<meta name="extended-properties:Company" content="" />
<meta name="dcterms:created" content="2017-03-09T12:24:26Z" />
<meta name="Last-Modified" content="2017-03-10T14:58:49Z" />
<meta name="dcterms:modified" content="2017-03-10T14:58:49Z" />
<meta name="Last-Save-Date" content="2017-03-10T14:58:49Z" />
<meta name="protected" content="false" />
<meta name="meta:save-date" content="2017-03-10T14:58:49Z" />
<meta name="Application-Name" content="Microsoft Excel" />
<meta name="modified" content="2017-03-10T14:58:49Z" />
<meta name="Content-Type"
content="application/vnd.ms-excel.sheet.binary.macroenabled.12" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By"
content="org.apache.tika.parser.microsoft.ooxml.OOXMLParser" />
<meta name="creator" content="Allison, Timothy B." />
<meta name="meta:author" content="Allison, Timothy B." />
<meta name="meta:creation-date" content="2017-03-09T12:24:26Z" />
<meta name="extended-properties:Application" content="Microsoft Excel" />
<meta name="meta:last-author" content="Allison, Timothy B." />
<meta name="Creation-Date" content="2017-03-09T12:24:26Z" />
<meta name="Last-Author" content="Allison, Timothy B." />
<meta name="X-TIKA:origResourceName"
content="C:\Users\tallison\Desktop\working\xlsb\" />
<meta name="Application-Version" content="16.0300" />
<meta name="Author" content="Allison, Timothy B." />
<meta name="publisher" content="" />
<meta name="dc:publisher" content="" />
<title></title>
</head>
<body><div><h1>mySheet1</h1>
<table><tbody><tr> <td>String</td> <td>This is a string</td></tr>
<tr> <td>integer</td> <td>13</td></tr>
<tr> <td>float</td> <td>13,1211231321</td></tr>
<tr> <td>currency</td> <td>$ 0,003,,03.00</td></tr>
<tr> <td>percent</td> <td>20%</td></tr>
<tr> <td>float 2</td> <td>13,12</td></tr>
<tr> <td>long int</td> <td>123456789012345</td></tr>
<tr> <td>longer int</td> <td>1,23456789012345E+15</td> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
test comment2
</td></tr>
<tr> <td>fraction</td> <td>1/4</td></tr>
<tr> <td>date</td> <td>3/9/17</td></tr>
<tr> <td>comment</td> <td>contents<br />
Allison, Timothy B.: Allison, Timothy B.:
test comment
</td></tr>
<tr> <td>hyperlink</td> <td>tika_link</td></tr>
<tr> <td>formula</td> <td>4</td> <td>2</td></tr>
<tr> <td>formulaErr</td> <td>ERROR</td></tr>
<tr> <td>formulaFloat</td> <td>0,5</td> <td>March</td> <td>April</td></tr>
<tr> <td>customFormat1</td> <td> 46/1963</td> <td>merchant1</td>
<td>1</td> <td>3</td></tr>
<tr> <td>customFormat2</td> <td> 3/128</td> <td>merchant2</td> <td>2</td>
<td>4</td></tr>
<tr> <td>text test</td></tr>
<tr> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment1
</td></tr>
<tr> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment2
</td></tr>
<tr> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment3
</td></tr>
<tr> <td>the</td> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment4 (end of row)
</td></tr>
<tr> <td>the</td> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment5 between cells
</td> <td>quick</td></tr>
<tr> <td>comment6<br />
Allison, Timothy B.: Allison, Timothy B.:
comment6 actually in cell
</td></tr>
<tr> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment7 end of file
</td></tr>
<tr> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment8 end of file</td></tr>
</tbody></table>
<p>OddLeftHeader OddCenterHeader OddRightHeader</p>
<p>EvenLeftHeader EvenCenterHeader EvenRightHeader
</p>
<p>FirstPageLeftHeader FirstPageCenterHeader FirstPageRightHeader</p>
<p>OddLeftFooter OddCenterFooter OddRightFooter</p>
<p>EvenLeftFooter EvenCenterFooter EvenRightFooter</p>
<p>FirstPageLeftFooter FirstPageCenterFooter FirstPageRightFooter</p>
<p>test textbox
</p>
<a href="http://lucene.apache.org/">http://lucene.apache.org/
</a><p>myChartTitle</p>
<p />
merchant1 March April 1 3 merchant2 March April 2 4 <p />
<p />
<p />
<p />
<p>test WordArt</p>
<p>myChartTitle</p>
<p />
merchant1 March April 1 3 merchant2 March April 2 4 <p />
<p />
<p />
<p />
<p>myChartTitle</p>
<p />
merchant1 March April 1 3 merchant2 March April 2 4 <p />
<p />
<p />
<p />
<a href="http://tika.apache.org/">http://tika.apache.org/</a></div>
<div class="package-entry" /><div class="package-entry" /><div
class="package-entry" /></body></html>
2017-07-10 10:17 GMT-03:00 JB Data <jb...@gmail.com>:
> +1.
> No regression in my 1.15 env <http://jbigdata.fr/jbigdata/ged-02.html>.
> Test docx chart extraction (TIKA-2254): OK.
>
> @*JB*Δ <http://jbigdata.fr>
>
>
> 2017-07-08 22:29 GMT+02:00 Chris Mattmann <ma...@apache.org>:
>
>> +1 from me SIGS and CHECKSUMS look good.
>>
>> Thanks Tim!
>>
>> Cheers,
>> Chris
>>
>> LMC-053601:apache-tika-1.16-rc1 mattmann$ for type in "" \-app \-eval
>> \-server; do $HOME/bin/stage_apache_rc tika$type 1.16
>> https://dist.apache.org/repos/dist/dev/tika/; done
>> % Total % Received % Xferd Average Speed Time Time Time
>> Current
>> Dload Upload Total Spent Left
>> Speed
>> 100 53.5M 100 53.5M 0 0 3992k 0 0:00:13 0:00:13 --:--:--
>> 5122k
>> % Total % Received % Xferd Average Speed Time Time Time
>> Current
>> Dload Upload Total Spent Left
>> Speed
>> 100 836 100 836 0 0 1092 0 --:--:-- --:--:-- --:--:--
>> 1092
>> % Total % Received % Xferd Average Speed Time Time Time
>> Current
>> Dload Upload Total Spent Left
>> Speed
>> 100 34 100 34 0 0 96 0 --:--:-- --:--:-- --:--:--
>> 96
>> % Total % Received % Xferd Average Speed Time Time Time
>> Current
>> Dload Upload Total Spent Left
>> Speed
>> 100 41.6M 100 41.6M 0 0 6578k 0 0:00:06 0:00:06 --:--:--
>> 8297k
>> % Total % Received % Xferd Average Speed Time Time Time
>> Current
>> Dload Upload Total Spent Left
>> Speed
>> 100 836 100 836 0 0 1012 0 --:--:-- --:--:-- --:--:--
>> 1012
>> % Total % Received % Xferd Average Speed Time Time Time
>> Current
>> Dload Upload Total Spent Left
>> Speed
>> 100 34 100 34 0 0 46 0 --:--:-- --:--:-- --:--:--
>> 46
>> % Total % Received % Xferd Average Speed Time Time Time
>> Current
>> Dload Upload Total Spent Left
>> Speed
>> 100 56.4M 100 56.4M 0 0 3950k 0 0:00:14 0:00:14 --:--:--
>> 4742k
>> % Total % Received % Xferd Average Speed Time Time Time
>> Current
>> Dload Upload Total Spent Left
>> Speed
>> 100 836 100 836 0 0 1470 0 --:--:-- --:--:-- --:--:--
>> 1469
>> % Total % Received % Xferd Average Speed Time Time Time
>> Current
>> Dload Upload Total Spent Left
>> Speed
>> 100 34 100 34 0 0 65 0 --:--:-- --:--:-- --:--:--
>> 65
>> LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/stage_apache_rc tika
>> 1.16-src https://dist.apache.org/repos/dist/dev/tika/
>> % Total % Received % Xferd Average Speed Time Time Time
>> Current
>> Dload Upload Total Spent Left
>> Speed
>> 100 84.2M 100 84.2M 0 0 6563k 0 0:00:13 0:00:13 --:--:--
>> 5261k
>> % Total % Received % Xferd Average Speed Time Time Time
>> Current
>> Dload Upload Total Spent Left
>> Speed
>> 100 836 100 836 0 0 2129 0 --:--:-- --:--:-- --:--:--
>> 2127
>> % Total % Received % Xferd Average Speed Time Time Time
>> Current
>> Dload Upload Total Spent Left
>> Speed
>> 100 34 100 34 0 0 47 0 --:--:-- --:--:-- --:--:--
>> 47
>> LMC-053601:apache-tika-1.16-rc1 mattmann$ ls
>> tika-1.16-src.zip tika-app-1.16.jar
>> tika-eval-1.16.jar tika-server-1.16.jar
>> tika-1.16-src.zip.asc tika-app-1.16.jar.asc
>> tika-eval-1.16.jar.asc tika-server-1.16.jar.asc
>> tika-1.16-src.zip.md5 tika-app-1.16.jar.md5
>> tika-eval-1.16.jar.md5 tika-server-1.16.jar.md5
>> LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/verify_gpg_sigs
>> Verifying Signature for file tika-1.16-src.zip.asc
>> gpg: assuming signed data in `tika-1.16-src.zip'
>> gpg: Signature made Fri Jul 7 19:27:42 2017 PDT using RSA key ID EF0CF38A
>> gpg: Good signature from "Tim Allison (ASF signing key) <
>> tallison@apache.org>"
>> gpg: WARNING: This key is not certified with a trusted signature!
>> gpg: There is no indication that the signature belongs to the
>> owner.
>> Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C
>> F38A
>> Verifying Signature for file tika-app-1.16.jar.asc
>> gpg: assuming signed data in `tika-app-1.16.jar'
>> gpg: Signature made Fri Jul 7 19:13:16 2017 PDT using RSA key ID EF0CF38A
>> gpg: Good signature from "Tim Allison (ASF signing key) <
>> tallison@apache.org>"
>> gpg: WARNING: This key is not certified with a trusted signature!
>> gpg: There is no indication that the signature belongs to the
>> owner.
>> Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C
>> F38A
>> Verifying Signature for file tika-eval-1.16.jar.asc
>> gpg: assuming signed data in `tika-eval-1.16.jar'
>> gpg: Signature made Fri Jul 7 19:20:17 2017 PDT using RSA key ID EF0CF38A
>> gpg: Good signature from "Tim Allison (ASF signing key) <
>> tallison@apache.org>"
>> gpg: WARNING: This key is not certified with a trusted signature!
>> gpg: There is no indication that the signature belongs to the
>> owner.
>> Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C
>> F38A
>> Verifying Signature for file tika-server-1.16.jar.asc
>> gpg: assuming signed data in `tika-server-1.16.jar'
>> gpg: Signature made Fri Jul 7 19:17:53 2017 PDT using RSA key ID EF0CF38A
>> gpg: Good signature from "Tim Allison (ASF signing key) <
>> tallison@apache.org>"
>> gpg: WARNING: This key is not certified with a trusted signature!
>> gpg: There is no indication that the signature belongs to the
>> owner.
>> Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C
>> F38A
>> LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/verify_md5_checksums
>> md5sum: stat '*.tar.gz': No such file or directory
>> md5sum: stat '*.bz2': No such file or directory
>> md5sum: stat '*.tgz': No such file or directory
>> tika-1.16-src.zip: OK
>> LMC-053601:apache-tika-1.16-rc1 mattmann$
>>
>>
>>
>>
>> On 7/7/17, 7:40 PM, "Tim Allison" <ta...@apache.org> wrote:
>>
>>
>>
>>
>> A candidate for the Tika 1.16 release is available at:
>> https://dist.apache.org/repos/dist/dev/tika/
>>
>> The release candidate is a zip archive of the sources in:
>> https://github.com/apache/tika/tree/1.16-rc1
>>
>> The SHA1 checksum of the archive is
>> e6884af0209ace42bf0b9b59d72c3c5a0052055e
>>
>> In addition, a staged maven repository is available here:
>> https://repository.apache.org/content/repositories/orgapachetika-1025
>>
>>
>>
>> Please vote on releasing this package as Apache Tika 1.16.
>> The vote is open for the next 72 hours and passes if a majority of at
>> least three +1 Tika PMC votes are cast.
>>
>> [ ] +1 Release this package as Apache Tika 1.16
>> [ ] -1 Do not release this package because...
>>
>>
>> This is my +1.
>>
>> Cheers,
>>
>> Tim
>>
>>
>>
>>
>>
>>
>
Re: [VOTE] Release Apache Tika 1.16 Candidate #1
Posted by Luís Filipe Nassif <lf...@gmail.com>.
I got the following failure on Window7, jdk1.8.0_131, in
OOXMLParserTest.testXLSBVarious:1537. Any ideas?
Failed tests:
OOXMLParserTest.testXLSBVarious:1537->TikaTest.assertContains:102
<td>13.1211231321</td> not found in:
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="date" content="2017-03-10T14:58:49Z" />
<meta name="extended-properties:AppVersion" content="16.0300" />
<meta name="dc:creator" content="Allison, Timothy B." />
<meta name="extended-properties:Company" content="" />
<meta name="dcterms:created" content="2017-03-09T12:24:26Z" />
<meta name="Last-Modified" content="2017-03-10T14:58:49Z" />
<meta name="dcterms:modified" content="2017-03-10T14:58:49Z" />
<meta name="Last-Save-Date" content="2017-03-10T14:58:49Z" />
<meta name="protected" content="false" />
<meta name="meta:save-date" content="2017-03-10T14:58:49Z" />
<meta name="Application-Name" content="Microsoft Excel" />
<meta name="modified" content="2017-03-10T14:58:49Z" />
<meta name="Content-Type"
content="application/vnd.ms-excel.sheet.binary.macroenabled.12" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By"
content="org.apache.tika.parser.microsoft.ooxml.OOXMLParser" />
<meta name="creator" content="Allison, Timothy B." />
<meta name="meta:author" content="Allison, Timothy B." />
<meta name="meta:creation-date" content="2017-03-09T12:24:26Z" />
<meta name="extended-properties:Application" content="Microsoft Excel" />
<meta name="meta:last-author" content="Allison, Timothy B." />
<meta name="Creation-Date" content="2017-03-09T12:24:26Z" />
<meta name="Last-Author" content="Allison, Timothy B." />
<meta name="X-TIKA:origResourceName"
content="C:\Users\tallison\Desktop\working\xlsb\" />
<meta name="Application-Version" content="16.0300" />
<meta name="Author" content="Allison, Timothy B." />
<meta name="publisher" content="" />
<meta name="dc:publisher" content="" />
<title></title>
</head>
<body><div><h1>mySheet1</h1>
<table><tbody><tr> <td>String</td> <td>This is a string</td></tr>
<tr> <td>integer</td> <td>13</td></tr>
<tr> <td>float</td> <td>13,1211231321</td></tr>
<tr> <td>currency</td> <td>$ 0,003,,03.00</td></tr>
<tr> <td>percent</td> <td>20%</td></tr>
<tr> <td>float 2</td> <td>13,12</td></tr>
<tr> <td>long int</td> <td>123456789012345</td></tr>
<tr> <td>longer int</td> <td>1,23456789012345E+15</td> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
test comment2
</td></tr>
<tr> <td>fraction</td> <td>1/4</td></tr>
<tr> <td>date</td> <td>3/9/17</td></tr>
<tr> <td>comment</td> <td>contents<br />
Allison, Timothy B.: Allison, Timothy B.:
test comment
</td></tr>
<tr> <td>hyperlink</td> <td>tika_link</td></tr>
<tr> <td>formula</td> <td>4</td> <td>2</td></tr>
<tr> <td>formulaErr</td> <td>ERROR</td></tr>
<tr> <td>formulaFloat</td> <td>0,5</td> <td>March</td> <td>April</td></tr>
<tr> <td>customFormat1</td> <td> 46/1963</td> <td>merchant1</td>
<td>1</td> <td>3</td></tr>
<tr> <td>customFormat2</td> <td> 3/128</td> <td>merchant2</td> <td>2</td>
<td>4</td></tr>
<tr> <td>text test</td></tr>
<tr> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment1
</td></tr>
<tr> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment2
</td></tr>
<tr> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment3
</td></tr>
<tr> <td>the</td> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment4 (end of row)
</td></tr>
<tr> <td>the</td> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment5 between cells
</td> <td>quick</td></tr>
<tr> <td>comment6<br />
Allison, Timothy B.: Allison, Timothy B.:
comment6 actually in cell
</td></tr>
<tr> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment7 end of file
</td></tr>
<tr> <td><br />
Allison, Timothy B.: Allison, Timothy B.:
comment8 end of file</td></tr>
</tbody></table>
<p>OddLeftHeader OddCenterHeader OddRightHeader</p>
<p>EvenLeftHeader EvenCenterHeader EvenRightHeader
</p>
<p>FirstPageLeftHeader FirstPageCenterHeader FirstPageRightHeader</p>
<p>OddLeftFooter OddCenterFooter OddRightFooter</p>
<p>EvenLeftFooter EvenCenterFooter EvenRightFooter</p>
<p>FirstPageLeftFooter FirstPageCenterFooter FirstPageRightFooter</p>
<p>test textbox
</p>
<a href="http://lucene.apache.org/">http://lucene.apache.org/
</a><p>myChartTitle</p>
<p />
merchant1 March April 1 3 merchant2 March April 2 4 <p />
<p />
<p />
<p />
<p>test WordArt</p>
<p>myChartTitle</p>
<p />
merchant1 March April 1 3 merchant2 March April 2 4 <p />
<p />
<p />
<p />
<p>myChartTitle</p>
<p />
merchant1 March April 1 3 merchant2 March April 2 4 <p />
<p />
<p />
<p />
<a href="http://tika.apache.org/">http://tika.apache.org/</a></div>
<div class="package-entry" /><div class="package-entry" /><div
class="package-entry" /></body></html>
2017-07-10 10:17 GMT-03:00 JB Data <jb...@gmail.com>:
> +1.
> No regression in my 1.15 env <http://jbigdata.fr/jbigdata/ged-02.html>.
> Test docx chart extraction (TIKA-2254): OK.
>
> @*JB*Δ <http://jbigdata.fr>
>
>
> 2017-07-08 22:29 GMT+02:00 Chris Mattmann <ma...@apache.org>:
>
>> +1 from me SIGS and CHECKSUMS look good.
>>
>> Thanks Tim!
>>
>> Cheers,
>> Chris
>>
>> LMC-053601:apache-tika-1.16-rc1 mattmann$ for type in "" \-app \-eval
>> \-server; do $HOME/bin/stage_apache_rc tika$type 1.16
>> https://dist.apache.org/repos/dist/dev/tika/; done
>> % Total % Received % Xferd Average Speed Time Time Time
>> Current
>> Dload Upload Total Spent Left
>> Speed
>> 100 53.5M 100 53.5M 0 0 3992k 0 0:00:13 0:00:13 --:--:--
>> 5122k
>> % Total % Received % Xferd Average Speed Time Time Time
>> Current
>> Dload Upload Total Spent Left
>> Speed
>> 100 836 100 836 0 0 1092 0 --:--:-- --:--:-- --:--:--
>> 1092
>> % Total % Received % Xferd Average Speed Time Time Time
>> Current
>> Dload Upload Total Spent Left
>> Speed
>> 100 34 100 34 0 0 96 0 --:--:-- --:--:-- --:--:--
>> 96
>> % Total % Received % Xferd Average Speed Time Time Time
>> Current
>> Dload Upload Total Spent Left
>> Speed
>> 100 41.6M 100 41.6M 0 0 6578k 0 0:00:06 0:00:06 --:--:--
>> 8297k
>> % Total % Received % Xferd Average Speed Time Time Time
>> Current
>> Dload Upload Total Spent Left
>> Speed
>> 100 836 100 836 0 0 1012 0 --:--:-- --:--:-- --:--:--
>> 1012
>> % Total % Received % Xferd Average Speed Time Time Time
>> Current
>> Dload Upload Total Spent Left
>> Speed
>> 100 34 100 34 0 0 46 0 --:--:-- --:--:-- --:--:--
>> 46
>> % Total % Received % Xferd Average Speed Time Time Time
>> Current
>> Dload Upload Total Spent Left
>> Speed
>> 100 56.4M 100 56.4M 0 0 3950k 0 0:00:14 0:00:14 --:--:--
>> 4742k
>> % Total % Received % Xferd Average Speed Time Time Time
>> Current
>> Dload Upload Total Spent Left
>> Speed
>> 100 836 100 836 0 0 1470 0 --:--:-- --:--:-- --:--:--
>> 1469
>> % Total % Received % Xferd Average Speed Time Time Time
>> Current
>> Dload Upload Total Spent Left
>> Speed
>> 100 34 100 34 0 0 65 0 --:--:-- --:--:-- --:--:--
>> 65
>> LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/stage_apache_rc tika
>> 1.16-src https://dist.apache.org/repos/dist/dev/tika/
>> % Total % Received % Xferd Average Speed Time Time Time
>> Current
>> Dload Upload Total Spent Left
>> Speed
>> 100 84.2M 100 84.2M 0 0 6563k 0 0:00:13 0:00:13 --:--:--
>> 5261k
>> % Total % Received % Xferd Average Speed Time Time Time
>> Current
>> Dload Upload Total Spent Left
>> Speed
>> 100 836 100 836 0 0 2129 0 --:--:-- --:--:-- --:--:--
>> 2127
>> % Total % Received % Xferd Average Speed Time Time Time
>> Current
>> Dload Upload Total Spent Left
>> Speed
>> 100 34 100 34 0 0 47 0 --:--:-- --:--:-- --:--:--
>> 47
>> LMC-053601:apache-tika-1.16-rc1 mattmann$ ls
>> tika-1.16-src.zip tika-app-1.16.jar
>> tika-eval-1.16.jar tika-server-1.16.jar
>> tika-1.16-src.zip.asc tika-app-1.16.jar.asc
>> tika-eval-1.16.jar.asc tika-server-1.16.jar.asc
>> tika-1.16-src.zip.md5 tika-app-1.16.jar.md5
>> tika-eval-1.16.jar.md5 tika-server-1.16.jar.md5
>> LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/verify_gpg_sigs
>> Verifying Signature for file tika-1.16-src.zip.asc
>> gpg: assuming signed data in `tika-1.16-src.zip'
>> gpg: Signature made Fri Jul 7 19:27:42 2017 PDT using RSA key ID EF0CF38A
>> gpg: Good signature from "Tim Allison (ASF signing key) <
>> tallison@apache.org>"
>> gpg: WARNING: This key is not certified with a trusted signature!
>> gpg: There is no indication that the signature belongs to the
>> owner.
>> Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C
>> F38A
>> Verifying Signature for file tika-app-1.16.jar.asc
>> gpg: assuming signed data in `tika-app-1.16.jar'
>> gpg: Signature made Fri Jul 7 19:13:16 2017 PDT using RSA key ID EF0CF38A
>> gpg: Good signature from "Tim Allison (ASF signing key) <
>> tallison@apache.org>"
>> gpg: WARNING: This key is not certified with a trusted signature!
>> gpg: There is no indication that the signature belongs to the
>> owner.
>> Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C
>> F38A
>> Verifying Signature for file tika-eval-1.16.jar.asc
>> gpg: assuming signed data in `tika-eval-1.16.jar'
>> gpg: Signature made Fri Jul 7 19:20:17 2017 PDT using RSA key ID EF0CF38A
>> gpg: Good signature from "Tim Allison (ASF signing key) <
>> tallison@apache.org>"
>> gpg: WARNING: This key is not certified with a trusted signature!
>> gpg: There is no indication that the signature belongs to the
>> owner.
>> Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C
>> F38A
>> Verifying Signature for file tika-server-1.16.jar.asc
>> gpg: assuming signed data in `tika-server-1.16.jar'
>> gpg: Signature made Fri Jul 7 19:17:53 2017 PDT using RSA key ID EF0CF38A
>> gpg: Good signature from "Tim Allison (ASF signing key) <
>> tallison@apache.org>"
>> gpg: WARNING: This key is not certified with a trusted signature!
>> gpg: There is no indication that the signature belongs to the
>> owner.
>> Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C
>> F38A
>> LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/verify_md5_checksums
>> md5sum: stat '*.tar.gz': No such file or directory
>> md5sum: stat '*.bz2': No such file or directory
>> md5sum: stat '*.tgz': No such file or directory
>> tika-1.16-src.zip: OK
>> LMC-053601:apache-tika-1.16-rc1 mattmann$
>>
>>
>>
>>
>> On 7/7/17, 7:40 PM, "Tim Allison" <ta...@apache.org> wrote:
>>
>>
>>
>>
>> A candidate for the Tika 1.16 release is available at:
>> https://dist.apache.org/repos/dist/dev/tika/
>>
>> The release candidate is a zip archive of the sources in:
>> https://github.com/apache/tika/tree/1.16-rc1
>>
>> The SHA1 checksum of the archive is
>> e6884af0209ace42bf0b9b59d72c3c5a0052055e
>>
>> In addition, a staged maven repository is available here:
>> https://repository.apache.org/content/repositories/orgapachetika-1025
>>
>>
>>
>> Please vote on releasing this package as Apache Tika 1.16.
>> The vote is open for the next 72 hours and passes if a majority of at
>> least three +1 Tika PMC votes are cast.
>>
>> [ ] +1 Release this package as Apache Tika 1.16
>> [ ] -1 Do not release this package because...
>>
>>
>> This is my +1.
>>
>> Cheers,
>>
>> Tim
>>
>>
>>
>>
>>
>>
>
Re: [VOTE] Release Apache Tika 1.16 Candidate #1
Posted by JB Data <jb...@gmail.com>.
+1.
No regression in my 1.15 env <http://jbigdata.fr/jbigdata/ged-02.html>.
Test docx chart extraction (TIKA-2254): OK.
@*JB*Δ <http://jbigdata.fr>
2017-07-08 22:29 GMT+02:00 Chris Mattmann <ma...@apache.org>:
> +1 from me SIGS and CHECKSUMS look good.
>
> Thanks Tim!
>
> Cheers,
> Chris
>
> LMC-053601:apache-tika-1.16-rc1 mattmann$ for type in "" \-app \-eval
> \-server; do $HOME/bin/stage_apache_rc tika$type 1.16
> https://dist.apache.org/repos/dist/dev/tika/; done
> % Total % Received % Xferd Average Speed Time Time Time
> Current
> Dload Upload Total Spent Left
> Speed
> 100 53.5M 100 53.5M 0 0 3992k 0 0:00:13 0:00:13 --:--:--
> 5122k
> % Total % Received % Xferd Average Speed Time Time Time
> Current
> Dload Upload Total Spent Left
> Speed
> 100 836 100 836 0 0 1092 0 --:--:-- --:--:-- --:--:--
> 1092
> % Total % Received % Xferd Average Speed Time Time Time
> Current
> Dload Upload Total Spent Left
> Speed
> 100 34 100 34 0 0 96 0 --:--:-- --:--:-- --:--:--
> 96
> % Total % Received % Xferd Average Speed Time Time Time
> Current
> Dload Upload Total Spent Left
> Speed
> 100 41.6M 100 41.6M 0 0 6578k 0 0:00:06 0:00:06 --:--:--
> 8297k
> % Total % Received % Xferd Average Speed Time Time Time
> Current
> Dload Upload Total Spent Left
> Speed
> 100 836 100 836 0 0 1012 0 --:--:-- --:--:-- --:--:--
> 1012
> % Total % Received % Xferd Average Speed Time Time Time
> Current
> Dload Upload Total Spent Left
> Speed
> 100 34 100 34 0 0 46 0 --:--:-- --:--:-- --:--:--
> 46
> % Total % Received % Xferd Average Speed Time Time Time
> Current
> Dload Upload Total Spent Left
> Speed
> 100 56.4M 100 56.4M 0 0 3950k 0 0:00:14 0:00:14 --:--:--
> 4742k
> % Total % Received % Xferd Average Speed Time Time Time
> Current
> Dload Upload Total Spent Left
> Speed
> 100 836 100 836 0 0 1470 0 --:--:-- --:--:-- --:--:--
> 1469
> % Total % Received % Xferd Average Speed Time Time Time
> Current
> Dload Upload Total Spent Left
> Speed
> 100 34 100 34 0 0 65 0 --:--:-- --:--:-- --:--:--
> 65
> LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/stage_apache_rc tika
> 1.16-src https://dist.apache.org/repos/dist/dev/tika/
> % Total % Received % Xferd Average Speed Time Time Time
> Current
> Dload Upload Total Spent Left
> Speed
> 100 84.2M 100 84.2M 0 0 6563k 0 0:00:13 0:00:13 --:--:--
> 5261k
> % Total % Received % Xferd Average Speed Time Time Time
> Current
> Dload Upload Total Spent Left
> Speed
> 100 836 100 836 0 0 2129 0 --:--:-- --:--:-- --:--:--
> 2127
> % Total % Received % Xferd Average Speed Time Time Time
> Current
> Dload Upload Total Spent Left
> Speed
> 100 34 100 34 0 0 47 0 --:--:-- --:--:-- --:--:--
> 47
> LMC-053601:apache-tika-1.16-rc1 mattmann$ ls
> tika-1.16-src.zip tika-app-1.16.jar
> tika-eval-1.16.jar tika-server-1.16.jar
> tika-1.16-src.zip.asc tika-app-1.16.jar.asc
> tika-eval-1.16.jar.asc tika-server-1.16.jar.asc
> tika-1.16-src.zip.md5 tika-app-1.16.jar.md5
> tika-eval-1.16.jar.md5 tika-server-1.16.jar.md5
> LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/verify_gpg_sigs
> Verifying Signature for file tika-1.16-src.zip.asc
> gpg: assuming signed data in `tika-1.16-src.zip'
> gpg: Signature made Fri Jul 7 19:27:42 2017 PDT using RSA key ID EF0CF38A
> gpg: Good signature from "Tim Allison (ASF signing key) <
> tallison@apache.org>"
> gpg: WARNING: This key is not certified with a trusted signature!
> gpg: There is no indication that the signature belongs to the
> owner.
> Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C F38A
> Verifying Signature for file tika-app-1.16.jar.asc
> gpg: assuming signed data in `tika-app-1.16.jar'
> gpg: Signature made Fri Jul 7 19:13:16 2017 PDT using RSA key ID EF0CF38A
> gpg: Good signature from "Tim Allison (ASF signing key) <
> tallison@apache.org>"
> gpg: WARNING: This key is not certified with a trusted signature!
> gpg: There is no indication that the signature belongs to the
> owner.
> Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C F38A
> Verifying Signature for file tika-eval-1.16.jar.asc
> gpg: assuming signed data in `tika-eval-1.16.jar'
> gpg: Signature made Fri Jul 7 19:20:17 2017 PDT using RSA key ID EF0CF38A
> gpg: Good signature from "Tim Allison (ASF signing key) <
> tallison@apache.org>"
> gpg: WARNING: This key is not certified with a trusted signature!
> gpg: There is no indication that the signature belongs to the
> owner.
> Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C F38A
> Verifying Signature for file tika-server-1.16.jar.asc
> gpg: assuming signed data in `tika-server-1.16.jar'
> gpg: Signature made Fri Jul 7 19:17:53 2017 PDT using RSA key ID EF0CF38A
> gpg: Good signature from "Tim Allison (ASF signing key) <
> tallison@apache.org>"
> gpg: WARNING: This key is not certified with a trusted signature!
> gpg: There is no indication that the signature belongs to the
> owner.
> Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C F38A
> LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/verify_md5_checksums
> md5sum: stat '*.tar.gz': No such file or directory
> md5sum: stat '*.bz2': No such file or directory
> md5sum: stat '*.tgz': No such file or directory
> tika-1.16-src.zip: OK
> LMC-053601:apache-tika-1.16-rc1 mattmann$
>
>
>
>
> On 7/7/17, 7:40 PM, "Tim Allison" <ta...@apache.org> wrote:
>
>
>
>
> A candidate for the Tika 1.16 release is available at:
> https://dist.apache.org/repos/dist/dev/tika/
>
> The release candidate is a zip archive of the sources in:
> https://github.com/apache/tika/tree/1.16-rc1
>
> The SHA1 checksum of the archive is
> e6884af0209ace42bf0b9b59d72c3c5a0052055e
>
> In addition, a staged maven repository is available here:
> https://repository.apache.org/content/repositories/orgapachetika-1025
>
>
>
> Please vote on releasing this package as Apache Tika 1.16.
> The vote is open for the next 72 hours and passes if a majority of at
> least three +1 Tika PMC votes are cast.
>
> [ ] +1 Release this package as Apache Tika 1.16
> [ ] -1 Do not release this package because...
>
>
> This is my +1.
>
> Cheers,
>
> Tim
>
>
>
>
>
>
Re: [VOTE] Release Apache Tika 1.16 Candidate #1
Posted by Chris Mattmann <ma...@apache.org>.
+1 from me SIGS and CHECKSUMS look good.
Thanks Tim!
Cheers,
Chris
LMC-053601:apache-tika-1.16-rc1 mattmann$ for type in "" \-app \-eval \-server; do $HOME/bin/stage_apache_rc tika$type 1.16 https://dist.apache.org/repos/dist/dev/tika/; done
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 53.5M 100 53.5M 0 0 3992k 0 0:00:13 0:00:13 --:--:-- 5122k
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 836 100 836 0 0 1092 0 --:--:-- --:--:-- --:--:-- 1092
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 34 100 34 0 0 96 0 --:--:-- --:--:-- --:--:-- 96
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 41.6M 100 41.6M 0 0 6578k 0 0:00:06 0:00:06 --:--:-- 8297k
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 836 100 836 0 0 1012 0 --:--:-- --:--:-- --:--:-- 1012
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 34 100 34 0 0 46 0 --:--:-- --:--:-- --:--:-- 46
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 56.4M 100 56.4M 0 0 3950k 0 0:00:14 0:00:14 --:--:-- 4742k
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 836 100 836 0 0 1470 0 --:--:-- --:--:-- --:--:-- 1469
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 34 100 34 0 0 65 0 --:--:-- --:--:-- --:--:-- 65
LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/stage_apache_rc tika 1.16-src https://dist.apache.org/repos/dist/dev/tika/
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 84.2M 100 84.2M 0 0 6563k 0 0:00:13 0:00:13 --:--:-- 5261k
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 836 100 836 0 0 2129 0 --:--:-- --:--:-- --:--:-- 2127
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 34 100 34 0 0 47 0 --:--:-- --:--:-- --:--:-- 47
LMC-053601:apache-tika-1.16-rc1 mattmann$ ls
tika-1.16-src.zip tika-app-1.16.jar tika-eval-1.16.jar tika-server-1.16.jar
tika-1.16-src.zip.asc tika-app-1.16.jar.asc tika-eval-1.16.jar.asc tika-server-1.16.jar.asc
tika-1.16-src.zip.md5 tika-app-1.16.jar.md5 tika-eval-1.16.jar.md5 tika-server-1.16.jar.md5
LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/verify_gpg_sigs
Verifying Signature for file tika-1.16-src.zip.asc
gpg: assuming signed data in `tika-1.16-src.zip'
gpg: Signature made Fri Jul 7 19:27:42 2017 PDT using RSA key ID EF0CF38A
gpg: Good signature from "Tim Allison (ASF signing key) <ta...@apache.org>"
gpg: WARNING: This key is not certified with a trusted signature!
gpg: There is no indication that the signature belongs to the owner.
Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C F38A
Verifying Signature for file tika-app-1.16.jar.asc
gpg: assuming signed data in `tika-app-1.16.jar'
gpg: Signature made Fri Jul 7 19:13:16 2017 PDT using RSA key ID EF0CF38A
gpg: Good signature from "Tim Allison (ASF signing key) <ta...@apache.org>"
gpg: WARNING: This key is not certified with a trusted signature!
gpg: There is no indication that the signature belongs to the owner.
Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C F38A
Verifying Signature for file tika-eval-1.16.jar.asc
gpg: assuming signed data in `tika-eval-1.16.jar'
gpg: Signature made Fri Jul 7 19:20:17 2017 PDT using RSA key ID EF0CF38A
gpg: Good signature from "Tim Allison (ASF signing key) <ta...@apache.org>"
gpg: WARNING: This key is not certified with a trusted signature!
gpg: There is no indication that the signature belongs to the owner.
Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C F38A
Verifying Signature for file tika-server-1.16.jar.asc
gpg: assuming signed data in `tika-server-1.16.jar'
gpg: Signature made Fri Jul 7 19:17:53 2017 PDT using RSA key ID EF0CF38A
gpg: Good signature from "Tim Allison (ASF signing key) <ta...@apache.org>"
gpg: WARNING: This key is not certified with a trusted signature!
gpg: There is no indication that the signature belongs to the owner.
Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C F38A
LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/verify_md5_checksums
md5sum: stat '*.tar.gz': No such file or directory
md5sum: stat '*.bz2': No such file or directory
md5sum: stat '*.tgz': No such file or directory
tika-1.16-src.zip: OK
LMC-053601:apache-tika-1.16-rc1 mattmann$
On 7/7/17, 7:40 PM, "Tim Allison" <ta...@apache.org> wrote:
A candidate for the Tika 1.16 release is available at:
https://dist.apache.org/repos/dist/dev/tika/
The release candidate is a zip archive of the sources in:
https://github.com/apache/tika/tree/1.16-rc1
The SHA1 checksum of the archive is
e6884af0209ace42bf0b9b59d72c3c5a0052055e
In addition, a staged maven repository is available here:
https://repository.apache.org/content/repositories/orgapachetika-1025
Please vote on releasing this package as Apache Tika 1.16.
The vote is open for the next 72 hours and passes if a majority of at
least three +1 Tika PMC votes are cast.
[ ] +1 Release this package as Apache Tika 1.16
[ ] -1 Do not release this package because...
This is my +1.
Cheers,
Tim
Re: [VOTE] Release Apache Tika 1.16 Candidate #1
Posted by Oleg Tikhonov <ol...@apache.org>.
[x]+1 Release this package as Apache Tika 1.16
Basic tests and build on Ubuntu 17.04 + Java 8 (Oracle).
Thanks,
Oleg
On Wed, Jul 12, 2017 at 11:03 AM, Dave Meikle <dm...@apache.org> wrote:
> On 8 July 2017 at 03:40, Tim Allison <ta...@apache.org> wrote:
>
> >
> > A candidate for the Tika 1.16 release is available at:
> > https://dist.apache.org/repos/dist/dev/tika/
> >
> > The release candidate is a zip archive of the sources in:
> > https://github.com/apache/tika/tree/1.16-rc1
> >
> > The SHA1 checksum of the archive is
> > e6884af0209ace42bf0b9b59d72c3c5a0052055e
> >
> > In addition, a staged maven repository is available here:
> > https://repository.apache.org/content/repositories/orgapachetika-1025
> >
> > Please vote on releasing this package as Apache Tika 1.16.
> > The vote is open for the next 72 hours and passes if a majority of at
> > least three +1 Tika PMC votes are cast.
> >
> > [ ] +1 Release this package as Apache Tika 1.16
> > [ ] -1 Do not release this package because...
> >
> >
> +1 from me. Checksums and signatures good. Built and tested on various
> machines using Java 8. Been run in a production workload and all good.
>
> Cheers,
> Dave
>
Re: [VOTE] Release Apache Tika 1.16 Candidate #1
Posted by Dave Meikle <dm...@apache.org>.
On 8 July 2017 at 03:40, Tim Allison <ta...@apache.org> wrote:
>
> A candidate for the Tika 1.16 release is available at:
> https://dist.apache.org/repos/dist/dev/tika/
>
> The release candidate is a zip archive of the sources in:
> https://github.com/apache/tika/tree/1.16-rc1
>
> The SHA1 checksum of the archive is
> e6884af0209ace42bf0b9b59d72c3c5a0052055e
>
> In addition, a staged maven repository is available here:
> https://repository.apache.org/content/repositories/orgapachetika-1025
>
> Please vote on releasing this package as Apache Tika 1.16.
> The vote is open for the next 72 hours and passes if a majority of at
> least three +1 Tika PMC votes are cast.
>
> [ ] +1 Release this package as Apache Tika 1.16
> [ ] -1 Do not release this package because...
>
>
+1 from me. Checksums and signatures good. Built and tested on various
machines using Java 8. Been run in a production workload and all good.
Cheers,
Dave
Re: [VOTE] Release Apache Tika 1.16 Candidate #1
Posted by Chris Mattmann <ma...@apache.org>.
+1 from me SIGS and CHECKSUMS look good.
Thanks Tim!
Cheers,
Chris
LMC-053601:apache-tika-1.16-rc1 mattmann$ for type in "" \-app \-eval \-server; do $HOME/bin/stage_apache_rc tika$type 1.16 https://dist.apache.org/repos/dist/dev/tika/; done
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 53.5M 100 53.5M 0 0 3992k 0 0:00:13 0:00:13 --:--:-- 5122k
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 836 100 836 0 0 1092 0 --:--:-- --:--:-- --:--:-- 1092
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 34 100 34 0 0 96 0 --:--:-- --:--:-- --:--:-- 96
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 41.6M 100 41.6M 0 0 6578k 0 0:00:06 0:00:06 --:--:-- 8297k
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 836 100 836 0 0 1012 0 --:--:-- --:--:-- --:--:-- 1012
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 34 100 34 0 0 46 0 --:--:-- --:--:-- --:--:-- 46
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 56.4M 100 56.4M 0 0 3950k 0 0:00:14 0:00:14 --:--:-- 4742k
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 836 100 836 0 0 1470 0 --:--:-- --:--:-- --:--:-- 1469
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 34 100 34 0 0 65 0 --:--:-- --:--:-- --:--:-- 65
LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/stage_apache_rc tika 1.16-src https://dist.apache.org/repos/dist/dev/tika/
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 84.2M 100 84.2M 0 0 6563k 0 0:00:13 0:00:13 --:--:-- 5261k
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 836 100 836 0 0 2129 0 --:--:-- --:--:-- --:--:-- 2127
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 34 100 34 0 0 47 0 --:--:-- --:--:-- --:--:-- 47
LMC-053601:apache-tika-1.16-rc1 mattmann$ ls
tika-1.16-src.zip tika-app-1.16.jar tika-eval-1.16.jar tika-server-1.16.jar
tika-1.16-src.zip.asc tika-app-1.16.jar.asc tika-eval-1.16.jar.asc tika-server-1.16.jar.asc
tika-1.16-src.zip.md5 tika-app-1.16.jar.md5 tika-eval-1.16.jar.md5 tika-server-1.16.jar.md5
LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/verify_gpg_sigs
Verifying Signature for file tika-1.16-src.zip.asc
gpg: assuming signed data in `tika-1.16-src.zip'
gpg: Signature made Fri Jul 7 19:27:42 2017 PDT using RSA key ID EF0CF38A
gpg: Good signature from "Tim Allison (ASF signing key) <ta...@apache.org>"
gpg: WARNING: This key is not certified with a trusted signature!
gpg: There is no indication that the signature belongs to the owner.
Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C F38A
Verifying Signature for file tika-app-1.16.jar.asc
gpg: assuming signed data in `tika-app-1.16.jar'
gpg: Signature made Fri Jul 7 19:13:16 2017 PDT using RSA key ID EF0CF38A
gpg: Good signature from "Tim Allison (ASF signing key) <ta...@apache.org>"
gpg: WARNING: This key is not certified with a trusted signature!
gpg: There is no indication that the signature belongs to the owner.
Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C F38A
Verifying Signature for file tika-eval-1.16.jar.asc
gpg: assuming signed data in `tika-eval-1.16.jar'
gpg: Signature made Fri Jul 7 19:20:17 2017 PDT using RSA key ID EF0CF38A
gpg: Good signature from "Tim Allison (ASF signing key) <ta...@apache.org>"
gpg: WARNING: This key is not certified with a trusted signature!
gpg: There is no indication that the signature belongs to the owner.
Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C F38A
Verifying Signature for file tika-server-1.16.jar.asc
gpg: assuming signed data in `tika-server-1.16.jar'
gpg: Signature made Fri Jul 7 19:17:53 2017 PDT using RSA key ID EF0CF38A
gpg: Good signature from "Tim Allison (ASF signing key) <ta...@apache.org>"
gpg: WARNING: This key is not certified with a trusted signature!
gpg: There is no indication that the signature belongs to the owner.
Primary key fingerprint: 833C 1CC4 926C 1DDE 29BB 8731 E403 2DC4 EF0C F38A
LMC-053601:apache-tika-1.16-rc1 mattmann$ $HOME/bin/verify_md5_checksums
md5sum: stat '*.tar.gz': No such file or directory
md5sum: stat '*.bz2': No such file or directory
md5sum: stat '*.tgz': No such file or directory
tika-1.16-src.zip: OK
LMC-053601:apache-tika-1.16-rc1 mattmann$
On 7/7/17, 7:40 PM, "Tim Allison" <ta...@apache.org> wrote:
A candidate for the Tika 1.16 release is available at:
https://dist.apache.org/repos/dist/dev/tika/
The release candidate is a zip archive of the sources in:
https://github.com/apache/tika/tree/1.16-rc1
The SHA1 checksum of the archive is
e6884af0209ace42bf0b9b59d72c3c5a0052055e
In addition, a staged maven repository is available here:
https://repository.apache.org/content/repositories/orgapachetika-1025
Please vote on releasing this package as Apache Tika 1.16.
The vote is open for the next 72 hours and passes if a majority of at
least three +1 Tika PMC votes are cast.
[ ] +1 Release this package as Apache Tika 1.16
[ ] -1 Do not release this package because...
This is my +1.
Cheers,
Tim
Re: [VOTE] Release Apache Tika 1.16 Candidate #1
Posted by Dave Meikle <dm...@apache.org>.
On 8 July 2017 at 03:40, Tim Allison <ta...@apache.org> wrote:
>
> A candidate for the Tika 1.16 release is available at:
> https://dist.apache.org/repos/dist/dev/tika/
>
> The release candidate is a zip archive of the sources in:
> https://github.com/apache/tika/tree/1.16-rc1
>
> The SHA1 checksum of the archive is
> e6884af0209ace42bf0b9b59d72c3c5a0052055e
>
> In addition, a staged maven repository is available here:
> https://repository.apache.org/content/repositories/orgapachetika-1025
>
> Please vote on releasing this package as Apache Tika 1.16.
> The vote is open for the next 72 hours and passes if a majority of at
> least three +1 Tika PMC votes are cast.
>
> [ ] +1 Release this package as Apache Tika 1.16
> [ ] -1 Do not release this package because...
>
>
+1 from me. Checksums and signatures good. Built and tested on various
machines using Java 8. Been run in a production workload and all good.
Cheers,
Dave
[VOTE] Release Apache Tika 1.16 Candidate #1
Posted by Tim Allison <ta...@apache.org>.
A candidate for the Tika 1.16 release is available at:
https://dist.apache.org/repos/dist/dev/tika/
The release candidate is a zip archive of the sources in:
https://github.com/apache/tika/tree/1.16-rc1
The SHA1 checksum of the archive is
e6884af0209ace42bf0b9b59d72c3c5a0052055e
In addition, a staged maven repository is available here:
https://repository.apache.org/content/repositories/orgapachetika-1025
Please vote on releasing this package as Apache Tika 1.16.
The vote is open for the next 72 hours and passes if a majority of at
least three +1 Tika PMC votes are cast.
[ ] +1 Release this package as Apache Tika 1.16
[ ] -1 Do not release this package because...
This is my +1.
Cheers,
Tim
[VOTE] Release Apache Tika 1.16 Candidate #1
Posted by Tim Allison <ta...@apache.org>.
A candidate for the Tika 1.16 release is available at:
https://dist.apache.org/repos/dist/dev/tika/
The release candidate is a zip archive of the sources in:
https://github.com/apache/tika/tree/1.16-rc1
The SHA1 checksum of the archive is
e6884af0209ace42bf0b9b59d72c3c5a0052055e
In addition, a staged maven repository is available here:
https://repository.apache.org/content/repositories/orgapachetika-1025
Please vote on releasing this package as Apache Tika 1.16.
The vote is open for the next 72 hours and passes if a majority of at
least three +1 Tika PMC votes are cast.
[ ] +1 Release this package as Apache Tika 1.16
[ ] -1 Do not release this package because...
This is my +1.
Cheers,
Tim
Re: [VOTE] Release Apache Tika 1.15 Candidate #1
Posted by Oleg Tikhonov <ol...@gmail.com>.
Cannot reproduce after having done some workarounds ...
On Wed, May 24, 2017 at 3:05 AM, Allison, Timothy B. <ta...@mitre.org>
wrote:
> Hi Oleg,
> What's your error on that unit test?
>
> -----Original Message-----
> From: olegtikhonov@gmail.com [mailto:olegtikhonov@gmail.com] On Behalf Of
> Oleg Tikhonov
> Sent: Tuesday, May 23, 2017 4:33 PM
> To: dev@tika.apache.org
> Subject: Re: [VOTE] Release Apache Tika 1.15 Candidate #1
>
> Also put
> ./tika-dl/src/test/java/org/apache/tika/dl/imagerec/
> DL4JInceptionV3NetTest.java
> @Ignore because I do not have any DL installed on my comp.
>
>
> On Tue, May 23, 2017 at 11:00 PM, Oleg Tikhonov <ol...@apache.org> wrote:
>
> > Hi guys,
> > Here is wrong ...
> > <parent>
> > <groupId>org.apache.tika</groupId>
> > <artifactId>tika-parent</artifactId>
> > <version>1.16-SNAPSHOT</version>
> > <relativePath>tika-parent/pom.xml</relativePath>
> > </parent>
> >
> >
> > If you are cloning the project, the upper level pom contains this.
> > The fix is to change 1.16-SNAPSHOT to 1.15
> >
> > What i did was:
> > git clone https://github.com/apache/tika.git
> >
> > Any suggestions?
> >
> > BR,
> > OLeg
> >
> >
> >
> >
> > On Tue, May 23, 2017 at 3:01 PM, Allison, Timothy B.
> > <ta...@mitre.org>
> > wrote:
> >
> >> I _think_ it is included. See below for the two options for parsing
> >> testZipEncrypted.zip.
> >>
> >> Are you not seeing this behavior? Were you expecting different
> behavior?
> >>
> >>
> >> 1) RecursiveParserWrapper
> >>
> >> List<Metadata> metadataList = getRecursiveMetadata("testZipE
> >> ncrypted.zip");
> >> debug(metadataList);
> >>
> >> yields:
> >>
> >> 0: X-Parsed-By : org.apache.tika.parser.DefaultParser
> >> 0: X-Parsed-By : org.apache.tika.parser.pkg.PackageParser
> >> 0: X-TIKA:EXCEPTION:embedded_stream_exception :
> >> org.apache.tika.exception.EncryptedDocumentException: stream
> >> (encrypted.txt) is encrypted
> >> at
> >> org.apache.tika.parser.pkg.PackageParser.parseEntry(PackageP
> >> arser.java:306)
> >> at
> >> org.apache.tika.parser.pkg.PackageParser.parse(PackageParser
> >> .java:230)
> >> at
> >> org.apache.tika.parser.CompositeParser.parse(CompositeParser
> >> .java:280)
> >> at
> >> org.apache.tika.parser.CompositeParser.parse(CompositeParser
> >> .java:280)
> >> at
> >> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectPars
> >> er.java:135)
> >> at
> >> org.apache.tika.parser.RecursiveParserWrapper.parse(Recursiv
> >> eParserWrapper.java:158)
> >> at org.apache.tika.TikaTest.getRecursiveMetadata(TikaTest.java:
> >> 221)
> >> at org.apache.tika.TikaTest.getRecursiveMetadata(TikaTest.java:
> >> 213)
> >> at
> >> org.apache.tika.parser.pkg.ZipParserTest.testZipEncrypted(Zi
> >> pParserTest.java:213)
> >> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> >> at
> >> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAcce
> >> ssorImpl.java:62)
> >> at
> >> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMe
> >> thodAccessorImpl.java:43)
> >> at java.lang.reflect.Method.invoke(Method.java:498)
> >> at
> >> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(
> >> FrameworkMethod.java:50)
> >> at
> >> org.junit.internal.runners.model.ReflectiveCallable.run(Refl
> >> ectiveCallable.java:12)
> >> at
> >> org.junit.runners.model.FrameworkMethod.invokeExplosively(Fr
> >> ameworkMethod.java:47)
> >> at
> >> org.junit.internal.runners.statements.InvokeMethod.evaluate(
> >> InvokeMethod.java:17)
> >> at org.junit.internal.runners.statements.RunBefores.evaluate(
> >> RunBefores.java:26)
> >> at org.junit.runners.ParentRunner.runLeaf(
> ParentRunner.java:325)
> >> at
> >> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit
> >> 4ClassRunner.java:78)
> >> at
> >> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit
> >> 4ClassRunner.java:57)
> >> at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
> >> at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:
> >> 71)
> >> at org.junit.runners.ParentRunner.runChildren(ParentRunner.
> >> java:288)
> >> at org.junit.runners.ParentRunner.access$000(ParentRunner.java:
> >> 58)
> >> at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:
> >> 268)
> >> at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
> >> at org.junit.runner.JUnitCore.run(JUnitCore.java:137)
> >> at
> >> com.intellij.junit4.JUnit4IdeaTestRunner.startRunnerWithArgs
> >> (JUnit4IdeaTestRunner.java:68)
> >> at
> >> com.intellij.rt.execution.junit.IdeaTestRunner$Repeater.star
> >> tRunnerWithArgs(IdeaTestRunner.java:51)
> >> at
> >> com.intellij.rt.execution.junit.JUnitStarter.prepareStreamsA
> >> ndStart(JUnitStarter.java:242)
> >> at
> >> com.intellij.rt.execution.junit.JUnitStarter.main(JUnitStart
> >> er.java:70)
> >>
> >> 0: X-TIKA:parse_time_millis : 34
> >> 0: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
> >> <head>
> >> <meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser"
> >> />
> >> <meta name="X-Parsed-By" content="org.apache.tika.
> parser.pkg.PackageParser"
> >> />
> >> <meta name="Content-Type" content="application/zip" />
> >> <title></title> </head> <body><div class="embedded"
> >> id="unencrypted.txt" /> <div
> >> class="package-entry"><h1>unencrypted.txt</h1>
> >> </div>
> >> <p>encrypted.txt</p>
> >> </body></html>
> >> 0: Content-Type : application/zip
> >> 1: date : 2017-03-21T13:07:48Z
> >> 1: X-Parsed-By : org.apache.tika.parser.DefaultParser
> >> 1: X-Parsed-By : org.apache.tika.parser.txt.TXTParser
> >> 1: resourceName : unencrypted.txt
> >> 1: dcterms:modified : 2017-03-21T13:07:48Z
> >> 1: Last-Modified : 2017-03-21T13:07:48Z
> >> 1: Last-Save-Date : 2017-03-21T13:07:48Z
> >> 1: embeddedRelationshipId : unencrypted.txt
> >> 1: meta:save-date : 2017-03-21T13:07:48Z
> >> 1: Content-Encoding : windows-1252
> >> 1: X-TIKA:parse_time_millis : 3
> >> 1: modified : 2017-03-21T13:07:48Z
> >> 1: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
> >> <head>
> >> <meta name="date" content="2017-03-21T13:07:48Z" /> <meta
> >> name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser"
> >> />
> >> <meta name="X-Parsed-By" content="org.apache.tika.parser.txt.TXTParser"
> >> />
> >> <meta name="resourceName" content="unencrypted.txt" /> <meta
> >> name="dcterms:modified" content="2017-03-21T13:07:48Z" /> <meta
> >> name="Last-Modified" content="2017-03-21T13:07:48Z" /> <meta
> >> name="Last-Save-Date" content="2017-03-21T13:07:48Z" /> <meta
> >> name="embeddedRelationshipId" content="unencrypted.txt" /> <meta
> >> name="meta:save-date" content="2017-03-21T13:07:48Z" /> <meta
> >> name="Content-Encoding" content="windows-1252" /> <meta
> >> name="modified" content="2017-03-21T13:07:48Z" /> <meta
> >> name="Content-Length" content="13" /> <meta
> >> name="X-TIKA:embedded_resource_path" content="/unencrypted.txt" />
> >> <meta name="Content-Type" content="text/plain; charset=windows-1252"
> >> /> <title></title> </head> <body><p>hello world </p> </body></html>
> >> 1: Content-Length : 13
> >> 1: X-TIKA:embedded_resource_path : /unencrypted.txt
> >> 1: Content-Type : text/plain; charset=windows-1252
> >>
> >> 2) Classic XML:
> >>
> >> XMLResult r = getXML("testZipEncrypted.zip");
> >> for (String n : r.metadata.names()) {
> >> for (String v : r.metadata.getValues(n)) {
> >> System.out.println("meta: "+n + " : "+v);
> >> }
> >> }
> >> System.out.println(r.xml);
> >>
> >> Yields:
> >> meta: X-Parsed-By : org.apache.tika.parser.DefaultParser
> >> meta: X-Parsed-By : org.apache.tika.parser.pkg.PackageParser
> >> meta: X-TIKA:EXCEPTION:embedded_stream_exception :
> >> org.apache.tika.exception.EncryptedDocumentException: stream
> >> (encrypted.txt) is encrypted
> >> at
> >> org.apache.tika.parser.pkg.PackageParser.parseEntry(PackageP
> >> arser.java:306)
> >> at
> >> org.apache.tika.parser.pkg.PackageParser.parse(PackageParser
> >> .java:230)
> >> at
> >> org.apache.tika.parser.CompositeParser.parse(CompositeParser
> >> .java:280)
> >> at
> >> org.apache.tika.parser.CompositeParser.parse(CompositeParser
> >> .java:280)
> >> at
> >> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectPars
> >> er.java:135)
> >> at org.apache.tika.TikaTest.getXML(TikaTest.java:205)
> >> at org.apache.tika.TikaTest.getXML(TikaTest.java:191)
> >> at
> >> org.apache.tika.parser.pkg.ZipParserTest.testZipEncrypted(Zi
> >> pParserTest.java:206)
> >> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> >> at
> >> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAcce
> >> ssorImpl.java:62)
> >> at
> >> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMe
> >> thodAccessorImpl.java:43)
> >> at java.lang.reflect.Method.invoke(Method.java:498)
> >> at
> >> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(
> >> FrameworkMethod.java:50)
> >> at
> >> org.junit.internal.runners.model.ReflectiveCallable.run(Refl
> >> ectiveCallable.java:12)
> >> at
> >> org.junit.runners.model.FrameworkMethod.invokeExplosively(Fr
> >> ameworkMethod.java:47)
> >> at
> >> org.junit.internal.runners.statements.InvokeMethod.evaluate(
> >> InvokeMethod.java:17)
> >> at org.junit.internal.runners.statements.RunBefores.evaluate(
> >> RunBefores.java:26)
> >> at org.junit.runners.ParentRunner.runLeaf(
> ParentRunner.java:325)
> >> at
> >> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit
> >> 4ClassRunner.java:78)
> >> at
> >> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit
> >> 4ClassRunner.java:57)
> >> at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
> >> at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:
> >> 71)
> >> at org.junit.runners.ParentRunner.runChildren(ParentRunner.
> >> java:288)
> >> at org.junit.runners.ParentRunner.access$000(ParentRunner.java:
> >> 58)
> >> at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:
> >> 268)
> >> at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
> >> at org.junit.runner.JUnitCore.run(JUnitCore.java:137)
> >> at
> >> com.intellij.junit4.JUnit4IdeaTestRunner.startRunnerWithArgs
> >> (JUnit4IdeaTestRunner.java:68)
> >> at
> >> com.intellij.rt.execution.junit.IdeaTestRunner$Repeater.star
> >> tRunnerWithArgs(IdeaTestRunner.java:51)
> >> at
> >> com.intellij.rt.execution.junit.JUnitStarter.prepareStreamsA
> >> ndStart(JUnitStarter.java:242)
> >> at
> >> com.intellij.rt.execution.junit.JUnitStarter.main(JUnitStart
> >> er.java:70)
> >>
> >> meta: Content-Type : application/zip
> >> <html xmlns="http://www.w3.org/1999/xhtml">
> >> <head>
> >> <meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser"
> >> />
> >> <meta name="X-Parsed-By" content="org.apache.tika.
> parser.pkg.PackageParser"
> >> />
> >> <meta name="Content-Type" content="application/zip" />
> >> <title></title> </head> <body><div class="embedded"
> >> id="unencrypted.txt" /> <div
> >> class="package-entry"><h1>unencrypted.txt</h1>
> >> <p>hello world
> >> </p>
> >>
> >> </div>
> >> <p>encrypted.txt</p>
> >> </body></html>
> >>
> >> -----Original Message-----
> >> From: Aeham Abushwashi [mailto:aeham.abushwashi@exonar.com]
> >> Sent: Tuesday, May 23, 2017 3:47 AM
> >> To: user@tika.apache.org; Tim Allison <ta...@apache.org>
> >> Cc: dev@tika.apache.org
> >> Subject: Re: [VOTE] Release Apache Tika 1.15 Candidate #1
> >>
> >> Thanks Tim and apologies if this isn't the right thread to ask this
> >> question... any reason TIKA-2300 is not included despite
> >> FixVersions=1.15 on the ticket?
> >>
> >> On 22 May 2017 at 20:25, Tim Allison <ta...@apache.org> wrote:
> >>
> >> > A candidate for the Tika 1.15 release is available at:
> >> > https://dist.apache.org/repos/dist/dev/tika/
> >> >
> >> > The release candidate is a zip archive of the sources in:
> >> > https://github.com/apache/tika/tree/1.15-rc1
> >> >
> >> > The SHA1 checksum of the archive is
> >> > e82697a6804373367fbba98d47426ab74e036eb1.
> >> >
> >> > In addition, a staged maven repository is available here:
> >> > https://repository.apache.org/content/repositories/orgapachetika-10
> >> > 22
> >> >
> >> > Please vote on releasing this package as Apache Tika 1.15.
> >> > The vote is open for the next 72 hours and passes if a majority of
> >> > at least three +1 Tika PMC votes are cast.
> >> >
> >> > [ ] +1 Release this package as Apache Tika 1.15 [ ] -1 Do not
> >> > release this package because...
> >> >
> >> > ***This is my first time as release manager. Please kick the tires
> >> > thoroughly.***
> >> >
> >> > This is my +1.
> >> >
> >> > Cheers,
> >> >
> >> > Tim
> >> >
> >>
> >>
> >>
> >> --
> >> Aeham Abushwashi
> >> Head of Engineering
> >> Exonar
> >>
> >> v: video.exonar.com | w: exonar.com <http://www.exonar.com/> |
> twitter:
> >> @exonar <https://twitter.com/exonar>
> >>
> >> GDPR: Why It’s About More Than Regulation: Download the White Paper
> >> Here < https://goo.gl/1cSVzH>
> >>
> >> Trial <https://www.exonar.com/platform/> the capability on your own
> >> organisation's data to understand what you've got, where it is and
> >> who has access to it.
> >>
> >>
> >> Come and meet us for a chat at Infosecurity Europe <
> >> http://www.infosecurityeurope.com/>on stand S07 in the Cyber
> >> Innovation Zone
> >> <http://www.infosecurityeurope.com/visit/whats-on/uk-cyber-
> >> innovation-zone/>
> >>
> >>
> >> Exonar Limited, registered in the UK, registration number 06439969 at
> >> 14 West Mills, Newbury, Berkshire, RG14 5HG. DISCLAIMER: This email
> >> and any attachments to it may be confidential or private. If you have
> >> received it in error, please notify us and delete it from your system.
> >>
> >
> >
>
RE: [VOTE] Release Apache Tika 1.15 Candidate #1
Posted by "Allison, Timothy B." <ta...@mitre.org>.
Hi Oleg,
What's your error on that unit test?
-----Original Message-----
From: olegtikhonov@gmail.com [mailto:olegtikhonov@gmail.com] On Behalf Of Oleg Tikhonov
Sent: Tuesday, May 23, 2017 4:33 PM
To: dev@tika.apache.org
Subject: Re: [VOTE] Release Apache Tika 1.15 Candidate #1
Also put
./tika-dl/src/test/java/org/apache/tika/dl/imagerec/DL4JInceptionV3NetTest.java
@Ignore because I do not have any DL installed on my comp.
On Tue, May 23, 2017 at 11:00 PM, Oleg Tikhonov <ol...@apache.org> wrote:
> Hi guys,
> Here is wrong ...
> <parent>
> <groupId>org.apache.tika</groupId>
> <artifactId>tika-parent</artifactId>
> <version>1.16-SNAPSHOT</version>
> <relativePath>tika-parent/pom.xml</relativePath>
> </parent>
>
>
> If you are cloning the project, the upper level pom contains this.
> The fix is to change 1.16-SNAPSHOT to 1.15
>
> What i did was:
> git clone https://github.com/apache/tika.git
>
> Any suggestions?
>
> BR,
> OLeg
>
>
>
>
> On Tue, May 23, 2017 at 3:01 PM, Allison, Timothy B.
> <ta...@mitre.org>
> wrote:
>
>> I _think_ it is included. See below for the two options for parsing
>> testZipEncrypted.zip.
>>
>> Are you not seeing this behavior? Were you expecting different behavior?
>>
>>
>> 1) RecursiveParserWrapper
>>
>> List<Metadata> metadataList = getRecursiveMetadata("testZipE
>> ncrypted.zip");
>> debug(metadataList);
>>
>> yields:
>>
>> 0: X-Parsed-By : org.apache.tika.parser.DefaultParser
>> 0: X-Parsed-By : org.apache.tika.parser.pkg.PackageParser
>> 0: X-TIKA:EXCEPTION:embedded_stream_exception :
>> org.apache.tika.exception.EncryptedDocumentException: stream
>> (encrypted.txt) is encrypted
>> at
>> org.apache.tika.parser.pkg.PackageParser.parseEntry(PackageP
>> arser.java:306)
>> at
>> org.apache.tika.parser.pkg.PackageParser.parse(PackageParser
>> .java:230)
>> at
>> org.apache.tika.parser.CompositeParser.parse(CompositeParser
>> .java:280)
>> at
>> org.apache.tika.parser.CompositeParser.parse(CompositeParser
>> .java:280)
>> at
>> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectPars
>> er.java:135)
>> at
>> org.apache.tika.parser.RecursiveParserWrapper.parse(Recursiv
>> eParserWrapper.java:158)
>> at org.apache.tika.TikaTest.getRecursiveMetadata(TikaTest.java:
>> 221)
>> at org.apache.tika.TikaTest.getRecursiveMetadata(TikaTest.java:
>> 213)
>> at
>> org.apache.tika.parser.pkg.ZipParserTest.testZipEncrypted(Zi
>> pParserTest.java:213)
>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>> at
>> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAcce
>> ssorImpl.java:62)
>> at
>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMe
>> thodAccessorImpl.java:43)
>> at java.lang.reflect.Method.invoke(Method.java:498)
>> at
>> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(
>> FrameworkMethod.java:50)
>> at
>> org.junit.internal.runners.model.ReflectiveCallable.run(Refl
>> ectiveCallable.java:12)
>> at
>> org.junit.runners.model.FrameworkMethod.invokeExplosively(Fr
>> ameworkMethod.java:47)
>> at
>> org.junit.internal.runners.statements.InvokeMethod.evaluate(
>> InvokeMethod.java:17)
>> at org.junit.internal.runners.statements.RunBefores.evaluate(
>> RunBefores.java:26)
>> at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
>> at
>> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit
>> 4ClassRunner.java:78)
>> at
>> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit
>> 4ClassRunner.java:57)
>> at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
>> at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:
>> 71)
>> at org.junit.runners.ParentRunner.runChildren(ParentRunner.
>> java:288)
>> at org.junit.runners.ParentRunner.access$000(ParentRunner.java:
>> 58)
>> at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:
>> 268)
>> at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
>> at org.junit.runner.JUnitCore.run(JUnitCore.java:137)
>> at
>> com.intellij.junit4.JUnit4IdeaTestRunner.startRunnerWithArgs
>> (JUnit4IdeaTestRunner.java:68)
>> at
>> com.intellij.rt.execution.junit.IdeaTestRunner$Repeater.star
>> tRunnerWithArgs(IdeaTestRunner.java:51)
>> at
>> com.intellij.rt.execution.junit.JUnitStarter.prepareStreamsA
>> ndStart(JUnitStarter.java:242)
>> at
>> com.intellij.rt.execution.junit.JUnitStarter.main(JUnitStart
>> er.java:70)
>>
>> 0: X-TIKA:parse_time_millis : 34
>> 0: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
>> <head>
>> <meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser"
>> />
>> <meta name="X-Parsed-By" content="org.apache.tika.parser.pkg.PackageParser"
>> />
>> <meta name="Content-Type" content="application/zip" />
>> <title></title> </head> <body><div class="embedded"
>> id="unencrypted.txt" /> <div
>> class="package-entry"><h1>unencrypted.txt</h1>
>> </div>
>> <p>encrypted.txt</p>
>> </body></html>
>> 0: Content-Type : application/zip
>> 1: date : 2017-03-21T13:07:48Z
>> 1: X-Parsed-By : org.apache.tika.parser.DefaultParser
>> 1: X-Parsed-By : org.apache.tika.parser.txt.TXTParser
>> 1: resourceName : unencrypted.txt
>> 1: dcterms:modified : 2017-03-21T13:07:48Z
>> 1: Last-Modified : 2017-03-21T13:07:48Z
>> 1: Last-Save-Date : 2017-03-21T13:07:48Z
>> 1: embeddedRelationshipId : unencrypted.txt
>> 1: meta:save-date : 2017-03-21T13:07:48Z
>> 1: Content-Encoding : windows-1252
>> 1: X-TIKA:parse_time_millis : 3
>> 1: modified : 2017-03-21T13:07:48Z
>> 1: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
>> <head>
>> <meta name="date" content="2017-03-21T13:07:48Z" /> <meta
>> name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser"
>> />
>> <meta name="X-Parsed-By" content="org.apache.tika.parser.txt.TXTParser"
>> />
>> <meta name="resourceName" content="unencrypted.txt" /> <meta
>> name="dcterms:modified" content="2017-03-21T13:07:48Z" /> <meta
>> name="Last-Modified" content="2017-03-21T13:07:48Z" /> <meta
>> name="Last-Save-Date" content="2017-03-21T13:07:48Z" /> <meta
>> name="embeddedRelationshipId" content="unencrypted.txt" /> <meta
>> name="meta:save-date" content="2017-03-21T13:07:48Z" /> <meta
>> name="Content-Encoding" content="windows-1252" /> <meta
>> name="modified" content="2017-03-21T13:07:48Z" /> <meta
>> name="Content-Length" content="13" /> <meta
>> name="X-TIKA:embedded_resource_path" content="/unencrypted.txt" />
>> <meta name="Content-Type" content="text/plain; charset=windows-1252"
>> /> <title></title> </head> <body><p>hello world </p> </body></html>
>> 1: Content-Length : 13
>> 1: X-TIKA:embedded_resource_path : /unencrypted.txt
>> 1: Content-Type : text/plain; charset=windows-1252
>>
>> 2) Classic XML:
>>
>> XMLResult r = getXML("testZipEncrypted.zip");
>> for (String n : r.metadata.names()) {
>> for (String v : r.metadata.getValues(n)) {
>> System.out.println("meta: "+n + " : "+v);
>> }
>> }
>> System.out.println(r.xml);
>>
>> Yields:
>> meta: X-Parsed-By : org.apache.tika.parser.DefaultParser
>> meta: X-Parsed-By : org.apache.tika.parser.pkg.PackageParser
>> meta: X-TIKA:EXCEPTION:embedded_stream_exception :
>> org.apache.tika.exception.EncryptedDocumentException: stream
>> (encrypted.txt) is encrypted
>> at
>> org.apache.tika.parser.pkg.PackageParser.parseEntry(PackageP
>> arser.java:306)
>> at
>> org.apache.tika.parser.pkg.PackageParser.parse(PackageParser
>> .java:230)
>> at
>> org.apache.tika.parser.CompositeParser.parse(CompositeParser
>> .java:280)
>> at
>> org.apache.tika.parser.CompositeParser.parse(CompositeParser
>> .java:280)
>> at
>> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectPars
>> er.java:135)
>> at org.apache.tika.TikaTest.getXML(TikaTest.java:205)
>> at org.apache.tika.TikaTest.getXML(TikaTest.java:191)
>> at
>> org.apache.tika.parser.pkg.ZipParserTest.testZipEncrypted(Zi
>> pParserTest.java:206)
>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>> at
>> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAcce
>> ssorImpl.java:62)
>> at
>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMe
>> thodAccessorImpl.java:43)
>> at java.lang.reflect.Method.invoke(Method.java:498)
>> at
>> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(
>> FrameworkMethod.java:50)
>> at
>> org.junit.internal.runners.model.ReflectiveCallable.run(Refl
>> ectiveCallable.java:12)
>> at
>> org.junit.runners.model.FrameworkMethod.invokeExplosively(Fr
>> ameworkMethod.java:47)
>> at
>> org.junit.internal.runners.statements.InvokeMethod.evaluate(
>> InvokeMethod.java:17)
>> at org.junit.internal.runners.statements.RunBefores.evaluate(
>> RunBefores.java:26)
>> at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
>> at
>> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit
>> 4ClassRunner.java:78)
>> at
>> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit
>> 4ClassRunner.java:57)
>> at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
>> at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:
>> 71)
>> at org.junit.runners.ParentRunner.runChildren(ParentRunner.
>> java:288)
>> at org.junit.runners.ParentRunner.access$000(ParentRunner.java:
>> 58)
>> at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:
>> 268)
>> at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
>> at org.junit.runner.JUnitCore.run(JUnitCore.java:137)
>> at
>> com.intellij.junit4.JUnit4IdeaTestRunner.startRunnerWithArgs
>> (JUnit4IdeaTestRunner.java:68)
>> at
>> com.intellij.rt.execution.junit.IdeaTestRunner$Repeater.star
>> tRunnerWithArgs(IdeaTestRunner.java:51)
>> at
>> com.intellij.rt.execution.junit.JUnitStarter.prepareStreamsA
>> ndStart(JUnitStarter.java:242)
>> at
>> com.intellij.rt.execution.junit.JUnitStarter.main(JUnitStart
>> er.java:70)
>>
>> meta: Content-Type : application/zip
>> <html xmlns="http://www.w3.org/1999/xhtml">
>> <head>
>> <meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser"
>> />
>> <meta name="X-Parsed-By" content="org.apache.tika.parser.pkg.PackageParser"
>> />
>> <meta name="Content-Type" content="application/zip" />
>> <title></title> </head> <body><div class="embedded"
>> id="unencrypted.txt" /> <div
>> class="package-entry"><h1>unencrypted.txt</h1>
>> <p>hello world
>> </p>
>>
>> </div>
>> <p>encrypted.txt</p>
>> </body></html>
>>
>> -----Original Message-----
>> From: Aeham Abushwashi [mailto:aeham.abushwashi@exonar.com]
>> Sent: Tuesday, May 23, 2017 3:47 AM
>> To: user@tika.apache.org; Tim Allison <ta...@apache.org>
>> Cc: dev@tika.apache.org
>> Subject: Re: [VOTE] Release Apache Tika 1.15 Candidate #1
>>
>> Thanks Tim and apologies if this isn't the right thread to ask this
>> question... any reason TIKA-2300 is not included despite
>> FixVersions=1.15 on the ticket?
>>
>> On 22 May 2017 at 20:25, Tim Allison <ta...@apache.org> wrote:
>>
>> > A candidate for the Tika 1.15 release is available at:
>> > https://dist.apache.org/repos/dist/dev/tika/
>> >
>> > The release candidate is a zip archive of the sources in:
>> > https://github.com/apache/tika/tree/1.15-rc1
>> >
>> > The SHA1 checksum of the archive is
>> > e82697a6804373367fbba98d47426ab74e036eb1.
>> >
>> > In addition, a staged maven repository is available here:
>> > https://repository.apache.org/content/repositories/orgapachetika-10
>> > 22
>> >
>> > Please vote on releasing this package as Apache Tika 1.15.
>> > The vote is open for the next 72 hours and passes if a majority of
>> > at least three +1 Tika PMC votes are cast.
>> >
>> > [ ] +1 Release this package as Apache Tika 1.15 [ ] -1 Do not
>> > release this package because...
>> >
>> > ***This is my first time as release manager. Please kick the tires
>> > thoroughly.***
>> >
>> > This is my +1.
>> >
>> > Cheers,
>> >
>> > Tim
>> >
>>
>>
>>
>> --
>> Aeham Abushwashi
>> Head of Engineering
>> Exonar
>>
>> v: video.exonar.com | w: exonar.com <http://www.exonar.com/> | twitter:
>> @exonar <https://twitter.com/exonar>
>>
>> GDPR: Why It’s About More Than Regulation: Download the White Paper
>> Here < https://goo.gl/1cSVzH>
>>
>> Trial <https://www.exonar.com/platform/> the capability on your own
>> organisation's data to understand what you've got, where it is and
>> who has access to it.
>>
>>
>> Come and meet us for a chat at Infosecurity Europe <
>> http://www.infosecurityeurope.com/>on stand S07 in the Cyber
>> Innovation Zone
>> <http://www.infosecurityeurope.com/visit/whats-on/uk-cyber-
>> innovation-zone/>
>>
>>
>> Exonar Limited, registered in the UK, registration number 06439969 at
>> 14 West Mills, Newbury, Berkshire, RG14 5HG. DISCLAIMER: This email
>> and any attachments to it may be confidential or private. If you have
>> received it in error, please notify us and delete it from your system.
>>
>
>
Re: [VOTE] Release Apache Tika 1.15 Candidate #1
Posted by Oleg Tikhonov <ol...@apache.org>.
Also put
./tika-dl/src/test/java/org/apache/tika/dl/imagerec/DL4JInceptionV3NetTest.java
@Ignore because I do not have any DL installed on my comp.
On Tue, May 23, 2017 at 11:00 PM, Oleg Tikhonov <ol...@apache.org> wrote:
> Hi guys,
> Here is wrong ...
> <parent>
> <groupId>org.apache.tika</groupId>
> <artifactId>tika-parent</artifactId>
> <version>1.16-SNAPSHOT</version>
> <relativePath>tika-parent/pom.xml</relativePath>
> </parent>
>
>
> If you are cloning the project, the upper level pom contains this.
> The fix is to change 1.16-SNAPSHOT to 1.15
>
> What i did was:
> git clone https://github.com/apache/tika.git
>
> Any suggestions?
>
> BR,
> OLeg
>
>
>
>
> On Tue, May 23, 2017 at 3:01 PM, Allison, Timothy B. <ta...@mitre.org>
> wrote:
>
>> I _think_ it is included. See below for the two options for parsing
>> testZipEncrypted.zip.
>>
>> Are you not seeing this behavior? Were you expecting different behavior?
>>
>>
>> 1) RecursiveParserWrapper
>>
>> List<Metadata> metadataList = getRecursiveMetadata("testZipE
>> ncrypted.zip");
>> debug(metadataList);
>>
>> yields:
>>
>> 0: X-Parsed-By : org.apache.tika.parser.DefaultParser
>> 0: X-Parsed-By : org.apache.tika.parser.pkg.PackageParser
>> 0: X-TIKA:EXCEPTION:embedded_stream_exception :
>> org.apache.tika.exception.EncryptedDocumentException: stream
>> (encrypted.txt) is encrypted
>> at org.apache.tika.parser.pkg.PackageParser.parseEntry(PackageP
>> arser.java:306)
>> at org.apache.tika.parser.pkg.PackageParser.parse(PackageParser
>> .java:230)
>> at org.apache.tika.parser.CompositeParser.parse(CompositeParser
>> .java:280)
>> at org.apache.tika.parser.CompositeParser.parse(CompositeParser
>> .java:280)
>> at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectPars
>> er.java:135)
>> at org.apache.tika.parser.RecursiveParserWrapper.parse(Recursiv
>> eParserWrapper.java:158)
>> at org.apache.tika.TikaTest.getRecursiveMetadata(TikaTest.java:
>> 221)
>> at org.apache.tika.TikaTest.getRecursiveMetadata(TikaTest.java:
>> 213)
>> at org.apache.tika.parser.pkg.ZipParserTest.testZipEncrypted(Zi
>> pParserTest.java:213)
>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>> at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAcce
>> ssorImpl.java:62)
>> at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMe
>> thodAccessorImpl.java:43)
>> at java.lang.reflect.Method.invoke(Method.java:498)
>> at org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(
>> FrameworkMethod.java:50)
>> at org.junit.internal.runners.model.ReflectiveCallable.run(Refl
>> ectiveCallable.java:12)
>> at org.junit.runners.model.FrameworkMethod.invokeExplosively(Fr
>> ameworkMethod.java:47)
>> at org.junit.internal.runners.statements.InvokeMethod.evaluate(
>> InvokeMethod.java:17)
>> at org.junit.internal.runners.statements.RunBefores.evaluate(
>> RunBefores.java:26)
>> at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
>> at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit
>> 4ClassRunner.java:78)
>> at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit
>> 4ClassRunner.java:57)
>> at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
>> at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:
>> 71)
>> at org.junit.runners.ParentRunner.runChildren(ParentRunner.
>> java:288)
>> at org.junit.runners.ParentRunner.access$000(ParentRunner.java:
>> 58)
>> at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:
>> 268)
>> at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
>> at org.junit.runner.JUnitCore.run(JUnitCore.java:137)
>> at com.intellij.junit4.JUnit4IdeaTestRunner.startRunnerWithArgs
>> (JUnit4IdeaTestRunner.java:68)
>> at com.intellij.rt.execution.junit.IdeaTestRunner$Repeater.star
>> tRunnerWithArgs(IdeaTestRunner.java:51)
>> at com.intellij.rt.execution.junit.JUnitStarter.prepareStreamsA
>> ndStart(JUnitStarter.java:242)
>> at com.intellij.rt.execution.junit.JUnitStarter.main(JUnitStart
>> er.java:70)
>>
>> 0: X-TIKA:parse_time_millis : 34
>> 0: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
>> <head>
>> <meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser"
>> />
>> <meta name="X-Parsed-By" content="org.apache.tika.parser.pkg.PackageParser"
>> />
>> <meta name="Content-Type" content="application/zip" />
>> <title></title>
>> </head>
>> <body><div class="embedded" id="unencrypted.txt" />
>> <div class="package-entry"><h1>unencrypted.txt</h1>
>> </div>
>> <p>encrypted.txt</p>
>> </body></html>
>> 0: Content-Type : application/zip
>> 1: date : 2017-03-21T13:07:48Z
>> 1: X-Parsed-By : org.apache.tika.parser.DefaultParser
>> 1: X-Parsed-By : org.apache.tika.parser.txt.TXTParser
>> 1: resourceName : unencrypted.txt
>> 1: dcterms:modified : 2017-03-21T13:07:48Z
>> 1: Last-Modified : 2017-03-21T13:07:48Z
>> 1: Last-Save-Date : 2017-03-21T13:07:48Z
>> 1: embeddedRelationshipId : unencrypted.txt
>> 1: meta:save-date : 2017-03-21T13:07:48Z
>> 1: Content-Encoding : windows-1252
>> 1: X-TIKA:parse_time_millis : 3
>> 1: modified : 2017-03-21T13:07:48Z
>> 1: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
>> <head>
>> <meta name="date" content="2017-03-21T13:07:48Z" />
>> <meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser"
>> />
>> <meta name="X-Parsed-By" content="org.apache.tika.parser.txt.TXTParser"
>> />
>> <meta name="resourceName" content="unencrypted.txt" />
>> <meta name="dcterms:modified" content="2017-03-21T13:07:48Z" />
>> <meta name="Last-Modified" content="2017-03-21T13:07:48Z" />
>> <meta name="Last-Save-Date" content="2017-03-21T13:07:48Z" />
>> <meta name="embeddedRelationshipId" content="unencrypted.txt" />
>> <meta name="meta:save-date" content="2017-03-21T13:07:48Z" />
>> <meta name="Content-Encoding" content="windows-1252" />
>> <meta name="modified" content="2017-03-21T13:07:48Z" />
>> <meta name="Content-Length" content="13" />
>> <meta name="X-TIKA:embedded_resource_path" content="/unencrypted.txt" />
>> <meta name="Content-Type" content="text/plain; charset=windows-1252" />
>> <title></title>
>> </head>
>> <body><p>hello world
>> </p>
>> </body></html>
>> 1: Content-Length : 13
>> 1: X-TIKA:embedded_resource_path : /unencrypted.txt
>> 1: Content-Type : text/plain; charset=windows-1252
>>
>> 2) Classic XML:
>>
>> XMLResult r = getXML("testZipEncrypted.zip");
>> for (String n : r.metadata.names()) {
>> for (String v : r.metadata.getValues(n)) {
>> System.out.println("meta: "+n + " : "+v);
>> }
>> }
>> System.out.println(r.xml);
>>
>> Yields:
>> meta: X-Parsed-By : org.apache.tika.parser.DefaultParser
>> meta: X-Parsed-By : org.apache.tika.parser.pkg.PackageParser
>> meta: X-TIKA:EXCEPTION:embedded_stream_exception :
>> org.apache.tika.exception.EncryptedDocumentException: stream
>> (encrypted.txt) is encrypted
>> at org.apache.tika.parser.pkg.PackageParser.parseEntry(PackageP
>> arser.java:306)
>> at org.apache.tika.parser.pkg.PackageParser.parse(PackageParser
>> .java:230)
>> at org.apache.tika.parser.CompositeParser.parse(CompositeParser
>> .java:280)
>> at org.apache.tika.parser.CompositeParser.parse(CompositeParser
>> .java:280)
>> at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectPars
>> er.java:135)
>> at org.apache.tika.TikaTest.getXML(TikaTest.java:205)
>> at org.apache.tika.TikaTest.getXML(TikaTest.java:191)
>> at org.apache.tika.parser.pkg.ZipParserTest.testZipEncrypted(Zi
>> pParserTest.java:206)
>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>> at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAcce
>> ssorImpl.java:62)
>> at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMe
>> thodAccessorImpl.java:43)
>> at java.lang.reflect.Method.invoke(Method.java:498)
>> at org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(
>> FrameworkMethod.java:50)
>> at org.junit.internal.runners.model.ReflectiveCallable.run(Refl
>> ectiveCallable.java:12)
>> at org.junit.runners.model.FrameworkMethod.invokeExplosively(Fr
>> ameworkMethod.java:47)
>> at org.junit.internal.runners.statements.InvokeMethod.evaluate(
>> InvokeMethod.java:17)
>> at org.junit.internal.runners.statements.RunBefores.evaluate(
>> RunBefores.java:26)
>> at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
>> at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit
>> 4ClassRunner.java:78)
>> at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit
>> 4ClassRunner.java:57)
>> at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
>> at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:
>> 71)
>> at org.junit.runners.ParentRunner.runChildren(ParentRunner.
>> java:288)
>> at org.junit.runners.ParentRunner.access$000(ParentRunner.java:
>> 58)
>> at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:
>> 268)
>> at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
>> at org.junit.runner.JUnitCore.run(JUnitCore.java:137)
>> at com.intellij.junit4.JUnit4IdeaTestRunner.startRunnerWithArgs
>> (JUnit4IdeaTestRunner.java:68)
>> at com.intellij.rt.execution.junit.IdeaTestRunner$Repeater.star
>> tRunnerWithArgs(IdeaTestRunner.java:51)
>> at com.intellij.rt.execution.junit.JUnitStarter.prepareStreamsA
>> ndStart(JUnitStarter.java:242)
>> at com.intellij.rt.execution.junit.JUnitStarter.main(JUnitStart
>> er.java:70)
>>
>> meta: Content-Type : application/zip
>> <html xmlns="http://www.w3.org/1999/xhtml">
>> <head>
>> <meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser"
>> />
>> <meta name="X-Parsed-By" content="org.apache.tika.parser.pkg.PackageParser"
>> />
>> <meta name="Content-Type" content="application/zip" />
>> <title></title>
>> </head>
>> <body><div class="embedded" id="unencrypted.txt" />
>> <div class="package-entry"><h1>unencrypted.txt</h1>
>> <p>hello world
>> </p>
>>
>> </div>
>> <p>encrypted.txt</p>
>> </body></html>
>>
>> -----Original Message-----
>> From: Aeham Abushwashi [mailto:aeham.abushwashi@exonar.com]
>> Sent: Tuesday, May 23, 2017 3:47 AM
>> To: user@tika.apache.org; Tim Allison <ta...@apache.org>
>> Cc: dev@tika.apache.org
>> Subject: Re: [VOTE] Release Apache Tika 1.15 Candidate #1
>>
>> Thanks Tim and apologies if this isn't the right thread to ask this
>> question... any reason TIKA-2300 is not included despite FixVersions=1.15
>> on the ticket?
>>
>> On 22 May 2017 at 20:25, Tim Allison <ta...@apache.org> wrote:
>>
>> > A candidate for the Tika 1.15 release is available at:
>> > https://dist.apache.org/repos/dist/dev/tika/
>> >
>> > The release candidate is a zip archive of the sources in:
>> > https://github.com/apache/tika/tree/1.15-rc1
>> >
>> > The SHA1 checksum of the archive is
>> > e82697a6804373367fbba98d47426ab74e036eb1.
>> >
>> > In addition, a staged maven repository is available here:
>> > https://repository.apache.org/content/repositories/orgapachetika-1022
>> >
>> > Please vote on releasing this package as Apache Tika 1.15.
>> > The vote is open for the next 72 hours and passes if a majority of at
>> > least three +1 Tika PMC votes are cast.
>> >
>> > [ ] +1 Release this package as Apache Tika 1.15 [ ] -1 Do not release
>> > this package because...
>> >
>> > ***This is my first time as release manager. Please kick the tires
>> > thoroughly.***
>> >
>> > This is my +1.
>> >
>> > Cheers,
>> >
>> > Tim
>> >
>>
>>
>>
>> --
>> Aeham Abushwashi
>> Head of Engineering
>> Exonar
>>
>> v: video.exonar.com | w: exonar.com <http://www.exonar.com/> | twitter:
>> @exonar <https://twitter.com/exonar>
>>
>> GDPR: Why It’s About More Than Regulation: Download the White Paper Here <
>> https://goo.gl/1cSVzH>
>>
>> Trial <https://www.exonar.com/platform/> the capability on your own
>> organisation's data to understand what you've got, where it is and who has
>> access to it.
>>
>>
>> Come and meet us for a chat at Infosecurity Europe <
>> http://www.infosecurityeurope.com/>on stand S07 in the Cyber Innovation
>> Zone <http://www.infosecurityeurope.com/visit/whats-on/uk-cyber-
>> innovation-zone/>
>>
>>
>> Exonar Limited, registered in the UK, registration number 06439969 at 14
>> West Mills, Newbury, Berkshire, RG14 5HG. DISCLAIMER: This email and any
>> attachments to it may be confidential or private. If you have received it
>> in error, please notify us and delete it from your system.
>>
>
>
RE: [VOTE] Release Apache Tika 1.15 Candidate #1
Posted by "Allison, Timothy B." <ta...@mitre.org>.
Ugh. Thank you! Will re-spin for RC2 shortly.
-----Original Message-----
From: olegtikhonov@gmail.com [mailto:olegtikhonov@gmail.com] On Behalf Of Oleg Tikhonov
Sent: Tuesday, May 23, 2017 4:00 PM
To: dev@tika.apache.org
Subject: Re: [VOTE] Release Apache Tika 1.15 Candidate #1
Hi guys,
Here is wrong ...
<parent>
<groupId>org.apache.tika</groupId>
<artifactId>tika-parent</artifactId>
<version>1.16-SNAPSHOT</version>
<relativePath>tika-parent/pom.xml</relativePath>
</parent>
If you are cloning the project, the upper level pom contains this.
The fix is to change 1.16-SNAPSHOT to 1.15
What i did was:
git clone https://github.com/apache/tika.git
Any suggestions?
BR,
OLeg
On Tue, May 23, 2017 at 3:01 PM, Allison, Timothy B. <ta...@mitre.org>
wrote:
> I _think_ it is included. See below for the two options for parsing
> testZipEncrypted.zip.
>
> Are you not seeing this behavior? Were you expecting different behavior?
>
>
> 1) RecursiveParserWrapper
>
> List<Metadata> metadataList = getRecursiveMetadata("
> testZipEncrypted.zip");
> debug(metadataList);
>
> yields:
>
> 0: X-Parsed-By : org.apache.tika.parser.DefaultParser
> 0: X-Parsed-By : org.apache.tika.parser.pkg.PackageParser
> 0: X-TIKA:EXCEPTION:embedded_stream_exception : org.apache.tika.exception.EncryptedDocumentException:
> stream (encrypted.txt) is encrypted
> at org.apache.tika.parser.pkg.PackageParser.parseEntry(
> PackageParser.java:306)
> at org.apache.tika.parser.pkg.PackageParser.parse(
> PackageParser.java:230)
> at org.apache.tika.parser.CompositeParser.parse(
> CompositeParser.java:280)
> at org.apache.tika.parser.CompositeParser.parse(
> CompositeParser.java:280)
> at org.apache.tika.parser.AutoDetectParser.parse(
> AutoDetectParser.java:135)
> at org.apache.tika.parser.RecursiveParserWrapper.parse(
> RecursiveParserWrapper.java:158)
> at org.apache.tika.TikaTest.getRecursiveMetadata(TikaTest.
> java:221)
> at org.apache.tika.TikaTest.getRecursiveMetadata(TikaTest.
> java:213)
> at org.apache.tika.parser.pkg.ZipParserTest.testZipEncrypted(
> ZipParserTest.java:213)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at sun.reflect.NativeMethodAccessorImpl.invoke(
> NativeMethodAccessorImpl.java:62)
> at sun.reflect.DelegatingMethodAccessorImpl.invoke(
> DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:498)
> at
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(
> FrameworkMethod.java:50)
> at org.junit.internal.runners.model.ReflectiveCallable.run(
> ReflectiveCallable.java:12)
> at org.junit.runners.model.FrameworkMethod.invokeExplosively(
> FrameworkMethod.java:47)
> at org.junit.internal.runners.statements.InvokeMethod.
> evaluate(InvokeMethod.java:17)
> at org.junit.internal.runners.statements.RunBefores.
> evaluate(RunBefores.java:26)
> at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
> at org.junit.runners.BlockJUnit4ClassRunner.runChild(
> BlockJUnit4ClassRunner.java:78)
> at org.junit.runners.BlockJUnit4ClassRunner.runChild(
> BlockJUnit4ClassRunner.java:57)
> at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
> at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
> at org.junit.runners.ParentRunner.runChildren(
> ParentRunner.java:288)
> at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
> at org.junit.runners.ParentRunner$2.evaluate(
> ParentRunner.java:268)
> at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
> at org.junit.runner.JUnitCore.run(JUnitCore.java:137)
> at
> com.intellij.junit4.JUnit4IdeaTestRunner.startRunnerWithArgs(
> JUnit4IdeaTestRunner.java:68)
> at com.intellij.rt.execution.junit.IdeaTestRunner$Repeater.
> startRunnerWithArgs(IdeaTestRunner.java:51)
> at com.intellij.rt.execution.junit.JUnitStarter.
> prepareStreamsAndStart(JUnitStarter.java:242)
> at com.intellij.rt.execution.junit.JUnitStarter.main(
> JUnitStarter.java:70)
>
> 0: X-TIKA:parse_time_millis : 34
> 0: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
> <head>
> <meta name="X-Parsed-By"
> content="org.apache.tika.parser.DefaultParser" /> <meta name="X-Parsed-By" content="org.apache.tika.parser.pkg.PackageParser"
> />
> <meta name="Content-Type" content="application/zip" /> <title></title>
> </head> <body><div class="embedded" id="unencrypted.txt" /> <div
> class="package-entry"><h1>unencrypted.txt</h1>
> </div>
> <p>encrypted.txt</p>
> </body></html>
> 0: Content-Type : application/zip
> 1: date : 2017-03-21T13:07:48Z
> 1: X-Parsed-By : org.apache.tika.parser.DefaultParser
> 1: X-Parsed-By : org.apache.tika.parser.txt.TXTParser
> 1: resourceName : unencrypted.txt
> 1: dcterms:modified : 2017-03-21T13:07:48Z
> 1: Last-Modified : 2017-03-21T13:07:48Z
> 1: Last-Save-Date : 2017-03-21T13:07:48Z
> 1: embeddedRelationshipId : unencrypted.txt
> 1: meta:save-date : 2017-03-21T13:07:48Z
> 1: Content-Encoding : windows-1252
> 1: X-TIKA:parse_time_millis : 3
> 1: modified : 2017-03-21T13:07:48Z
> 1: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
> <head>
> <meta name="date" content="2017-03-21T13:07:48Z" /> <meta
> name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
> <meta name="X-Parsed-By"
> content="org.apache.tika.parser.txt.TXTParser" /> <meta
> name="resourceName" content="unencrypted.txt" /> <meta
> name="dcterms:modified" content="2017-03-21T13:07:48Z" /> <meta
> name="Last-Modified" content="2017-03-21T13:07:48Z" /> <meta
> name="Last-Save-Date" content="2017-03-21T13:07:48Z" /> <meta
> name="embeddedRelationshipId" content="unencrypted.txt" /> <meta
> name="meta:save-date" content="2017-03-21T13:07:48Z" /> <meta
> name="Content-Encoding" content="windows-1252" /> <meta
> name="modified" content="2017-03-21T13:07:48Z" /> <meta
> name="Content-Length" content="13" /> <meta
> name="X-TIKA:embedded_resource_path" content="/unencrypted.txt" />
> <meta name="Content-Type" content="text/plain; charset=windows-1252"
> /> <title></title> </head> <body><p>hello world </p> </body></html>
> 1: Content-Length : 13
> 1: X-TIKA:embedded_resource_path : /unencrypted.txt
> 1: Content-Type : text/plain; charset=windows-1252
>
> 2) Classic XML:
>
> XMLResult r = getXML("testZipEncrypted.zip");
> for (String n : r.metadata.names()) {
> for (String v : r.metadata.getValues(n)) {
> System.out.println("meta: "+n + " : "+v);
> }
> }
> System.out.println(r.xml);
>
> Yields:
> meta: X-Parsed-By : org.apache.tika.parser.DefaultParser
> meta: X-Parsed-By : org.apache.tika.parser.pkg.PackageParser
> meta: X-TIKA:EXCEPTION:embedded_stream_exception :
> org.apache.tika.exception.EncryptedDocumentException: stream
> (encrypted.txt) is encrypted
> at org.apache.tika.parser.pkg.PackageParser.parseEntry(
> PackageParser.java:306)
> at org.apache.tika.parser.pkg.PackageParser.parse(
> PackageParser.java:230)
> at org.apache.tika.parser.CompositeParser.parse(
> CompositeParser.java:280)
> at org.apache.tika.parser.CompositeParser.parse(
> CompositeParser.java:280)
> at org.apache.tika.parser.AutoDetectParser.parse(
> AutoDetectParser.java:135)
> at org.apache.tika.TikaTest.getXML(TikaTest.java:205)
> at org.apache.tika.TikaTest.getXML(TikaTest.java:191)
> at org.apache.tika.parser.pkg.ZipParserTest.testZipEncrypted(
> ZipParserTest.java:206)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at sun.reflect.NativeMethodAccessorImpl.invoke(
> NativeMethodAccessorImpl.java:62)
> at sun.reflect.DelegatingMethodAccessorImpl.invoke(
> DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:498)
> at
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(
> FrameworkMethod.java:50)
> at org.junit.internal.runners.model.ReflectiveCallable.run(
> ReflectiveCallable.java:12)
> at org.junit.runners.model.FrameworkMethod.invokeExplosively(
> FrameworkMethod.java:47)
> at org.junit.internal.runners.statements.InvokeMethod.
> evaluate(InvokeMethod.java:17)
> at org.junit.internal.runners.statements.RunBefores.
> evaluate(RunBefores.java:26)
> at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
> at org.junit.runners.BlockJUnit4ClassRunner.runChild(
> BlockJUnit4ClassRunner.java:78)
> at org.junit.runners.BlockJUnit4ClassRunner.runChild(
> BlockJUnit4ClassRunner.java:57)
> at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
> at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
> at org.junit.runners.ParentRunner.runChildren(
> ParentRunner.java:288)
> at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
> at org.junit.runners.ParentRunner$2.evaluate(
> ParentRunner.java:268)
> at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
> at org.junit.runner.JUnitCore.run(JUnitCore.java:137)
> at
> com.intellij.junit4.JUnit4IdeaTestRunner.startRunnerWithArgs(
> JUnit4IdeaTestRunner.java:68)
> at com.intellij.rt.execution.junit.IdeaTestRunner$Repeater.
> startRunnerWithArgs(IdeaTestRunner.java:51)
> at com.intellij.rt.execution.junit.JUnitStarter.
> prepareStreamsAndStart(JUnitStarter.java:242)
> at com.intellij.rt.execution.junit.JUnitStarter.main(
> JUnitStarter.java:70)
>
> meta: Content-Type : application/zip
> <html xmlns="http://www.w3.org/1999/xhtml">
> <head>
> <meta name="X-Parsed-By"
> content="org.apache.tika.parser.DefaultParser" /> <meta name="X-Parsed-By" content="org.apache.tika.parser.pkg.PackageParser"
> />
> <meta name="Content-Type" content="application/zip" /> <title></title>
> </head> <body><div class="embedded" id="unencrypted.txt" /> <div
> class="package-entry"><h1>unencrypted.txt</h1>
> <p>hello world
> </p>
>
> </div>
> <p>encrypted.txt</p>
> </body></html>
>
> -----Original Message-----
> From: Aeham Abushwashi [mailto:aeham.abushwashi@exonar.com]
> Sent: Tuesday, May 23, 2017 3:47 AM
> To: user@tika.apache.org; Tim Allison <ta...@apache.org>
> Cc: dev@tika.apache.org
> Subject: Re: [VOTE] Release Apache Tika 1.15 Candidate #1
>
> Thanks Tim and apologies if this isn't the right thread to ask this
> question... any reason TIKA-2300 is not included despite
> FixVersions=1.15 on the ticket?
>
> On 22 May 2017 at 20:25, Tim Allison <ta...@apache.org> wrote:
>
> > A candidate for the Tika 1.15 release is available at:
> > https://dist.apache.org/repos/dist/dev/tika/
> >
> > The release candidate is a zip archive of the sources in:
> > https://github.com/apache/tika/tree/1.15-rc1
> >
> > The SHA1 checksum of the archive is
> > e82697a6804373367fbba98d47426ab74e036eb1.
> >
> > In addition, a staged maven repository is available here:
> > https://repository.apache.org/content/repositories/orgapachetika-102
> > 2
> >
> > Please vote on releasing this package as Apache Tika 1.15.
> > The vote is open for the next 72 hours and passes if a majority of
> > at least three +1 Tika PMC votes are cast.
> >
> > [ ] +1 Release this package as Apache Tika 1.15 [ ] -1 Do not
> > release this package because...
> >
> > ***This is my first time as release manager. Please kick the tires
> > thoroughly.***
> >
> > This is my +1.
> >
> > Cheers,
> >
> > Tim
> >
>
>
>
> --
> Aeham Abushwashi
> Head of Engineering
> Exonar
>
> v: video.exonar.com | w: exonar.com <http://www.exonar.com/> | twitter:
> @exonar <https://twitter.com/exonar>
>
> GDPR: Why It’s About More Than Regulation: Download the White Paper
> Here < https://goo.gl/1cSVzH>
>
> Trial <https://www.exonar.com/platform/> the capability on your own
> organisation's data to understand what you've got, where it is and who
> has access to it.
>
>
> Come and meet us for a chat at Infosecurity Europe <http://www.
> infosecurityeurope.com/>on stand S07 in the Cyber Innovation Zone <
> http://www.infosecurityeurope.com/visit/whats-on/uk-cyber-innovation-z
> one/
> >
>
>
> Exonar Limited, registered in the UK, registration number 06439969 at
> 14 West Mills, Newbury, Berkshire, RG14 5HG. DISCLAIMER: This email
> and any attachments to it may be confidential or private. If you have
> received it in error, please notify us and delete it from your system.
>
Re: [VOTE] Release Apache Tika 1.15 Candidate #1
Posted by Oleg Tikhonov <ol...@apache.org>.
Hi guys,
Here is wrong ...
<parent>
<groupId>org.apache.tika</groupId>
<artifactId>tika-parent</artifactId>
<version>1.16-SNAPSHOT</version>
<relativePath>tika-parent/pom.xml</relativePath>
</parent>
If you are cloning the project, the upper level pom contains this.
The fix is to change 1.16-SNAPSHOT to 1.15
What i did was:
git clone https://github.com/apache/tika.git
Any suggestions?
BR,
OLeg
On Tue, May 23, 2017 at 3:01 PM, Allison, Timothy B. <ta...@mitre.org>
wrote:
> I _think_ it is included. See below for the two options for parsing
> testZipEncrypted.zip.
>
> Are you not seeing this behavior? Were you expecting different behavior?
>
>
> 1) RecursiveParserWrapper
>
> List<Metadata> metadataList = getRecursiveMetadata("
> testZipEncrypted.zip");
> debug(metadataList);
>
> yields:
>
> 0: X-Parsed-By : org.apache.tika.parser.DefaultParser
> 0: X-Parsed-By : org.apache.tika.parser.pkg.PackageParser
> 0: X-TIKA:EXCEPTION:embedded_stream_exception : org.apache.tika.exception.EncryptedDocumentException:
> stream (encrypted.txt) is encrypted
> at org.apache.tika.parser.pkg.PackageParser.parseEntry(
> PackageParser.java:306)
> at org.apache.tika.parser.pkg.PackageParser.parse(
> PackageParser.java:230)
> at org.apache.tika.parser.CompositeParser.parse(
> CompositeParser.java:280)
> at org.apache.tika.parser.CompositeParser.parse(
> CompositeParser.java:280)
> at org.apache.tika.parser.AutoDetectParser.parse(
> AutoDetectParser.java:135)
> at org.apache.tika.parser.RecursiveParserWrapper.parse(
> RecursiveParserWrapper.java:158)
> at org.apache.tika.TikaTest.getRecursiveMetadata(TikaTest.
> java:221)
> at org.apache.tika.TikaTest.getRecursiveMetadata(TikaTest.
> java:213)
> at org.apache.tika.parser.pkg.ZipParserTest.testZipEncrypted(
> ZipParserTest.java:213)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at sun.reflect.NativeMethodAccessorImpl.invoke(
> NativeMethodAccessorImpl.java:62)
> at sun.reflect.DelegatingMethodAccessorImpl.invoke(
> DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:498)
> at org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(
> FrameworkMethod.java:50)
> at org.junit.internal.runners.model.ReflectiveCallable.run(
> ReflectiveCallable.java:12)
> at org.junit.runners.model.FrameworkMethod.invokeExplosively(
> FrameworkMethod.java:47)
> at org.junit.internal.runners.statements.InvokeMethod.
> evaluate(InvokeMethod.java:17)
> at org.junit.internal.runners.statements.RunBefores.
> evaluate(RunBefores.java:26)
> at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
> at org.junit.runners.BlockJUnit4ClassRunner.runChild(
> BlockJUnit4ClassRunner.java:78)
> at org.junit.runners.BlockJUnit4ClassRunner.runChild(
> BlockJUnit4ClassRunner.java:57)
> at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
> at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
> at org.junit.runners.ParentRunner.runChildren(
> ParentRunner.java:288)
> at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
> at org.junit.runners.ParentRunner$2.evaluate(
> ParentRunner.java:268)
> at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
> at org.junit.runner.JUnitCore.run(JUnitCore.java:137)
> at com.intellij.junit4.JUnit4IdeaTestRunner.startRunnerWithArgs(
> JUnit4IdeaTestRunner.java:68)
> at com.intellij.rt.execution.junit.IdeaTestRunner$Repeater.
> startRunnerWithArgs(IdeaTestRunner.java:51)
> at com.intellij.rt.execution.junit.JUnitStarter.
> prepareStreamsAndStart(JUnitStarter.java:242)
> at com.intellij.rt.execution.junit.JUnitStarter.main(
> JUnitStarter.java:70)
>
> 0: X-TIKA:parse_time_millis : 34
> 0: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
> <head>
> <meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
> <meta name="X-Parsed-By" content="org.apache.tika.parser.pkg.PackageParser"
> />
> <meta name="Content-Type" content="application/zip" />
> <title></title>
> </head>
> <body><div class="embedded" id="unencrypted.txt" />
> <div class="package-entry"><h1>unencrypted.txt</h1>
> </div>
> <p>encrypted.txt</p>
> </body></html>
> 0: Content-Type : application/zip
> 1: date : 2017-03-21T13:07:48Z
> 1: X-Parsed-By : org.apache.tika.parser.DefaultParser
> 1: X-Parsed-By : org.apache.tika.parser.txt.TXTParser
> 1: resourceName : unencrypted.txt
> 1: dcterms:modified : 2017-03-21T13:07:48Z
> 1: Last-Modified : 2017-03-21T13:07:48Z
> 1: Last-Save-Date : 2017-03-21T13:07:48Z
> 1: embeddedRelationshipId : unencrypted.txt
> 1: meta:save-date : 2017-03-21T13:07:48Z
> 1: Content-Encoding : windows-1252
> 1: X-TIKA:parse_time_millis : 3
> 1: modified : 2017-03-21T13:07:48Z
> 1: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
> <head>
> <meta name="date" content="2017-03-21T13:07:48Z" />
> <meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
> <meta name="X-Parsed-By" content="org.apache.tika.parser.txt.TXTParser" />
> <meta name="resourceName" content="unencrypted.txt" />
> <meta name="dcterms:modified" content="2017-03-21T13:07:48Z" />
> <meta name="Last-Modified" content="2017-03-21T13:07:48Z" />
> <meta name="Last-Save-Date" content="2017-03-21T13:07:48Z" />
> <meta name="embeddedRelationshipId" content="unencrypted.txt" />
> <meta name="meta:save-date" content="2017-03-21T13:07:48Z" />
> <meta name="Content-Encoding" content="windows-1252" />
> <meta name="modified" content="2017-03-21T13:07:48Z" />
> <meta name="Content-Length" content="13" />
> <meta name="X-TIKA:embedded_resource_path" content="/unencrypted.txt" />
> <meta name="Content-Type" content="text/plain; charset=windows-1252" />
> <title></title>
> </head>
> <body><p>hello world
> </p>
> </body></html>
> 1: Content-Length : 13
> 1: X-TIKA:embedded_resource_path : /unencrypted.txt
> 1: Content-Type : text/plain; charset=windows-1252
>
> 2) Classic XML:
>
> XMLResult r = getXML("testZipEncrypted.zip");
> for (String n : r.metadata.names()) {
> for (String v : r.metadata.getValues(n)) {
> System.out.println("meta: "+n + " : "+v);
> }
> }
> System.out.println(r.xml);
>
> Yields:
> meta: X-Parsed-By : org.apache.tika.parser.DefaultParser
> meta: X-Parsed-By : org.apache.tika.parser.pkg.PackageParser
> meta: X-TIKA:EXCEPTION:embedded_stream_exception :
> org.apache.tika.exception.EncryptedDocumentException: stream
> (encrypted.txt) is encrypted
> at org.apache.tika.parser.pkg.PackageParser.parseEntry(
> PackageParser.java:306)
> at org.apache.tika.parser.pkg.PackageParser.parse(
> PackageParser.java:230)
> at org.apache.tika.parser.CompositeParser.parse(
> CompositeParser.java:280)
> at org.apache.tika.parser.CompositeParser.parse(
> CompositeParser.java:280)
> at org.apache.tika.parser.AutoDetectParser.parse(
> AutoDetectParser.java:135)
> at org.apache.tika.TikaTest.getXML(TikaTest.java:205)
> at org.apache.tika.TikaTest.getXML(TikaTest.java:191)
> at org.apache.tika.parser.pkg.ZipParserTest.testZipEncrypted(
> ZipParserTest.java:206)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at sun.reflect.NativeMethodAccessorImpl.invoke(
> NativeMethodAccessorImpl.java:62)
> at sun.reflect.DelegatingMethodAccessorImpl.invoke(
> DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:498)
> at org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(
> FrameworkMethod.java:50)
> at org.junit.internal.runners.model.ReflectiveCallable.run(
> ReflectiveCallable.java:12)
> at org.junit.runners.model.FrameworkMethod.invokeExplosively(
> FrameworkMethod.java:47)
> at org.junit.internal.runners.statements.InvokeMethod.
> evaluate(InvokeMethod.java:17)
> at org.junit.internal.runners.statements.RunBefores.
> evaluate(RunBefores.java:26)
> at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
> at org.junit.runners.BlockJUnit4ClassRunner.runChild(
> BlockJUnit4ClassRunner.java:78)
> at org.junit.runners.BlockJUnit4ClassRunner.runChild(
> BlockJUnit4ClassRunner.java:57)
> at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
> at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
> at org.junit.runners.ParentRunner.runChildren(
> ParentRunner.java:288)
> at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
> at org.junit.runners.ParentRunner$2.evaluate(
> ParentRunner.java:268)
> at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
> at org.junit.runner.JUnitCore.run(JUnitCore.java:137)
> at com.intellij.junit4.JUnit4IdeaTestRunner.startRunnerWithArgs(
> JUnit4IdeaTestRunner.java:68)
> at com.intellij.rt.execution.junit.IdeaTestRunner$Repeater.
> startRunnerWithArgs(IdeaTestRunner.java:51)
> at com.intellij.rt.execution.junit.JUnitStarter.
> prepareStreamsAndStart(JUnitStarter.java:242)
> at com.intellij.rt.execution.junit.JUnitStarter.main(
> JUnitStarter.java:70)
>
> meta: Content-Type : application/zip
> <html xmlns="http://www.w3.org/1999/xhtml">
> <head>
> <meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
> <meta name="X-Parsed-By" content="org.apache.tika.parser.pkg.PackageParser"
> />
> <meta name="Content-Type" content="application/zip" />
> <title></title>
> </head>
> <body><div class="embedded" id="unencrypted.txt" />
> <div class="package-entry"><h1>unencrypted.txt</h1>
> <p>hello world
> </p>
>
> </div>
> <p>encrypted.txt</p>
> </body></html>
>
> -----Original Message-----
> From: Aeham Abushwashi [mailto:aeham.abushwashi@exonar.com]
> Sent: Tuesday, May 23, 2017 3:47 AM
> To: user@tika.apache.org; Tim Allison <ta...@apache.org>
> Cc: dev@tika.apache.org
> Subject: Re: [VOTE] Release Apache Tika 1.15 Candidate #1
>
> Thanks Tim and apologies if this isn't the right thread to ask this
> question... any reason TIKA-2300 is not included despite FixVersions=1.15
> on the ticket?
>
> On 22 May 2017 at 20:25, Tim Allison <ta...@apache.org> wrote:
>
> > A candidate for the Tika 1.15 release is available at:
> > https://dist.apache.org/repos/dist/dev/tika/
> >
> > The release candidate is a zip archive of the sources in:
> > https://github.com/apache/tika/tree/1.15-rc1
> >
> > The SHA1 checksum of the archive is
> > e82697a6804373367fbba98d47426ab74e036eb1.
> >
> > In addition, a staged maven repository is available here:
> > https://repository.apache.org/content/repositories/orgapachetika-1022
> >
> > Please vote on releasing this package as Apache Tika 1.15.
> > The vote is open for the next 72 hours and passes if a majority of at
> > least three +1 Tika PMC votes are cast.
> >
> > [ ] +1 Release this package as Apache Tika 1.15 [ ] -1 Do not release
> > this package because...
> >
> > ***This is my first time as release manager. Please kick the tires
> > thoroughly.***
> >
> > This is my +1.
> >
> > Cheers,
> >
> > Tim
> >
>
>
>
> --
> Aeham Abushwashi
> Head of Engineering
> Exonar
>
> v: video.exonar.com | w: exonar.com <http://www.exonar.com/> | twitter:
> @exonar <https://twitter.com/exonar>
>
> GDPR: Why It’s About More Than Regulation: Download the White Paper Here <
> https://goo.gl/1cSVzH>
>
> Trial <https://www.exonar.com/platform/> the capability on your own
> organisation's data to understand what you've got, where it is and who has
> access to it.
>
>
> Come and meet us for a chat at Infosecurity Europe <http://www.
> infosecurityeurope.com/>on stand S07 in the Cyber Innovation Zone <
> http://www.infosecurityeurope.com/visit/whats-on/uk-cyber-innovation-zone/
> >
>
>
> Exonar Limited, registered in the UK, registration number 06439969 at 14
> West Mills, Newbury, Berkshire, RG14 5HG. DISCLAIMER: This email and any
> attachments to it may be confidential or private. If you have received it
> in error, please notify us and delete it from your system.
>
Re: [VOTE] Release Apache Tika 1.15 Candidate #1
Posted by Aeham Abushwashi <ae...@exonar.com>.
You're absolutely right. I was going by the lack of a release note on the
rc1 branch, but looking at the code the change is actually there.
Thanks!
Aeham
Re: [VOTE] Release Apache Tika 1.15 Candidate #1
Posted by Aeham Abushwashi <ae...@exonar.com>.
You're absolutely right. I was going by the lack of a release note on the
rc1 branch, but looking at the code the change is actually there.
Thanks!
Aeham
RE: [VOTE] Release Apache Tika 1.15 Candidate #1
Posted by "Allison, Timothy B." <ta...@mitre.org>.
I _think_ it is included. See below for the two options for parsing testZipEncrypted.zip.
Are you not seeing this behavior? Were you expecting different behavior?
1) RecursiveParserWrapper
List<Metadata> metadataList = getRecursiveMetadata("testZipEncrypted.zip");
debug(metadataList);
yields:
0: X-Parsed-By : org.apache.tika.parser.DefaultParser
0: X-Parsed-By : org.apache.tika.parser.pkg.PackageParser
0: X-TIKA:EXCEPTION:embedded_stream_exception : org.apache.tika.exception.EncryptedDocumentException: stream (encrypted.txt) is encrypted
at org.apache.tika.parser.pkg.PackageParser.parseEntry(PackageParser.java:306)
at org.apache.tika.parser.pkg.PackageParser.parse(PackageParser.java:230)
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
at org.apache.tika.parser.RecursiveParserWrapper.parse(RecursiveParserWrapper.java:158)
at org.apache.tika.TikaTest.getRecursiveMetadata(TikaTest.java:221)
at org.apache.tika.TikaTest.getRecursiveMetadata(TikaTest.java:213)
at org.apache.tika.parser.pkg.ZipParserTest.testZipEncrypted(ZipParserTest.java:213)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:498)
at org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
at org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
at org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
at org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
at org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:78)
at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:57)
at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288)
at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268)
at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
at org.junit.runner.JUnitCore.run(JUnitCore.java:137)
at com.intellij.junit4.JUnit4IdeaTestRunner.startRunnerWithArgs(JUnit4IdeaTestRunner.java:68)
at com.intellij.rt.execution.junit.IdeaTestRunner$Repeater.startRunnerWithArgs(IdeaTestRunner.java:51)
at com.intellij.rt.execution.junit.JUnitStarter.prepareStreamsAndStart(JUnitStarter.java:242)
at com.intellij.rt.execution.junit.JUnitStarter.main(JUnitStarter.java:70)
0: X-TIKA:parse_time_millis : 34
0: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.pkg.PackageParser" />
<meta name="Content-Type" content="application/zip" />
<title></title>
</head>
<body><div class="embedded" id="unencrypted.txt" />
<div class="package-entry"><h1>unencrypted.txt</h1>
</div>
<p>encrypted.txt</p>
</body></html>
0: Content-Type : application/zip
1: date : 2017-03-21T13:07:48Z
1: X-Parsed-By : org.apache.tika.parser.DefaultParser
1: X-Parsed-By : org.apache.tika.parser.txt.TXTParser
1: resourceName : unencrypted.txt
1: dcterms:modified : 2017-03-21T13:07:48Z
1: Last-Modified : 2017-03-21T13:07:48Z
1: Last-Save-Date : 2017-03-21T13:07:48Z
1: embeddedRelationshipId : unencrypted.txt
1: meta:save-date : 2017-03-21T13:07:48Z
1: Content-Encoding : windows-1252
1: X-TIKA:parse_time_millis : 3
1: modified : 2017-03-21T13:07:48Z
1: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="date" content="2017-03-21T13:07:48Z" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.txt.TXTParser" />
<meta name="resourceName" content="unencrypted.txt" />
<meta name="dcterms:modified" content="2017-03-21T13:07:48Z" />
<meta name="Last-Modified" content="2017-03-21T13:07:48Z" />
<meta name="Last-Save-Date" content="2017-03-21T13:07:48Z" />
<meta name="embeddedRelationshipId" content="unencrypted.txt" />
<meta name="meta:save-date" content="2017-03-21T13:07:48Z" />
<meta name="Content-Encoding" content="windows-1252" />
<meta name="modified" content="2017-03-21T13:07:48Z" />
<meta name="Content-Length" content="13" />
<meta name="X-TIKA:embedded_resource_path" content="/unencrypted.txt" />
<meta name="Content-Type" content="text/plain; charset=windows-1252" />
<title></title>
</head>
<body><p>hello world
</p>
</body></html>
1: Content-Length : 13
1: X-TIKA:embedded_resource_path : /unencrypted.txt
1: Content-Type : text/plain; charset=windows-1252
2) Classic XML:
XMLResult r = getXML("testZipEncrypted.zip");
for (String n : r.metadata.names()) {
for (String v : r.metadata.getValues(n)) {
System.out.println("meta: "+n + " : "+v);
}
}
System.out.println(r.xml);
Yields:
meta: X-Parsed-By : org.apache.tika.parser.DefaultParser
meta: X-Parsed-By : org.apache.tika.parser.pkg.PackageParser
meta: X-TIKA:EXCEPTION:embedded_stream_exception : org.apache.tika.exception.EncryptedDocumentException: stream (encrypted.txt) is encrypted
at org.apache.tika.parser.pkg.PackageParser.parseEntry(PackageParser.java:306)
at org.apache.tika.parser.pkg.PackageParser.parse(PackageParser.java:230)
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
at org.apache.tika.TikaTest.getXML(TikaTest.java:205)
at org.apache.tika.TikaTest.getXML(TikaTest.java:191)
at org.apache.tika.parser.pkg.ZipParserTest.testZipEncrypted(ZipParserTest.java:206)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:498)
at org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
at org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
at org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
at org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
at org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:78)
at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:57)
at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288)
at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268)
at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
at org.junit.runner.JUnitCore.run(JUnitCore.java:137)
at com.intellij.junit4.JUnit4IdeaTestRunner.startRunnerWithArgs(JUnit4IdeaTestRunner.java:68)
at com.intellij.rt.execution.junit.IdeaTestRunner$Repeater.startRunnerWithArgs(IdeaTestRunner.java:51)
at com.intellij.rt.execution.junit.JUnitStarter.prepareStreamsAndStart(JUnitStarter.java:242)
at com.intellij.rt.execution.junit.JUnitStarter.main(JUnitStarter.java:70)
meta: Content-Type : application/zip
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.pkg.PackageParser" />
<meta name="Content-Type" content="application/zip" />
<title></title>
</head>
<body><div class="embedded" id="unencrypted.txt" />
<div class="package-entry"><h1>unencrypted.txt</h1>
<p>hello world
</p>
</div>
<p>encrypted.txt</p>
</body></html>
-----Original Message-----
From: Aeham Abushwashi [mailto:aeham.abushwashi@exonar.com]
Sent: Tuesday, May 23, 2017 3:47 AM
To: user@tika.apache.org; Tim Allison <ta...@apache.org>
Cc: dev@tika.apache.org
Subject: Re: [VOTE] Release Apache Tika 1.15 Candidate #1
Thanks Tim and apologies if this isn't the right thread to ask this question... any reason TIKA-2300 is not included despite FixVersions=1.15 on the ticket?
On 22 May 2017 at 20:25, Tim Allison <ta...@apache.org> wrote:
> A candidate for the Tika 1.15 release is available at:
> https://dist.apache.org/repos/dist/dev/tika/
>
> The release candidate is a zip archive of the sources in:
> https://github.com/apache/tika/tree/1.15-rc1
>
> The SHA1 checksum of the archive is
> e82697a6804373367fbba98d47426ab74e036eb1.
>
> In addition, a staged maven repository is available here:
> https://repository.apache.org/content/repositories/orgapachetika-1022
>
> Please vote on releasing this package as Apache Tika 1.15.
> The vote is open for the next 72 hours and passes if a majority of at
> least three +1 Tika PMC votes are cast.
>
> [ ] +1 Release this package as Apache Tika 1.15 [ ] -1 Do not release
> this package because...
>
> ***This is my first time as release manager. Please kick the tires
> thoroughly.***
>
> This is my +1.
>
> Cheers,
>
> Tim
>
--
Aeham Abushwashi
Head of Engineering
Exonar
v: video.exonar.com | w: exonar.com <http://www.exonar.com/> | twitter:
@exonar <https://twitter.com/exonar>
GDPR: Why It’s About More Than Regulation: Download the White Paper Here <https://goo.gl/1cSVzH>
Trial <https://www.exonar.com/platform/> the capability on your own organisation's data to understand what you've got, where it is and who has access to it.
Come and meet us for a chat at Infosecurity Europe <http://www.infosecurityeurope.com/>on stand S07 in the Cyber Innovation Zone <http://www.infosecurityeurope.com/visit/whats-on/uk-cyber-innovation-zone/>
Exonar Limited, registered in the UK, registration number 06439969 at 14 West Mills, Newbury, Berkshire, RG14 5HG. DISCLAIMER: This email and any attachments to it may be confidential or private. If you have received it in error, please notify us and delete it from your system.
RE: [VOTE] Release Apache Tika 1.15 Candidate #1
Posted by "Allison, Timothy B." <ta...@mitre.org>.
I _think_ it is included. See below for the two options for parsing testZipEncrypted.zip.
Are you not seeing this behavior? Were you expecting different behavior?
1) RecursiveParserWrapper
List<Metadata> metadataList = getRecursiveMetadata("testZipEncrypted.zip");
debug(metadataList);
yields:
0: X-Parsed-By : org.apache.tika.parser.DefaultParser
0: X-Parsed-By : org.apache.tika.parser.pkg.PackageParser
0: X-TIKA:EXCEPTION:embedded_stream_exception : org.apache.tika.exception.EncryptedDocumentException: stream (encrypted.txt) is encrypted
at org.apache.tika.parser.pkg.PackageParser.parseEntry(PackageParser.java:306)
at org.apache.tika.parser.pkg.PackageParser.parse(PackageParser.java:230)
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
at org.apache.tika.parser.RecursiveParserWrapper.parse(RecursiveParserWrapper.java:158)
at org.apache.tika.TikaTest.getRecursiveMetadata(TikaTest.java:221)
at org.apache.tika.TikaTest.getRecursiveMetadata(TikaTest.java:213)
at org.apache.tika.parser.pkg.ZipParserTest.testZipEncrypted(ZipParserTest.java:213)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:498)
at org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
at org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
at org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
at org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
at org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:78)
at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:57)
at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288)
at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268)
at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
at org.junit.runner.JUnitCore.run(JUnitCore.java:137)
at com.intellij.junit4.JUnit4IdeaTestRunner.startRunnerWithArgs(JUnit4IdeaTestRunner.java:68)
at com.intellij.rt.execution.junit.IdeaTestRunner$Repeater.startRunnerWithArgs(IdeaTestRunner.java:51)
at com.intellij.rt.execution.junit.JUnitStarter.prepareStreamsAndStart(JUnitStarter.java:242)
at com.intellij.rt.execution.junit.JUnitStarter.main(JUnitStarter.java:70)
0: X-TIKA:parse_time_millis : 34
0: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.pkg.PackageParser" />
<meta name="Content-Type" content="application/zip" />
<title></title>
</head>
<body><div class="embedded" id="unencrypted.txt" />
<div class="package-entry"><h1>unencrypted.txt</h1>
</div>
<p>encrypted.txt</p>
</body></html>
0: Content-Type : application/zip
1: date : 2017-03-21T13:07:48Z
1: X-Parsed-By : org.apache.tika.parser.DefaultParser
1: X-Parsed-By : org.apache.tika.parser.txt.TXTParser
1: resourceName : unencrypted.txt
1: dcterms:modified : 2017-03-21T13:07:48Z
1: Last-Modified : 2017-03-21T13:07:48Z
1: Last-Save-Date : 2017-03-21T13:07:48Z
1: embeddedRelationshipId : unencrypted.txt
1: meta:save-date : 2017-03-21T13:07:48Z
1: Content-Encoding : windows-1252
1: X-TIKA:parse_time_millis : 3
1: modified : 2017-03-21T13:07:48Z
1: X-TIKA:content : <html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="date" content="2017-03-21T13:07:48Z" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.txt.TXTParser" />
<meta name="resourceName" content="unencrypted.txt" />
<meta name="dcterms:modified" content="2017-03-21T13:07:48Z" />
<meta name="Last-Modified" content="2017-03-21T13:07:48Z" />
<meta name="Last-Save-Date" content="2017-03-21T13:07:48Z" />
<meta name="embeddedRelationshipId" content="unencrypted.txt" />
<meta name="meta:save-date" content="2017-03-21T13:07:48Z" />
<meta name="Content-Encoding" content="windows-1252" />
<meta name="modified" content="2017-03-21T13:07:48Z" />
<meta name="Content-Length" content="13" />
<meta name="X-TIKA:embedded_resource_path" content="/unencrypted.txt" />
<meta name="Content-Type" content="text/plain; charset=windows-1252" />
<title></title>
</head>
<body><p>hello world
</p>
</body></html>
1: Content-Length : 13
1: X-TIKA:embedded_resource_path : /unencrypted.txt
1: Content-Type : text/plain; charset=windows-1252
2) Classic XML:
XMLResult r = getXML("testZipEncrypted.zip");
for (String n : r.metadata.names()) {
for (String v : r.metadata.getValues(n)) {
System.out.println("meta: "+n + " : "+v);
}
}
System.out.println(r.xml);
Yields:
meta: X-Parsed-By : org.apache.tika.parser.DefaultParser
meta: X-Parsed-By : org.apache.tika.parser.pkg.PackageParser
meta: X-TIKA:EXCEPTION:embedded_stream_exception : org.apache.tika.exception.EncryptedDocumentException: stream (encrypted.txt) is encrypted
at org.apache.tika.parser.pkg.PackageParser.parseEntry(PackageParser.java:306)
at org.apache.tika.parser.pkg.PackageParser.parse(PackageParser.java:230)
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
at org.apache.tika.TikaTest.getXML(TikaTest.java:205)
at org.apache.tika.TikaTest.getXML(TikaTest.java:191)
at org.apache.tika.parser.pkg.ZipParserTest.testZipEncrypted(ZipParserTest.java:206)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:498)
at org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
at org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
at org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
at org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
at org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:78)
at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:57)
at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288)
at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268)
at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
at org.junit.runner.JUnitCore.run(JUnitCore.java:137)
at com.intellij.junit4.JUnit4IdeaTestRunner.startRunnerWithArgs(JUnit4IdeaTestRunner.java:68)
at com.intellij.rt.execution.junit.IdeaTestRunner$Repeater.startRunnerWithArgs(IdeaTestRunner.java:51)
at com.intellij.rt.execution.junit.JUnitStarter.prepareStreamsAndStart(JUnitStarter.java:242)
at com.intellij.rt.execution.junit.JUnitStarter.main(JUnitStarter.java:70)
meta: Content-Type : application/zip
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta name="X-Parsed-By" content="org.apache.tika.parser.DefaultParser" />
<meta name="X-Parsed-By" content="org.apache.tika.parser.pkg.PackageParser" />
<meta name="Content-Type" content="application/zip" />
<title></title>
</head>
<body><div class="embedded" id="unencrypted.txt" />
<div class="package-entry"><h1>unencrypted.txt</h1>
<p>hello world
</p>
</div>
<p>encrypted.txt</p>
</body></html>
-----Original Message-----
From: Aeham Abushwashi [mailto:aeham.abushwashi@exonar.com]
Sent: Tuesday, May 23, 2017 3:47 AM
To: user@tika.apache.org; Tim Allison <ta...@apache.org>
Cc: dev@tika.apache.org
Subject: Re: [VOTE] Release Apache Tika 1.15 Candidate #1
Thanks Tim and apologies if this isn't the right thread to ask this question... any reason TIKA-2300 is not included despite FixVersions=1.15 on the ticket?
On 22 May 2017 at 20:25, Tim Allison <ta...@apache.org> wrote:
> A candidate for the Tika 1.15 release is available at:
> https://dist.apache.org/repos/dist/dev/tika/
>
> The release candidate is a zip archive of the sources in:
> https://github.com/apache/tika/tree/1.15-rc1
>
> The SHA1 checksum of the archive is
> e82697a6804373367fbba98d47426ab74e036eb1.
>
> In addition, a staged maven repository is available here:
> https://repository.apache.org/content/repositories/orgapachetika-1022
>
> Please vote on releasing this package as Apache Tika 1.15.
> The vote is open for the next 72 hours and passes if a majority of at
> least three +1 Tika PMC votes are cast.
>
> [ ] +1 Release this package as Apache Tika 1.15 [ ] -1 Do not release
> this package because...
>
> ***This is my first time as release manager. Please kick the tires
> thoroughly.***
>
> This is my +1.
>
> Cheers,
>
> Tim
>
--
Aeham Abushwashi
Head of Engineering
Exonar
v: video.exonar.com | w: exonar.com <http://www.exonar.com/> | twitter:
@exonar <https://twitter.com/exonar>
GDPR: Why It’s About More Than Regulation: Download the White Paper Here <https://goo.gl/1cSVzH>
Trial <https://www.exonar.com/platform/> the capability on your own organisation's data to understand what you've got, where it is and who has access to it.
Come and meet us for a chat at Infosecurity Europe <http://www.infosecurityeurope.com/>on stand S07 in the Cyber Innovation Zone <http://www.infosecurityeurope.com/visit/whats-on/uk-cyber-innovation-zone/>
Exonar Limited, registered in the UK, registration number 06439969 at 14 West Mills, Newbury, Berkshire, RG14 5HG. DISCLAIMER: This email and any attachments to it may be confidential or private. If you have received it in error, please notify us and delete it from your system.
Re: [VOTE] Release Apache Tika 1.15 Candidate #1
Posted by Aeham Abushwashi <ae...@exonar.com>.
Thanks Tim and apologies if this isn't the right thread to ask this
question... any reason TIKA-2300 is not included despite FixVersions=1.15
on the ticket?
On 22 May 2017 at 20:25, Tim Allison <ta...@apache.org> wrote:
> A candidate for the Tika 1.15 release is available at:
> https://dist.apache.org/repos/dist/dev/tika/
>
> The release candidate is a zip archive of the sources in:
> https://github.com/apache/tika/tree/1.15-rc1
>
> The SHA1 checksum of the archive is
> e82697a6804373367fbba98d47426ab74e036eb1.
>
> In addition, a staged maven repository is available here:
> https://repository.apache.org/content/repositories/orgapachetika-1022
>
> Please vote on releasing this package as Apache Tika 1.15.
> The vote is open for the next 72 hours and passes if a majority of at
> least three +1 Tika PMC votes are cast.
>
> [ ] +1 Release this package as Apache Tika 1.15
> [ ] -1 Do not release this package because...
>
> ***This is my first time as release manager. Please kick the tires
> thoroughly.***
>
> This is my +1.
>
> Cheers,
>
> Tim
>
--
Aeham Abushwashi
Head of Engineering
Exonar
v: video.exonar.com | w: exonar.com <http://www.exonar.com/> | twitter:
@exonar <https://twitter.com/exonar>
GDPR: Why It’s About More Than Regulation: Download the White Paper Here
<https://goo.gl/1cSVzH>
Trial <https://www.exonar.com/platform/> the capability on your own
organisation's data to understand what you've got, where it is and who has
access to it.
Come and meet us for a chat at Infosecurity Europe
<http://www.infosecurityeurope.com/>on stand S07 in the Cyber Innovation
Zone
<http://www.infosecurityeurope.com/visit/whats-on/uk-cyber-innovation-zone/>
Exonar Limited, registered in the UK, registration number 06439969 at 14
West Mills, Newbury, Berkshire, RG14 5HG. DISCLAIMER: This email and any
attachments to it may be confidential or private. If you have received it
in error, please notify us and delete it from your system.
RE: [VOTE] Release Apache Tika 1.15 Candidate #1
Posted by "Allison, Timothy B." <ta...@mitre.org>.
I updated my key here: https://people.apache.org/keys/committer/
It will take 24 hours to become visible.
-----Original Message-----
From: Tim Allison [mailto:tallison@apache.org]
Sent: Monday, May 22, 2017 3:25 PM
To: dev@tika.apache.org; user@tika.apache.org
Subject: [VOTE] Release Apache Tika 1.15 Candidate #1
A candidate for the Tika 1.15 release is available at:
https://dist.apache.org/repos/dist/dev/tika/
The release candidate is a zip archive of the sources in:
https://github.com/apache/tika/tree/1.15-rc1
The SHA1 checksum of the archive is
e82697a6804373367fbba98d47426ab74e036eb1.
In addition, a staged maven repository is available here:
https://repository.apache.org/content/repositories/orgapachetika-1022
Please vote on releasing this package as Apache Tika 1.15.
The vote is open for the next 72 hours and passes if a majority of at least three +1 Tika PMC votes are cast.
[ ] +1 Release this package as Apache Tika 1.15 [ ] -1 Do not release this package because...
***This is my first time as release manager. Please kick the tires thoroughly.***
This is my +1.
Cheers,
Tim
Re: [VOTE] Release Apache Tika 1.15 Candidate #1
Posted by Aeham Abushwashi <ae...@exonar.com>.
Thanks Tim and apologies if this isn't the right thread to ask this
question... any reason TIKA-2300 is not included despite FixVersions=1.15
on the ticket?
On 22 May 2017 at 20:25, Tim Allison <ta...@apache.org> wrote:
> A candidate for the Tika 1.15 release is available at:
> https://dist.apache.org/repos/dist/dev/tika/
>
> The release candidate is a zip archive of the sources in:
> https://github.com/apache/tika/tree/1.15-rc1
>
> The SHA1 checksum of the archive is
> e82697a6804373367fbba98d47426ab74e036eb1.
>
> In addition, a staged maven repository is available here:
> https://repository.apache.org/content/repositories/orgapachetika-1022
>
> Please vote on releasing this package as Apache Tika 1.15.
> The vote is open for the next 72 hours and passes if a majority of at
> least three +1 Tika PMC votes are cast.
>
> [ ] +1 Release this package as Apache Tika 1.15
> [ ] -1 Do not release this package because...
>
> ***This is my first time as release manager. Please kick the tires
> thoroughly.***
>
> This is my +1.
>
> Cheers,
>
> Tim
>
--
Aeham Abushwashi
Head of Engineering
Exonar
v: video.exonar.com | w: exonar.com <http://www.exonar.com/> | twitter:
@exonar <https://twitter.com/exonar>
GDPR: Why It’s About More Than Regulation: Download the White Paper Here
<https://goo.gl/1cSVzH>
Trial <https://www.exonar.com/platform/> the capability on your own
organisation's data to understand what you've got, where it is and who has
access to it.
Come and meet us for a chat at Infosecurity Europe
<http://www.infosecurityeurope.com/>on stand S07 in the Cyber Innovation
Zone
<http://www.infosecurityeurope.com/visit/whats-on/uk-cyber-innovation-zone/>
Exonar Limited, registered in the UK, registration number 06439969 at 14
West Mills, Newbury, Berkshire, RG14 5HG. DISCLAIMER: This email and any
attachments to it may be confidential or private. If you have received it
in error, please notify us and delete it from your system.