You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] [Commented] (TIKA-3126) Consider new endpoint (metadata + content non recursive) - posted by "Carina Antunes (Jira)" <ji...@apache.org> on 2020/07/01 08:47:00 UTC, 6 replies.
- [jira] [Commented] (TIKA-3097) Out of memory while parsing docx - posted by "suchendra (Jira)" <ji...@apache.org> on 2020/07/02 04:03:00 UTC, 2 replies.
- /rmeta/text - with option to leave out the child files - posted by Nicholas DiPiazza <ni...@gmail.com> on 2020/07/02 13:41:34 UTC, 2 replies.
- [GitHub] [tika] nddipiazza opened a new pull request #323: writeLimit and maxEmbeddedResources for recursive parsing - add header - posted by GitBox <gi...@apache.org> on 2020/07/04 05:07:50 UTC, 0 replies.
- [GitHub] [tika] nddipiazza commented on pull request #315: TIKA-3082 OpenAPI for tika-server - posted by GitBox <gi...@apache.org> on 2020/07/04 17:04:38 UTC, 1 replies.
- [jira] [Commented] (TIKA-3082) OpenAPI for tika-server - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/07/04 17:05:00 UTC, 7 replies.
- [GitHub] [tika] nddipiazza edited a comment on pull request #315: TIKA-3082 OpenAPI for tika-server - posted by GitBox <gi...@apache.org> on 2020/07/04 17:05:01 UTC, 3 replies.
- [GitHub] [tika] lewismc commented on pull request #315: TIKA-3082 OpenAPI for tika-server - posted by GitBox <gi...@apache.org> on 2020/07/04 20:34:48 UTC, 1 replies.
- FW: [EXTERNAL] Unstructured Extraction by tika(Pdf) - posted by Chris Mattmann <ma...@apache.org> on 2020/07/06 15:41:39 UTC, 1 replies.
- Fwd: Unstructured Extraction by tika(Pdf) - posted by vijaya saradhi reddy <ds...@gmail.com> on 2020/07/06 15:46:31 UTC, 2 replies.
- [jira] [Commented] (TIKA-3115) Detect parquet files - posted by "Kaushik Gunasekaran (Jira)" <ji...@apache.org> on 2020/07/07 13:33:00 UTC, 1 replies.
- [jira] [Comment Edited] (TIKA-3115) Detect parquet files - posted by "Kaushik Gunasekaran (Jira)" <ji...@apache.org> on 2020/07/07 13:33:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3128) MOV file produces RuntimeException with 1.24.1, used to work with earlier version - posted by "Sameer Apte (Jira)" <ji...@apache.org> on 2020/07/07 14:31:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3128) MOV file produces RuntimeException with 1.24.1, used to work with earlier version 1.19.1 - posted by "Sameer Apte (Jira)" <ji...@apache.org> on 2020/07/07 14:32:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3121) Rename master branch - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/08 20:51:00 UTC, 5 replies.
- tika server - spawned children die over time - posted by Nicholas DiPiazza <ni...@gmail.com> on 2020/07/09 01:54:39 UTC, 3 replies.
- Re: Need some help understanding why this code gets stuck in timeout exceptions - posted by Nicholas DiPiazza <ni...@gmail.com> on 2020/07/09 01:59:58 UTC, 1 replies.
- [GitHub] [tika] nddipiazza commented on pull request #323: Address TIKA-3126 - add headers to control writeLimit and maxEmbeddedResources for recursive parsing - posted by GitBox <gi...@apache.org> on 2020/07/09 02:02:33 UTC, 1 replies.
- [jira] [Comment Edited] (TIKA-3126) Consider new endpoint (metadata + content non recursive) - posted by "Nicholas DiPiazza (Jira)" <ji...@apache.org> on 2020/07/09 14:16:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-3097) Out of memory while parsing docx - posted by "suchendra (Jira)" <ji...@apache.org> on 2020/07/09 14:17:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3129) Tika server - track a "last parsed on" timestamp and provide an endpoint to get it - posted by "Nicholas DiPiazza (Jira)" <ji...@apache.org> on 2020/07/09 15:42:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3130) Add "ICC:" as a namespace ICC metadata - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/09 21:33:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-1570) Seeking a stop method for better use with Apache Commons Daemon - posted by "Michael Davis (Jira)" <ji...@apache.org> on 2020/07/10 15:38:00 UTC, 1 replies.
- [jira] [Comment Edited] (TIKA-3121) Rename master branch - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/10 16:33:00 UTC, 0 replies.
- [GitHub] [tika] michaelwda opened a new pull request #324: fix for TIKA-1570 contributed by michaelwda - posted by GitBox <gi...@apache.org> on 2020/07/10 19:29:36 UTC, 0 replies.
- [jira] [Created] (TIKA-3131) PDFParserConfig default values were accidentally swapped - posted by "Clark Perkins (Jira)" <ji...@apache.org> on 2020/07/10 23:14:00 UTC, 0 replies.
- [GitHub] [tika] clarkperkins opened a new pull request #325: TIKA-3131 -- swap default values of averageCharTolerance and spacingT… - posted by GitBox <gi...@apache.org> on 2020/07/10 23:19:07 UTC, 0 replies.
- [jira] [Commented] (TIKA-3131) PDFParserConfig default values were accidentally swapped - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/07/10 23:20:00 UTC, 4 replies.
- [jira] [Updated] (TIKA-3131) PDFParserConfig default values were accidentally swapped - posted by "Tilman Hausherr (Jira)" <ji...@apache.org> on 2020/07/11 08:05:00 UTC, 1 replies.
- [jira] [Updated] (TIKA-3112) NullPointerException at AbstractPDF2XHTML.extractXMPXFA() when using tika-app GUI - posted by "Tilman Hausherr (Jira)" <ji...@apache.org> on 2020/07/11 08:56:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3132) Many missing sub-class-of application/xml (or at least text/plain) for +xml types - posted by "Francisco Tolmasky (Jira)" <ji...@apache.org> on 2020/07/12 20:48:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3132) Many missing sub-class-of application/xml (or at least text/plain) for +xml types - posted by "Francisco Tolmasky (Jira)" <ji...@apache.org> on 2020/07/12 20:50:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3127) When using html parser any empty attribute sets value to attribute name e.g. link gives href="href" - posted by "chenshuming (Jira)" <ji...@apache.org> on 2020/07/13 11:04:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3133) /rmeta endpoint should not hard code writeLimit and maxEmbeddedResources - posted by "Nicholas DiPiazza (Jira)" <ji...@apache.org> on 2020/07/14 15:04:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3133) /rmeta endpoint should not hard code writeLimit and maxEmbeddedResources - posted by "Nicholas DiPiazza (Jira)" <ji...@apache.org> on 2020/07/14 15:04:00 UTC, 0 replies.
- [GitHub] [tika] nddipiazza closed pull request #323: Address TIKA-3126 - add headers to control writeLimit and maxEmbeddedResources for recursive parsing - posted by GitBox <gi...@apache.org> on 2020/07/14 15:04:02 UTC, 0 replies.
- [GitHub] [tika] nddipiazza opened a new pull request #326: TIKA-3133 - writeLimit and maxEmbeddedResources for recursive parsing - add header - posted by GitBox <gi...@apache.org> on 2020/07/14 15:09:45 UTC, 0 replies.
- [jira] [Commented] (TIKA-3133) /rmeta endpoint should not hard code writeLimit and maxEmbeddedResources - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/07/14 15:10:00 UTC, 3 replies.
- [GitHub] [tika] nddipiazza commented on pull request #326: TIKA-3133 - writeLimit and maxEmbeddedResources for recursive parsing - add header - posted by GitBox <gi...@apache.org> on 2020/07/14 15:10:58 UTC, 0 replies.
- [jira] [Created] (TIKA-3134) totalCharsPerPage and unmappedUnicodeCharsPerPage configuration - posted by "Dávid Tóth (Jira)" <ji...@apache.org> on 2020/07/15 13:46:00 UTC, 0 replies.
- [jira] [Closed] (TIKA-3097) Out of memory while parsing docx - posted by "suchendra (Jira)" <ji...@apache.org> on 2020/07/15 14:02:00 UTC, 0 replies.
- [jira] [Closed] (TIKA-3098) Detecting embedded image - posted by "suchendra (Jira)" <ji...@apache.org> on 2020/07/15 14:03:00 UTC, 0 replies.
- [GitHub] [tika] tothd91 opened a new pull request #327: fix for TIKA-3134 contributed by tothd - posted by GitBox <gi...@apache.org> on 2020/07/15 14:36:23 UTC, 0 replies.
- [jira] [Commented] (TIKA-3134) totalCharsPerPage and unmappedUnicodeCharsPerPage configuration - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/07/15 14:37:00 UTC, 3 replies.
- [jira] [Created] (TIKA-3135) No need to spool file for HeifParser - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/15 17:35:00 UTC, 0 replies.
- [GitHub] [tika] tballison merged pull request #326: TIKA-3133 - writeLimit and maxEmbeddedResources for recursive parsing - add header - posted by GitBox <gi...@apache.org> on 2020/07/15 17:37:05 UTC, 0 replies.
- [GitHub] [tika] tballison commented on pull request #327: fix for TIKA-3134 contributed by tothd - posted by GitBox <gi...@apache.org> on 2020/07/15 17:41:07 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3133) /rmeta endpoint should not hard code writeLimit and maxEmbeddedResources - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/15 19:02:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3135) No need to spool file for HeifParser - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/15 19:03:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3126) Consider new endpoint (metadata + content non recursive) - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/15 19:04:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-3133) /rmeta endpoint should not hard code writeLimit and maxEmbeddedResources - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/15 19:05:00 UTC, 0 replies.
- [GitHub] [tika] tballison merged pull request #325: TIKA-3131 -- swap default values of averageCharTolerance and spacingT… - posted by GitBox <gi...@apache.org> on 2020/07/15 19:08:14 UTC, 0 replies.
- [GitHub] [tika] tballison commented on pull request #325: TIKA-3131 -- swap default values of averageCharTolerance and spacingT… - posted by GitBox <gi...@apache.org> on 2020/07/15 19:08:25 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3131) PDFParserConfig default values were accidentally swapped - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/15 19:10:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3135) No need to spool file for HeifParser - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/07/15 20:15:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3130) Add "ICC:" as a namespace ICC metadata - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/07/15 20:15:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3136) Add additional OCR support : EasyOCR - posted by "Kranthi Kiran GV (Jira)" <ji...@apache.org> on 2020/07/16 06:09:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3136) Add additional OCR support : EasyOCR - posted by "Kranthi Kiran GV (Jira)" <ji...@apache.org> on 2020/07/16 06:12:00 UTC, 1 replies.
- [jira] [Commented] (TIKA-3088) java.lang.NullPointerException when converting Open Office presentation (.odp) to html - posted by "Andreas Weber (Jira)" <ji...@apache.org> on 2020/07/16 08:30:00 UTC, 3 replies.
- [jira] [Comment Edited] (TIKA-3088) java.lang.NullPointerException when converting Open Office presentation (.odp) to html - posted by "Andreas Weber (Jira)" <ji...@apache.org> on 2020/07/16 08:38:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3137) Enable a metadata filter for the RecursiveParserWrapper - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/16 15:05:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3138) PDF parser with XFA produce malformed XML - posted by "wiwi (Jira)" <ji...@apache.org> on 2020/07/16 15:17:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3138) PDF parser with XFA produce malformed XML - posted by "wiwi (Jira)" <ji...@apache.org> on 2020/07/16 15:19:00 UTC, 5 replies.
- [jira] [Assigned] (TIKA-3138) PDF parser with XFA produce malformed XML - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/16 16:02:00 UTC, 0 replies.
- [GitHub] [tika] jendabenda opened a new pull request #328: fix for TIKA-3139 contributed by wiwi - posted by GitBox <gi...@apache.org> on 2020/07/16 16:58:49 UTC, 0 replies.
- [GitHub] [tika] tballison merged pull request #328: fix for TIKA-3139 contributed by wiwi - posted by GitBox <gi...@apache.org> on 2020/07/16 17:00:24 UTC, 0 replies.
- [jira] [Commented] (TIKA-3138) PDF parser with XFA produce malformed XML - posted by "wiwi (Jira)" <ji...@apache.org> on 2020/07/16 17:05:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3138) PDF parser with XFA produce malformed XML - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/16 19:37:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3137) Enable a metadata filter for the RecursiveParserWrapper - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/16 20:00:03 UTC, 4 replies.
- [jira] [Created] (TIKA-3139) Use static AutoDetectParser in tika-server - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/16 20:14:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3139) Use static AutoDetectParser in tika-server - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/16 20:21:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3073) Add gzip in- and out- interceptors to tika-server - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/16 20:23:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3073) Add gzip in- and out- interceptors to tika-server - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/16 20:24:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3069) Unpack with header X-Tika-PDFextractInlineImages does not extract content from image - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/16 20:26:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3140) Add a metadata filter for tika-eval - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/16 20:47:00 UTC, 0 replies.
- [GitHub] [tika] tballison opened a new pull request #329: TIKA-3140 - posted by GitBox <gi...@apache.org> on 2020/07/16 21:28:47 UTC, 0 replies.
- [jira] [Commented] (TIKA-3140) Add a metadata filter for tika-eval - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/07/16 21:29:00 UTC, 7 replies.
- [jira] [Created] (TIKA-3141) LINUX - Tika shouldn't throw an exception for an empty TIKA_CONFIG environment variable value - posted by "Josh Burchard (Jira)" <ji...@apache.org> on 2020/07/16 22:43:00 UTC, 0 replies.
- JDK 15 is now in Rampdown Phase Two - posted by Rory O'Donnell <ro...@oracle.com> on 2020/07/17 08:36:09 UTC, 0 replies.
- [jira] [Updated] (TIKA-3141) LINUX - Tika shouldn't throw an exception for an empty TIKA_CONFIG environment variable value - posted by "Josh Burchard (Jira)" <ji...@apache.org> on 2020/07/17 13:09:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3104) Detection of memgraph files exported from Xcode - posted by "Parth (Jira)" <ji...@apache.org> on 2020/07/17 14:15:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-3104) Detection of memgraph files exported from Xcode - posted by "Parth (Jira)" <ji...@apache.org> on 2020/07/17 14:22:00 UTC, 0 replies.
- [GitHub] [tika] tballison merged pull request #329: TIKA-3140 - posted by GitBox <gi...@apache.org> on 2020/07/17 17:19:27 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3140) Add a metadata filter for tika-eval - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/17 17:20:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3137) Enable a metadata filter for the RecursiveParserWrapper - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/17 17:20:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3129) Tika server - track a "last parsed on" timestamp and provide an endpoint to get it - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/17 17:24:00 UTC, 1 replies.
- [jira] [Resolved] (TIKA-3129) Tika server - track a "last parsed on" timestamp and provide an endpoint to get it - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/17 17:25:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3142) Update Jenkins for main branch, maybe turn on more modern jdks - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/17 17:37:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3142) Update Jenkins for main branch, maybe turn on more modern jdks - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/17 19:36:00 UTC, 0 replies.
- [GitHub] [tika] PeterAlfredLee opened a new pull request #330: Update urls of some bin files - posted by GitBox <gi...@apache.org> on 2020/07/18 03:14:15 UTC, 0 replies.
- [GitHub] [tika] PeterAlfredLee opened a new pull request #331: Fix some test error when jvm's default language is not en - posted by GitBox <gi...@apache.org> on 2020/07/18 03:16:59 UTC, 0 replies.
- [GitHub] [tika] PeterAlfredLee opened a new pull request #332: Fix can't del tmp file in windows - posted by GitBox <gi...@apache.org> on 2020/07/18 03:24:55 UTC, 0 replies.
- [GitHub] [tika] PeterAlfredLee opened a new pull request #333: Adds github action CI builds on Ubuntu - posted by GitBox <gi...@apache.org> on 2020/07/18 03:57:18 UTC, 0 replies.
- [GitHub] [tika] THausherr commented on pull request #332: Fix can't del tmp file in windows - posted by GitBox <gi...@apache.org> on 2020/07/18 17:01:01 UTC, 1 replies.
- [GitHub] [tika] tothd91 commented on pull request #327: fix for TIKA-3134 contributed by tothd - posted by GitBox <gi...@apache.org> on 2020/07/20 07:20:55 UTC, 0 replies.
- [jira] [Created] (TIKA-3143) Enable custom resources and writers in tika-server - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/20 17:02:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3143) Enable custom resources and writers in tika-server - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/20 17:10:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3088) java.lang.NullPointerException when converting Open Office presentation (.odp) to html - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/21 17:31:00 UTC, 0 replies.
- Tika extract images - posted by Shaojun Ni <sh...@theagilehub.net> on 2020/07/22 15:06:44 UTC, 2 replies.
- [jira] [Created] (TIKA-3144) Detecting hprof memory dump files exported from Android Studio - posted by "Parth (Jira)" <ji...@apache.org> on 2020/07/23 16:33:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3144) Detecting hprof memory dump files exported from Android Studio - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/23 17:08:00 UTC, 17 replies.
- [jira] [Comment Edited] (TIKA-3144) Detecting hprof memory dump files exported from Android Studio - posted by "Parth (Jira)" <ji...@apache.org> on 2020/07/23 17:17:00 UTC, 3 replies.
- [jira] [Created] (TIKA-3145) Add a content digester to tika-eval text stats - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/23 21:35:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3146) Add Nutch's TextProfileSignature digest to tika-eval's text stats - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/23 21:39:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3147) String punctuation in lang id component within tika-eval - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/24 21:18:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3145) Add a content digester to tika-eval text stats - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/24 21:21:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3146) Add Nutch's TextProfileSignature digest to tika-eval's text stats - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/24 21:22:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3145) Add a content digester to tika-eval text stats - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/07/24 22:00:00 UTC, 4 replies.
- [jira] [Commented] (TIKA-3146) Add Nutch's TextProfileSignature digest to tika-eval's text stats - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/07/24 22:00:00 UTC, 4 replies.
- [jira] [Created] (TIKA-3148) Remove apache-cxf dependency from tika-parsers - posted by "Rapster (Jira)" <ji...@apache.org> on 2020/07/26 22:40:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3147) String punctuation in lang id component within tika-eval - posted by "Kenneth William Krugler (Jira)" <ji...@apache.org> on 2020/07/27 14:21:01 UTC, 3 replies.
- [jira] [Commented] (TIKA-3148) Remove apache-cxf dependency from tika-parsers - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/27 17:22:00 UTC, 1 replies.
- [jira] [Resolved] (TIKA-3147) Strip punctuation in lang id component within tika-eval - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/27 17:27:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3147) Strip punctuation in lang id component within tika-eval - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/07/27 17:27:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3147) Strip punctuation in lang id component within tika-eval - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/07/27 17:56:00 UTC, 8 replies.
- [jira] [Commented] (TIKA-3141) LINUX - Tika shouldn't throw an exception for an empty TIKA_CONFIG environment variable value - posted by "chenshuming (Jira)" <ji...@apache.org> on 2020/07/28 09:33:00 UTC, 3 replies.
- [jira] [Comment Edited] (TIKA-3141) LINUX - Tika shouldn't throw an exception for an empty TIKA_CONFIG environment variable value - posted by "chenshuming (Jira)" <ji...@apache.org> on 2020/07/28 09:34:00 UTC, 1 replies.
- [jira] [Created] (TIKA-3149) Tikka 1.18 not working with tess4j 3.4.8 on linux - posted by "Vishakha (Jira)" <ji...@apache.org> on 2020/07/28 13:23:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3150) MimeType Regex End of Binary Fails - posted by "David Margolis (Jira)" <ji...@apache.org> on 2020/07/29 17:10:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3150) MimeType Regex End of Binary Fails - posted by "David Margolis (Jira)" <ji...@apache.org> on 2020/07/29 17:12:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3150) MimeType Regex End of Binary File Fails - posted by "David Margolis (Jira)" <ji...@apache.org> on 2020/07/29 18:26:00 UTC, 2 replies.
- [jira] [Created] (TIKA-3151) Update jaxb-runtime and remove activation dependencies & exclusions - posted by "Hans Brende (Jira)" <ji...@apache.org> on 2020/07/29 20:04:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3152) Calling autoDetectParser.parse results in Unexpected RuntimeException on .msg file with large attachment. - posted by "Caleb Postlethwait (Jira)" <ji...@apache.org> on 2020/07/29 21:14:00 UTC, 0 replies.
- [GitHub] [tika] PeterAlfredLee opened a new pull request #334: Tika-3141 : add empty environment variable handle - posted by GitBox <gi...@apache.org> on 2020/07/30 11:54:47 UTC, 0 replies.
- [GitHub] [tika] PeterAlfredLee commented on pull request #332: Fix can't del tmp file in windows - posted by GitBox <gi...@apache.org> on 2020/07/30 12:22:56 UTC, 0 replies.
- PRs on github need reviews - posted by Peter Lee <pe...@apache.org> on 2020/07/30 12:39:12 UTC, 0 replies.
- [GitHub] [tika] keithrbennett commented on a change in pull request #334: Tika-3141 : add empty environment variable handle - posted by GitBox <gi...@apache.org> on 2020/07/30 15:59:48 UTC, 0 replies.
- [GitHub] [tika] THausherr edited a comment on pull request #332: Fix can't del tmp file in windows - posted by GitBox <gi...@apache.org> on 2020/07/30 19:02:12 UTC, 0 replies.