You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] [Updated] (TIKA-3203) MP4Parser temporary files are not deleted from Tomcat temp folder - posted by "Isabelle Giguere (Jira)" <ji...@apache.org> on 2020/10/01 13:09:00 UTC, 0 replies.
- JDK 16 EA build 18 is now available - posted by Rory O'Donnell <ro...@oracle.com> on 2020/10/02 09:09:23 UTC, 0 replies.
- Looking for a small PDF file with fontbox fonts - posted by Sergey Beryozkin <sb...@gmail.com> on 2020/10/02 17:27:52 UTC, 1 replies.
- [GitHub] [tika] trejkaz commented on pull request #299: fix for TIKA-3003 contributed by cesarsotovalero - posted by GitBox <gi...@apache.org> on 2020/10/07 00:57:52 UTC, 0 replies.
- [jira] [Commented] (TIKA-3003) Remove unused dependencies - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/10/07 00:58:00 UTC, 3 replies.
- [GitHub] [tika] trejkaz edited a comment on pull request #299: fix for TIKA-3003 contributed by cesarsotovalero - posted by GitBox <gi...@apache.org> on 2020/10/07 00:58:16 UTC, 0 replies.
- [GitHub] [tika] PeterAlfredLee commented on pull request #366: Fix build fail caused by can't find test file - posted by GitBox <gi...@apache.org> on 2020/10/07 02:19:29 UTC, 0 replies.
- [GitHub] [tika] PeterAlfredLee closed pull request #366: Fix build fail caused by can't find test file - posted by GitBox <gi...@apache.org> on 2020/10/07 02:19:29 UTC, 0 replies.
- [jira] [Commented] (TIKA-3204) License incompliance with xmp-core 6.1.10 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/10/07 17:09:00 UTC, 2 replies.
- [GitHub] [tika] tballison commented on pull request #299: fix for TIKA-3003 contributed by cesarsotovalero - posted by GitBox <gi...@apache.org> on 2020/10/07 17:29:18 UTC, 1 replies.
- [GitHub] [tika] tballison merged pull request #312: TIKA-3044 add -C/--content cli option using WriteOutContentHandler - posted by GitBox <gi...@apache.org> on 2020/10/07 17:47:47 UTC, 0 replies.
- [jira] [Commented] (TIKA-3044) add -C/--content cli option using WriteOutContentHandler - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/10/07 17:48:00 UTC, 2 replies.
- [jira] [Resolved] (TIKA-3044) add -C/--content cli option using WriteOutContentHandler - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/10/07 18:07:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3203) MP4Parser temporary files are not deleted from Tomcat temp folder - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/10/07 19:11:00 UTC, 5 replies.
- [jira] [Comment Edited] (TIKA-3203) MP4Parser temporary files are not deleted from Tomcat temp folder - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/10/07 19:13:00 UTC, 1 replies.
- [GitHub] [tika] JohnLBergqvist closed pull request #259: TIKA-2783: Add unit tests for org.apache.tika.mime.HexCoDec - posted by GitBox <gi...@apache.org> on 2020/10/08 14:41:52 UTC, 1 replies.
- [jira] [Commented] (TIKA-2783) Add unit tests for org.apache.tika.mime.HexCoDec - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/10/08 14:42:00 UTC, 1 replies.
- [jira] [Created] (TIKA-3207) Invalid language code in TesseractOCRConfig - posted by "Daniel Smyda (Jira)" <ji...@apache.org> on 2020/10/08 21:53:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3207) Invalid language code in TesseractOCRConfig - posted by "Daniel Smyda (Jira)" <ji...@apache.org> on 2020/10/08 21:54:00 UTC, 7 replies.
- [jira] [Commented] (TIKA-3207) Invalid language code in TesseractOCRConfig - posted by "Daniel Smyda (Jira)" <ji...@apache.org> on 2020/10/08 22:16:00 UTC, 2 replies.
- [jira] [Comment Edited] (TIKA-3207) Invalid language code in TesseractOCRConfig - posted by "Daniel Smyda (Jira)" <ji...@apache.org> on 2020/10/08 22:17:00 UTC, 2 replies.
- [jira] [Resolved] (TIKA-3207) Invalid language code in TesseractOCRConfig - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/10/09 15:57:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3203) MP4Parser temporary files are not deleted from Tomcat temp folder - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/10/09 20:22:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3208) tika-server Detect when using fileUrl header does not close the file handle - posted by "Darren Cooper (Jira)" <ji...@apache.org> on 2020/10/09 20:39:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3208) tika-server Detect when using fileUrl header does not close the file handle - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/10/09 21:08:00 UTC, 0 replies.
- Fwd: XLSX wrapped in an OLE2 CompObj/Package - should WorkbookFactory handle it? - posted by Tim Allison <ta...@apache.org> on 2020/10/09 21:15:25 UTC, 5 replies.
- [jira] [Commented] (TIKA-3208) tika-server Detect when using fileUrl header does not close the file handle - posted by "Darren Cooper (Jira)" <ji...@apache.org> on 2020/10/09 22:26:00 UTC, 2 replies.
- [GitHub] [tika] PeterAlfredLee opened a new pull request #367: Modify some code use try-with-resources - posted by GitBox <gi...@apache.org> on 2020/10/13 08:12:56 UTC, 0 replies.
- [jira] [Commented] (TIKA-3128) MOV file produces RuntimeException with 1.24.1, used to work with earlier version 1.19.1 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/10/13 17:11:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3209) Different between PictureRunMapper in POI and PicturesSource in Tika - posted by "Peter Lee (Jira)" <ji...@apache.org> on 2020/10/14 02:17:00 UTC, 0 replies.
- [GitHub] [tika] PeterAlfredLee opened a new pull request #368: Modify some code use for-each loop - posted by GitBox <gi...@apache.org> on 2020/10/15 06:38:23 UTC, 0 replies.
- [jira] [Created] (TIKA-3210) tika status endpoint should have a Node UUID - posted by "Nicholas DiPiazza (Jira)" <ji...@apache.org> on 2020/10/16 16:00:03 UTC, 0 replies.
- [jira] [Updated] (TIKA-3210) tika status endpoint should have a Node UUID - posted by "Nicholas DiPiazza (Jira)" <ji...@apache.org> on 2020/10/16 16:00:09 UTC, 0 replies.
- [jira] [Commented] (TIKA-3210) tika status endpoint should have a Node UUID - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/10/16 16:57:00 UTC, 9 replies.
- [jira] [Comment Edited] (TIKA-3210) tika status endpoint should have a Node UUID - posted by "Nicholas DiPiazza (Jira)" <ji...@apache.org> on 2020/10/16 19:20:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3209) Different between PictureRunMapper in POI and PicturesSource in Tika - posted by "Peter Lee (Jira)" <ji...@apache.org> on 2020/10/19 01:22:00 UTC, 2 replies.
- [GitHub] [tika] PeterAlfredLee opened a new pull request #369: Use IOException instead of IOExceptionWithCause - posted by GitBox <gi...@apache.org> on 2020/10/19 09:09:22 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3210) tika status endpoint should have a Node UUID - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/10/19 20:14:00 UTC, 0 replies.
- [GitHub] [tika] PeterAlfredLee commented on pull request #369: Use IOException instead of IOExceptionWithCause - posted by GitBox <gi...@apache.org> on 2020/10/20 01:27:48 UTC, 3 replies.
- [GitHub] [tika] kkrugler commented on pull request #369: Use IOException instead of IOExceptionWithCause - posted by GitBox <gi...@apache.org> on 2020/10/20 03:52:18 UTC, 1 replies.
- [OT] Looking for Apache POI help - posted by Sergey Beryozkin <sb...@gmail.com> on 2020/10/20 11:53:43 UTC, 0 replies.
- Tika 1.25 release date? - posted by Alexander Klimetschek <ak...@adobe.com.INVALID> on 2020/10/21 01:44:06 UTC, 5 replies.
- [GitHub] [tika] tballison commented on pull request #369: Use IOException instead of IOExceptionWithCause - posted by GitBox <gi...@apache.org> on 2020/10/21 13:43:45 UTC, 0 replies.
- [jira] [Created] (TIKA-3211) Junrar does not support Rar5, 7-Zip-JBinding does, so how about implement RarParser using 7-Zip-JBinding? - posted by "tuister (Jira)" <ji...@apache.org> on 2020/10/22 02:12:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3211) Junrar does not support Rar5, 7-Zip-JBinding does, so how about implement RarParser using 7-Zip-JBinding? - posted by "Nick Burch (Jira)" <ji...@apache.org> on 2020/10/22 08:23:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2939) Figure out how to allow OCR'ing of large PDFs via tika-server - posted by "Daniel Coldrick (Jira)" <ji...@apache.org> on 2020/10/22 09:25:00 UTC, 3 replies.
- [jira] [Created] (TIKA-3212) Tika extractor not properly extracting hindi texts - posted by "Adarsh (Jira)" <ji...@apache.org> on 2020/10/22 10:21:00 UTC, 0 replies.
- JDK 16 EA build 21 is available - posted by Rory O'Donnell <ro...@oracle.com> on 2020/10/23 09:38:16 UTC, 0 replies.
- [jira] [Created] (TIKA-3213) Consider migrating universalcharsetdetector to a live fork - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/10/23 17:50:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3213) Consider migrating universalcharsetdetector to a live fork - posted by "Peter Lee (Jira)" <ji...@apache.org> on 2020/10/24 08:21:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3119) General upgrades for 1.25 - posted by "Thamme Gowda (Jira)" <ji...@apache.org> on 2020/10/25 20:05:00 UTC, 0 replies.
- [GitHub] [tika] PeterAlfredLee opened a new pull request #370: Add dependency commons-io to tika-core in main branch - posted by GitBox <gi...@apache.org> on 2020/10/26 03:11:59 UTC, 0 replies.
- [GitHub] [tika] PeterAlfredLee opened a new pull request #371: Simplify some loop by use method Collection.addAll and method Arrarys.asList - posted by GitBox <gi...@apache.org> on 2020/10/28 03:22:30 UTC, 0 replies.
- [jira] [Created] (TIKA-3214) Tika Fails to extract content from MS Word - posted by "Sergey Smolyakov (Jira)" <ji...@apache.org> on 2020/10/28 08:54:00 UTC, 0 replies.
- [GitHub] [tika] PeterAlfredLee opened a new pull request #372: Modify some calls of method Collection.toArray - posted by GitBox <gi...@apache.org> on 2020/10/29 07:03:02 UTC, 0 replies.
- [jira] [Commented] (TIKA-3009) XML Parser reset() detection no working in weblogic 12.2.1.3 - posted by "vigi (Jira)" <ji...@apache.org> on 2020/10/29 10:41:00 UTC, 3 replies.
- [jira] [Comment Edited] (TIKA-3009) XML Parser reset() detection no working in weblogic 12.2.1.3 - posted by "vigi (Jira)" <ji...@apache.org> on 2020/10/29 10:54:00 UTC, 1 replies.
- [jira] [Created] (TIKA-3215) Add a detector that calls the 'file' command - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/10/29 13:29:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3215) Add a detector that calls the 'file' command - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/10/29 16:44:00 UTC, 1 replies.
- [jira] [Created] (TIKA-3216) Add FileProfiler to tika-eval - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/10/29 20:59:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3216) Add FileProfiler to tika-eval - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/10/30 16:20:00 UTC, 1 replies.
- [jira] [Created] (TIKA-3217) Extract metadata from XMPPDFSchema in PDFs' XMP - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/10/30 18:35:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3216) Add FileProfiler to tika-eval - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/10/30 19:09:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3215) Add a detector that calls the 'file' command - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/10/30 19:09:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3217) Extract metadata from XMPPDFSchema in PDFs' XMP - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/10/30 19:10:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3217) Extract metadata from XMPPDFSchema in PDFs' XMP - posted by "Hudson (Jira)" <ji...@apache.org> on 2020/10/30 19:34:00 UTC, 1 replies.
- [jira] [Resolved] (TIKA-3009) XML Parser reset() detection no working in weblogic 12.2.1.3 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2020/10/30 20:08:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3218) Wrong comment for method sortLoadedClasses in ServiceLoaderUtils - posted by "Peter Lee (Jira)" <ji...@apache.org> on 2020/10/31 04:05:00 UTC, 0 replies.
- When calling /rmeta/text, is there a way to time box the request to a certain amount of time? - posted by Nicholas DiPiazza <ni...@gmail.com> on 2020/10/31 18:43:24 UTC, 0 replies.