You are viewing a plain text version of this content. The canonical link for it is here.
- [GitHub] [tika-docker] dameikle commented on a change in pull request #2: set tesseract ocr langauges as docker build args - posted by GitBox <gi...@apache.org> on 2021/01/01 16:25:51 UTC, 0 replies.
- [GitHub] [tika-docker] mhf-ir commented on a change in pull request #2: set tesseract ocr langauges as docker build args - posted by GitBox <gi...@apache.org> on 2021/01/01 16:39:14 UTC, 0 replies.
- [GitHub] [tika-docker] mhf-ir commented on pull request #2: set tesseract ocr langauges as docker build args - posted by GitBox <gi...@apache.org> on 2021/01/01 17:15:35 UTC, 0 replies.
- [GitHub] [tika-docker] mhf-ir edited a comment on pull request #2: set tesseract ocr langauges as docker build args - posted by GitBox <gi...@apache.org> on 2021/01/01 17:16:06 UTC, 0 replies.
- [jira] [Created] (TIKA-3257) RAR files extracted content is not separated from the inner file names - posted by "Yahav Amsalem (Jira)" <ji...@apache.org> on 2021/01/02 20:45:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/04 14:54:00 UTC, 3 replies.
- [jira] [Created] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/04 14:54:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 - posted by "Tilman Hausherr (Jira)" <ji...@apache.org> on 2021/01/04 17:24:00 UTC, 24 replies.
- [jira] [Created] (TIKA-3259) Improve logging for TesseractOCRParser - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/04 19:11:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3260) Update rotation.py to work with python3 and a more modern matplotlib - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/04 22:16:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3260) Update rotation.py to work with python3 and a more modern matplotlib - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/04 22:26:00 UTC, 20 replies.
- [jira] [Updated] (TIKA-3260) Update rotation.py to work with python3 and a more modern matplotlib - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/04 22:31:00 UTC, 1 replies.
- [jira] [Created] (TIKA-3261) Text file is parsed by "EmptyParser" but the file does contain what looks like valid text - posted by "Josh Burchard (Jira)" <ji...@apache.org> on 2021/01/04 22:34:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3261) Text file is parsed by "EmptyParser" but the file does contain what looks like valid text - posted by "Josh Burchard (Jira)" <ji...@apache.org> on 2021/01/04 22:35:00 UTC, 3 replies.
- [jira] [Commented] (TIKA-3261) Text file is parsed by "EmptyParser" but the file does contain what looks like valid text - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/05 14:02:00 UTC, 1 replies.
- [jira] [Comment Edited] (TIKA-3261) Text file is parsed by "EmptyParser" but the file does contain what looks like valid text - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/05 14:05:00 UTC, 0 replies.
- [jira] [Closed] (TIKA-3261) Text file is parsed by "EmptyParser" but the file does contain what looks like valid text - posted by "Josh Burchard (Jira)" <ji...@apache.org> on 2021/01/05 15:06:01 UTC, 0 replies.
- [jira] [Created] (TIKA-3262) Undo reverse ClassLoader sort in Tika 2.0.0 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/05 15:35:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2548) Add Python Path configuration to TesseractOCRParser - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/05 19:52:01 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3259) Improve logging for TesseractOCRParser - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/05 19:56:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3263) WriteLimitReachedException is not public - posted by "Peter Kronenberg (Jira)" <ji...@apache.org> on 2021/01/05 20:09:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3263) WriteLimitReachedException is not public - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/05 20:16:00 UTC, 2 replies.
- jira->dev list down? - posted by Tim Allison <ta...@apache.org> on 2021/01/05 22:19:22 UTC, 0 replies.
- [jira] [Commented] (TIKA-3259) Improve logging for TesseractOCRParser - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/01/05 22:43:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3262) Undo reverse ClassLoader sort in Tika 2.0.0 - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/01/05 22:43:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3263) WriteLimitReachedException is not public - posted by "Kenneth William Krugler (Jira)" <ji...@apache.org> on 2021/01/05 23:21:01 UTC, 1 replies.
- [jira] [Comment Edited] (TIKA-3260) Update rotation.py to work with python3 and a more modern matplotlib - posted by "Peter Kronenberg (Jira)" <ji...@apache.org> on 2021/01/05 23:35:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/06 12:48:00 UTC, 11 replies.
- [jira] [Created] (TIKA-3264) Improve the per page OCR heuristics for AUTO mode - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/06 15:44:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3265) Tika 2.0.0 -- improvements to image preprocessing in TesseractOCRParser - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/06 16:47:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3266) Generalize OCRParser so that users can service load custom ocr parsers - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/06 19:47:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3265) Tika 2.0.0 -- improvements to image preprocessing in TesseractOCRParser - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/01/06 19:53:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3267) Method getEnableImageProcessing() in TesseractOCRConfig should be renamed - posted by "Peter Kronenberg (Jira)" <ji...@apache.org> on 2021/01/07 17:13:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-3267) Method getEnableImageProcessing() in TesseractOCRConfig should be renamed - posted by "Nick Burch (Jira)" <ji...@apache.org> on 2021/01/07 18:28:00 UTC, 3 replies.
- [jira] [Created] (TIKA-3268) TikaConfig -- throw exception if exclude parser can't be loaded - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/07 18:46:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3268) TikaConfig -- throw exception if exclude parser can't be loaded - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/01/07 22:40:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-1735) Unsupported AutoCAD drawing version: AC1027 - posted by "Nicholas DiPiazza (Jira)" <ji...@apache.org> on 2021/01/08 19:52:00 UTC, 3 replies.
- [jira] [Commented] (TIKA-3266) Generalize OCRParser so that users can service load custom ocr parsers - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/08 22:46:00 UTC, 4 replies.
- [GitHub] [tika] nddipiazza opened a new pull request #395: TIKA-1735 - ac2017 and add ability to use dwgread if it is installed. - posted by GitBox <gi...@apache.org> on 2021/01/09 21:28:42 UTC, 1 replies.
- [jira] [Commented] (TIKA-3244) General upgrades for 1.26 - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/01/10 14:00:00 UTC, 9 replies.
- [GitHub] [tika] tballison opened a new pull request #396: TIKA-3266 - posted by GitBox <gi...@apache.org> on 2021/01/11 20:47:23 UTC, 0 replies.
- [GitHub] [tika] tballison merged pull request #396: TIKA-3266 - posted by GitBox <gi...@apache.org> on 2021/01/11 20:48:24 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3266) Generalize OCRParser so that users can service load custom ocr parsers - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/11 21:10:00 UTC, 0 replies.
- 2.0.0-ALPHA? - posted by Tim Allison <ta...@apache.org> on 2021/01/11 21:21:17 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3178) Tika 2.0.0 -- Add back OSGi bundles for Tika parsers - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/11 21:54:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3269) Update artifact releases for 2.0.0 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/11 21:55:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3269) Update artifact releases for 2.0.0 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/11 22:01:00 UTC, 1 replies.
- [jira] [Created] (TIKA-3270) Render non-text in PDFs for OCR - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/12 15:42:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3271) Change default image resize size in TesseractParser's pre-processing step - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/12 18:15:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3271) Change default image resize size in TesseractParser's pre-processing step - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/12 18:24:00 UTC, 2 replies.
- [jira] [Commented] (TIKA-3270) Render non-text in PDFs for OCR - posted by "Tilman Hausherr (Jira)" <ji...@apache.org> on 2021/01/12 19:16:00 UTC, 5 replies.
- [jira] [Updated] (TIKA-3270) Render non-text in PDFs for OCR - posted by "Tilman Hausherr (Jira)" <ji...@apache.org> on 2021/01/12 19:30:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3267) Method getEnableImageProcessing() in TesseractOCRConfig should be renamed - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/12 20:11:01 UTC, 0 replies.
- [jira] [Created] (TIKA-3272) Improve Rotation handling - posted by "Peter Kronenberg (Jira)" <ji...@apache.org> on 2021/01/13 14:26:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3273) Further metadat cleanup for TIka 2.0.0 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/13 14:50:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3273) Further metadata cleanup for TIka 2.0.0 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/13 14:51:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3274) Tika 2.0.0 -- Move parser specific metadata out of tika-core to parser modules - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/13 15:18:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3273) Further metadata cleanup for TIka 2.0.0 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/13 15:47:00 UTC, 0 replies.
- Looking for PR code review for DWG parser changes - posted by Nicholas DiPiazza <ni...@gmail.com> on 2021/01/13 16:28:02 UTC, 2 replies.
- droste.zip - posted by Tim Allison <ta...@apache.org> on 2021/01/13 16:38:31 UTC, 1 replies.
- [jira] [Commented] (TIKA-3273) Further metadata cleanup for TIka 2.0.0 - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/01/13 17:16:01 UTC, 0 replies.
- OCR Testing - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/13 18:29:30 UTC, 0 replies.
- OCR testing - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/13 18:32:12 UTC, 1 replies.
- [jira] [Resolved] (TIKA-3271) Change default image resize size in TesseractParser's pre-processing step - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/13 21:07:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3258) Run OCR on PDFs with 'auto' mode as default in Tika 2.0.0 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/13 21:08:00 UTC, 0 replies.
- Fwd: Python dependency - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/14 01:17:17 UTC, 0 replies.
- [VOTE] Release Apache Tika 2.0.0-ALPHA Candidate #1 - posted by Tim Allison <ta...@apache.org> on 2021/01/14 01:19:16 UTC, 7 replies.
- Re: Python dependency - posted by Tim Allison <ta...@apache.org> on 2021/01/14 01:28:57 UTC, 1 replies.
- [GitHub] [tika] PeterAlfredLee closed pull request #333: Adds github action CI builds on Ubuntu - posted by GitBox <gi...@apache.org> on 2021/01/14 02:16:52 UTC, 0 replies.
- [GitHub] [tika] PeterAlfredLee closed pull request #380: Add two tests for `OPCPackageDetector` - posted by GitBox <gi...@apache.org> on 2021/01/14 02:17:02 UTC, 0 replies.
- [GitHub] [tika] PeterAlfredLee closed pull request #382: Simplify some code in OPCPackageDetector#detect - posted by GitBox <gi...@apache.org> on 2021/01/14 02:17:02 UTC, 0 replies.
- [GitHub] [tika] PeterAlfredLee merged pull request #369: Use IOException instead of IOExceptionWithCause - posted by GitBox <gi...@apache.org> on 2021/01/14 02:18:03 UTC, 0 replies.
- [jira] [Commented] (TIKA-3274) Tika 2.0.0 -- Move parser specific metadata out of tika-core to parser modules - posted by "Nick Burch (Jira)" <ji...@apache.org> on 2021/01/14 09:18:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3272) Improve Rotation handling - posted by "Peter Kronenberg (Jira)" <ji...@apache.org> on 2021/01/14 14:00:00 UTC, 0 replies.
- [GitHub] [tika] peterkronenberg opened a new pull request #397: Tika 3272 - posted by GitBox <gi...@apache.org> on 2021/01/14 15:01:45 UTC, 0 replies.
- [jira] [Commented] (TIKA-3226) Add custom connector endpoint - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/14 19:43:00 UTC, 8 replies.
- JDK 16 is now in Rampdown Phase Two - posted by Rory O'Donnell <ro...@oracle.com> on 2021/01/15 09:10:50 UTC, 0 replies.
- Re: Help in tika-python - posted by Chris Mattmann <ma...@apache.org> on 2021/01/15 18:24:27 UTC, 0 replies.
- [RESULT][VOTE] Release Apache Tika 2.0.0-ALPHA Candidate #1 - posted by Tim Allison <ta...@apache.org> on 2021/01/16 15:56:00 UTC, 2 replies.
- Config Tika Server - posted by Nilton Monteiro <al...@hotmail.com> on 2021/01/18 12:08:20 UTC, 2 replies.
- [jira] [Created] (TIKA-3275) Document major changes in 2.x on our wiki - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/18 13:11:00 UTC, 0 replies.
- [ANNOUNCE] Apache Tika 2.0.0-ALPHA released - posted by Tim Allison <ta...@apache.org> on 2021/01/18 13:17:23 UTC, 2 replies.
- site? - posted by Tim Allison <ta...@apache.org> on 2021/01/18 13:19:13 UTC, 1 replies.
- [jira] [Created] (TIKA-3276) Upgrade netCDF-Java library (aka CDM) to latest version - posted by "Sandeep Kulkarni (Jira)" <ji...@apache.org> on 2021/01/18 15:19:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3277) Apache POI 5.0.0 released - posted by "PJ Fanning (Jira)" <ji...@apache.org> on 2021/01/18 16:19:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3277) Apache POI 5.0.0 released - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/19 13:44:00 UTC, 2 replies.
- [jira] [Created] (TIKA-3278) Add core artifacts to release for 2.0.0-BETA - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/19 13:45:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3279) Misleading text for download link - posted by "Sebb (Jira)" <ji...@apache.org> on 2021/01/19 16:28:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3279) Misleading text for download link - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/19 21:57:00 UTC, 0 replies.
- [GitHub] [tika] tballison commented on a change in pull request #397: Tika 3272 - Remove usage of rotation.py and Python dependency - posted by GitBox <gi...@apache.org> on 2021/01/21 14:13:50 UTC, 1 replies.
- [GitHub] [tika] peterkronenberg commented on pull request #397: Tika 3272 - Remove usage of rotation.py and Python dependency - posted by GitBox <gi...@apache.org> on 2021/01/21 15:40:20 UTC, 1 replies.
- [GitHub] [tika] tballison merged pull request #397: Tika 3272 - Remove usage of rotation.py and Python dependency - posted by GitBox <gi...@apache.org> on 2021/01/21 16:39:37 UTC, 1 replies.
- [jira] [Created] (TIKA-3280) server-core not bundled w server-classic in 2.0.0-ALPHA - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/23 12:51:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-3244) General upgrades for 1.26 - posted by "Tilman Hausherr (Jira)" <ji...@apache.org> on 2021/01/23 13:09:00 UTC, 1 replies.
- [GitHub] [tika] dameikle opened a new pull request #398: Removed exclusion of tika-server-core from tika-server-classic - posted by GitBox <gi...@apache.org> on 2021/01/25 23:30:31 UTC, 1 replies.
- [GitHub] [tika] dameikle commented on pull request #398: Removed exclusion of tika-server-core from tika-server-classic - posted by GitBox <gi...@apache.org> on 2021/01/25 23:32:31 UTC, 1 replies.
- [GitHub] [tika] tballison commented on pull request #398: Removed exclusion of tika-server-core from tika-server-classic - posted by GitBox <gi...@apache.org> on 2021/01/26 11:48:00 UTC, 0 replies.
- [GitHub] [tika] tballison opened a new pull request #399: TIKA-3226 - posted by GitBox <gi...@apache.org> on 2021/01/26 20:30:34 UTC, 0 replies.
- [GitHub] [tika] tballison merged pull request #399: TIKA-3226 - posted by GitBox <gi...@apache.org> on 2021/01/26 20:31:55 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3226) Add custom connector endpoint - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/26 20:34:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3093) Enable tika-server to forward parse results to another endpoint - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/26 20:35:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2972) Allow users to specify a list/map of ContentHandlerFactories in tika-config.xml - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/26 20:36:00 UTC, 0 replies.
- [GitHub] [tika] tballison commented on pull request #395: TIKA-1735 - add AC1027 and AC1032 and add ability to use dwgread if it is installed. - posted by GitBox <gi...@apache.org> on 2021/01/26 20:59:31 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3280) server-core not bundled w server-classic in 2.0.0-ALPHA - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/26 22:56:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3281) Clean up RecursiveParserWrapper for 2.0.0 - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/26 22:57:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3282) OneNote Parser breaks non-ASCII Characters - posted by "Adrian Diemer (Jira)" <ji...@apache.org> on 2021/01/27 10:11:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3282) OneNote Parser breaks non-ASCII Characters - posted by "Adrian Diemer (Jira)" <ji...@apache.org> on 2021/01/27 10:18:00 UTC, 4 replies.
- [GitHub] [tika] AdrianD-intrafind opened a new pull request #400: TIKA-3282 fix non-ascii characters in onenote - posted by GitBox <gi...@apache.org> on 2021/01/27 10:25:46 UTC, 0 replies.
- [jira] [Commented] (TIKA-3282) OneNote Parser breaks non-ASCII Characters - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/01/27 10:26:00 UTC, 13 replies.
- [jira] [Commented] (TIKA-3281) Clean up RecursiveParserWrapper for 2.0.0 - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/01/27 15:49:00 UTC, 0 replies.
- improving javadocs - posted by Tim Allison <ta...@apache.org> on 2021/01/27 16:10:29 UTC, 0 replies.
- [jira] [Created] (TIKA-3283) Add an s3 emitter to tika-pipes - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/27 18:13:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3284) Refactor tika-batch to use the new tika-pipes module - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/27 18:34:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3283) Add an s3 emitter to tika-pipes - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/27 18:34:00 UTC, 0 replies.
- [jira] [Created] (TIKA-3285) Allow s3 fetcher to pull file ranges - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/27 18:37:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3285) Allow s3 fetcher to pull file ranges - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/01/27 20:44:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3283) Add an s3 emitter to tika-pipes - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/01/27 20:44:00 UTC, 0 replies.
- [GitHub] [tika] nddipiazza commented on pull request #400: TIKA-3282 fix non-ascii characters in onenote - posted by GitBox <gi...@apache.org> on 2021/01/27 23:16:37 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-3282) OneNote Parser breaks non-ASCII Characters - posted by "Adrian Diemer (Jira)" <ji...@apache.org> on 2021/01/28 07:20:01 UTC, 0 replies.
- [GitHub] [tika] tballison merged pull request #400: TIKA-3282 fix non-ascii characters in onenote - posted by GitBox <gi...@apache.org> on 2021/01/28 13:36:56 UTC, 0 replies.
- [GitHub] [tika] tballison commented on pull request #400: TIKA-3282 fix non-ascii characters in onenote - posted by GitBox <gi...@apache.org> on 2021/01/28 13:37:17 UTC, 0 replies.
- FW: {EXTERNAL}Invalid language code - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/28 14:29:51 UTC, 2 replies.
- [jira] [Updated] (TIKA-3286) Tika not issue an error when language file doesn't exist; not supporting script files - posted by "Peter Kronenberg (Jira)" <ji...@apache.org> on 2021/01/28 18:56:00 UTC, 13 replies.
- [jira] [Created] (TIKA-3286) Tika not issue an error when language file doesn't exist; not supporting script files - posted by "Peter Kronenberg (Jira)" <ji...@apache.org> on 2021/01/28 18:56:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-3286) Tika does not issue an error when language file doesn't exist; not supporting script files - posted by "Peter Kronenberg (Jira)" <ji...@apache.org> on 2021/01/28 19:02:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3286) Tika does not issue an error when language file doesn't exist; not supporting script files - posted by "Peter Kronenberg (Jira)" <ji...@apache.org> on 2021/01/29 18:06:00 UTC, 2 replies.
- [jira] [Created] (TIKA-3287) Add http fetcher - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/29 18:16:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-3286) Tika does not issue an error when language file doesn't exist; not supporting script files - posted by "Peter Kronenberg (Jira)" <ji...@apache.org> on 2021/01/29 19:06:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-3287) Add http fetcher - posted by "Hudson (Jira)" <ji...@apache.org> on 2021/01/29 21:26:00 UTC, 1 replies.
- [jira] [Created] (TIKA-3288) Allow batching for emitters - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/29 22:01:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-3287) Add http fetcher - posted by "Tim Allison (Jira)" <ji...@apache.org> on 2021/01/29 22:02:00 UTC, 0 replies.