You are viewing a plain text version of this content. The canonical link for it is here.
- RE: {EXTERNAL}OCR on PDFs - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/04 15:15:05 UTC, 4 replies.
- Re: OCR on PDFs - posted by Tim Allison <ta...@apache.org> on 2021/01/04 16:10:31 UTC, 1 replies.
- Page Segmentation Mode - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/04 17:26:10 UTC, 4 replies.
- RE: Using Tesseract with Tika - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/04 17:32:59 UTC, 6 replies.
- Setting parser options - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/05 20:59:00 UTC, 2 replies.
- PDFParser.properties formatting - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/07 17:19:37 UTC, 1 replies.
- ocr examples - posted by Tim Allison <ta...@apache.org> on 2021/01/07 18:28:54 UTC, 0 replies.
- Language detection - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/07 23:28:04 UTC, 2 replies.
- Tika on repository.apache.org - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/08 15:40:58 UTC, 6 replies.
- Problem parsing DOCX - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/08 16:38:12 UTC, 3 replies.
- PDFBox's detectAngles - posted by Tim Allison <ta...@apache.org> on 2021/01/08 18:13:59 UTC, 0 replies.
- tesseract resize option - posted by Tim Allison <ta...@apache.org> on 2021/01/08 18:22:22 UTC, 0 replies.
- RE: {EXTERNAL}tesseract resize option - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/08 18:35:57 UTC, 3 replies.
- TesseractOCRConfig which jar? - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/08 21:24:57 UTC, 3 replies.
- ApplyRotation default? - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/11 14:19:50 UTC, 2 replies.
- OCR_STRATEGY=AUTO - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/11 14:41:04 UTC, 1 replies.
- OCR of other than PDF files - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/11 15:36:54 UTC, 1 replies.
- Turning off ImageProcessing - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/11 17:50:30 UTC, 8 replies.
- Image processing timings - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/12 15:14:31 UTC, 3 replies.
- PDFs and detectAngles - posted by Tim Allison <ta...@apache.org> on 2021/01/12 16:58:57 UTC, 1 replies.
- Rotation script - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/13 01:28:10 UTC, 9 replies.
- Re: [EXTERNAL] Re: Rotation script - posted by Chris Mattmann <ma...@apache.org> on 2021/01/13 02:17:00 UTC, 0 replies.
- Getting language of parsed text - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/13 19:20:45 UTC, 3 replies.
- [VOTE] Release Apache Tika 2.0.0-ALPHA Candidate #1 - posted by Tim Allison <ta...@apache.org> on 2021/01/14 01:19:16 UTC, 0 replies.
- Building with Tika 2.0 - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/15 19:19:07 UTC, 2 replies.
- RE: {EXTERNAL}Building with Tika 2.0 - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/15 19:27:58 UTC, 0 replies.
- [RESULT][VOTE] Release Apache Tika 2.0.0-ALPHA Candidate #1 - posted by Tim Allison <ta...@apache.org> on 2021/01/16 15:56:00 UTC, 1 replies.
- [ANNOUNCE] Apache Tika 2.0.0-ALPHA released - posted by Tim Allison <ta...@apache.org> on 2021/01/18 13:17:23 UTC, 0 replies.
- Tesseract PSM=0 - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/22 14:31:20 UTC, 3 replies.
- Invalid language code - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/27 21:03:17 UTC, 1 replies.
- RE: {EXTERNAL}Invalid language code - posted by Peter Kronenberg <pe...@torch.ai> on 2021/01/28 01:51:03 UTC, 3 replies.