You are viewing a plain text version of this content. The canonical link for it is here.
- Re: regarding the data bank of test PDF files (pdfs_202011) . . . - posted by Albretch Mueller <lb...@gmail.com> on 2022/07/01 02:04:44 UTC, 0 replies.
- Question about Tika-2.4.x configuration in project pom.xml - posted by Luís Filipe Nassif <lf...@gmail.com> on 2022/07/12 17:09:05 UTC, 4 replies.
- Tika 2.4.x how to configure the scientific parsers - posted by Paul Borgermans <pa...@gmail.com> on 2022/07/13 15:32:33 UTC, 1 replies.
- tika-server 2.4.1 'corrupt stream' error scanning attachments, via dovecot fts plugin ? - posted by PGNet Dev <pg...@gmail.com> on 2022/07/15 13:41:16 UTC, 35 replies.
- from pdf to some sort of XMLish ODT kind of file ... - posted by Albretch Mueller <lb...@gmail.com> on 2022/07/18 09:04:07 UTC, 1 replies.
- Version question - posted by "Mark Kerzner SHMsoft, Inc." <ma...@shmsoft.com> on 2022/07/20 03:33:59 UTC, 1 replies.
- adding explicit OCR parser config to tika-server-config-custom.xml disables working OCR image processing? - posted by PGNet Dev <pg...@gmail.com> on 2022/07/23 16:58:48 UTC, 2 replies.
- Datasets for testing large number of attachments - posted by Oscar Rieken Jr via user <us...@tika.apache.org> on 2022/07/25 20:35:24 UTC, 8 replies.
- bug: adding to tika 2.4.2 config.xml truncates metadata return - posted by PGNet Dev <pg...@gmail.com> on 2022/07/26 10:52:04 UTC, 2 replies.
- tika 2.4.1 'Text extraction failed' errors when dovecot+fts 2.3.19.1 passes embedded *.eml (message/rfc822) files ; org.apache.tika.parser.mail.RFC822Parser or dovecot ? - posted by PGNet Dev <pg...@gmail.com> on 2022/07/30 23:51:17 UTC, 0 replies.