You are viewing a plain text version of this content. The canonical link for it is here.
- Re: Project dependencies page - posted by Vadim Roizman <ro...@gmail.com> on 2014/03/01 18:21:16 UTC, 0 replies.
- Re: CSCI ASSIGNMENT QUESTION - posted by Chris Mattmann <ma...@apache.org> on 2014/03/02 06:52:11 UTC, 0 replies.
- Re: Submission to ApacheCon on Tika - posted by Jukka Zitting <ju...@gmail.com> on 2014/03/02 16:59:24 UTC, 2 replies.
- [jira] [Created] (TIKA-1252) Tika is not indexing all authors of a PDF - posted by "Alexandre Madurell (JIRA)" <ji...@apache.org> on 2014/03/03 03:49:20 UTC, 0 replies.
- Tika 1.5 vs 1.4 testing - posted by Hong-Thai Nguyen <Ho...@polyspot.com> on 2014/03/03 14:18:55 UTC, 1 replies.
- [jira] [Created] (TIKA-1253) SLF4J: The requested version 1.5.6 by your slf4j binding is not compatible with [1.6, 1.7] - posted by "sudheshna iyer (JIRA)" <ji...@apache.org> on 2014/03/03 16:01:31 UTC, 0 replies.
- [jira] [Updated] (TIKA-1253) SLF4J: The requested version 1.5.6 by your slf4j binding is not compatible with [1.6, 1.7] - posted by "sudheshna iyer (JIRA)" <ji...@apache.org> on 2014/03/03 17:21:24 UTC, 0 replies.
- [jira] [Commented] (TIKA-1253) SLF4J: The requested version 1.5.6 by your slf4j binding is not compatible with [1.6, 1.7] - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2014/03/03 17:59:22 UTC, 0 replies.
- [jira] [Created] (TIKA-1254) No warning when Tika does not find a parser. - posted by "Ankit Gupta (JIRA)" <ji...@apache.org> on 2014/03/03 19:42:24 UTC, 0 replies.
- [jira] [Commented] (TIKA-1252) Tika is not indexing all authors of a PDF - posted by "Alexandre Madurell (JIRA)" <ji...@apache.org> on 2014/03/03 20:14:25 UTC, 14 replies.
- [jira] [Commented] (TIKA-1254) No warning when Tika does not find a parser. - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2014/03/03 21:06:28 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-1252) Tika is not indexing all authors of a PDF - posted by "Uwe Schindler (JIRA)" <ji...@apache.org> on 2014/03/03 23:18:34 UTC, 1 replies.
- [jira] [Updated] (TIKA-1252) Tika is not indexing all authors of a PDF - posted by "Alexandre Madurell (JIRA)" <ji...@apache.org> on 2014/03/04 09:25:21 UTC, 1 replies.
- [jira] [Created] (TIKA-1255) WordExtractor - bold hyperlink not closed properly - posted by "Alan Hunter (JIRA)" <ji...@apache.org> on 2014/03/04 15:55:53 UTC, 0 replies.
- [jira] [Updated] (TIKA-1255) WordExtractor - bold hyperlink not closed properly - posted by "Alan Hunter (JIRA)" <ji...@apache.org> on 2014/03/04 15:59:21 UTC, 2 replies.
- [jira] [Issue Comment Deleted] (TIKA-1252) Tika is not indexing all authors of a PDF - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2014/03/04 20:51:25 UTC, 0 replies.
- [jira] [Updated] (TIKA-1232) Add PDF version to PDFParser output - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2014/03/04 21:02:23 UTC, 2 replies.
- [jira] [Commented] (TIKA-1232) Add PDF version to PDFParser output - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2014/03/04 21:02:28 UTC, 5 replies.
- [jira] [Assigned] (TIKA-1252) Tika is not indexing all authors of a PDF - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2014/03/05 02:09:45 UTC, 0 replies.
- [jira] [Created] (TIKA-1256) Windows 07 excel ".xlsx" file Tika 1.4 api is detecting wrong mimetype. - posted by "Kavitha (JIRA)" <ji...@apache.org> on 2014/03/05 08:13:42 UTC, 0 replies.
- [jira] [Updated] (TIKA-1256) Windows 07 excel ".xlsx" file Tika 1.4 api is detecting wrong mimetype. - posted by "Kavitha (JIRA)" <ji...@apache.org> on 2014/03/05 08:24:42 UTC, 1 replies.
- [jira] [Updated] (TIKA-623) Add support for Outlook PST - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2014/03/05 10:26:44 UTC, 0 replies.
- [jira] [Commented] (TIKA-623) Add support for Outlook PST - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2014/03/05 10:30:46 UTC, 4 replies.
- [jira] [Assigned] (TIKA-623) Add support for Outlook PST - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2014/03/05 10:30:51 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-623) Add support for Outlook PST - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2014/03/05 10:30:57 UTC, 0 replies.
- [jira] [Resolved] (TIKA-623) Add support for Outlook PST - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2014/03/05 11:17:44 UTC, 0 replies.
- [jira] [Commented] (TIKA-1256) Windows 07 excel ".xlsx" file Tika 1.4 api is detecting wrong mimetype. - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2014/03/05 11:35:42 UTC, 0 replies.
- Searching for Tika Jira issues using Lucene - posted by Michael McCandless <lu...@mikemccandless.com> on 2014/03/05 17:47:52 UTC, 2 replies.
- Using guava on tika ? - posted by Hong-Thai Nguyen <Ho...@polyspot.com> on 2014/03/06 12:41:12 UTC, 4 replies.
- [jira] [Created] (TIKA-1257) MS Word Filter out control characters on ouput - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2014/03/06 14:22:44 UTC, 0 replies.
- [jira] [Updated] (TIKA-1257) MS Word Filter out control characters on ouput - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2014/03/06 14:24:44 UTC, 2 replies.
- buildbot failure in ASF Buildbot on tika-trunk - posted by bu...@apache.org on 2014/03/06 14:26:11 UTC, 1 replies.
- [jira] [Resolved] (TIKA-1257) MS Word Filter out control characters on ouput - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2014/03/06 14:26:43 UTC, 0 replies.
- RE: [ANNOUNCE] Apache Tika 1.5 Released - posted by Hong-Thai Nguyen <Ho...@polyspot.com> on 2014/03/06 14:27:23 UTC, 3 replies.
- buildbot success in ASF Buildbot on tika-trunk - posted by bu...@apache.org on 2014/03/06 14:41:14 UTC, 1 replies.
- [jira] [Comment Edited] (TIKA-1257) MS Word Filter out control characters on ouput - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2014/03/06 14:50:46 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1232) Add PDF version to PDFParser output - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2014/03/06 17:54:46 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1252) Tika is not indexing all authors of a PDF - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2014/03/06 18:14:52 UTC, 0 replies.
- Unconsistent logging in current tika (1.5) - posted by Konstantin Gribov <gr...@gmail.com> on 2014/03/06 21:11:18 UTC, 2 replies.
- [jira] [Updated] (TIKA-1256) MS Office 07 excel ".xlsx" file Tika 1.4 api is detecting wrong mimetype. - posted by "Kavitha (JIRA)" <ji...@apache.org> on 2014/03/07 09:01:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1256) MS Office 07 excel ".xlsx" file Tika 1.4 api is detecting wrong mimetype. - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2014/03/07 09:41:42 UTC, 0 replies.
- [jira] [Reopened] (TIKA-623) Add support for Outlook PST - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2014/03/07 10:09:47 UTC, 0 replies.
- [jira] [Created] (TIKA-1258) Update NetCDF dependency - posted by "Konstantin Gribov (JIRA)" <ji...@apache.org> on 2014/03/07 11:49:45 UTC, 0 replies.
- [jira] [Updated] (TIKA-1258) Update NetCDF dependency - posted by "Konstantin Gribov (JIRA)" <ji...@apache.org> on 2014/03/07 11:51:42 UTC, 0 replies.
- [jira] [Commented] (TIKA-1023) Weird associated to vorbis-java-core tests in vorbis-java-tika - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2014/03/09 17:00:44 UTC, 0 replies.
- [jira] [Updated] (TIKA-1251) RuntimeException when parsing word (.doc) documents. Works in Tika 1.4 but not 1.5 - posted by "Vadim Roizman (JIRA)" <ji...@apache.org> on 2014/03/11 07:51:42 UTC, 0 replies.
- [jira] [Commented] (TIKA-1182) Out of memory exception when parsing TTF file - posted by "Magnus Lövgren (JIRA)" <ji...@apache.org> on 2014/03/11 10:32:44 UTC, 0 replies.
- [jira] [Commented] (TIKA-1231) Safely handle null embedded files in PDFs - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2014/03/12 01:20:43 UTC, 0 replies.
- [jira] [Commented] (TIKA-1113) Parsing for OGV file results in java.lang.ClassCastException - posted by "Fabian Lange (JIRA)" <ji...@apache.org> on 2014/03/12 13:03:43 UTC, 7 replies.
- [jira] [Commented] (TIKA-1243) Support for 7z archives - posted by "Luis Filipe Nassif (JIRA)" <ji...@apache.org> on 2014/03/13 02:12:44 UTC, 1 replies.
- [jira] [Commented] (TIKA-241) Rar archive support - posted by "Luis Filipe Nassif (JIRA)" <ji...@apache.org> on 2014/03/13 02:30:48 UTC, 1 replies.
- [jira] [Created] (TIKA-1259) More ogg based mime entries - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2014/03/13 10:12:42 UTC, 0 replies.
- [jira] [Commented] (TIKA-1259) More ogg based mime entries - posted by "Fabian Lange (JIRA)" <ji...@apache.org> on 2014/03/13 12:21:42 UTC, 1 replies.
- Use of Levenshtein distance to find similar words - posted by Margi Patel <ma...@usc.edu> on 2014/03/16 19:36:53 UTC, 1 replies.
- [jira] [Commented] (TIKA-972) Unexpected RuntimeException from org.apache.tika.parser.pdf.PDFParser . - posted by "Florent Guillaume (JIRA)" <ji...@apache.org> on 2014/03/17 02:24:42 UTC, 0 replies.
- [jira] [Commented] (TIKA-93) OCR support - posted by "Luis Filipe Nassif (JIRA)" <ji...@apache.org> on 2014/03/17 03:30:47 UTC, 5 replies.
- [jira] [Created] (TIKA-1260) Detection result for zero-byte files is text/plain - posted by "Johan van der Knijff (JIRA)" <ji...@apache.org> on 2014/03/17 18:17:44 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1260) Detection result for zero-byte files is text/plain - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2014/03/17 19:29:47 UTC, 0 replies.
- [jira] [Commented] (TIKA-1112) Parsing for OGV file with invalid checksum - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2014/03/18 13:23:44 UTC, 1 replies.
- [jira] [Resolved] (TIKA-1259) More ogg based mime entries - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2014/03/18 14:09:44 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1113) Parsing for OGV file results in java.lang.ClassCastException - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2014/03/18 14:09:45 UTC, 0 replies.
- [jira] [Updated] (TIKA-1244) Better parsing of Mbox files - posted by "Luis Filipe Nassif (JIRA)" <ji...@apache.org> on 2014/03/18 23:32:44 UTC, 0 replies.
- [jira] [Created] (TIKA-1261) Commons Compress version should be 1.5 - posted by "Ryan Quam (JIRA)" <ji...@apache.org> on 2014/03/19 16:49:42 UTC, 0 replies.
- [jira] [Commented] (TIKA-1261) Commons Compress version should be 1.5 - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2014/03/19 17:03:45 UTC, 1 replies.
- [jira] [Created] (TIKA-1262) parseToString fails to detect content-type / charset for GB2312 text file - posted by "Jeremy McLain (JIRA)" <ji...@apache.org> on 2014/03/20 00:31:48 UTC, 0 replies.
- [jira] [Updated] (TIKA-1262) parseToString fails to detect content-type / charset for GB2312 text file - posted by "Jeremy McLain (JIRA)" <ji...@apache.org> on 2014/03/20 00:33:43 UTC, 7 replies.
- [jira] [Commented] (TIKA-1262) parseToString fails to detect content-type / charset for GB2312 text file - posted by "Jeremy McLain (JIRA)" <ji...@apache.org> on 2014/03/20 01:27:42 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-1262) parseToString fails to detect content-type / charset for GB2312 text file - posted by "Jeremy McLain (JIRA)" <ji...@apache.org> on 2014/03/20 01:29:42 UTC, 1 replies.
- [jira] [Issue Comment Deleted] (TIKA-1262) parseToString fails to detect content-type / charset for GB2312 text file - posted by "Jeremy McLain (JIRA)" <ji...@apache.org> on 2014/03/20 01:29:45 UTC, 0 replies.
- [jira] [Updated] (TIKA-1262) parseToString fails to detect content-type / charset - posted by "Jeremy McLain (JIRA)" <ji...@apache.org> on 2014/03/20 02:26:42 UTC, 0 replies.
- [jira] [Commented] (TIKA-1262) parseToString fails to detect content-type / charset - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2014/03/20 03:53:43 UTC, 1 replies.
- [jira] [Closed] (TIKA-1262) parseToString fails to detect content-type / charset - posted by "Jeremy McLain (JIRA)" <ji...@apache.org> on 2014/03/20 18:13:46 UTC, 0 replies.
- [jira] [Created] (TIKA-1263) Atom feed failed to detect - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2014/03/20 23:27:43 UTC, 0 replies.
- [jira] [Updated] (TIKA-1263) Atom feed failed to detect - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2014/03/20 23:27:53 UTC, 0 replies.
- [jira] [Commented] (TIKA-1244) Better parsing of Mbox files - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2014/03/21 12:04:42 UTC, 2 replies.
- [jira] [Resolved] (TIKA-1263) Atom feed failed to detect - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2014/03/21 14:38:42 UTC, 0 replies.
- [jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2014/03/21 20:07:42 UTC, 13 replies.
- [jira] [Updated] (TIKA-1010) Embedded documents in RTF are not extracted - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2014/03/21 20:09:46 UTC, 10 replies.
- [jira] [Comment Edited] (TIKA-1010) Embedded documents in RTF are not extracted - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2014/03/21 20:39:45 UTC, 4 replies.
- [jira] [Updated] (TIKA-1151) Maven Build Should Automatically Produce test-jar Artifacts - posted by "Ray Gauss II (JIRA)" <ji...@apache.org> on 2014/03/24 16:42:54 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1151) Maven Build Should Automatically Produce test-jar Artifacts - posted by "Ray Gauss II (JIRA)" <ji...@apache.org> on 2014/03/24 16:42:54 UTC, 0 replies.
- [jira] [Created] (TIKA-1264) Improve PST file detection - posted by "Luis Filipe Nassif (JIRA)" <ji...@apache.org> on 2014/03/25 15:13:16 UTC, 0 replies.
- [jira] [Commented] (TIKA-1264) Improve PST file detection - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2014/03/25 16:28:19 UTC, 3 replies.
- [jira] [Resolved] (TIKA-1261) Commons Compress version should be 1.5 - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2014/03/25 17:20:18 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1264) Improve PST file detection - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2014/03/25 17:40:15 UTC, 0 replies.
- [jira] [Commented] (TIKA-1165) Autodetect and parse Asciidoc - posted by "David Pilato (JIRA)" <ji...@apache.org> on 2014/03/25 18:42:23 UTC, 0 replies.
- [jira] [Created] (TIKA-1265) Text parsing support for NetCDF - posted by "Ann Burgess (JIRA)" <ji...@apache.org> on 2014/03/26 02:12:14 UTC, 0 replies.
- How should video files with audio be handled by parsers? - posted by Nick Burch <ni...@apache.org> on 2014/03/27 16:34:39 UTC, 5 replies.
- [jira] [Commented] (TIKA-1079) Word document hits AIOOBE in SummaryExtractor.parseSummaries - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2014/03/27 16:41:16 UTC, 0 replies.
- Parser.parse with file instead of stream - posted by Stefano Fornari <st...@gmail.com> on 2014/03/27 23:07:01 UTC, 2 replies.
- PDF parser (two more questions) - posted by Stefano Fornari <st...@gmail.com> on 2014/03/27 23:21:40 UTC, 9 replies.
- [jira] [Assigned] (TIKA-1010) Embedded documents in RTF are not extracted - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2014/03/28 01:31:18 UTC, 0 replies.
- [jira] [Assigned] (TIKA-1244) Better parsing of Mbox files - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2014/03/28 15:29:16 UTC, 0 replies.
- metadata key for original file path? - posted by "Allison, Timothy B." <ta...@mitre.org> on 2014/03/28 16:25:09 UTC, 1 replies.
- How to exclude a mimetype form being indexed in solr using tika? - posted by eShard <zi...@yahoo.com> on 2014/03/28 18:59:04 UTC, 2 replies.
- [PDFParser] XHTML vs plain text (was Re: PDF parser (two more questions)) - posted by Stefano Fornari <st...@gmail.com> on 2014/03/29 14:33:28 UTC, 0 replies.
- [PDFParser] - read limited number of characters - posted by Stefano Fornari <st...@gmail.com> on 2014/03/29 15:31:41 UTC, 0 replies.
- [jira] [Created] (TIKA-1266) Tika OSGI Bundle needs Bundle-ClassPath to work in Equinox - posted by "pm (JIRA)" <ji...@apache.org> on 2014/03/31 13:23:18 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1244) Better parsing of Mbox files - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2014/03/31 13:59:15 UTC, 0 replies.
- [jira] [Created] (TIKA-1267) Improve Mbox file detection - posted by "Luis Filipe Nassif (JIRA)" <ji...@apache.org> on 2014/03/31 16:43:45 UTC, 0 replies.