You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] [Created] (TIKA-1200) Upgrade pdfbox 1.8.3 - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2013/12/02 10:28:35 UTC, 0 replies.
- NonSequentialPDFParser - posted by Hong-Thai Nguyen <Ho...@polyspot.com> on 2013/12/02 15:17:52 UTC, 4 replies.
- [jira] [Resolved] (TIKA-1200) Upgrade pdfbox 1.8.3 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2013/12/02 16:03:35 UTC, 0 replies.
- [jira] [Created] (TIKA-1201) Add option for switching to pdfbox NonSequentialPDFParser - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2013/12/02 16:27:35 UTC, 0 replies.
- [jira] [Updated] (TIKA-1201) Add possibility for switching to pdfbox NonSequentialPDFParser - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2013/12/02 16:29:41 UTC, 1 replies.
- [jira] [Assigned] (TIKA-1201) Add possibility for switching to pdfbox NonSequentialPDFParser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2013/12/02 16:35:36 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1201) Add possibility for switching to pdfbox NonSequentialPDFParser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2013/12/03 01:56:36 UTC, 0 replies.
- [jira] [Created] (TIKA-1202) Refactor PDFParser to enable easier parameter setting - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2013/12/03 02:01:36 UTC, 0 replies.
- [jira] [Updated] (TIKA-1202) Refactor PDFParser to enable easier parameter setting - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2013/12/03 04:41:35 UTC, 0 replies.
- [jira] [Commented] (TIKA-1201) Add possibility for switching to pdfbox NonSequentialPDFParser - posted by "Timo Boehme (JIRA)" <ji...@apache.org> on 2013/12/03 10:22:38 UTC, 0 replies.
- [jira] [Created] (TIKA-1203) Some metadata not extracted from PDF files when NonSequentialPDFParser is used - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2013/12/03 17:24:35 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-1201) Add possibility for switching to pdfbox NonSequentialPDFParser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2013/12/03 17:26:36 UTC, 0 replies.
- [jira] [Commented] (TIKA-1199) Tika extracts weird signs instead of text - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2013/12/03 17:40:38 UTC, 1 replies.
- [jira] [Commented] (TIKA-1202) Refactor PDFParser to enable easier parameter setting - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2013/12/04 09:36:37 UTC, 1 replies.
- [jira] [Commented] (TIKA-1196) JAX-RS server only responds to queries to/from http://localhost - posted by "Rian Stockbower (JIRA)" <ji...@apache.org> on 2013/12/05 03:17:36 UTC, 2 replies.
- [jira] [Commented] (TIKA-1197) Update CXF dependency in Tika Server to CXF 2.7.7 or CXF 2.7.8 - posted by "Sergey Beryozkin (JIRA)" <ji...@apache.org> on 2013/12/05 14:27:36 UTC, 0 replies.
- [jira] [Commented] (TIKA-1198) Consider optionally utilizing CXF JAX-RS Attachment support - posted by "Sergey Beryozkin (JIRA)" <ji...@apache.org> on 2013/12/05 17:32:37 UTC, 4 replies.
- [jira] [Comment Edited] (TIKA-1198) Consider optionally utilizing CXF JAX-RS Attachment support - posted by "Sergey Beryozkin (JIRA)" <ji...@apache.org> on 2013/12/05 17:36:36 UTC, 0 replies.
- [jira] [Commented] (TIKA-1121) Socket server text parsing error on large text files - posted by "Sergey Beryozkin (JIRA)" <ji...@apache.org> on 2013/12/05 17:40:37 UTC, 4 replies.
- [jira] [Reopened] (TIKA-941) Detecting KML / KMZ files - posted by "Marco Quaranta (JIRA)" <ji...@apache.org> on 2013/12/06 17:34:35 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-941) Detecting KML / KMZ files - posted by "Marco Quaranta (JIRA)" <ji...@apache.org> on 2013/12/06 17:34:36 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1202) Refactor PDFParser to enable easier parameter setting - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2013/12/06 20:53:37 UTC, 1 replies.
- [jira] [Created] (TIKA-1204) DWFX files detection - posted by "Marco Quaranta (JIRA)" <ji...@apache.org> on 2013/12/09 10:14:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-1204) DWFX files detection - posted by "Marco Quaranta (JIRA)" <ji...@apache.org> on 2013/12/09 10:22:08 UTC, 0 replies.
- [jira] [Reopened] (TIKA-1202) Refactor PDFParser to enable easier parameter setting - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2013/12/09 20:02:07 UTC, 0 replies.
- [jira] [Resolved] (TIKA-973) PDF form data isn't included in extracted content. - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2013/12/10 02:16:07 UTC, 0 replies.
- Tika 715 (invalid xhtml output) - posted by Raymond Wiker <rw...@gmail.com> on 2013/12/11 12:33:27 UTC, 0 replies.
- [jira] [Updated] (TIKA-1205) Allow PDFParser to fallback to other parser if there is an exception - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2013/12/11 14:30:07 UTC, 1 replies.
- [jira] [Created] (TIKA-1205) Allow PDFParser to fallback to other parser if there is an exception - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2013/12/11 14:30:07 UTC, 0 replies.
- [jira] [Commented] (TIKA-1205) Allow PDFParser to fallback to other parser if there is an exception - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2013/12/11 14:44:07 UTC, 1 replies.
- [jira] [Comment Edited] (TIKA-1121) Socket server text parsing error on large text files - posted by "Mane (JIRA)" <ji...@apache.org> on 2013/12/11 18:45:07 UTC, 1 replies.
- [jira] [Created] (TIKA-1206) rfc822 standard headers - posted by "Marco Quaranta (JIRA)" <ji...@apache.org> on 2013/12/12 16:12:07 UTC, 0 replies.
- [jira] [Commented] (TIKA-994) Type Detection Fault - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/12/12 19:12:07 UTC, 0 replies.
- [jira] [Created] (TIKA-1207) Parent task for integration of Any23 into Tika - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/12/12 19:46:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-1207) Parent task for integration of Any23 into Tika - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/12/12 19:48:06 UTC, 0 replies.
- [jira] [Created] (TIKA-1208) Migrate Any23 mime contributions to Tika - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/12/12 19:48:07 UTC, 0 replies.
- Initial work on Any23 proposal & migration to Tika - posted by Lewis John Mcgibbney <le...@gmail.com> on 2013/12/12 19:58:18 UTC, 1 replies.
- [jira] [Commented] (TIKA-1208) Migrate Any23 mime contributions to Tika - posted by "Peter Ansell (JIRA)" <ji...@apache.org> on 2013/12/12 23:17:07 UTC, 5 replies.
- [jira] [Comment Edited] (TIKA-1208) Migrate Any23 mime contributions to Tika - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/12/13 01:34:07 UTC, 0 replies.
- [jira] [Reopened] (TIKA-973) PDF form data isn't included in extracted content. - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2013/12/13 14:23:07 UTC, 0 replies.
- Jenkins build is back to normal : Tika-trunk ยป Apache Tika application #1050 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/12/13 15:56:39 UTC, 0 replies.
- Jenkins build is back to normal : Tika-trunk #1050 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/12/13 15:57:06 UTC, 0 replies.
- [jira] [Created] (TIKA-1209) Upgrade Tika tests to JUnit 4.X - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/12/13 19:33:07 UTC, 0 replies.
- Support for marks in InputStream passed to Tika.detect - posted by Lewis John Mcgibbney <le...@gmail.com> on 2013/12/13 22:07:42 UTC, 0 replies.
- [jira] [Created] (TIKA-1210) Address tika-parsers o.a.t.mime.TestMimeTypes TODO: Need a test flash file - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/12/14 16:11:06 UTC, 0 replies.
- [jira] [Updated] (TIKA-1209) Upgrade Tika tests to JUnit 4.X - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/12/14 18:28:06 UTC, 0 replies.
- Switch to JUnit 4.x? - posted by Ken Krugler <kk...@transpac.com> on 2013/12/15 00:39:04 UTC, 5 replies.
- [jira] [Commented] (TIKA-1209) Upgrade Tika tests to JUnit 4.X - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2013/12/15 00:39:07 UTC, 3 replies.
- [jira] [Created] (TIKA-1211) OpenDocument (ODF) parser produces multipe startDocument() events - posted by "Uwe Schindler (JIRA)" <ji...@apache.org> on 2013/12/17 13:50:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-1211) OpenDocument (ODF) parser produces multiple startDocument() events - posted by "Uwe Schindler (JIRA)" <ji...@apache.org> on 2013/12/17 13:50:07 UTC, 1 replies.
- [jira] [Commented] (TIKA-1211) OpenDocument (ODF) parser produces multiple startDocument() events - posted by "Uwe Schindler (JIRA)" <ji...@apache.org> on 2013/12/17 13:58:08 UTC, 1 replies.
- [jira] [Commented] (TIKA-1193) Allow access to HtmlParser's HtmlSchema - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2013/12/18 13:42:07 UTC, 0 replies.
- [jira] [Created] (TIKA-1212) Recursive Extraction of Archive File - posted by "Vikram (JIRA)" <ji...@apache.org> on 2013/12/19 06:07:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-1212) Recursive Extraction of Archive File - posted by "Vikram (JIRA)" <ji...@apache.org> on 2013/12/19 06:11:08 UTC, 3 replies.
- [jira] [Commented] (TIKA-1212) Recursive Extraction of Archive File - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2013/12/19 15:46:11 UTC, 1 replies.
- [jira] [Updated] (TIKA-1160) Add support for SolidWorks files - posted by "gunter rombauts (JIRA)" <ji...@apache.org> on 2013/12/19 16:09:07 UTC, 0 replies.
- Tika 1.5 release ? - posted by Hong-Thai Nguyen <Ho...@polyspot.com> on 2013/12/19 18:18:02 UTC, 2 replies.
- [jira] [Resolved] (TIKA-1209) Upgrade Tika tests to JUnit 4.X - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2013/12/19 20:49:07 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-1212) Recursive Extraction of Archive File - posted by "Vikram (JIRA)" <ji...@apache.org> on 2013/12/20 11:22:09 UTC, 1 replies.
- [jira] [Updated] (TIKA-1210) Address tika-parsers o.a.t.mime.TestMimeTypes TODO: Need a test flash file - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/12/20 15:12:09 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-1210) Address tika-parsers o.a.t.mime.TestMimeTypes TODO: Need a test flash file - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/12/20 15:12:10 UTC, 0 replies.
- [jira] [Created] (TIKA-1213) Parsing (extracting content) a single 5Mb pdf file takes 3minutes - posted by "Clemens Wyss (JIRA)" <ji...@apache.org> on 2013/12/22 15:18:50 UTC, 0 replies.
- [jira] [Updated] (TIKA-1213) Parsing (extracting content) a single 5Mb pdf file takes 3minutes - posted by "Clemens Wyss (JIRA)" <ji...@apache.org> on 2013/12/22 15:22:50 UTC, 0 replies.
- [jira] [Commented] (TIKA-1213) Parsing (extracting content) a single 5Mb pdf file takes 3minutes - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2013/12/22 16:54:52 UTC, 4 replies.
- [jira] [Comment Edited] (TIKA-1213) Parsing (extracting content) a single 5Mb pdf file takes 3minutes - posted by "Clemens Wyss (JIRA)" <ji...@apache.org> on 2013/12/23 08:21:51 UTC, 0 replies.
- [jira] [Created] (TIKA-1214) Infinity Loop in Mpeg Stream - posted by "Georg Hartmann (JIRA)" <ji...@apache.org> on 2013/12/23 09:32:50 UTC, 0 replies.
- [jira] [Commented] (TIKA-1152) Process loops infinitely on parsing of a CHM file - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2013/12/23 10:50:50 UTC, 1 replies.
- [jira] [Commented] (TIKA-1214) Infinity Loop in Mpeg Stream - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2013/12/23 10:54:50 UTC, 1 replies.
- [jira] [Commented] (TIKA-93) OCR support - posted by "frank (JIRA)" <ji...@apache.org> on 2013/12/24 08:48:53 UTC, 1 replies.
- [jira] [Assigned] (TIKA-1211) OpenDocument (ODF) parser produces multiple startDocument() events - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2013/12/24 15:32:52 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1211) OpenDocument (ODF) parser produces multiple startDocument() events - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2013/12/24 16:53:51 UTC, 0 replies.
- [jira] [Updated] (TIKA-1110) Incorrectly declared SUPPORTED_TYPES in ChmParser. - posted by "Vadim Roizman (JIRA)" <ji...@apache.org> on 2013/12/26 20:32:50 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1110) Incorrectly declared SUPPORTED_TYPES in ChmParser. - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2013/12/27 04:33:52 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1213) Parsing (extracting content) a single 5Mb pdf file takes 3minutes - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2013/12/27 04:35:53 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1152) Process loops infinitely on parsing of a CHM file - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2013/12/27 04:48:50 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1210) Address tika-parsers o.a.t.mime.TestMimeTypes TODO: Need a test flash file - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2013/12/27 05:03:50 UTC, 0 replies.
- [jira] [Assigned] (TIKA-1193) Allow access to HtmlParser's HtmlSchema - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2013/12/27 05:05:50 UTC, 0 replies.
- Build failed in Jenkins: Tika-trunk #1054 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/12/27 05:34:33 UTC, 0 replies.
- Jenkins build is back to normal : Tika-trunk #1055 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/12/27 09:41:49 UTC, 0 replies.
- [jira] [Updated] (TIKA-1215) Regression: Unable parse a mp3 file on 1.5 which parsed successfully on 1.4 - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2013/12/27 11:29:50 UTC, 0 replies.
- [jira] [Created] (TIKA-1215) Regression: Unable parse a mp3 file on 1.5 which parsed successfully on 1.4 - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2013/12/27 11:29:50 UTC, 0 replies.
- [jira] [Commented] (TIKA-245) Support of CHM Format - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2013/12/27 16:14:51 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1122) Tika fails to parse chm files - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2013/12/27 16:14:57 UTC, 0 replies.
- [jira] [Commented] (TIKA-1215) Regression: Unable parse a mp3 file on 1.5 which parsed successfully on 1.4 - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2013/12/27 16:56:50 UTC, 2 replies.
- [jira] [Comment Edited] (TIKA-1215) Regression: Unable parse a mp3 file on 1.5 which parsed successfully on 1.4 - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2013/12/27 17:00:50 UTC, 0 replies.
- [jira] [Commented] (TIKA-1206) rfc822 standard headers - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2013/12/27 17:17:50 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1193) Allow access to HtmlParser's HtmlSchema - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2013/12/28 02:14:51 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1160) Add support for SolidWorks files - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2013/12/28 02:40:51 UTC, 0 replies.
- [jira] [Commented] (TIKA-1210) Address tika-parsers o.a.t.mime.TestMimeTypes TODO: Need a test flash file - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/12/28 14:33:50 UTC, 0 replies.
- [jira] [Commented] (TIKA-1086) Tika-bundle 1.3 does not import org.w3c.dom package - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2013/12/28 16:42:50 UTC, 0 replies.
- Build failed in Jenkins: Tika-trunk #1058 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/12/28 22:53:00 UTC, 1 replies.
- [jira] [Commented] (TIKA-820) Locator is unset for HTML parser - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2013/12/28 23:45:52 UTC, 0 replies.
- [jira] [Resolved] (TIKA-820) Locator is unset for HTML parser - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2013/12/28 23:47:50 UTC, 0 replies.
- Build failed in Jenkins: Tika-trunk #1059 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/12/29 01:53:56 UTC, 0 replies.
- Jenkins build is back to normal : Tika-trunk #1060 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/12/29 03:45:52 UTC, 0 replies.
- Help on 1.4/1.5 - posted by Stefano Fornari <st...@gmail.com> on 2013/12/29 08:40:38 UTC, 3 replies.
- [jira] [Comment Edited] (TIKA-1214) Infinity Loop in Mpeg Stream - posted by "Stefano Fornari (JIRA)" <ji...@apache.org> on 2013/12/29 11:19:51 UTC, 0 replies.
- [DISCUSS] Prepare Release 1.5? - posted by David Meikle <lo...@gmail.com> on 2013/12/29 12:41:15 UTC, 0 replies.