You are viewing a plain text version of this content. The canonical link for it is here.
- Re: Tesseract OCR engine - posted by Alex Ott <al...@gmail.com> on 2011/12/01 09:10:52 UTC, 0 replies.
- [jira] [Commented] (TIKA-793) Invalid ASCII character (65533) when retriving MP3 metadata - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/12/01 12:03:41 UTC, 3 replies.
- [jira] [Created] (TIKA-796) Tika breaks words of rotated text in PDF documents - posted by "Franz Canaval (Created) (JIRA)" <ji...@apache.org> on 2011/12/01 12:21:39 UTC, 0 replies.
- [jira] [Commented] (TIKA-623) Add support for Outlook PST - posted by "Andrzej Bialecki (Commented) (JIRA)" <ji...@apache.org> on 2011/12/01 12:48:40 UTC, 2 replies.
- [jira] [Commented] (TIKA-796) Tika breaks words of rotated text in PDF documents - posted by "Michael McCandless (Commented) (JIRA)" <ji...@apache.org> on 2011/12/01 12:52:40 UTC, 0 replies.
- tika's beta dependency - posted by ankush chadha <an...@yahoo.com> on 2011/12/01 14:38:40 UTC, 1 replies.
- [jira] [Created] (TIKA-797) MimeType.getExtension for application/vnd.ms-powerpoint returns ppz. I'd expect ppt. - posted by "Antoni Mylka (Created) (JIRA)" <ji...@apache.org> on 2011/12/02 13:07:40 UTC, 0 replies.
- [jira] [Updated] (TIKA-797) MimeType.getExtension for application/vnd.ms-powerpoint returns ppz. I'd expect ppt. - posted by "Antoni Mylka (Updated) (JIRA)" <ji...@apache.org> on 2011/12/02 13:09:39 UTC, 0 replies.
- [jira] [Commented] (TIKA-797) MimeType.getExtension for application/vnd.ms-powerpoint returns ppz. I'd expect ppt. - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/12/02 13:19:40 UTC, 0 replies.
- [jira] [Resolved] (TIKA-797) MimeType.getExtension for application/vnd.ms-powerpoint returns ppz. I'd expect ppt. - posted by "Nick Burch (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/02 13:19:40 UTC, 0 replies.
- [jira] [Created] (TIKA-798) Distinguish between EMF and WMF - posted by "Antoni Mylka (Created) (JIRA)" <ji...@apache.org> on 2011/12/02 14:43:40 UTC, 0 replies.
- [jira] [Updated] (TIKA-798) Distinguish between EMF and WMF - posted by "Antoni Mylka (Updated) (JIRA)" <ji...@apache.org> on 2011/12/02 14:43:44 UTC, 0 replies.
- [jira] [Commented] (TIKA-762) EXIF extraction from PNG images - posted by "Fabian Lange (Commented) (JIRA)" <ji...@apache.org> on 2011/12/02 16:37:39 UTC, 1 replies.
- [jira] [Created] (TIKA-799) ForkParser does not populate metadata object after completing a parse - posted by "Arthur Meneau (Created) (JIRA)" <ji...@apache.org> on 2011/12/03 03:27:39 UTC, 0 replies.
- [jira] [Commented] (TIKA-798) Distinguish between EMF and WMF - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/12/05 01:35:40 UTC, 0 replies.
- [jira] [Resolved] (TIKA-795) [PATCH] NoSuchMethod - XSLFPowerPointExtractorDecorator.buildXHTML POI - XSLFSlide.getMasterSheet() - posted by "Nick Burch (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/05 01:39:39 UTC, 0 replies.
- [jira] [Updated] (TIKA-410) textbox content extaction for word documents - posted by "John Mastarone (Updated) (JIRA)" <ji...@apache.org> on 2011/12/05 02:25:39 UTC, 0 replies.
- [jira] [Resolved] (TIKA-410) textbox content extaction for word documents - posted by "Nick Burch (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/05 04:45:40 UTC, 0 replies.
- [jira] [Commented] (TIKA-410) textbox content extaction for word documents - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/12/05 04:45:40 UTC, 0 replies.
- News item on publication of Tika in Action? - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2011/12/05 05:05:48 UTC, 0 replies.
- [jira] [Commented] (TIKA-526) OOXMLParser fails to extract text from within smart tags - posted by "Fabian Lange (Commented) (JIRA)" <ji...@apache.org> on 2011/12/05 11:07:40 UTC, 0 replies.
- [jira] [Created] (TIKA-800) mark/reset not supported from POIFSContainerDetector - posted by "Andrzej Bialecki (Created) (JIRA)" <ji...@apache.org> on 2011/12/05 12:25:39 UTC, 0 replies.
- [jira] [Commented] (TIKA-800) mark/reset not supported from POIFSContainerDetector - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/12/05 12:41:39 UTC, 4 replies.
- [jira] [Created] (TIKA-801) ContentHandlerDecorator outputs invalid element - posted by "Andrzej Bialecki (Created) (JIRA)" <ji...@apache.org> on 2011/12/05 13:37:39 UTC, 0 replies.
- [jira] [Commented] (TIKA-423) Parse docx and output to text file missing words - posted by "Fabian Lange (Commented) (JIRA)" <ji...@apache.org> on 2011/12/05 13:37:40 UTC, 1 replies.
- [jira] [Commented] (TIKA-801) ContentHandlerDecorator outputs invalid element - posted by "Michael McCandless (Commented) (JIRA)" <ji...@apache.org> on 2011/12/05 14:49:39 UTC, 6 replies.
- Subscribing - posted by Guyot Raphaƫl <cp...@free.fr> on 2011/12/05 22:01:17 UTC, 0 replies.
- [jira] [Created] (TIKA-802) NullPointerException when parsing iWork files - posted by "Arthur Meneau (Created) (JIRA)" <ji...@apache.org> on 2011/12/06 02:13:39 UTC, 0 replies.
- [jira] [Resolved] (TIKA-800) mark/reset not supported from POIFSContainerDetector - posted by "Nick Burch (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/06 02:15:40 UTC, 0 replies.
- [jira] [Updated] (TIKA-802) NullPointerException when parsing iWork files - posted by "Arthur Meneau (Updated) (JIRA)" <ji...@apache.org> on 2011/12/06 02:17:39 UTC, 1 replies.
- [jira] [Commented] (TIKA-802) NullPointerException when parsing iWork files - posted by "Arthur Meneau (Commented) (JIRA)" <ji...@apache.org> on 2011/12/06 02:17:40 UTC, 4 replies.
- Build failed in Jenkins: Tika-trunk #742 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/06 03:02:02 UTC, 0 replies.
- Jenkins build is back to normal : Tika-trunk #743 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/12/06 17:25:34 UTC, 0 replies.
- [jira] [Issue Comment Edited] (TIKA-801) ContentHandlerDecorator outputs invalid element - posted by "Paul Hill (Issue Comment Edited) (JIRA)" <ji...@apache.org> on 2011/12/08 01:44:40 UTC, 1 replies.
- [jira] [Updated] (TIKA-801) ContentHandlerDecorator outputs invalid element - posted by "Paul Hill (Updated) (JIRA)" <ji...@apache.org> on 2011/12/08 01:44:40 UTC, 1 replies.
- [jira] [Created] (TIKA-803) Outlook parser to mark the message body in some special way - posted by "Swapna Vuppala (Created) (JIRA)" <ji...@apache.org> on 2011/12/08 09:59:41 UTC, 0 replies.
- [jira] [Assigned] (TIKA-801) ContentHandlerDecorator outputs invalid element - posted by "Michael McCandless (Assigned) (JIRA)" <ji...@apache.org> on 2011/12/08 12:56:40 UTC, 0 replies.
- [jira] [Updated] (TIKA-682) Creative Suite formats are not supported - posted by "Damon Rand (Updated) (JIRA)" <ji...@apache.org> on 2011/12/09 11:04:40 UTC, 1 replies.
- [jira] [Created] (TIKA-804) Parsing outlook format template (.oft ) - posted by "Babu Gajendran (Created) (JIRA)" <ji...@apache.org> on 2011/12/09 15:28:47 UTC, 0 replies.
- [jira] [Created] (TIKA-805) improvements in XSLFPowerPointExtractorDecorator - posted by "Yegor Kozlov (Created) (JIRA)" <ji...@apache.org> on 2011/12/09 15:48:39 UTC, 0 replies.
- [jira] [Updated] (TIKA-805) improvements in XSLFPowerPointExtractorDecorator - posted by "Yegor Kozlov (Updated) (JIRA)" <ji...@apache.org> on 2011/12/09 15:48:40 UTC, 0 replies.
- [jira] [Created] (TIKA-806) MS Word Detection magics are a bit overzealous - posted by "Antoni Mylka (Created) (JIRA)" <ji...@apache.org> on 2011/12/09 16:06:40 UTC, 0 replies.
- [jira] [Updated] (TIKA-806) MS Word Detection magics are a bit overzealous - posted by "Antoni Mylka (Updated) (JIRA)" <ji...@apache.org> on 2011/12/09 16:42:40 UTC, 3 replies.
- [jira] [Commented] (TIKA-806) MS Word Detection magics are a bit overzealous - posted by "Alex Ott (Commented) (JIRA)" <ji...@apache.org> on 2011/12/09 16:52:39 UTC, 5 replies.
- [jira] [Resolved] (TIKA-801) ContentHandlerDecorator outputs invalid element - posted by "Michael McCandless (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/09 19:50:42 UTC, 0 replies.
- [jira] [Updated] (TIKA-804) Parsing outlook format template (.oft ) - posted by "Babu Gajendran (Updated) (JIRA)" <ji...@apache.org> on 2011/12/10 07:23:46 UTC, 1 replies.
- [jira] [Commented] (TIKA-804) Parsing outlook format template (.oft ) - posted by "Babu Gajendran (Commented) (JIRA)" <ji...@apache.org> on 2011/12/10 07:23:47 UTC, 3 replies.
- [jira] [Created] (TIKA-807) PHP version of Tika - posted by "Ingo Renner (Created) (JIRA)" <ji...@apache.org> on 2011/12/10 13:21:39 UTC, 0 replies.
- Re: Multilingual Tika - posted by Ingo Renner <in...@typo3.org> on 2011/12/10 13:22:03 UTC, 0 replies.
- [jira] [Updated] (TIKA-807) PHP version of Tika - posted by "Ingo Renner (Updated) (JIRA)" <ji...@apache.org> on 2011/12/10 13:23:40 UTC, 0 replies.
- [jira] [Resolved] (TIKA-804) Parsing outlook format template (.oft ) - posted by "Nick Burch (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/11 05:49:40 UTC, 0 replies.
- [jira] [Created] (TIKA-808) Fork Parser doesn't work for PDF files - posted by "Nick Burch (Created) (JIRA)" <ji...@apache.org> on 2011/12/12 03:14:30 UTC, 0 replies.
- [jira] [Commented] (TIKA-808) Fork Parser doesn't work for PDF files - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/12/12 03:26:36 UTC, 1 replies.
- [jira] [Created] (TIKA-809) IndexOutOfBoundsException with TikaGUI - posted by "John Mastarone (Created) (JIRA)" <ji...@apache.org> on 2011/12/12 03:56:31 UTC, 0 replies.
- [jira] [Updated] (TIKA-809) IndexOutOfBoundsException with TikaGUI - posted by "John Mastarone (Updated) (JIRA)" <ji...@apache.org> on 2011/12/12 04:04:44 UTC, 0 replies.
- [jira] [Commented] (TIKA-809) IndexOutOfBoundsException with TikaGUI - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/12/12 04:04:46 UTC, 1 replies.
- [jira] [Resolved] (TIKA-809) IndexOutOfBoundsException with TikaGUI - posted by "Nick Burch (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/12 04:10:32 UTC, 0 replies.
- [jira] [Commented] (TIKA-682) Creative Suite formats are not supported - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/12/12 06:58:30 UTC, 1 replies.
- [ANNOUNCE] Welcome Antoni Mylka as Tika committer + PMC member - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2011/12/12 17:58:42 UTC, 4 replies.
- [ANNOUNCE] Welcome Jerome Charron as Tika committer + PMC member - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2011/12/12 19:26:05 UTC, 1 replies.
- [jira] [Created] (TIKA-810) Upgrade to PDFbox 1.7.0 as available - posted by "Jeremy Anderson (Created) (JIRA)" <ji...@apache.org> on 2011/12/12 20:29:30 UTC, 0 replies.
- [jira] [Updated] (TIKA-810) Upgrade to PDFbox 1.7.0 as available - posted by "Jeremy Anderson (Updated) (JIRA)" <ji...@apache.org> on 2011/12/12 20:35:30 UTC, 1 replies.
- [jira] [Commented] (TIKA-788) DWG parser infinite loop on possibly corrupt file - posted by "Tim-Christian Mundt (Commented) (JIRA)" <ji...@apache.org> on 2011/12/13 01:07:30 UTC, 0 replies.
- [jira] [Resolved] (TIKA-803) Outlook parser to mark the message body in some special way - posted by "Nick Burch (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/13 05:15:31 UTC, 0 replies.
- [jira] [Commented] (TIKA-803) Outlook parser to mark the message body in some special way - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/12/13 05:15:31 UTC, 0 replies.
- [jira] [Commented] (TIKA-805) improvements in XSLFPowerPointExtractorDecorator - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/12/13 07:11:31 UTC, 1 replies.
- Pushing parsers upstream - posted by Jukka Zitting <ju...@gmail.com> on 2011/12/13 10:42:07 UTC, 13 replies.
- [jira] [Created] (TIKA-811) Upgrade metadatExtractor version for OpenJDK 7 support - posted by "Emmanuel Hugonnet (Created) (JIRA)" <ji...@apache.org> on 2011/12/13 10:53:30 UTC, 0 replies.
- [jira] [Updated] (TIKA-811) Upgrade metadatExtractor version for OpenJDK 7 support - posted by "Emmanuel Hugonnet (Updated) (JIRA)" <ji...@apache.org> on 2011/12/13 10:55:30 UTC, 0 replies.
- [jira] [Commented] (TIKA-811) Upgrade metadatExtractor version for OpenJDK 7 support - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/12/13 11:21:30 UTC, 1 replies.
- JIRA rights. - posted by Antoni Mylka <an...@gmail.com> on 2011/12/13 14:05:25 UTC, 1 replies.
- [jira] [Resolved] (TIKA-806) MS Word Detection magics are a bit overzealous - posted by "Antoni Mylka (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/13 14:37:30 UTC, 0 replies.
- [jira] [Created] (TIKA-812) Improve the detection of Works Spreadsheet 7.0 files - posted by "Antoni Mylka (Created) (JIRA)" <ji...@apache.org> on 2011/12/13 17:21:30 UTC, 0 replies.
- [jira] [Closed] (TIKA-798) Distinguish between EMF and WMF - posted by "Antoni Mylka (Closed) (JIRA)" <ji...@apache.org> on 2011/12/13 17:23:30 UTC, 0 replies.
- [jira] [Updated] (TIKA-812) Improve the detection of Works Spreadsheet 7.0 files - posted by "Antoni Mylka (Updated) (JIRA)" <ji...@apache.org> on 2011/12/13 17:25:30 UTC, 1 replies.
- [jira] [Created] (TIKA-813) Webarchive detection. - posted by "Antoni Mylka (Created) (JIRA)" <ji...@apache.org> on 2011/12/13 19:12:30 UTC, 0 replies.
- [jira] [Updated] (TIKA-813) Webarchive detection. - posted by "Antoni Mylka (Updated) (JIRA)" <ji...@apache.org> on 2011/12/13 19:12:30 UTC, 3 replies.
- [jira] [Created] (TIKA-814) Increase the amount of bytes read by TextDetector - posted by "Antoni Mylka (Created) (JIRA)" <ji...@apache.org> on 2011/12/13 21:33:30 UTC, 0 replies.
- [jira] [Updated] (TIKA-814) Increase the amount of bytes read by TextDetector - posted by "Antoni Mylka (Updated) (JIRA)" <ji...@apache.org> on 2011/12/13 21:33:31 UTC, 0 replies.
- [jira] [Commented] (TIKA-812) Improve the detection of Works Spreadsheet 7.0 files - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/12/14 01:05:29 UTC, 0 replies.
- [jira] [Closed] (TIKA-791) Fix the detection of protected OOXML files - posted by "Antoni Mylka (Closed) (JIRA)" <ji...@apache.org> on 2011/12/14 14:23:30 UTC, 0 replies.
- [jira] [Issue Comment Edited] (TIKA-810) Upgrade to PDFbox 1.7.0 as available - posted by "Jeremy Anderson (Issue Comment Edited) (JIRA)" <ji...@apache.org> on 2011/12/16 17:28:33 UTC, 1 replies.
- [jira] [Commented] (TIKA-810) Upgrade to PDFbox 1.7.0 as available - posted by "Antoni Mylka (Commented) (JIRA)" <ji...@apache.org> on 2011/12/16 18:50:31 UTC, 2 replies.
- [jira] [Created] (TIKA-815) Tika parsers should handle failures more gracefully - posted by "Jerome Lacoste (Created) (JIRA)" <ji...@apache.org> on 2011/12/17 11:36:30 UTC, 0 replies.
- [jira] [Commented] (TIKA-815) Tika parsers should handle failures more gracefully - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/12/17 11:44:33 UTC, 3 replies.
- [jira] [Closed] (TIKA-812) Improve the detection of Works Spreadsheet 7.0 files - posted by "Antoni Mylka (Closed) (JIRA)" <ji...@apache.org> on 2011/12/19 12:19:30 UTC, 0 replies.
- [jira] [Closed] (TIKA-813) Webarchive detection. - posted by "Antoni Mylka (Closed) (JIRA)" <ji...@apache.org> on 2011/12/19 12:29:30 UTC, 0 replies.
- [jira] [Closed] (TIKA-814) Increase the amount of bytes read by TextDetector - posted by "Antoni Mylka (Closed) (JIRA)" <ji...@apache.org> on 2011/12/19 12:41:30 UTC, 0 replies.
- [jira] [Commented] (TIKA-291) Adobe InDesign support - posted by "Adei Mandaluniz (Commented) (JIRA)" <ji...@apache.org> on 2011/12/19 13:15:30 UTC, 1 replies.
- [jira] [Created] (TIKA-816) (XLS/XLSX) Missing date/time in text content. - posted by "Albert L. (Created) (JIRA)" <ji...@apache.org> on 2011/12/19 16:57:31 UTC, 0 replies.
- [jira] [Updated] (TIKA-816) (XLS/XLSX) Improperly formatted date/time in text content. - posted by "Albert L. (Updated) (JIRA)" <ji...@apache.org> on 2011/12/19 16:59:30 UTC, 0 replies.
- [jira] [Created] (TIKA-817) (PPT/PPTX) Missing date/time in text content. - posted by "Albert L. (Created) (JIRA)" <ji...@apache.org> on 2011/12/19 17:07:30 UTC, 0 replies.
- [jira] [Commented] (TIKA-817) (PPT/PPTX) Missing date/time in text content. - posted by "Albert L. (Commented) (JIRA)" <ji...@apache.org> on 2011/12/19 17:09:30 UTC, 1 replies.
- [jira] [Commented] (TIKA-816) (XLS/XLSX) Improperly formatted date/time in text content. - posted by "Albert L. (Commented) (JIRA)" <ji...@apache.org> on 2011/12/19 19:23:30 UTC, 2 replies.
- [jira] [Created] (TIKA-818) Allow PDFBox to be used with RandomAccessFile vs RandomAccessBuffer to allow for a memory vs performance tradeoff - posted by "Paul Pearcy (Created) (JIRA)" <ji...@apache.org> on 2011/12/19 19:25:30 UTC, 0 replies.
- [jira] [Created] (TIKA-819) Make Option to Exclude Embedded Files' Text for Text Content - posted by "Albert L. (Created) (JIRA)" <ji...@apache.org> on 2011/12/19 19:49:32 UTC, 0 replies.
- [jira] [Commented] (TIKA-819) Make Option to Exclude Embedded Files' Text for Text Content - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/12/20 02:25:31 UTC, 3 replies.
- [jira] [Commented] (TIKA-700) Upgrade to POI 3.8 as available - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/12/20 07:01:33 UTC, 0 replies.
- [jira] [Commented] (TIKA-705) Valid OOXML PPT file hits InvalidFormatException thrown in POI - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/12/20 07:23:30 UTC, 0 replies.
- [jira] [Commented] (TIKA-757) Address TODOs when we upgrade to next POI release (3.8 beta 5) - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/12/20 07:23:30 UTC, 0 replies.
- [jira] [Resolved] (TIKA-423) Parse docx and output to text file missing words - posted by "Nick Burch (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/20 07:25:30 UTC, 0 replies.
- [jira] [Commented] (TIKA-818) Allow PDFBox to be used with RandomAccessFile vs RandomAccessBuffer to allow for a memory vs performance tradeoff - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/12/20 08:42:31 UTC, 0 replies.
- [jira] [Created] (TIKA-820) Locator is unset for HTML parser - posted by "Daniel Bonniot de Ruisselet (Created) (JIRA)" <ji...@apache.org> on 2011/12/20 10:01:33 UTC, 0 replies.
- [jira] [Updated] (TIKA-820) Locator is unset for HTML parser - posted by "Daniel Bonniot de Ruisselet (Updated) (JIRA)" <ji...@apache.org> on 2011/12/20 10:01:33 UTC, 0 replies.
- [jira] [Commented] (TIKA-820) Locator is unset for HTML parser - posted by "Daniel Bonniot de Ruisselet (Commented) (JIRA)" <ji...@apache.org> on 2011/12/20 10:03:31 UTC, 0 replies.
- [jira] [Commented] (TIKA-686) Split tika-parsers into separate components - posted by "Antoni Mylka (Commented) (JIRA)" <ji...@apache.org> on 2011/12/20 13:43:30 UTC, 0 replies.
- [jira] [Created] (TIKA-821) Support detecting old MIcrosoft Works Word Processor formats - posted by "Antoni Mylka (Created) (JIRA)" <ji...@apache.org> on 2011/12/20 16:51:30 UTC, 0 replies.
- [jira] [Commented] (TIKA-821) Support detecting old MIcrosoft Works Word Processor formats - posted by "Antoni Mylka (Commented) (JIRA)" <ji...@apache.org> on 2011/12/20 16:57:30 UTC, 0 replies.
- [jira] [Created] (TIKA-822) MediaType fails to parse charset that has quoted value - posted by "peter royal (Created) (JIRA)" <ji...@apache.org> on 2011/12/20 19:44:30 UTC, 0 replies.
- [jira] [Created] (TIKA-823) Detect StarOffice files - posted by "Antoni Mylka (Created) (JIRA)" <ji...@apache.org> on 2011/12/21 00:07:31 UTC, 0 replies.
- [jira] [Updated] (TIKA-823) Detect StarOffice files - posted by "Antoni Mylka (Updated) (JIRA)" <ji...@apache.org> on 2011/12/21 00:09:30 UTC, 0 replies.
- Re: svn commit: r1221323 - in /tika/trunk: tika-core/src/main/resources/org/apache/tika/mime/ tika-parsers/src/main/java/org/apache/tika/parser/microsoft/ tika-parsers/src/test/java/org/apache/tika/detect/ tika-parsers/src/test/java/org/apache/tika/mime/ t... - posted by Nick Burch <ni...@alfresco.com> on 2011/12/21 01:50:01 UTC, 0 replies.
- [jira] [Commented] (TIKA-822) MediaType fails to parse charset that has quoted value - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/12/21 02:51:30 UTC, 4 replies.
- [jira] [Updated] (TIKA-822) MediaType fails to parse charset that has quoted value - posted by "peter royal (Updated) (JIRA)" <ji...@apache.org> on 2011/12/21 03:01:31 UTC, 0 replies.
- [jira] [Resolved] (TIKA-822) MediaType fails to parse charset that has quoted value - posted by "Nick Burch (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/21 04:05:33 UTC, 0 replies.
- [jira] [Commented] (TIKA-823) Detect StarOffice files - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/12/21 04:27:31 UTC, 1 replies.
- [jira] [Closed] (TIKA-823) Detect StarOffice files - posted by "Antoni Mylka (Closed) (JIRA)" <ji...@apache.org> on 2011/12/21 13:03:30 UTC, 0 replies.
- [jira] [Created] (TIKA-825) Extract rel attr with LinkContentHandler - posted by "Markus Jelsma (Created) (JIRA)" <ji...@apache.org> on 2011/12/21 14:49:30 UTC, 0 replies.
- [jira] [Created] (TIKA-824) Extract rel attr with LinkContentHandler - posted by "Markus Jelsma (Created) (JIRA)" <ji...@apache.org> on 2011/12/21 14:49:30 UTC, 0 replies.
- [jira] [Closed] (TIKA-825) Extract rel attr with LinkContentHandler - posted by "Markus Jelsma (Closed) (JIRA)" <ji...@apache.org> on 2011/12/21 14:49:31 UTC, 0 replies.
- [jira] [Updated] (TIKA-824) Extract rel attr with LinkContentHandler - posted by "Markus Jelsma (Updated) (JIRA)" <ji...@apache.org> on 2011/12/21 14:51:30 UTC, 1 replies.
- [jira] [Assigned] (TIKA-824) Extract rel attr with LinkContentHandler - posted by "Chris A. Mattmann (Assigned) (JIRA)" <ji...@apache.org> on 2011/12/21 16:41:32 UTC, 0 replies.
- [jira] [Created] (TIKA-826) TikaException / OfficeXmlFileException with .xlsb files - posted by "John Mastarone (Created) (JIRA)" <ji...@apache.org> on 2011/12/22 05:15:30 UTC, 0 replies.
- [jira] [Updated] (TIKA-826) TikaException / OfficeXmlFileException with .xlsb files - posted by "John Mastarone (Updated) (JIRA)" <ji...@apache.org> on 2011/12/22 05:17:30 UTC, 0 replies.
- [jira] [Commented] (TIKA-676) Boilerpipe fails - posted by "Markus Jelsma (Commented) (JIRA)" <ji...@apache.org> on 2011/12/22 13:08:30 UTC, 0 replies.
- [jira] [Issue Comment Edited] (TIKA-826) TikaException / OfficeXmlFileException with .xlsb files - posted by "John Mastarone (Issue Comment Edited) (JIRA)" <ji...@apache.org> on 2011/12/22 14:36:30 UTC, 0 replies.
- [jira] [Commented] (TIKA-826) TikaException / OfficeXmlFileException with .xlsb files - posted by "John Mastarone (Commented) (JIRA)" <ji...@apache.org> on 2011/12/22 14:36:30 UTC, 2 replies.
- Parser stability and ForkParser - posted by Jerome Lacoste <je...@gmail.com> on 2011/12/22 17:18:30 UTC, 2 replies.
- [jira] [Created] (TIKA-827) ForkServer fails to report issues if an exception is not properly serializable - posted by "Jerome Lacoste (Created) (JIRA)" <ji...@apache.org> on 2011/12/23 12:04:30 UTC, 0 replies.
- [jira] [Created] (TIKA-828) TaggedIOException can be passed non Serializable objects - posted by "Jerome Lacoste (Created) (JIRA)" <ji...@apache.org> on 2011/12/23 12:28:30 UTC, 0 replies.
- [jira] [Updated] (TIKA-829) Tika lacks preconditions on its input, causing some potential misuse of the API - posted by "Jerome Lacoste (Updated) (JIRA)" <ji...@apache.org> on 2011/12/23 12:30:30 UTC, 1 replies.
- [jira] [Created] (TIKA-829) Tika lacks preconditions on its input, causing some potential misuse of the API - posted by "Jerome Lacoste (Created) (JIRA)" <ji...@apache.org> on 2011/12/23 12:30:30 UTC, 0 replies.
- [jira] [Created] (TIKA-830) Tika.parseToString() causes ForkParser to try to serialize itself - posted by "Jerome Lacoste (Created) (JIRA)" <ji...@apache.org> on 2011/12/23 12:42:30 UTC, 0 replies.
- [jira] [Created] (TIKA-831) ForkClient doesn't report error due to widening conversion issue - posted by "Jerome Lacoste (Created) (JIRA)" <ji...@apache.org> on 2011/12/23 13:10:30 UTC, 0 replies.
- [jira] [Updated] (TIKA-808) Fork Parser doesn't work for PDF files - posted by "Jerome Lacoste (Updated) (JIRA)" <ji...@apache.org> on 2011/12/23 13:24:30 UTC, 0 replies.
- [jira] [Updated] (TIKA-827) ForkServer fails to report issues if an exception is not properly serializable - posted by "Jerome Lacoste (Updated) (JIRA)" <ji...@apache.org> on 2011/12/23 13:26:31 UTC, 2 replies.
- [jira] [Updated] (TIKA-828) TaggedIOException can be passed non Serializable objects - posted by "Jerome Lacoste (Updated) (JIRA)" <ji...@apache.org> on 2011/12/23 13:28:30 UTC, 0 replies.
- [jira] [Updated] (TIKA-830) Tika.parseToString() causes ForkParser to try to serialize itself - posted by "Jerome Lacoste (Updated) (JIRA)" <ji...@apache.org> on 2011/12/23 13:28:30 UTC, 3 replies.
- [jira] [Updated] (TIKA-831) ForkClient doesn't report error due to widening conversion issue - posted by "Jerome Lacoste (Updated) (JIRA)" <ji...@apache.org> on 2011/12/23 13:30:30 UTC, 0 replies.
- [jira] [Created] (TIKA-832) ForkParser is unfriendly to code that prints things to its output - posted by "Jerome Lacoste (Created) (JIRA)" <ji...@apache.org> on 2011/12/23 14:36:37 UTC, 0 replies.
- [jira] [Updated] (TIKA-832) ForkParser is unfriendly to code that prints things to its output - posted by "Jerome Lacoste (Updated) (JIRA)" <ji...@apache.org> on 2011/12/23 14:36:39 UTC, 4 replies.
- [jira] [Commented] (TIKA-832) ForkParser is unfriendly to code that prints things to its output - posted by "Jerome Lacoste (Commented) (JIRA)" <ji...@apache.org> on 2011/12/23 22:48:30 UTC, 1 replies.
- [jira] [Resolved] (TIKA-828) TaggedIOException can be passed non Serializable objects - posted by "Jukka Zitting (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/23 23:54:30 UTC, 0 replies.
- [jira] [Resolved] (TIKA-808) Fork Parser doesn't work for PDF files - posted by "Jukka Zitting (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/24 00:36:30 UTC, 0 replies.
- [jira] [Commented] (TIKA-830) Tika.parseToString() causes ForkParser to try to serialize itself - posted by "Jukka Zitting (Commented) (JIRA)" <ji...@apache.org> on 2011/12/24 00:46:30 UTC, 5 replies.
- [VOTE] Release Apache ODF Toolkit 0.5-incubating(RC6) - posted by Devin Han <de...@apache.org> on 2011/12/24 10:11:02 UTC, 0 replies.
- [jira] [Resolved] (TIKA-829) Tika lacks preconditions on its input, causing some potential misuse of the API - posted by "Nick Burch (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/26 05:06:30 UTC, 0 replies.
- [jira] [Commented] (TIKA-829) Tika lacks preconditions on its input, causing some potential misuse of the API - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/12/26 05:06:30 UTC, 0 replies.
- [jira] [Commented] (TIKA-827) ForkServer fails to report issues if an exception is not properly serializable - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/12/26 13:32:30 UTC, 1 replies.
- [jira] [Commented] (TIKA-831) ForkClient doesn't report error due to widening conversion issue - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/12/26 13:52:31 UTC, 1 replies.
- [jira] [Resolved] (TIKA-831) ForkClient doesn't report error due to widening conversion issue - posted by "Nick Burch (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/27 03:46:30 UTC, 0 replies.
- [jira] [Created] (TIKA-833) POI Daily beta6 as of 12/27 breaks ExcelParserTest.testExcelParserFormatting() - posted by "Jeremy Anderson (Created) (JIRA)" <ji...@apache.org> on 2011/12/27 15:48:30 UTC, 0 replies.
- [jira] [Updated] (TIKA-833) POI Daily beta6 as of 12/27 breaks ExcelParserTest.testExcelParserFormatting() - posted by "Jeremy Anderson (Updated) (JIRA)" <ji...@apache.org> on 2011/12/27 16:36:31 UTC, 1 replies.
- [VOTE] Release Apache ODF Toolkit 0.5-incubating(RC7) - posted by Devin Han <de...@apache.org> on 2011/12/27 17:45:12 UTC, 1 replies.
- [jira] [Commented] (TIKA-833) POI Daily beta6 as of 12/27 breaks ExcelParserTest.testExcelParserFormatting() - posted by "Jeremy Anderson (Commented) (JIRA)" <ji...@apache.org> on 2011/12/27 18:10:30 UTC, 3 replies.
- [jira] [Created] (TIKA-834) server problem only 1st (-m -j) result is correct additional runs include data from previous runs - posted by "George Kappel (Created) (JIRA)" <ji...@apache.org> on 2011/12/28 23:51:30 UTC, 0 replies.
- InfoQ article on Tika published - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2011/12/29 00:27:49 UTC, 0 replies.
- [jira] [Updated] (TIKA-834) server problem only 1st result is correct additional runs include data from 1st run - posted by "George Kappel (Updated) (JIRA)" <ji...@apache.org> on 2011/12/29 04:26:32 UTC, 0 replies.
- [jira] [Resolved] (TIKA-793) Invalid ASCII character (65533) when retriving MP3 metadata - posted by "Nick Burch (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/29 10:13:31 UTC, 0 replies.
- [jira] [Resolved] (TIKA-833) POI Daily beta6 as of 12/27 breaks ExcelParserTest.testExcelParserFormatting() - posted by "Jeremy Anderson (Resolved) (JIRA)" <ji...@apache.org> on 2011/12/29 14:47:32 UTC, 0 replies.
- [jira] [Reopened] (TIKA-833) POI Daily beta6 as of 12/27 breaks ExcelParserTest.testExcelParserFormatting() - posted by "Jeremy Anderson (Reopened) (JIRA)" <ji...@apache.org> on 2011/12/29 17:15:30 UTC, 0 replies.
- [jira] [Closed] (TIKA-833) POI Daily beta6 as of 12/27 breaks ExcelParserTest.testExcelParserFormatting() - posted by "Jeremy Anderson (Closed) (JIRA)" <ji...@apache.org> on 2011/12/29 17:17:30 UTC, 0 replies.
- I would like to join this mailing list - posted by "Lotrowski, Adam" <Ad...@ibi.com> on 2011/12/29 18:42:42 UTC, 1 replies.
- [jira] [Created] (TIKA-835) TNEF parsing unstable - posted by "Rob Tulloh (Created) (JIRA)" <ji...@apache.org> on 2011/12/29 19:21:31 UTC, 0 replies.
- [jira] [Created] (TIKA-836) parsing really slow on some documents - posted by "Rob Tulloh (Created) (JIRA)" <ji...@apache.org> on 2011/12/29 20:27:30 UTC, 0 replies.
- [jira] [Updated] (TIKA-836) parsing really slow on some documents - posted by "Rob Tulloh (Updated) (JIRA)" <ji...@apache.org> on 2011/12/29 21:46:31 UTC, 0 replies.
- [jira] [Commented] (TIKA-835) TNEF parsing unstable - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/12/30 05:21:31 UTC, 3 replies.
- [jira] [Commented] (TIKA-836) parsing really slow on some documents - posted by "Rob Tulloh (Commented) (JIRA)" <ji...@apache.org> on 2011/12/30 05:21:31 UTC, 0 replies.
- arrayindex out of bounds exception - posted by Aami <si...@algotree.com> on 2011/12/30 07:52:33 UTC, 0 replies.
- [jira] [Closed] (TIKA-835) TNEF parsing unstable - posted by "Rob Tulloh (Closed) (JIRA)" <ji...@apache.org> on 2011/12/30 13:35:30 UTC, 0 replies.