You are viewing a plain text version of this content. The canonical link for it is here.
- Re: [jira] Updated: (TIKA-402) Support for Keynote and Pages documents - posted by Alex Ott <al...@gmail.com> on 2010/06/01 09:17:47 UTC, 0 replies.
- [jira] Commented: (TIKA-422) Wrong charset conversion in some RTF documents. - posted by "Leszek Piotrowicz (JIRA)" <ji...@apache.org> on 2010/06/01 16:20:43 UTC, 0 replies.
- [jira] Updated: (TIKA-361) Update OutlookExtractor to match new POI API - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/02 17:37:40 UTC, 5 replies.
- [jira] Created: (TIKA-436) Tika throws RuntimeException when parsing PPTX with null creation date - posted by "rick cameron (JIRA)" <ji...@apache.org> on 2010/06/03 23:51:54 UTC, 0 replies.
- [jira] Updated: (TIKA-436) Tika throws RuntimeException when parsing PPTX with null creation date - posted by "rick cameron (JIRA)" <ji...@apache.org> on 2010/06/03 23:53:54 UTC, 0 replies.
- [jira] Commented: (TIKA-391) Intermittent errors detecting xls files - posted by "Chris Bamford (JIRA)" <ji...@apache.org> on 2010/06/04 10:36:57 UTC, 5 replies.
- PDF text extraction problems - posted by Ehsan Sadeghi <es...@gmail.com> on 2010/06/04 11:51:07 UTC, 0 replies.
- [jira] Commented: (TIKA-436) Tika throws RuntimeException when parsing PPTX with null creation date - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/04 14:58:55 UTC, 0 replies.
- [jira] Commented: (TIKA-419) Allow parser lookup from a custom class loader - posted by "Brad Greenlee (JIRA)" <ji...@apache.org> on 2010/06/04 23:17:03 UTC, 0 replies.
- RE: confirm unsubscribe from dev@tika.apache.org - posted by RAKHI GUPTA <gu...@hotmail.com> on 2010/06/05 01:20:23 UTC, 0 replies.
- RE: Please unsubscribe me. - posted by RAKHI GUPTA <gu...@hotmail.com> on 2010/06/05 01:21:11 UTC, 1 replies.
- Welcome Julien Nioche, new Tika PMC member and committer - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2010/06/06 00:42:43 UTC, 1 replies.
- [jira] Created: (TIKA-437) OfficeParser: support for write-protected xlsx files - posted by "Maxim Valyanskiy (JIRA)" <ji...@apache.org> on 2010/06/07 13:26:36 UTC, 0 replies.
- [jira] Updated: (TIKA-437) OfficeParser: support for write-protected xlsx files - posted by "Maxim Valyanskiy (JIRA)" <ji...@apache.org> on 2010/06/07 13:28:39 UTC, 0 replies.
- [jira] Commented: (TIKA-402) Support for iWork documents - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/06/07 23:35:44 UTC, 0 replies.
- [jira] Commented: (TIKA-434) Bug in TagSoup causes IOException - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/06/08 00:26:12 UTC, 0 replies.
- Reg Autodetector Tika Parser - posted by dynamolalit <la...@gmail.com> on 2010/06/08 09:57:51 UTC, 0 replies.
- Re: Reg AutoDetectParser Tika Parser - posted by dynamolalit <la...@gmail.com> on 2010/06/09 09:05:28 UTC, 1 replies.
- [jira] Commented: (TIKA-420) [PATCH] Integration of boilerpipe: Boilerplate Removal and Fulltext Extraction from HTML pages - posted by "Christian Kohlschütter (JIRA)" <ji...@apache.org> on 2010/06/09 13:32:14 UTC, 0 replies.
- Out-of-date mailing list info? - posted by Ken Krugler <kk...@transpac.com> on 2010/06/10 17:01:12 UTC, 1 replies.
- Tika in Action - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2010/06/12 04:07:24 UTC, 0 replies.
- [jira] Created: (TIKA-438) Parse and return the complete set of custom document properties from MS Office documents - posted by "Mads Hansen (JIRA)" <ji...@apache.org> on 2010/06/13 18:31:31 UTC, 0 replies.
- [jira] Updated: (TIKA-438) Parse and return the complete set of custom document properties from MS Office documents - posted by "Mads Hansen (JIRA)" <ji...@apache.org> on 2010/06/13 18:34:13 UTC, 1 replies.
- [jira] Commented: (TIKA-373) Upgrade to POI 3.7 (or 4.0?) - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/14 11:01:13 UTC, 3 replies.
- [jira] Updated: (TIKA-371) Excel formatting depends on the default locale - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/14 13:35:15 UTC, 0 replies.
- [jira] Created: (TIKA-439) DWGParser (and some others) not used by AutoDetectParser - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/15 16:31:25 UTC, 0 replies.
- [jira] Created: (TIKA-440) [Patch] Fetch the composer information in the MP3 Parser - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/15 18:39:22 UTC, 0 replies.
- [jira] Updated: (TIKA-440) [Patch] Fetch the composer information in the MP3 Parser - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/15 18:41:23 UTC, 0 replies.
- Detecting container formats - posted by Nick Burch <ni...@alfresco.com> on 2010/06/15 19:25:13 UTC, 9 replies.
- [jira] Updated: (TIKA-441) Sometimes, tika not working (crashed) because of null classloader - posted by "Alex Ott (JIRA)" <ji...@apache.org> on 2010/06/15 20:55:23 UTC, 0 replies.
- [jira] Created: (TIKA-441) Sometimes, tika not working (crashed) because of null classloader - posted by "Alex Ott (JIRA)" <ji...@apache.org> on 2010/06/15 20:55:23 UTC, 0 replies.
- Trouble committing to Tika - posted by Jukka Zitting <ju...@gmail.com> on 2010/06/16 00:32:39 UTC, 3 replies.
- [jira] Created: (TIKA-442) Image extractors use inconsistent metadata keys and formats for common features - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/16 16:38:22 UTC, 0 replies.
- Short developerworks article on Tika - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2010/06/16 18:14:01 UTC, 0 replies.
- [jira] Resolved: (TIKA-441) Sometimes, tika not working (crashed) because of null classloader - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/06/16 21:10:23 UTC, 0 replies.
- [jira] Commented: (TIKA-441) Sometimes, tika not working (crashed) because of null classloader - posted by "Alex Ott (JIRA)" <ji...@apache.org> on 2010/06/16 21:14:23 UTC, 0 replies.
- [jira] Resolved: (TIKA-440) [Patch] Fetch the composer information in the MP3 Parser - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/06/16 23:06:23 UTC, 0 replies.
- [jira] Commented: (TIKA-442) Image extractors use inconsistent metadata keys and formats for common features - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/06/16 23:42:24 UTC, 2 replies.
- [jira] Resolved: (TIKA-439) DWGParser (and some others) not used by AutoDetectParser - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/06/17 00:45:23 UTC, 0 replies.
- [jira] Created: (TIKA-443) Geographic Information Parser - posted by "Arturo Beltran (JIRA)" <ji...@apache.org> on 2010/06/17 11:26:24 UTC, 0 replies.
- [jira] Commented: (TIKA-443) Geographic Information Parser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/06/17 16:14:24 UTC, 12 replies.
- Getting started - posted by Arturo Beltran <ar...@uji.es> on 2010/06/17 16:39:17 UTC, 3 replies.
- [jira] Resolved: (TIKA-308) Improve supertype handling in type registry - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/06/18 14:08:23 UTC, 0 replies.
- [jira] Resolved: (TIKA-298) CompositeParser.getParser() should use mimetype hierarchy when falling back - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/06/18 14:13:23 UTC, 0 replies.
- Re: Build with Maven. OutOfMemoryError - posted by hpstricker <st...@epublius.de> on 2010/06/18 18:23:48 UTC, 0 replies.
- Maven/Tika - still out of memory - posted by hpstricker <st...@epublius.de> on 2010/06/18 18:35:34 UTC, 0 replies.
- [jira] Created: (TIKA-444) Tika sites refers to incorrect svn repo URL - posted by "Peter Wolanin (JIRA)" <ji...@apache.org> on 2010/06/21 00:41:24 UTC, 0 replies.
- [jira] Updated: (TIKA-444) Tika sites refers to incorrect svn repo URL - posted by "Peter Wolanin (JIRA)" <ji...@apache.org> on 2010/06/21 00:43:23 UTC, 1 replies.
- [jira] Assigned: (TIKA-444) Tika sites refers to incorrect svn repo URL - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/06/21 02:36:24 UTC, 0 replies.
- [jira] Resolved: (TIKA-444) Tika sites refers to incorrect svn repo URL - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/06/21 02:43:23 UTC, 0 replies.
- [jira] Issue Comment Edited: (TIKA-443) Geographic Information Parser - posted by "Mayank Singh (JIRA)" <ji...@apache.org> on 2010/06/21 11:16:25 UTC, 0 replies.
- svnpubsub for the Tika web site - posted by Jukka Zitting <ju...@gmail.com> on 2010/06/21 12:02:42 UTC, 3 replies.
- [jira] Updated: (TIKA-443) Geographic Information Parser - posted by "Arturo Beltran (JIRA)" <ji...@apache.org> on 2010/06/22 11:33:58 UTC, 0 replies.
- [jira] Resolved: (TIKA-361) Update OutlookExtractor to match new POI API - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/23 18:28:52 UTC, 0 replies.
- [jira] Commented: (TIKA-437) OfficeParser: support for write-protected xlsx files - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/23 19:00:53 UTC, 0 replies.
- [jira] Resolved: (TIKA-437) OfficeParser: support for write-protected xlsx files - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/25 12:47:50 UTC, 0 replies.
- Limiting the extracted content - posted by "Jana, Kumar Raja" <kj...@ptc.com> on 2010/06/28 15:49:58 UTC, 0 replies.
- [jira] Closed: (TIKA-442) Image extractors use inconsistent metadata keys and formats for common features - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/28 16:00:49 UTC, 0 replies.
- [jira] Closed: (TIKA-316) Parsing Visio diagrams with tika-app causes TikaException (Found a chunk with a negative length) - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/28 17:04:51 UTC, 0 replies.
- [jira] Closed: (TIKA-436) Tika throws RuntimeException when parsing PPTX with null creation date - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/28 17:08:50 UTC, 0 replies.
- [jira] Created: (TIKA-445) Geographic metadata namespace - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/28 17:25:49 UTC, 0 replies.
- [jira] Closed: (TIKA-371) Excel formatting depends on the default locale - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/28 17:46:52 UTC, 0 replies.
- [jira] Updated: (TIKA-445) Geographic metadata namespace - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/28 17:52:50 UTC, 0 replies.
- [jira] Updated: (TIKA-373) Upgrade to POI 3.7 - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/06/28 17:54:50 UTC, 0 replies.
- [jira] Commented: (TIKA-445) Geographic metadata namespace - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/06/28 17:54:54 UTC, 1 replies.
- [jira] Commented: (TIKA-418) RuntimeException while getting content for ppsx, ppsm, pptm, thmx and xps file types - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/28 18:09:50 UTC, 2 replies.
- [jira] Created: (TIKA-446) Upgrade to PDFBox 1.2.0 - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/06/28 19:28:49 UTC, 0 replies.
- [jira] Updated: (TIKA-418) RuntimeException while getting content for ppsx, ppsm, pptm, thmx and xps file types - posted by "Rajiv Kumar (JIRA)" <ji...@apache.org> on 2010/06/29 08:40:49 UTC, 1 replies.
- [jira] Closed: (TIKA-445) Geographic metadata namespace - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/29 13:22:49 UTC, 0 replies.
- [jira] Created: (TIKA-447) Container aware mimetype detection - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/29 17:36:49 UTC, 0 replies.
- [jira] Updated: (TIKA-447) Container aware mimetype detection - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/29 17:36:50 UTC, 0 replies.
- [jira] Created: (TIKA-448) Tika FLVParser hangs - posted by "Jeroen van Vianen (JIRA)" <ji...@apache.org> on 2010/06/29 19:23:51 UTC, 0 replies.
- [jira] Updated: (TIKA-448) Tika FLVParser hangs - posted by "Jeroen van Vianen (JIRA)" <ji...@apache.org> on 2010/06/29 19:23:51 UTC, 1 replies.
- [jira] Commented: (TIKA-448) Tika FLVParser hangs - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/06/29 21:19:50 UTC, 1 replies.
- Re: svn commit: r958942 - in /tika/trunk/tika-parsers/src: main/java/org/apache/tika/parser/html/ main/java/org/apache/tika/parser/image/ main/java/org/apache/tika/parser/jpeg/ test/java/org/apache/tika/parser/html/ test/java/org/apache/tika/parser/j - posted by Jukka Zitting <ju...@gmail.com> on 2010/06/29 23:57:23 UTC, 2 replies.
- [jira] Commented: (TIKA-371) Excel formatting depends on the default locale - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/06/30 00:21:51 UTC, 0 replies.
- [jira] Resolved: (TIKA-446) Upgrade to PDFBox 1.2.0 - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/06/30 00:28:49 UTC, 0 replies.
- [jira] Created: (TIKA-449) Update parsers to extract geographic metadata - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/30 13:13:49 UTC, 0 replies.
- [jira] Commented: (TIKA-449) Update parsers to extract geographic metadata - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/30 13:13:50 UTC, 1 replies.
- Re: svn commit: r958942 - in /tika/trunk/tika-parsers/src: main/java/org/apache/tika/parser/html/ main/java/org/apache/tika/parser/image/ main/java/org/apache/tika/parser/jpeg/ test/java/org/apache/tika/parser/html/ test/java/org/apache/tika/parser/j - posted by Nick Burch <ni...@alfresco.com> on 2010/06/30 13:22:41 UTC, 0 replies.
- [jira] Resolved: (TIKA-449) Update parsers to extract geographic metadata - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/30 13:22:52 UTC, 0 replies.
- Re: svn commit: r958942 - in /tika/trunk/tika-parsers/src: main/java/org/apache/tika/parser/html/ main/java/org/apache/tika/parser/image/ main/java/org/apache/tika/parser/jpeg/ test/java/org/apache/tika/parser/html/ test/java/org/apache/tika/parser/j - posted by Nick Burch <ni...@alfresco.com> on 2010/06/30 13:23:59 UTC, 0 replies.
- [jira] Created: (TIKA-450) Document our issue tracking workflows - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/06/30 13:31:49 UTC, 0 replies.
- [jira] Created: (TIKA-451) Inconsistent date format for Metadata.CREATION_DATE and Metadata.LAST_MODIFIED - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/30 13:41:49 UTC, 0 replies.
- [jira] Created: (TIKA-452) Extract custom pdf metadata - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/30 13:50:50 UTC, 0 replies.
- [jira] Resolved: (TIKA-452) Extract custom pdf metadata - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/30 13:54:49 UTC, 0 replies.
- [jira] Commented: (TIKA-452) Extract custom pdf metadata - posted by "Jeremias Maerki (JIRA)" <ji...@apache.org> on 2010/06/30 14:00:51 UTC, 4 replies.
- [jira] Updated: (TIKA-452) Extract custom pdf metadata - posted by "Jeremias Maerki (JIRA)" <ji...@apache.org> on 2010/06/30 14:04:52 UTC, 0 replies.
- [jira] Commented: (TIKA-451) Inconsistent date format for Metadata.CREATION_DATE and Metadata.LAST_MODIFIED - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/06/30 16:58:49 UTC, 0 replies.
- [jira] Commented: (TIKA-408) Word 6.0/7.0 documents support in office parser - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/06/30 18:16:50 UTC, 0 replies.