You are viewing a plain text version of this content. The canonical link for it is here.
- PDFBox bug in 0.8-incubating - posted by Ken Krugler <kk...@transpac.com> on 2010/01/04 22:27:48 UTC, 0 replies.
- [jira] Updated: (TIKA-103) Excel parsing ignores cell formating - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2010/01/05 02:20:54 UTC, 1 replies.
- [jira] Commented: (TIKA-103) Excel parsing ignores cell formating - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2010/01/05 02:26:54 UTC, 2 replies.
- [jira] Assigned: (TIKA-103) Excel parsing ignores cell formating - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2010/01/05 02:48:54 UTC, 0 replies.
- Another shutdown error thrown during parsing - posted by Ken Krugler <kk...@transpac.com> on 2010/01/06 23:00:22 UTC, 1 replies.
- [jira] Created: (TIKA-358) Auto-detection of HTML fails with common auto-generated template - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/01/06 23:34:54 UTC, 0 replies.
- [jira] Updated: (TIKA-358) Auto-detection of HTML fails with common auto-generated template - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/01/06 23:36:54 UTC, 0 replies.
- [jira] Created: (TIKA-359) Calls to Charset.isSupported() will throw exceptions for invalid charset names - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/01/06 23:56:54 UTC, 0 replies.
- [jira] Commented: (TIKA-318) Upgrade nekohtml dependency from 1.9.9 to 1.9.13 - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/01/08 12:26:54 UTC, 0 replies.
- [jira] Created: (TIKA-360) Outstanding Improvements to Number/Date Formatting in ExcelParser - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2010/01/08 17:45:09 UTC, 0 replies.
- [jira] Resolved: (TIKA-103) Excel parsing ignores cell formating - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2010/01/08 17:51:13 UTC, 0 replies.
- [jira] Commented: (TIKA-360) Outstanding Improvements to Number/Date Formatting in ExcelParser - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2010/01/08 17:55:20 UTC, 0 replies.
- [jira] Commented: (TIKA-359) Calls to Charset.isSupported() will throw exceptions for invalid charset names - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/01/08 18:42:54 UTC, 0 replies.
- TIKA-103 - Excel Number/Date Formatting. - posted by Dave Meikle <lo...@gmail.com> on 2010/01/08 19:56:52 UTC, 4 replies.
- Tika Dependency to bouncycastle lib..Tika 0.5 / Tika 0.6-SNAPSHOT... - posted by Karl Heinz Marbaise <kh...@gmx.de> on 2010/01/10 20:50:47 UTC, 1 replies.
- [jira] Commented: (TIKA-148) The ExcelParsing should scan the cell comments - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/01/11 12:24:54 UTC, 0 replies.
- [jira] Created: (TIKA-361) Update OutlookExtractor to match new POI API - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/01/11 15:40:54 UTC, 0 replies.
- [jira] Updated: (TIKA-361) Update OutlookExtractor to match new POI API - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/01/11 15:40:54 UTC, 0 replies.
- [jira] Created: (TIKA-362) Add publisher support - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/01/11 16:10:54 UTC, 0 replies.
- [jira] Updated: (TIKA-362) Add publisher support - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/01/11 16:10:54 UTC, 0 replies.
- PDF parser exception - posted by Doug Carter <dc...@mercycorps.org> on 2010/01/12 20:37:52 UTC, 3 replies.
- [jira] Created: (TIKA-363) PDF Content Type seen as application/rdf+xml not appliction/pdf - posted by "Tim Reynolds (JIRA)" <ji...@apache.org> on 2010/01/13 22:23:54 UTC, 0 replies.
- [jira] Commented: (TIKA-316) Parsing Visio diagrams with tika-app causes TikaException (Found a chunk with a negative length) - posted by "Maxim Valyanskiy (JIRA)" <ji...@apache.org> on 2010/01/14 15:20:54 UTC, 0 replies.
- [jira] Created: (TIKA-364) [PATCH] Metadata mark for xlsx documents with protected sheets - posted by "Maxim Valyanskiy (JIRA)" <ji...@apache.org> on 2010/01/14 15:22:54 UTC, 0 replies.
- [jira] Updated: (TIKA-364) [PATCH] Metadata mark for xlsx documents with protected sheets - posted by "Maxim Valyanskiy (JIRA)" <ji...@apache.org> on 2010/01/14 15:24:54 UTC, 0 replies.
- Tika command line performance - posted by Doug Carter <dc...@mercycorps.org> on 2010/01/15 20:07:05 UTC, 5 replies.
- [jira] Resolved: (TIKA-327) Parsing "HTML" as DcXML - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/01/16 06:36:56 UTC, 0 replies.
- [jira] Assigned: (TIKA-357) Increase buffer size for meta tag sniffing - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/01/16 06:38:54 UTC, 0 replies.
- [jira] Updated: (TIKA-357) Increase buffer size for meta tag sniffing - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/01/16 06:46:54 UTC, 1 replies.
- [jira] Commented: (TIKA-357) Increase buffer size for meta tag sniffing - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/01/16 06:59:54 UTC, 3 replies.
- [jira] Issue Comment Edited: (TIKA-357) Increase buffer size for meta tag sniffing - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/01/16 07:01:57 UTC, 0 replies.
- Hudson build became unstable: Tika-trunk » Apache Tika parsers #252 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/01/16 07:14:22 UTC, 0 replies.
- Hudson build became unstable: Tika-trunk #252 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/01/16 07:14:22 UTC, 0 replies.
- [jira] Commented: (TIKA-327) Parsing "HTML" as DcXML - posted by "Erik Hetzner (JIRA)" <ji...@apache.org> on 2010/01/16 17:49:54 UTC, 0 replies.
- Extracting dublin core metadata in HtmlParser? - posted by Nick Burch <ni...@alfresco.com> on 2010/01/19 14:41:45 UTC, 1 replies.
- [jira] Created: (TIKA-365) Extract more OpenDocument metadata - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/01/19 17:42:54 UTC, 0 replies.
- [jira] Updated: (TIKA-365) Extract more OpenDocument metadata - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/01/19 17:44:54 UTC, 3 replies.
- Tika 0.5 API - posted by Stefan Burger <St...@Burger-Pumpen.de> on 2010/01/19 18:36:53 UTC, 1 replies.
- [jira] Created: (TIKA-366) Increase buffer size for mime type sniffing - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/01/20 03:01:54 UTC, 0 replies.
- [jira] Updated: (TIKA-359) Calls to Charset.isSupported() will throw exceptions for invalid charset names - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/01/20 03:05:54 UTC, 0 replies.
- [jira] Resolved: (TIKA-366) Increase buffer size for mime type sniffing - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/01/20 03:05:54 UTC, 0 replies.
- [jira] Updated: (TIKA-323) Make Tika site look like Lucene ecosystem Apache Forrest-built sites - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/01/20 03:07:54 UTC, 0 replies.
- Hudson build is still unstable: Tika-trunk » Apache Tika parsers #253 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/01/20 04:06:49 UTC, 0 replies.
- Hudson build is still unstable: Tika-trunk #253 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/01/20 04:06:52 UTC, 0 replies.
- [jira] Created: (TIKA-367) Mime type rootXML equality improvement - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/01/20 06:33:56 UTC, 0 replies.
- [jira] Updated: (TIKA-367) Mime type rootXML equality improvement - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/01/20 06:35:56 UTC, 0 replies.
- [jira] Resolved: (TIKA-367) Mime type rootXML equality improvement - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/01/20 06:45:54 UTC, 0 replies.
- [jira] Resolved: (TIKA-357) Increase buffer size for meta tag sniffing - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/01/20 06:51:55 UTC, 0 replies.
- Hudson build is back to stable: Tika-trunk » Apache Tika parsers #254 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/01/20 07:08:32 UTC, 0 replies.
- Hudson build is back to stable: Tika-trunk #254 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/01/20 07:08:34 UTC, 0 replies.
- [VOTE] Apache Tika 0.6 release candidate #1 - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2010/01/20 07:56:50 UTC, 11 replies.
- [jira] Created: (TIKA-368) ID3v2 support for mp3 parser - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/01/20 15:57:54 UTC, 0 replies.
- [jira] Updated: (TIKA-368) ID3v2 support for mp3 parser - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/01/20 15:59:54 UTC, 2 replies.
- Ogg vorbis metadata? - posted by Nick Burch <ni...@alfresco.com> on 2010/01/21 18:20:40 UTC, 1 replies.
- [jira] Assigned: (TIKA-354) ProfilingHandler should take a length-limiting parameter - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/01/24 17:50:17 UTC, 0 replies.
- [jira] Commented: (TIKA-354) ProfilingHandler should take a length-limiting parameter - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/01/24 17:54:17 UTC, 0 replies.
- [jira] Created: (TIKA-369) Improve accuracy of language detection - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/01/24 19:52:17 UTC, 0 replies.
- [jira] Updated: (TIKA-369) Improve accuracy of language detection - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/01/24 20:06:17 UTC, 4 replies.
- [jira] Commented: (TIKA-369) Improve accuracy of language detection - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/01/24 20:40:17 UTC, 0 replies.
- [jira] Issue Comment Edited: (TIKA-369) Improve accuracy of language detection - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/01/24 20:40:17 UTC, 1 replies.
- [jira] Created: (TIKA-370) Tika pom.xml is missing dependencies on bouncycastle jars needed by PDFBox - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/01/25 21:53:34 UTC, 0 replies.
- [jira] Commented: (TIKA-370) Tika pom.xml is missing dependencies on bouncycastle jars needed by PDFBox - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/01/25 21:53:34 UTC, 0 replies.
- [jira] Issue Comment Edited: (TIKA-370) Tika pom.xml is missing dependencies on bouncycastle jars needed by PDFBox - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/01/25 21:55:34 UTC, 0 replies.
- Timeout support with parsers - posted by Ken Krugler <kk...@transpac.com> on 2010/01/25 22:57:11 UTC, 3 replies.
- [jira] Created: (TIKA-371) Excel formatting depends on the default locale - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/01/26 10:42:34 UTC, 0 replies.
- [jira] Updated: (TIKA-371) Excel formatting depends on the default locale - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/01/26 11:06:34 UTC, 0 replies.
- [jira] Resolved: (TIKA-368) ID3v2 support for mp3 parser - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/01/26 12:36:34 UTC, 0 replies.
- [jira] Resolved: (TIKA-365) Extract more OpenDocument metadata - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/01/26 12:52:34 UTC, 0 replies.
- [jira] Resolved: (TIKA-363) PDF Content Type seen as application/rdf+xml not appliction/pdf - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/01/26 13:18:34 UTC, 0 replies.
- [jira] Created: (TIKA-372) Channel and SampleRate information for MP3 files - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/01/26 16:28:34 UTC, 0 replies.
- [jira] Updated: (TIKA-372) Channel and SampleRate information for MP3 files - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/01/26 16:30:34 UTC, 0 replies.
- [jira] Resolved: (TIKA-362) Add publisher support - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/01/26 17:26:34 UTC, 0 replies.
- [jira] Created: (TIKA-373) Upgrade to POI 3.7 (or 4.0?) - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/01/26 17:38:34 UTC, 0 replies.
- [jira] Resolved: (TIKA-364) [PATCH] Metadata mark for xlsx documents with protected sheets - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/01/26 18:26:35 UTC, 0 replies.
- [jira] Commented: (TIKA-372) Channel and SampleRate information for MP3 files - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/01/26 18:40:34 UTC, 0 replies.
- [jira] Resolved: (TIKA-356) Wrong Repository URL on the Web-Site - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/01/26 19:30:34 UTC, 0 replies.
- [jira] Resolved: (TIKA-141) Mime Content Type detection of a web document from its URL. - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/01/27 00:00:35 UTC, 0 replies.
- [jira] Created: (TIKA-374) AutoDetectParser not thread-safe? - posted by "Adam Rauch (JIRA)" <ji...@apache.org> on 2010/01/27 00:59:37 UTC, 0 replies.
- [jira] Resolved: (TIKA-239) System.err prints from XmlRootExtractor - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/01/27 17:04:34 UTC, 0 replies.
- [jira] Resolved: (TIKA-374) AutoDetectParser not thread-safe? - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/01/27 19:16:34 UTC, 0 replies.
- [RESULT] [VOTE] Apache Tika 0.6 release candidate #1 - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2010/01/27 23:01:19 UTC, 1 replies.
- Character encodings on the web - posted by Jukka Zitting <ju...@gmail.com> on 2010/01/29 13:15:57 UTC, 1 replies.
- [jira] Resolved: (TIKA-199) Improved audio detection and parsing - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/01/30 20:40:34 UTC, 0 replies.
- [ANNOUNCE] Apache Tika 0.6 released - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2010/01/31 18:41:05 UTC, 0 replies.