You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] Created: (TIKA-316) Parsing Visio diagrams with tika-app causes TikaException (Found a chunk with a negative length) - posted by "Mike Hays (JIRA)" <ji...@apache.org> on 2009/11/03 22:07:32 UTC, 0 replies.
- [jira] Updated: (TIKA-316) Parsing Visio diagrams with tika-app causes TikaException (Found a chunk with a negative length) - posted by "Mike Hays (JIRA)" <ji...@apache.org> on 2009/11/03 22:11:32 UTC, 2 replies.
- Free live video streaming of ApacheCon US 2009 - posted by Michael McCandless <lu...@mikemccandless.com> on 2009/11/04 14:25:25 UTC, 1 replies.
- [jira] Reopened: (TIKA-309) Mime type application/rdf+xml not correctly detected - posted by "Yuan-Fang Li (JIRA)" <ji...@apache.org> on 2009/11/05 01:03:32 UTC, 1 replies.
- [jira] Created: (TIKA-317) Annotation-based Tika configuration - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/11/07 01:59:32 UTC, 0 replies.
- [jira] Commented: (TIKA-317) Annotation-based Tika configuration - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2009/11/07 02:05:32 UTC, 4 replies.
- [jira] Assigned: (TIKA-314) Initial support for JPEG EXIF metadata extraction - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2009/11/07 05:21:41 UTC, 1 replies.
- [jira] Resolved: (TIKA-314) Initial support for JPEG EXIF metadata extraction - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2009/11/07 05:23:41 UTC, 0 replies.
- 0.5 release - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2009/11/07 05:25:37 UTC, 0 replies.
- [jira] Commented: (TIKA-309) Mime type application/rdf+xml not correctly detected - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2009/11/07 05:27:41 UTC, 4 replies.
- [jira] Assigned: (TIKA-309) Mime type application/rdf+xml not correctly detected - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2009/11/07 05:27:41 UTC, 0 replies.
- [jira] Resolved: (TIKA-275) Parse context - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/11/07 05:31:43 UTC, 0 replies.
- [jira] Resolved: (TIKA-209) Language detection is weak. - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/11/07 05:35:41 UTC, 0 replies.
- [jira] Updated: (TIKA-317) Annotation-based Tika configuration - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/11/07 05:59:32 UTC, 0 replies.
- [jira] Created: (TIKA-318) Upgrade nekohtml dependency from 1.9.9 to 1.9.13 - posted by "Attila Király (JIRA)" <ji...@apache.org> on 2009/11/07 18:31:32 UTC, 0 replies.
- [jira] Updated: (TIKA-315) Tika appears to skip over an entire section of a Microsoft Word Document - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2009/11/07 19:03:32 UTC, 1 replies.
- [jira] Updated: (TIKA-318) Upgrade nekohtml dependency from 1.9.9 to 1.9.13 - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2009/11/07 19:03:32 UTC, 0 replies.
- [jira] Updated: (TIKA-298) CompositeParser.getParser() should use mimetype hierarchy when falling back - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2009/11/07 19:03:32 UTC, 0 replies.
- [jira] Commented: (TIKA-318) Upgrade nekohtml dependency from 1.9.9 to 1.9.13 - posted by "Benson Margulies (JIRA)" <ji...@apache.org> on 2009/11/07 19:59:32 UTC, 0 replies.
- [jira] Commented: (TIKA-94) Speech recognition - posted by "David Woollard (JIRA)" <ji...@apache.org> on 2009/11/09 22:28:32 UTC, 0 replies.
- Tika facade - static or not - posted by Jukka Zitting <ju...@gmail.com> on 2009/11/11 19:21:35 UTC, 8 replies.
- Parse context - class or map? - posted by Jukka Zitting <ju...@gmail.com> on 2009/11/11 19:33:00 UTC, 5 replies.
- [jira] Created: (TIKA-319) HtmlParser - use encoding hint only if charset is supported - posted by "Piotr B. (JIRA)" <ji...@apache.org> on 2009/11/12 09:53:39 UTC, 0 replies.
- [jira] Created: (TIKA-320) Allow disabling language detection in AutoDetectParser - posted by "Erik Hetzner (JIRA)" <ji...@apache.org> on 2009/11/12 20:06:39 UTC, 0 replies.
- [jira] Resolved: (TIKA-320) Allow disabling language detection in AutoDetectParser - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/11/13 04:20:39 UTC, 0 replies.
- [jira] Resolved: (TIKA-319) HtmlParser - use encoding hint only if charset is supported - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/11/13 04:51:39 UTC, 0 replies.
- [jira] Resolved: (TIKA-318) Upgrade nekohtml dependency from 1.9.9 to 1.9.13 - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/11/13 04:53:39 UTC, 0 replies.
- [jira] Created: (TIKA-321) Optimize type detection speed - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/11/13 05:03:39 UTC, 0 replies.
- [jira] Created: (TIKA-322) Improve encoding detection speed and accuracy - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/11/13 05:11:39 UTC, 0 replies.
- [jira] Commented: (TIKA-271) secure-processing not supported by some JAXP implementations - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2009/11/13 17:59:39 UTC, 1 replies.
- [jira] Issue Comment Edited: (TIKA-309) Mime type application/rdf+xml not correctly detected - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2009/11/13 18:17:39 UTC, 2 replies.
- [jira] Resolved: (TIKA-313) patch: ODF improvements for svg:desc, presentation notes - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/11/13 18:21:39 UTC, 0 replies.
- [jira] Resolved: (TIKA-309) Mime type application/rdf+xml not correctly detected - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2009/11/13 23:40:39 UTC, 1 replies.
- Hudson build became unstable: Tika-trunk » Apache Tika parsers #213 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/11/14 00:06:39 UTC, 0 replies.
- Hudson build became unstable: Tika-trunk #213 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/11/14 00:06:41 UTC, 0 replies.
- Hudson build is still unstable: Tika-trunk » Apache Tika parsers #214 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/11/14 01:08:05 UTC, 0 replies.
- Hudson build is still unstable: Tika-trunk #214 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/11/14 01:08:08 UTC, 0 replies.
- Hudson build is still unstable: Tika-trunk » Apache Tika parsers #215 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/11/14 04:07:50 UTC, 0 replies.
- Hudson build is still unstable: Tika-trunk #215 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/11/14 04:07:54 UTC, 0 replies.
- Build Unstable - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2009/11/14 04:41:58 UTC, 0 replies.
- Hudson build is back to stable: Tika-trunk » Apache Tika parsers #216 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/11/14 05:07:01 UTC, 0 replies.
- Hudson build is back to stable: Tika-trunk #216 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/11/14 05:07:04 UTC, 0 replies.
- Build failed in Hudson: Tika-trunk » Apache Tika parent #217 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/11/14 06:01:37 UTC, 0 replies.
- Build failed in Hudson: Tika-trunk #217 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/11/14 06:01:38 UTC, 0 replies.
- Hudson build is back to normal: Tika-trunk » Apache Tika parent #218 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/11/14 07:07:29 UTC, 0 replies.
- Hudson build is back to normal: Tika-trunk #218 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/11/14 07:07:33 UTC, 0 replies.
- [jira] Assigned: (TIKA-209) Language detection is weak. - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2009/11/14 17:26:39 UTC, 0 replies.
- [jira] Created: (TIKA-323) Make Tika site look like Lucene ecosystem Apache Forrest-built sites - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2009/11/14 20:02:39 UTC, 0 replies.
- [VOTE] Apache Tika 0.5 release candidate #1 - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2009/11/14 20:27:20 UTC, 6 replies.
- [jira] Updated: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode - posted by "Peter Wolanin (JIRA)" <ji...@apache.org> on 2009/11/15 18:53:39 UTC, 3 replies.
- [jira] Created: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode - posted by "Peter Wolanin (JIRA)" <ji...@apache.org> on 2009/11/15 18:53:39 UTC, 0 replies.
- [jira] Commented: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode - posted by "Peter Wolanin (JIRA)" <ji...@apache.org> on 2009/11/15 18:57:39 UTC, 3 replies.
- [jira] Issue Comment Edited: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode - posted by "Peter Wolanin (JIRA)" <ji...@apache.org> on 2009/11/15 19:01:39 UTC, 1 replies.
- [jira] Commented: (TIKA-322) Improve encoding detection speed and accuracy - posted by "Luke Nezda (JIRA)" <ji...@apache.org> on 2009/11/15 19:13:39 UTC, 0 replies.
- [jira] Updated: (TIKA-325) tika-parent/pom.xml missing 2007 - posted by "Luke Nezda (JIRA)" <ji...@apache.org> on 2009/11/15 19:53:39 UTC, 1 replies.
- [jira] Created: (TIKA-325) tika-parent/pom.xml missing 2007 - posted by "Luke Nezda (JIRA)" <ji...@apache.org> on 2009/11/15 19:53:39 UTC, 0 replies.
- [jira] Commented: (TIKA-320) Allow disabling language detection in AutoDetectParser - posted by "Erik Hetzner (JIRA)" <ji...@apache.org> on 2009/11/16 20:50:39 UTC, 0 replies.
- [jira] Created: (TIKA-326) Map javax.imageio.IIOException to TikaException - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/11/17 14:26:39 UTC, 0 replies.
- [jira] Resolved: (TIKA-326) Map javax.imageio.IIOException to TikaException - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/11/17 14:32:39 UTC, 0 replies.
- [jira] Updated: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode (on Mac OS X) - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/11/17 15:51:39 UTC, 2 replies.
- [jira] Commented: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode (on Mac OS X) - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/11/17 16:07:39 UTC, 5 replies.
- [jira] Resolved: (TIKA-325) tika-parent/pom.xml missing 2007 - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/11/17 16:47:39 UTC, 0 replies.
- [jira] Created: (TIKA-327) Parsing "HTML" as DcXML - posted by "Erik Hetzner (JIRA)" <ji...@apache.org> on 2009/11/18 02:48:39 UTC, 0 replies.
- [jira] Updated: (TIKA-327) Parsing "HTML" as DcXML - posted by "Erik Hetzner (JIRA)" <ji...@apache.org> on 2009/11/18 02:48:39 UTC, 0 replies.
- [jira] Created: (TIKA-328) Add parser for .flv videos - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/11/19 23:41:39 UTC, 0 replies.
- [jira] Updated: (TIKA-328) Add parser for .flv videos - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/11/19 23:43:39 UTC, 0 replies.
- [RESULT] [VOTE] Apache Tika 0.5 release candidate #1 - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2009/11/20 03:53:59 UTC, 3 replies.
- [jira] Created: (TIKA-329) secure-processing not supported by some JAXP implementations (2) - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2009/11/20 11:31:40 UTC, 0 replies.
- [jira] Updated: (TIKA-329) secure-processing not supported by some JAXP implementations (2) - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2009/11/20 11:35:39 UTC, 0 replies.
- [ANNOUNCE] Apache Tika 0.5 Released - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2009/11/22 16:50:47 UTC, 6 replies.
- Build failed in Hudson: Tika-trunk #226 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/11/22 18:00:59 UTC, 0 replies.
- Build failed in Hudson: Tika-trunk #227 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/11/22 19:00:54 UTC, 0 replies.
- Build failed in Hudson: Tika-trunk #228 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/11/22 20:00:56 UTC, 5 replies.
- Build failed in Hudson: Tika-trunk #229 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/11/22 21:01:22 UTC, 0 replies.
- Build failed in Hudson: Tika-trunk #230 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/11/23 01:17:10 UTC, 0 replies.
- Hudson build is back to normal: Tika-trunk #231 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/11/23 03:16:57 UTC, 0 replies.
- [jira] Created: (TIKA-330) Better HWP (Hangul Word Processor) detection pattern - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/11/23 12:27:39 UTC, 0 replies.
- [jira] Resolved: (TIKA-330) Better HWP (Hangul Word Processor) detection pattern - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/11/23 12:29:39 UTC, 0 replies.
- [jira] Created: (TIKA-331) Windings font recognition in Tika parsing + spacing issue - posted by "MRIT64 (JIRA)" <ji...@apache.org> on 2009/11/24 19:42:39 UTC, 0 replies.
- [jira] Updated: (TIKA-331) Windings font recognition in Tika parsing + spacing issue - posted by "MRIT64 (JIRA)" <ji...@apache.org> on 2009/11/24 19:58:40 UTC, 1 replies.
- [jira] Commented: (TIKA-331) Windings font recognition in Tika parsing + spacing issue - posted by "MRIT64 (JIRA)" <ji...@apache.org> on 2009/11/24 20:11:39 UTC, 1 replies.
- [jira] Created: (TIKA-332) Use http-equiv meta tag charset info when processing HTML documents - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2009/11/25 18:50:39 UTC, 0 replies.
- [jira] Created: (TIKA-333) Improve accuracy of charset detection for HTML pages - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2009/11/25 18:54:42 UTC, 0 replies.
- [jira] Closed: (TIKA-333) Improve accuracy of charset detection for HTML pages - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2009/11/25 19:38:39 UTC, 0 replies.
- [jira] Commented: (TIKA-332) Use http-equiv meta tag charset info when processing HTML documents - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2009/11/25 19:38:39 UTC, 0 replies.
- [jira] Updated: (TIKA-332) Use http-equiv meta tag charset info when processing HTML documents - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2009/11/25 19:40:39 UTC, 0 replies.
- [jira] Created: (TIKA-334) HtmlParser should use CharsetDetector whenever no charset is specified via meta http-equiv tag - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2009/11/25 19:42:39 UTC, 0 replies.
- [jira] Created: (TIKA-335) TXTParser use of CharsetDetector has several bugs - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2009/11/25 19:48:39 UTC, 0 replies.
- [jira] Updated: (TIKA-335) TXTParser should use incoming charset - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2009/11/25 19:50:39 UTC, 1 replies.
- Missing href attribute handling - posted by Ken Krugler <kk...@transpac.com> on 2009/11/25 20:12:36 UTC, 0 replies.
- [jira] Updated: (TIKA-334) HtmlParser should use CharsetDetector whenever no charset is specified via meta http-equiv tag - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2009/11/25 20:53:39 UTC, 0 replies.
- [jira] Created: (TIKA-336) More issues with RDF mime detection - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2009/11/25 23:51:40 UTC, 0 replies.
- [jira] Resolved: (TIKA-336) More issues with RDF mime detection - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2009/11/26 00:45:39 UTC, 0 replies.
- [jira] Created: (TIKA-337) SWF parser - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2009/11/27 11:21:39 UTC, 0 replies.
- [jira] Updated: (TIKA-337) SWF parser - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2009/11/27 11:23:39 UTC, 1 replies.
- [jira] Resolved: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode (on Mac OS X) - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/11/27 16:12:20 UTC, 0 replies.
- [jira] Created: (TIKA-338) Trying to use -encoding parameter alwyas results in an exception - posted by "Peter Wolanin (JIRA)" <ji...@apache.org> on 2009/11/27 18:16:20 UTC, 0 replies.
- [jira] Closed: (TIKA-338) Trying to use -encoding parameter alwyas results in an exception - posted by "Peter Wolanin (JIRA)" <ji...@apache.org> on 2009/11/27 18:25:22 UTC, 0 replies.
- [jira] Commented: (TIKA-338) Trying to use -encoding parameter alwyas results in an exception - posted by "Peter Wolanin (JIRA)" <ji...@apache.org> on 2009/11/27 18:25:22 UTC, 0 replies.
- [jira] Issue Comment Edited: (TIKA-324) Tika CLI mangles utf-8 content in text (-t) mode (on Mac OS X) - posted by "Peter Wolanin (JIRA)" <ji...@apache.org> on 2009/11/27 18:27:21 UTC, 0 replies.
- hi〗 - posted by katrina hollow <ka...@hotmail.com> on 2009/11/29 09:47:16 UTC, 0 replies.
- [jira] Resolved: (TIKA-337) SWF parser - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/11/30 00:57:20 UTC, 0 replies.
- [jira] Commented: (TIKA-147) Add Flash parser - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/11/30 00:59:20 UTC, 1 replies.
- [jira] Commented: (TIKA-335) TXTParser should use incoming charset - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/11/30 02:23:20 UTC, 0 replies.
- [jira] Resolved: (TIKA-334) HtmlParser should use CharsetDetector whenever no charset is specified via meta http-equiv tag - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/11/30 02:43:20 UTC, 0 replies.
- [jira] Commented: (TIKA-328) Add parser for .flv videos - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/11/30 02:45:20 UTC, 0 replies.
- [jira] Resolved: (TIKA-329) secure-processing not supported by some JAXP implementations (2) - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/11/30 02:49:20 UTC, 0 replies.
- [jira] Commented: (TIKA-327) Parsing "HTML" as DcXML - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/11/30 02:55:20 UTC, 0 replies.