You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] Commented: (TIKA-451) Inconsistent date format for Metadata.CREATION_DATE and Metadata.LAST_MODIFIED - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/07/01 18:21:50 UTC, 12 replies.
- [jira] Created: (TIKA-453) Conflicting Estonian language profile code to ISO 639 - posted by "Janno Veldemann (JIRA)" <ji...@apache.org> on 2010/07/02 15:50:50 UTC, 0 replies.
- [jira] Created: (TIKA-454) Illegal Charset Name crashes HTMLParser - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/02 16:54:50 UTC, 0 replies.
- [jira] Updated: (TIKA-454) Illegal Charset Name crashes HTMLParser - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/02 17:02:51 UTC, 0 replies.
- [jira] Assigned: (TIKA-454) Illegal Charset Name crashes HTMLParser - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/02 17:34:49 UTC, 0 replies.
- [jira] Assigned: (TIKA-408) Word 6.0/7.0 documents support in office parser - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/07/02 23:51:56 UTC, 0 replies.
- [jira] Commented: (TIKA-408) Word 6.0/7.0 documents support in office parser - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/07/02 23:54:50 UTC, 0 replies.
- [jira] Commented: (TIKA-315) Tika appears to skip over an entire section of a Microsoft Word Document - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/07/02 23:56:49 UTC, 0 replies.
- [jira] Resolved: (TIKA-315) Tika appears to skip over an entire section of a Microsoft Word Document - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/07/02 23:56:50 UTC, 0 replies.
- [jira] Commented: (TIKA-212) Do you have Tika in .NET? - posted by "Kevin Miller (JIRA)" <ji...@apache.org> on 2010/07/04 00:30:50 UTC, 0 replies.
- [jira] Updated: (TIKA-402) Support for iWork documents - posted by "Martijn van Groningen (JIRA)" <ji...@apache.org> on 2010/07/04 21:02:49 UTC, 1 replies.
- [jira] Closed: (TIKA-454) Illegal Charset Name crashes HTMLParser - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/05 10:51:50 UTC, 0 replies.
- [jira] Created: (TIKA-455) Zip parser stuck on truncated zip files. - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/05 14:08:52 UTC, 0 replies.
- [jira] Updated: (TIKA-455) Zip parser stuck on truncated zip files. - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/07/05 14:08:54 UTC, 0 replies.
- [jira] Commented: (TIKA-455) Zip parser stuck on truncated zip files. - posted by "Stefan Bodewig (JIRA)" <ji...@apache.org> on 2010/07/05 14:32:50 UTC, 0 replies.
- [jira] Created: (TIKA-456) Support timeouts for parsers - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/07/05 22:42:51 UTC, 0 replies.
- [jira] Commented: (TIKA-456) Support timeouts for parsers - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/07/05 22:48:54 UTC, 3 replies.
- [jira] Resolved: (TIKA-455) Zip parser stuck on truncated zip files. - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/07/06 13:50:49 UTC, 1 replies.
- [jira] Reopened: (TIKA-455) Zip parser stuck on truncated zip files. - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/07/06 13:50:50 UTC, 0 replies.
- [jira] Resolved: (TIKA-402) Support for iWork documents - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/07/06 14:19:50 UTC, 1 replies.
- [jira] Commented: (TIKA-402) Support for iWork documents - posted by "Martijn van Groningen (JIRA)" <ji...@apache.org> on 2010/07/06 14:39:50 UTC, 4 replies.
- [jira] Commented: (TIKA-292) PDFBox is too verbose - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/07/06 14:49:49 UTC, 0 replies.
- [jira] Reopened: (TIKA-446) Upgrade to PDFBox 1.2.0 - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/07/06 14:57:49 UTC, 0 replies.
- Hudson build became unstable: Tika-trunk » Apache Tika parsers #313 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/06 15:01:02 UTC, 0 replies.
- Hudson build became unstable: Tika-trunk #313 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/06 15:01:04 UTC, 0 replies.
- [jira] Reopened: (TIKA-402) Support for iWork documents - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/07/06 15:19:49 UTC, 0 replies.
- Re: Hudson build became unstable: Tika-trunk » Apache Tika parsers #313 - posted by Jukka Zitting <ju...@gmail.com> on 2010/07/06 15:19:50 UTC, 0 replies.
- Hudson build is back to stable : Tika-trunk » Apache Tika parsers #314 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/06 16:26:47 UTC, 0 replies.
- Hudson build is back to stable : Tika-trunk #314 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/06 16:26:50 UTC, 0 replies.
- Re: [jira] Commented: (TIKA-451) Inconsistent date format for Metadata.CREATION_DATE and Metadata.LAST_MODIFIED - posted by Oleg Tikhonov <ol...@gmail.com> on 2010/07/06 20:54:04 UTC, 0 replies.
- [jira] Assigned: (TIKA-451) Inconsistent date format for Metadata.CREATION_DATE and Metadata.LAST_MODIFIED - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/07/06 22:21:52 UTC, 0 replies.
- buildbot success in ASF Buildbot on tika-trunk - posted by bu...@apache.org on 2010/07/07 09:17:48 UTC, 2 replies.
- Tika 0.7 And Solr - posted by rohanpatil <ro...@gmail.com> on 2010/07/07 13:01:30 UTC, 2 replies.
- Re: Getting started - posted by Arturo Beltran <ar...@uji.es> on 2010/07/07 13:25:46 UTC, 9 replies.
- Specify HTMLHandler via Context - posted by Julien Nioche <li...@gmail.com> on 2010/07/07 17:08:05 UTC, 1 replies.
- [jira] Created: (TIKA-457) HTMLParser gets an early event - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/07 17:28:50 UTC, 0 replies.
- [jira] Created: (TIKA-458) Specify HTMLHandler via Context - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/07 17:30:50 UTC, 0 replies.
- [jira] Updated: (TIKA-458) Specify HTMLHandler via Context - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/07 17:30:50 UTC, 0 replies.
- [jira] Commented: (TIKA-457) HTMLParser gets an early event - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/07/07 22:30:52 UTC, 0 replies.
- [jira] Commented: (TIKA-458) Specify HTMLHandler via Context - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/07/08 00:11:52 UTC, 0 replies.
- [jira] Closed: (TIKA-359) Calls to Charset.isSupported() will throw exceptions for invalid charset names - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/07/08 01:36:50 UTC, 0 replies.
- [jira] Created: (TIKA-459) Improve handling of incorrect charset names in HTTP response header - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/07/08 02:08:50 UTC, 0 replies.
- [jira] Updated: (TIKA-459) Improve handling of incorrect charset names in HTTP response header - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/07/08 02:16:50 UTC, 0 replies.
- [jira] Commented: (TIKA-459) Improve handling of incorrect charset names in HTTP response header - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/08 03:09:50 UTC, 0 replies.
- [jira] Created: (TIKA-460) HTMLHandler misses treatment of A elements - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/08 15:54:50 UTC, 0 replies.
- [jira] Updated: (TIKA-460) HTMLHandler misses treatment of A elements - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/08 15:56:49 UTC, 0 replies.
- [jira] Created: (TIKA-461) RFC822 messages not parsed - posted by "Joshua Turner (JIRA)" <ji...@apache.org> on 2010/07/08 16:45:49 UTC, 0 replies.
- buildbot failure in ASF Buildbot on tika-trunk - posted by bu...@apache.org on 2010/07/08 19:56:07 UTC, 15 replies.
- [jira] Resolved: (TIKA-459) Improve handling of incorrect charset names in HTTP response header - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/07/08 19:58:50 UTC, 0 replies.
- [jira] Updated: (TIKA-453) Conflicting Estonian language profile code to ISO 639 - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/07/09 23:20:50 UTC, 1 replies.
- [jira] Resolved: (TIKA-453) Conflicting Estonian language profile code to ISO 639 - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/07/09 23:27:50 UTC, 0 replies.
- [jira] Created: (TIKA-462) Add Boilerpipe 1.0.4 to Maven central and remove java.net repository from parser pom - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/07/09 23:39:58 UTC, 0 replies.
- [jira] Assigned: (TIKA-462) Add Boilerpipe 1.0.4 to Maven central and remove java.net repository from parser pom - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/07/09 23:40:00 UTC, 0 replies.
- [jira] Updated: (TIKA-420) [PATCH] Integration of boilerpipe: Boilerplate Removal and Fulltext Extraction from HTML pages - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/07/10 02:18:51 UTC, 0 replies.
- TIKA-420 patch for boilerplate removal - posted by Ken Krugler <kk...@transpac.com> on 2010/07/10 02:23:27 UTC, 0 replies.
- [jira] Updated: (TIKA-446) Upgrade to PDFBox 1.2.1 - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/07/11 06:57:49 UTC, 0 replies.
- [jira] Resolved: (TIKA-446) Upgrade to PDFBox 1.2.1 - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/07/11 07:03:07 UTC, 0 replies.
- [jira] Assigned: (TIKA-394) Missing spaces on html parsing - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/07/11 23:34:50 UTC, 0 replies.
- [jira] Commented: (TIKA-394) Missing spaces on html parsing - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/07/11 23:39:50 UTC, 0 replies.
- Packages and attributes - posted by Paul Jakubik <pa...@purediscovery.com> on 2010/07/12 17:03:41 UTC, 14 replies.
- Boilerpipe integration - posted by Ken Krugler <kk...@transpac.com> on 2010/07/12 19:34:28 UTC, 0 replies.
- [jira] Resolved: (TIKA-420) [PATCH] Integration of boilerpipe: Boilerplate Removal and Fulltext Extraction from HTML pages - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/07/12 19:34:51 UTC, 0 replies.
- [jira] Created: (TIKA-463) HtmlParser doesn't extract links from img, map, object, frame, iframe, area, link - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/07/12 22:15:49 UTC, 0 replies.
- [jira] Commented: (TIKA-463) HtmlParser doesn't extract links from img, map, object, frame, iframe, area, link - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/07/12 22:47:53 UTC, 6 replies.
- [jira] Commented: (TIKA-460) HTMLHandler misses treatment of A elements - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/13 13:33:49 UTC, 0 replies.
- [jira] Created: (TIKA-464) Contribute a "get Tika parsing up and running in 5 minutes" quick start guide - posted by "Arturo Beltran (JIRA)" <ji...@apache.org> on 2010/07/14 12:41:53 UTC, 0 replies.
- [jira] Created: (TIKA-465) LanguageIdentifier API enhancements - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/14 19:40:51 UTC, 0 replies.
- [jira] Assigned: (TIKA-465) LanguageIdentifier API enhancements - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/07/14 23:26:53 UTC, 0 replies.
- Build failed in Hudson: Tika-trunk #319 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/15 01:17:22 UTC, 0 replies.
- Build failed in Hudson: Tika-trunk » Apache Tika parsers #319 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/15 01:17:22 UTC, 0 replies.
- Build failed in Hudson: Tika-trunk » Apache Tika parsers #320 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/15 12:03:52 UTC, 0 replies.
- Build failed in Hudson: Tika-trunk #320 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/15 12:03:55 UTC, 0 replies.
- Hudson build is back to normal : Tika-trunk » Apache Tika parsers #321 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/15 13:09:52 UTC, 0 replies.
- Hudson build is back to normal : Tika-trunk #321 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/15 13:09:54 UTC, 0 replies.
- MediaType.getParameters return type - posted by Ken Krugler <kk...@transpac.com> on 2010/07/15 23:59:52 UTC, 0 replies.
- [jira] Updated: (TIKA-464) Contribute a "get Tika parsing up and running in 5 minutes" quick start guide - posted by "Arturo Beltran (JIRA)" <ji...@apache.org> on 2010/07/16 13:10:50 UTC, 0 replies.
- [jira] Created: (TIKA-466) Feed Parser - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/16 13:22:51 UTC, 0 replies.
- [jira] Updated: (TIKA-466) Feed Parser - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/16 13:22:51 UTC, 0 replies.
- [jira] Assigned: (TIKA-466) Feed Parser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/16 17:39:51 UTC, 0 replies.
- [jira] Assigned: (TIKA-464) Contribute a "get Tika parsing up and running in 5 minutes" quick start guide - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/16 17:50:52 UTC, 0 replies.
- [jira] Resolved: (TIKA-464) Contribute a "get Tika parsing up and running in 5 minutes" quick start guide - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/16 19:40:51 UTC, 0 replies.
- [jira] Resolved: (TIKA-466) Feed Parser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/16 19:53:50 UTC, 0 replies.
- [jira] Updated: (TIKA-463) HtmlParser doesn't extract links from img, map, object, frame, iframe, area, link - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/19 10:15:53 UTC, 1 replies.
- [jira] Commented: (TIKA-147) Add Flash parser - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/19 16:57:54 UTC, 0 replies.
- Broken link in Tika mainpage - posted by André Ricardo <an...@gmail.com> on 2010/07/19 18:34:22 UTC, 1 replies.
- [jira] Created: (TIKA-467) Link to 5min quick start parser guide wrong - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/20 04:42:51 UTC, 0 replies.
- [jira] Resolved: (TIKA-467) Link to 5min quick start parser guide wrong - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/20 04:44:52 UTC, 0 replies.
- [jira] Updated: (TIKA-358) Auto-detection of HTML fails with common auto-generated template - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/20 04:48:51 UTC, 0 replies.
- [jira] Updated: (TIKA-405) Problems handling Hyperlinks and Tables in Word 97 Docs - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/20 04:50:50 UTC, 0 replies.
- [jira] Updated: (TIKA-462) Add Boilerpipe 1.0.4 to Maven central and remove java.net repository from parser pom - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/20 04:50:51 UTC, 0 replies.
- [jira] Updated: (TIKA-456) Support timeouts for parsers - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/20 04:50:51 UTC, 0 replies.
- [jira] Commented: (TIKA-466) Feed Parser - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/20 20:37:51 UTC, 0 replies.
- [jira] Closed: (TIKA-466) Feed Parser - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/07/20 20:37:56 UTC, 0 replies.
- [jira] Created: (TIKA-468) Missing Silde-Count metadata of PPT files - posted by "Łukasz Wiktor (JIRA)" <ji...@apache.org> on 2010/07/21 14:46:51 UTC, 0 replies.
- [jira] Updated: (TIKA-468) Missing Silde-Count metadata for PPT files - posted by "Łukasz Wiktor (JIRA)" <ji...@apache.org> on 2010/07/21 14:51:50 UTC, 0 replies.
- [jira] Created: (TIKA-469) The Parser is not correctly outputting Arabic text documents - posted by "Robert Cullen (JIRA)" <ji...@apache.org> on 2010/07/22 16:52:51 UTC, 0 replies.
- [jira] Created: (TIKA-470) Tika App command line option to list the registered parsers and their supported mime types - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/07/22 18:30:50 UTC, 0 replies.
- [jira] Commented: (TIKA-470) Tika App command line option to list the registered parsers and their supported mime types - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/07/22 22:07:49 UTC, 0 replies.
- [jira] Created: (TIKA-471) Avoid Charset name bottleneck when multiple threads are using HtmlParser - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/07/24 03:36:50 UTC, 0 replies.
- [jira] Resolved: (TIKA-470) Tika App command line option to list the registered parsers and their supported mime types - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/07/26 14:21:50 UTC, 0 replies.
- [jira] Commented: (TIKA-447) Container aware mimetype detection - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/07/28 15:59:16 UTC, 3 replies.
- Build failed in Hudson: Tika-trunk » Apache Tika parsers #327 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/29 16:03:29 UTC, 0 replies.
- Build failed in Hudson: Tika-trunk #327 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2010/07/29 16:03:31 UTC, 0 replies.
- [jira] Commented: (TIKA-391) Intermittent errors detecting xls files - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/07/29 19:03:17 UTC, 1 replies.
- [jira] Resolved: (TIKA-391) Intermittent errors detecting xls files - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/07/29 19:05:19 UTC, 0 replies.
- [jira] Created: (TIKA-472) Extract image title, description and author - posted by "Staffan Olsson (JIRA)" <ji...@apache.org> on 2010/07/30 08:48:16 UTC, 0 replies.
- [jira] Updated: (TIKA-472) Extract image title, description and author - posted by "Staffan Olsson (JIRA)" <ji...@apache.org> on 2010/07/30 08:50:16 UTC, 0 replies.
- [jira] Commented: (TIKA-424) Avoid ArrayIndexOutOfBoundsException on some mp3 files - posted by "Christophe Gourmelon (JIRA)" <ji...@apache.org> on 2010/07/30 15:32:16 UTC, 1 replies.
- [jira] Issue Comment Edited: (TIKA-424) Avoid ArrayIndexOutOfBoundsException on some mp3 files - posted by "Christophe Gourmelon (JIRA)" <ji...@apache.org> on 2010/07/30 16:33:16 UTC, 0 replies.
- Word95 and earlier versions - posted by "Bracken, Patrick" <Pa...@finra.org> on 2010/07/30 16:43:12 UTC, 1 replies.
- [jira] Resolved: (TIKA-472) Extract image title, description and author - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/07/31 18:12:16 UTC, 0 replies.
- [jira] Resolved: (TIKA-214) Excel Parsing Issues - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/07/31 19:03:16 UTC, 0 replies.