You are viewing a plain text version of this content. The canonical link for it is here.
- Re: Boilerpipe is nice, but what about readability? - posted by Otis Gospodnetic <og...@yahoo.com> on 2011/01/02 19:55:50 UTC, 1 replies.
- [jira] Created: (TIKA-580) RAR Archive Support Tika - posted by "Maik (JIRA)" <ji...@apache.org> on 2011/01/03 14:16:46 UTC, 0 replies.
- [jira] Resolved: (TIKA-569) More fault-tolerant loading of parsers and detectors - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2011/01/03 19:16:45 UTC, 0 replies.
- Build failed in Hudson: Tika-trunk » Apache Tika parent #442 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2011/01/03 20:43:40 UTC, 0 replies.
- Build failed in Hudson: Tika-trunk #442 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2011/01/03 20:43:41 UTC, 0 replies.
- [Call for Papers] ICSE Software Engineering for Cloud Computing (SECLOUD) Workshop - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2011/01/03 22:38:24 UTC, 1 replies.
- [jira] Created: (TIKA-581) Parser fails on files that parsed with v0.7 - posted by "Dennis Adler (JIRA)" <ji...@apache.org> on 2011/01/03 23:58:45 UTC, 0 replies.
- [jira] Updated: (TIKA-581) Parser fails on files that parsed with v0.7 - posted by "Dennis Adler (JIRA)" <ji...@apache.org> on 2011/01/04 00:00:50 UTC, 0 replies.
- [jira] Commented: (TIKA-577) IndexOutOfBounds Exception looking for Picture in Word 03 doc that has no pictures - posted by "Dennis Adler (JIRA)" <ji...@apache.org> on 2011/01/04 01:14:45 UTC, 6 replies.
- [jira] Created: (TIKA-582) Lithuanian language identification - posted by "Žygimantas Medelis (JIRA)" <ji...@apache.org> on 2011/01/04 21:09:48 UTC, 0 replies.
- [jira] Updated: (TIKA-582) Lithuanian language identification - posted by "Žygimantas Medelis (JIRA)" <ji...@apache.org> on 2011/01/04 21:09:49 UTC, 1 replies.
- [jira] Assigned: (TIKA-582) Lithuanian language identification - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2011/01/04 22:12:50 UTC, 0 replies.
- Logging question - posted by Ken Krugler <kk...@transpac.com> on 2011/01/05 04:52:05 UTC, 3 replies.
- [jira] Closed: (TIKA-580) RAR Archive Support Tika - posted by "Maik (JIRA)" <ji...@apache.org> on 2011/01/06 15:52:49 UTC, 0 replies.
- [jira] Updated: (TIKA-375) Improve code quality metrics - posted by "Jeroen Reijn (JIRA)" <ji...@apache.org> on 2011/01/10 23:25:45 UTC, 0 replies.
- [jira] Resolved: (TIKA-577) IndexOutOfBounds Exception looking for Picture in Word 03 doc that has no pictures - posted by "Maxim Valyanskiy (JIRA)" <ji...@apache.org> on 2011/01/12 16:20:46 UTC, 0 replies.
- Hudson build is back to normal : Tika-trunk » Apache Tika parent #443 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2011/01/13 10:59:40 UTC, 0 replies.
- Hudson build is back to normal : Tika-trunk #443 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2011/01/13 10:59:42 UTC, 0 replies.
- [jira] Created: (TIKA-583) Tika 0.8 line break removal is faulty (misses space when concatenating lines) for PDF file - posted by "Dennis Adler (JIRA)" <ji...@apache.org> on 2011/01/14 02:12:45 UTC, 0 replies.
- [jira] Updated: (TIKA-583) Tika 0.8 line break removal is faulty (misses space when concatenating lines) for PDF file - posted by "Dennis Adler (JIRA)" <ji...@apache.org> on 2011/01/14 02:14:45 UTC, 0 replies.
- [jira] Commented: (TIKA-583) Tika 0.8 line break removal is faulty (misses space when concatenating lines) for PDF file - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2011/01/14 17:51:45 UTC, 1 replies.
- [jira] Created: (TIKA-584) Tika parse of some PDF files removes all spaces between words - posted by "Ajay Vohra (JIRA)" <ji...@apache.org> on 2011/01/15 14:11:45 UTC, 0 replies.
- [jira] Commented: (TIKA-584) Tika parse of some PDF files removes all spaces between words - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2011/01/15 20:17:45 UTC, 1 replies.
- [jira] Updated: (TIKA-584) Tika parse of some PDF files removes all spaces between words - posted by "Ajay Vohra (JIRA)" <ji...@apache.org> on 2011/01/17 04:45:43 UTC, 0 replies.
- [jira] Created: (TIKA-585) AudioParser Fails with NPE on fileFormat.properties - posted by "Cyriel Vringer (JIRA)" <ji...@apache.org> on 2011/01/17 15:47:44 UTC, 0 replies.
- [jira] Created: (TIKA-586) Parsing a ms access file (*.mdb) throws an error - posted by "Martijn van Groningen (JIRA)" <ji...@apache.org> on 2011/01/17 22:12:00 UTC, 0 replies.
- [jira] Updated: (TIKA-586) Parsing a ms access file (*.mdb) throws an error - posted by "Martijn van Groningen (JIRA)" <ji...@apache.org> on 2011/01/17 22:13:45 UTC, 0 replies.
- [jira] Commented: (TIKA-586) Parsing a ms access file (*.mdb) throws an error - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2011/01/18 15:30:43 UTC, 0 replies.
- [jira] Resolved: (TIKA-586) Parsing a ms access file (*.mdb) throws an error - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2011/01/18 15:30:44 UTC, 0 replies.
- [jira] Resolved: (TIKA-416) Out-of-process text extraction - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2011/01/18 16:34:44 UTC, 0 replies.
- [jira] Issue Comment Edited: (TIKA-416) Out-of-process text extraction - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2011/01/18 16:36:43 UTC, 0 replies.
- [jira] Commented: (TIKA-416) Out-of-process text extraction - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2011/01/18 16:50:43 UTC, 0 replies.
- [jira] Created: (TIKA-587) NullPointerException in OutlookExtractor on missing chunks - posted by "Tom Klonikowski (JIRA)" <ji...@apache.org> on 2011/01/18 17:18:45 UTC, 0 replies.
- [jira] Commented: (TIKA-567) Temporary file leak in TikaInputStream - posted by "David Benson (JIRA)" <ji...@apache.org> on 2011/01/18 22:43:43 UTC, 6 replies.
- [jira] Commented: (TIKA-548) PDF content extracted as single line - posted by "Paul Pearcy (JIRA)" <ji...@apache.org> on 2011/01/18 23:28:44 UTC, 0 replies.
- [jira] Updated: (TIKA-577) IndexOutOfBounds Exception looking for Picture in Word 03 doc that has no pictures - posted by "Dennis Adler (JIRA)" <ji...@apache.org> on 2011/01/19 03:22:43 UTC, 0 replies.
- [jira] Resolved: (TIKA-583) Tika 0.8 line break removal is faulty (misses space when concatenating lines) for PDF file - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2011/01/19 11:42:45 UTC, 0 replies.
- [jira] Resolved: (TIKA-567) Temporary file leak in TikaInputStream - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2011/01/19 13:42:44 UTC, 0 replies.
- [jira] Resolved: (TIKA-587) NullPointerException in OutlookExtractor on missing chunks - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2011/01/19 13:46:45 UTC, 0 replies.
- [jira] Resolved: (TIKA-585) AudioParser Fails with NPE on fileFormat.properties - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2011/01/19 13:54:46 UTC, 0 replies.
- [jira] Resolved: (TIKA-584) Tika parse of some PDF files removes all spaces between words - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2011/01/19 13:58:45 UTC, 0 replies.
- [jira] Commented: (TIKA-375) Improve code quality metrics - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2011/01/19 14:08:45 UTC, 0 replies.
- [jira] Resolved: (TIKA-582) Lithuanian language identification - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2011/01/19 14:18:46 UTC, 0 replies.
- [jira] Commented: (TIKA-576) OutofMemory issues while building Tika - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2011/01/19 14:24:45 UTC, 0 replies.
- [jira] Commented: (TIKA-581) Parser fails on files that parsed with v0.7 - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2011/01/19 14:48:43 UTC, 0 replies.
- [jira] Resolved: (TIKA-578) XMLParser ContentHandler: multiple endDocument calls - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2011/01/19 15:04:44 UTC, 0 replies.
- [jira] Resolved: (TIKA-551) Unit test failures in org.apache.tika.parser.image.ImageParserTest on JDK 1.6.0_05 - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2011/01/19 15:12:45 UTC, 0 replies.
- [jira] Reopened: (TIKA-577) IndexOutOfBounds Exception looking for Picture in Word 03 doc that has no pictures - posted by "Dennis Adler (JIRA)" <ji...@apache.org> on 2011/01/19 22:26:43 UTC, 0 replies.
- [jira] Created: (TIKA-588) MIME detection for iWork documents returns application/zip - posted by "Alexander Chow (JIRA)" <ji...@apache.org> on 2011/01/20 17:01:44 UTC, 0 replies.
- [jira] Updated: (TIKA-588) MIME detection for iWork documents returns application/zip - posted by "Alexander Chow (JIRA)" <ji...@apache.org> on 2011/01/20 17:01:45 UTC, 0 replies.
- Build failed in Hudson: Tika-trunk #457 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2011/01/21 15:06:19 UTC, 1 replies.
- Hudson build is back to normal : Tika-trunk #458 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2011/01/21 15:55:01 UTC, 0 replies.
- [jira] Created: (TIKA-589) NPE with POI when parsing word docs - posted by "John Wang (JIRA)" <ji...@apache.org> on 2011/01/26 09:52:43 UTC, 0 replies.
- [jira] Commented: (TIKA-589) NPE with POI when parsing word docs - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2011/01/26 13:45:44 UTC, 1 replies.
- [jira] Commented: (TIKA-588) MIME detection for iWork documents returns application/zip - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2011/01/27 18:54:46 UTC, 1 replies.
- [jira] Resolved: (TIKA-588) MIME detection for iWork documents returns application/zip - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2011/01/27 18:56:43 UTC, 0 replies.
- [jira] Created: (TIKA-590) Create facility for deeper introspection of media files - posted by "Andre-John Mas (JIRA)" <ji...@apache.org> on 2011/01/28 22:44:45 UTC, 0 replies.
- [jira] Closed: (TIKA-589) NPE with POI when parsing word docs - posted by "John Wang (JIRA)" <ji...@apache.org> on 2011/01/29 20:03:43 UTC, 0 replies.
- [jira] Commented: (TIKA-590) Create facility for deeper introspection of media files - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2011/01/29 20:11:43 UTC, 2 replies.
- [jira] Created: (TIKA-591) Separate launcer process for forking JVMs - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2011/01/31 11:01:15 UTC, 0 replies.