You are viewing a plain text version of this content. The canonical link for it is here.
- Tika 1.1 release - posted by Daniel Malmer <da...@markit.com> on 2012/03/01 19:01:56 UTC, 2 replies.
- [jira] [Commented] (TIKA-863) MailContentHandler should not create AutoDetectParser on each call - posted by "Andrzej Bialecki (Commented) (JIRA)" <ji...@apache.org> on 2012/03/02 11:07:59 UTC, 0 replies.
- Fwd: Google Summer of Code 2012 upcoming - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2012/03/04 18:57:18 UTC, 0 replies.
- [jira] [Updated] (TIKA-861) Parse links in PDF - posted by "Chris A. Mattmann (Updated) (JIRA)" <ji...@apache.org> on 2012/03/07 16:26:57 UTC, 0 replies.
- [jira] [Updated] (TIKA-868) TXT parser does not honour the specified encoding - posted by "Chris A. Mattmann (Updated) (JIRA)" <ji...@apache.org> on 2012/03/07 16:26:58 UTC, 0 replies.
- [jira] [Updated] (TIKA-715) Some parsers produce non-well-formed XHTML SAX events - posted by "Chris A. Mattmann (Updated) (JIRA)" <ji...@apache.org> on 2012/03/07 16:26:58 UTC, 0 replies.
- [jira] [Updated] (TIKA-817) (PPT/PPTX) Missing date/time in text content. - posted by "Chris A. Mattmann (Updated) (JIRA)" <ji...@apache.org> on 2012/03/07 16:26:58 UTC, 0 replies.
- [jira] [Updated] (TIKA-757) Address TODOs when we upgrade to next POI release (3.8 beta 5) - posted by "Chris A. Mattmann (Updated) (JIRA)" <ji...@apache.org> on 2012/03/07 16:26:59 UTC, 0 replies.
- [jira] [Updated] (TIKA-776) ExifTool Embedder - posted by "Chris A. Mattmann (Updated) (JIRA)" <ji...@apache.org> on 2012/03/07 16:26:59 UTC, 0 replies.
- [jira] [Updated] (TIKA-747) Ogg Vorbis and FLAC Parsers - posted by "Chris A. Mattmann (Updated) (JIRA)" <ji...@apache.org> on 2012/03/07 16:26:59 UTC, 0 replies.
- [jira] [Updated] (TIKA-605) Tika GDAL parser - posted by "Chris A. Mattmann (Updated) (JIRA)" <ji...@apache.org> on 2012/03/07 16:26:59 UTC, 0 replies.
- [jira] [Updated] (TIKA-758) Address TODOs when we upgrade to next PDFBox release - posted by "Chris A. Mattmann (Updated) (JIRA)" <ji...@apache.org> on 2012/03/07 16:27:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-820) Locator is unset for HTML parser - posted by "Chris A. Mattmann (Updated) (JIRA)" <ji...@apache.org> on 2012/03/07 16:27:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-816) (XLS/XLSX) Improperly formatted date/time in text content. - posted by "Chris A. Mattmann (Updated) (JIRA)" <ji...@apache.org> on 2012/03/07 16:27:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-539) Encoding detection is too biased by encoding in meta tag - posted by "Chris A. Mattmann (Updated) (JIRA)" <ji...@apache.org> on 2012/03/07 16:27:02 UTC, 0 replies.
- [jira] [Updated] (TIKA-819) Make Option to Exclude Embedded Files' Text for Text Content - posted by "Chris A. Mattmann (Updated) (JIRA)" <ji...@apache.org> on 2012/03/07 16:27:03 UTC, 0 replies.
- [jira] [Updated] (TIKA-754) Automatic line break insertion (BR element) instead of '\n' in XHTMLContentHandler - posted by "Chris A. Mattmann (Updated) (JIRA)" <ji...@apache.org> on 2012/03/07 16:27:03 UTC, 0 replies.
- [jira] [Updated] (TIKA-775) Embed Capabilities - posted by "Chris A. Mattmann (Updated) (JIRA)" <ji...@apache.org> on 2012/03/07 16:27:03 UTC, 0 replies.
- [jira] [Updated] (TIKA-842) IPTC Properties Should be Defined Completely and Independently of the Drew Library - posted by "Chris A. Mattmann (Updated) (JIRA)" <ji...@apache.org> on 2012/03/07 16:27:04 UTC, 0 replies.
- [jira] [Updated] (TIKA-774) ExifTool Parser - posted by "Chris A. Mattmann (Updated) (JIRA)" <ji...@apache.org> on 2012/03/07 16:27:04 UTC, 0 replies.
- [jira] [Updated] (TIKA-859) DublinCore Metadata Keys Should be Prefixed and Property Objects - posted by "Chris A. Mattmann (Updated) (JIRA)" <ji...@apache.org> on 2012/03/07 16:27:04 UTC, 2 replies.
- [jira] [Updated] (TIKA-593) Tika network server - posted by "Chris A. Mattmann (Updated) (JIRA)" <ji...@apache.org> on 2012/03/07 16:27:04 UTC, 4 replies.
- buildbot failure in ASF Buildbot on tika-trunk - posted by bu...@apache.org on 2012/03/07 16:30:02 UTC, 0 replies.
- [jira] [Created] (TIKA-869) IdentityHtmlMapper.mapSafeElement() needs to return lower-cased incoming name - posted by "Ken Krugler (Created) (JIRA)" <ji...@apache.org> on 2012/03/07 20:28:56 UTC, 0 replies.
- [jira] [Updated] (TIKA-869) IdentityHtmlMapper.mapSafeElement() needs to return lower-cased incoming name - posted by "Ken Krugler (Updated) (JIRA)" <ji...@apache.org> on 2012/03/07 20:32:57 UTC, 0 replies.
- [jira] [Created] (TIKA-870) Allow to use call parseToString with a additional parameter of MaxStringLength, so it can be changed per call - posted by "Shay Banon (Created) (JIRA)" <ji...@apache.org> on 2012/03/07 20:44:57 UTC, 0 replies.
- [jira] [Assigned] (TIKA-870) Allow to use call parseToString with a additional parameter of MaxStringLength, so it can be changed per call - posted by "Michael McCandless (Assigned) (JIRA)" <ji...@apache.org> on 2012/03/07 20:50:57 UTC, 0 replies.
- [jira] [Commented] (TIKA-870) Allow to use call parseToString with a additional parameter of MaxStringLength, so it can be changed per call - posted by "Michael McCandless (Commented) (JIRA)" <ji...@apache.org> on 2012/03/07 20:50:57 UTC, 0 replies.
- [jira] [Updated] (TIKA-870) Allow to use call parseToString with a additional parameter of MaxStringLength, so it can be changed per call - posted by "Michael McCandless (Updated) (JIRA)" <ji...@apache.org> on 2012/03/07 22:30:58 UTC, 0 replies.
- [VOTE] Apache Tika 1.1 release rc #1 - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2012/03/07 22:35:27 UTC, 9 replies.
- [jira] [Created] (TIKA-871) Text in nested groups within a pptx not parsed - posted by "Curtis Hyder (Created) (JIRA)" <ji...@apache.org> on 2012/03/09 00:57:57 UTC, 0 replies.
- [jira] [Updated] (TIKA-871) Text in nested groups within a pptx not parsed - posted by "Curtis Hyder (Updated) (JIRA)" <ji...@apache.org> on 2012/03/09 00:57:57 UTC, 1 replies.
- [jira] [Created] (TIKA-872) Tika --extract fails for RTF - posted by "Albert L. (Created) (JIRA)" <ji...@apache.org> on 2012/03/09 20:12:57 UTC, 0 replies.
- [jira] [Updated] (TIKA-872) Tika --extract fails for RTF - posted by "Albert L. (Updated) (JIRA)" <ji...@apache.org> on 2012/03/09 20:12:59 UTC, 2 replies.
- [jira] [Created] (TIKA-873) Tika --extract fails for DOC - posted by "Albert L. (Created) (JIRA)" <ji...@apache.org> on 2012/03/09 20:14:59 UTC, 0 replies.
- [jira] [Updated] (TIKA-873) Tika --extract fails for DOC - posted by "Albert L. (Updated) (JIRA)" <ji...@apache.org> on 2012/03/09 20:16:59 UTC, 2 replies.
- [jira] [Commented] (TIKA-873) Tika --extract fails for DOC - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2012/03/09 22:12:57 UTC, 7 replies.
- [jira] [Resolved] (TIKA-870) Allow to use call parseToString with a additional parameter of MaxStringLength, so it can be changed per call - posted by "Michael McCandless (Resolved) (JIRA)" <ji...@apache.org> on 2012/03/11 18:33:01 UTC, 0 replies.
- [jira] [Created] (TIKA-874) Identify FITS (Flexible Image Transport System) files - posted by "Peter May (Created) (JIRA)" <ji...@apache.org> on 2012/03/12 11:59:38 UTC, 0 replies.
- [jira] [Updated] (TIKA-874) Identify FITS (Flexible Image Transport System) files - posted by "Peter May (Updated) (JIRA)" <ji...@apache.org> on 2012/03/12 12:19:38 UTC, 3 replies.
- [jira] [Issue Comment Edited] (TIKA-874) Identify FITS (Flexible Image Transport System) files - posted by "Peter May (Issue Comment Edited) (JIRA)" <ji...@apache.org> on 2012/03/12 14:12:40 UTC, 1 replies.
- [jira] [Assigned] (TIKA-874) Identify FITS (Flexible Image Transport System) files - posted by "Chris A. Mattmann (Assigned) (JIRA)" <ji...@apache.org> on 2012/03/12 15:24:38 UTC, 0 replies.
- [jira] [Resolved] (TIKA-874) Identify FITS (Flexible Image Transport System) files - posted by "Chris A. Mattmann (Resolved) (JIRA)" <ji...@apache.org> on 2012/03/12 16:08:47 UTC, 0 replies.
- [jira] [Commented] (TIKA-875) Temporary file leak in ImageParser - posted by "Niels Beekman (Commented) (JIRA)" <ji...@apache.org> on 2012/03/13 10:43:38 UTC, 1 replies.
- [jira] [Created] (TIKA-875) Temporary file leak in ImageParser - posted by "Niels Beekman (Created) (JIRA)" <ji...@apache.org> on 2012/03/13 10:43:38 UTC, 0 replies.
- [jira] [Updated] (TIKA-875) Temporary file leak in ImageParser - posted by "Niels Beekman (Updated) (JIRA)" <ji...@apache.org> on 2012/03/13 10:45:38 UTC, 0 replies.
- [jira] [Assigned] (TIKA-875) Temporary file leak in ImageParser - posted by "Michael McCandless (Assigned) (JIRA)" <ji...@apache.org> on 2012/03/13 14:47:40 UTC, 0 replies.
- [jira] [Resolved] (TIKA-875) Temporary file leak in ImageParser - posted by "Michael McCandless (Resolved) (JIRA)" <ji...@apache.org> on 2012/03/13 15:01:38 UTC, 0 replies.
- Please let me join the user group - posted by prince shah <pr...@gmail.com> on 2012/03/14 00:04:55 UTC, 0 replies.
- [jira] [Created] (TIKA-876) Signed pdf parsing - posted by "Fausto Cruzeiro de Moraes (Created) (JIRA)" <ji...@apache.org> on 2012/03/14 22:24:35 UTC, 0 replies.
- [jira] [Commented] (TIKA-876) Signed pdf parsing - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2012/03/15 17:45:39 UTC, 3 replies.
- [jira] [Created] (TIKA-877) Embedded document not extracted (regression) - posted by "Daniel Bonniot de Ruisselet (Created) (JIRA)" <ji...@apache.org> on 2012/03/18 22:00:39 UTC, 0 replies.
- [jira] [Updated] (TIKA-877) Embedded document not extracted (regression) - posted by "Daniel Bonniot de Ruisselet (Updated) (JIRA)" <ji...@apache.org> on 2012/03/18 22:02:40 UTC, 0 replies.
- [jira] [Commented] (TIKA-877) Embedded document not extracted (regression) - posted by "Daniel Bonniot de Ruisselet (Commented) (JIRA)" <ji...@apache.org> on 2012/03/19 21:03:39 UTC, 11 replies.
- [jira] [Created] (TIKA-878) Reuse computed Map inside CompositeParser - posted by "Luis Filipe Nassif (Created) (JIRA)" <ji...@apache.org> on 2012/03/19 23:35:42 UTC, 0 replies.
- [jira] [Commented] (TIKA-878) Reuse computed Map inside CompositeParser - posted by "Jukka Zitting (Commented) (JIRA)" <ji...@apache.org> on 2012/03/19 23:41:38 UTC, 1 replies.
- [jira] [Resolved] (TIKA-878) Reuse computed Map inside CompositeParser - posted by "Jukka Zitting (Resolved) (JIRA)" <ji...@apache.org> on 2012/03/20 13:17:39 UTC, 0 replies.
- [jira] [Assigned] (TIKA-877) Embedded document not extracted (regression) - posted by "Maxim Valyanskiy (Assigned) (JIRA)" <ji...@apache.org> on 2012/03/21 09:37:41 UTC, 0 replies.
- [jira] [Issue Comment Edited] (TIKA-877) Embedded document not extracted (regression) - posted by "Daniel Bonniot de Ruisselet (Issue Comment Edited) (JIRA)" <ji...@apache.org> on 2012/03/21 10:15:39 UTC, 0 replies.
- [jira] [Resolved] (TIKA-877) Embedded document not extracted (regression) - posted by "Maxim Valyanskiy (Resolved) (JIRA)" <ji...@apache.org> on 2012/03/21 12:43:39 UTC, 0 replies.
- [jira] [Resolved] (TIKA-873) Tika --extract fails for DOC - posted by "Maxim Valyanskiy (Resolved) (JIRA)" <ji...@apache.org> on 2012/03/21 13:31:40 UTC, 0 replies.
- [jira] [Created] (TIKA-879) Detection problem: message/rfc822 file is detected as text/plain. - posted by "Kostya Gribov (Created) (JIRA)" <ji...@apache.org> on 2012/03/21 14:45:39 UTC, 0 replies.
- Pluggable language detection - posted by Julien Nioche <li...@gmail.com> on 2012/03/21 16:51:54 UTC, 5 replies.
- [jira] [Created] (TIKA-880) while integrating microsoft parser it is giving error - posted by "Somenath Mukhopadhyay (Created) (JIRA)" <ji...@apache.org> on 2012/03/22 06:02:24 UTC, 1 replies.
- [jira] [Created] (TIKA-881) HtmlParser sometimes(!) throws IOException while determining Html-Encoding - posted by "Klaus v. Einem (Created) (JIRA)" <ji...@apache.org> on 2012/03/22 13:04:22 UTC, 0 replies.
- [jira] [Updated] (TIKA-881) HtmlParser sometimes(!) throws IOException while determining Html-Encoding - posted by "Klaus v. Einem (Updated) (JIRA)" <ji...@apache.org> on 2012/03/22 13:10:24 UTC, 1 replies.
- [jira] [Commented] (TIKA-593) Tika network server - posted by "Maxim Valyanskiy (Commented) (JIRA)" <ji...@apache.org> on 2012/03/22 14:10:24 UTC, 20 replies.
- [jira] [Issue Comment Edited] (TIKA-881) HtmlParser sometimes(!) throws IOException while determining Html-Encoding - posted by "Klaus v. Einem (Issue Comment Edited) (JIRA)" <ji...@apache.org> on 2012/03/22 14:16:22 UTC, 2 replies.
- [jira] [Created] (TIKA-882) IllegalArgumentException: No part found for relationship - posted by "Maxim Valyanskiy (Created) (JIRA)" <ji...@apache.org> on 2012/03/22 15:42:23 UTC, 0 replies.
- [jira] [Resolved] (TIKA-882) IllegalArgumentException: No part found for relationship - posted by "Maxim Valyanskiy (Resolved) (JIRA)" <ji...@apache.org> on 2012/03/22 15:46:22 UTC, 0 replies.
- Build failed in Jenkins: Tika-trunk #813 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/03/22 16:06:48 UTC, 0 replies.
- [jira] [Assigned] (TIKA-881) HtmlParser sometimes(!) throws IOException while determining Html-Encoding - posted by "Ken Krugler (Assigned) (JIRA)" <ji...@apache.org> on 2012/03/22 17:44:22 UTC, 0 replies.
- [jira] [Commented] (TIKA-881) HtmlParser sometimes(!) throws IOException while determining Html-Encoding - posted by "Ken Krugler (Commented) (JIRA)" <ji...@apache.org> on 2012/03/22 17:46:22 UTC, 0 replies.
- [jira] [Issue Comment Edited] (TIKA-593) Tika network server - posted by "Maxim Valyanskiy (Issue Comment Edited) (JIRA)" <ji...@apache.org> on 2012/03/23 08:59:44 UTC, 3 replies.
- Jenkins build is back to normal : Tika-trunk #814 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/03/23 10:12:55 UTC, 0 replies.
- [jira] [Created] (TIKA-883) Extract embedded images in PPT - posted by "Maxim Valyanskiy (Created) (JIRA)" <ji...@apache.org> on 2012/03/23 12:49:28 UTC, 0 replies.
- [jira] [Resolved] (TIKA-883) Extract embedded images in PPT - posted by "Maxim Valyanskiy (Resolved) (JIRA)" <ji...@apache.org> on 2012/03/23 13:11:32 UTC, 0 replies.
- [RESULT] [VOTE] Apache Tika 1.1 release rc #1 - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2012/03/23 20:59:03 UTC, 0 replies.
- [ANNOUNCE] Apache Tika 1.1 released - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2012/03/23 21:01:56 UTC, 0 replies.
- Build failed in Jenkins: Tika-trunk #818 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/03/26 13:28:55 UTC, 0 replies.
- Build failed in Jenkins: Tika-trunk #819 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/03/26 14:32:55 UTC, 0 replies.
- Build failed in Jenkins: Tika-trunk #820 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/03/26 15:30:47 UTC, 2 replies.
- [jira] [Created] (TIKA-884) Dynamic loading of Parser and Detector services - posted by "Jukka Zitting (Created) (JIRA)" <ji...@apache.org> on 2012/03/27 17:48:26 UTC, 0 replies.
- [jira] [Resolved] (TIKA-884) Dynamic loading of Parser and Detector services - posted by "Jukka Zitting (Resolved) (JIRA)" <ji...@apache.org> on 2012/03/27 19:34:25 UTC, 0 replies.
- Build failed in Jenkins: Tika-trunk #821 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/03/27 20:20:03 UTC, 2 replies.
- Build failed in Jenkins: Tika-trunk #822 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/03/27 21:11:35 UTC, 0 replies.
- Jenkins build is back to normal : Tika-trunk #823 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/03/28 01:16:06 UTC, 0 replies.
- [jira] [Created] (TIKA-885) Possible ConcurrentModificationException while accessing Metadata produced by ParsingReader - posted by "Luis Filipe Nassif (Created) (JIRA)" <ji...@apache.org> on 2012/03/28 01:34:29 UTC, 0 replies.
- Build failed in Jenkins: Tika-trunk #824 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/03/28 07:15:26 UTC, 0 replies.
- [jira] [Created] (TIKA-886) OOXMLExtractorFactory can leave files open - posted by "Nick Burch (Created) (JIRA)" <ji...@apache.org> on 2012/03/28 17:23:25 UTC, 0 replies.
- [jira] [Commented] (TIKA-886) OOXMLExtractorFactory can leave files open - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2012/03/28 17:25:27 UTC, 1 replies.
- [jira] [Resolved] (TIKA-886) OOXMLExtractorFactory can leave files open - posted by "Nick Burch (Resolved) (JIRA)" <ji...@apache.org> on 2012/03/28 17:27:33 UTC, 0 replies.
- Jenkins build is back to normal : Tika-trunk #825 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/03/28 18:21:33 UTC, 0 replies.
- [jira] [Created] (TIKA-887) Tika fails to parse some MP3 tags correctly and produces null characters in value - posted by "Jens Hübel (Created JIRA)" <ji...@apache.org> on 2012/03/29 09:41:35 UTC, 0 replies.
- [jira] [Commented] (TIKA-887) Tika fails to parse some MP3 tags correctly and produces null characters in value - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2012/03/29 12:30:28 UTC, 1 replies.
- [jira] [Updated] (TIKA-887) Tika fails to parse some MP3 tags correctly and produces null characters in value - posted by "Jens Hübel (Updated JIRA)" <ji...@apache.org> on 2012/03/29 13:20:26 UTC, 0 replies.
- [jira] [Resolved] (TIKA-593) Tika network server - posted by "Chris A. Mattmann (Resolved) (JIRA)" <ji...@apache.org> on 2012/03/29 16:42:31 UTC, 0 replies.
- [jira] [Reopened] (TIKA-593) Tika network server - posted by "Chris A. Mattmann (Reopened) (JIRA)" <ji...@apache.org> on 2012/03/29 16:42:31 UTC, 0 replies.
- Build failed in Jenkins: Tika-trunk #827 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/03/29 17:14:25 UTC, 0 replies.
- [jira] [Created] (TIKA-888) NetCDF parser uses Java 6 JAR file and test/compilation fails with Java 1.5, although TIKA is Java 1.5 - posted by "Uwe Schindler (Created) (JIRA)" <ji...@apache.org> on 2012/03/30 08:22:30 UTC, 0 replies.
- How to put the extracted image in the right place in Display - posted by "som.mukhopadhyay" <so...@googlemail.com> on 2012/03/30 12:32:19 UTC, 0 replies.
- [jira] [Assigned] (TIKA-888) NetCDF parser uses Java 6 JAR file and test/compilation fails with Java 1.5, although TIKA is Java 1.5 - posted by "Chris A. Mattmann (Assigned) (JIRA)" <ji...@apache.org> on 2012/03/30 16:10:36 UTC, 0 replies.
- [jira] [Commented] (TIKA-888) NetCDF parser uses Java 6 JAR file and test/compilation fails with Java 1.5, although TIKA is Java 1.5 - posted by "Chris A. Mattmann (Commented) (JIRA)" <ji...@apache.org> on 2012/03/30 16:20:28 UTC, 7 replies.
- [jira] [Created] (TIKA-889) XHTMLContentHandler wont emit newline when html element matches ENDLINE set - posted by "John Conwell (Created) (JIRA)" <ji...@apache.org> on 2012/03/30 20:54:27 UTC, 0 replies.
- [jira] [Updated] (TIKA-852) Quicktime / MP4 Metadata Parser - posted by "Sebastian Annies (Updated) (JIRA)" <ji...@apache.org> on 2012/03/31 17:52:25 UTC, 0 replies.