You are viewing a plain text version of this content. The canonical link for it is here.
- HTML styles and tags are ignored - posted by andrewtr <an...@compvue.com> on 2012/06/04 14:21:14 UTC, 1 replies.
- CSS styles and - tags been ignored while parsing
- posted by andrewtr <an...@compvue.com> on 2012/06/04 14:24:58 UTC, 0 replies.
- partial file parsing - posted by "K, Baraneetharan" <ba...@hp.com> on 2012/06/05 09:18:18 UTC, 0 replies.
- [jira] [Closed] (TIKA-933) Tika in server mode stops responding and reports NPE over and over in logs - posted by "Rob Tulloh (JIRA)" <ji...@apache.org> on 2012/06/05 22:00:23 UTC, 0 replies.
- TikaInputStream customization - posted by "K, Baraneetharan" <ba...@hp.com> on 2012/06/06 12:30:56 UTC, 3 replies.
- [jira] [Created] (TIKA-938) Index out of bounds exception parsing MS Word 2003 doc - posted by "Tim Barrett (JIRA)" <ji...@apache.org> on 2012/06/07 12:01:23 UTC, 0 replies.
- [jira] [Commented] (TIKA-938) Index out of bounds exception parsing MS Word 2003 doc - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2012/06/07 12:11:23 UTC, 0 replies.
- Welcome Ray Gauss as a Tika committer/PMC - posted by Nick Burch <ni...@alfresco.com> on 2012/06/08 17:14:40 UTC, 4 replies.
- [jira] [Updated] (TIKA-929) Consistent, namespaced definitions for office file related metadata - posted by "Jörg Ehrlich (JIRA)" <ji...@apache.org> on 2012/06/12 14:44:42 UTC, 2 replies.
- [jira] [Created] (TIKA-939) Windows Media Video file detected as Windows Media Audio - posted by "Emil Burzo (JIRA)" <ji...@apache.org> on 2012/06/13 17:04:16 UTC, 0 replies.
- [jira] [Updated] (TIKA-939) Windows Media Video file detected as Windows Media Audio - posted by "Emil Burzo (JIRA)" <ji...@apache.org> on 2012/06/13 17:05:42 UTC, 1 replies.
- [jira] [Commented] (TIKA-939) Windows Media Video file detected as Windows Media Audio - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2012/06/13 17:38:42 UTC, 0 replies.
- [jira] [Resolved] (TIKA-939) Windows Media Video file detected as Windows Media Audio - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2012/06/13 17:38:43 UTC, 0 replies.
- buildbot success in ASF Buildbot on tika-trunk - posted by bu...@apache.org on 2012/06/13 17:43:12 UTC, 1 replies.
- Jenkins build is back to stable : Tika-trunk #871 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/06/13 19:25:30 UTC, 0 replies.
- tika pull request: doesn't build with rome 0.9 - change to 1.0 - posted by Git at Apache <gi...@git.apache.org> on 2012/06/19 07:44:45 UTC, 0 replies.
- building tika - rome 0.9 dependency - posted by Pradeep Singh <pr...@gmail.com> on 2012/06/19 07:58:36 UTC, 0 replies.
- Support detecting 7-zip format - posted by Marco Quaranta <mq...@gmail.com> on 2012/06/20 10:42:09 UTC, 1 replies.
- Convert file before Tika processes it? - posted by 122jxgcn <yw...@gmail.com> on 2012/06/21 04:35:58 UTC, 3 replies.
- [jira] [Created] (TIKA-940) Support detecting 7-zip format - posted by "Marco Quaranta (JIRA)" <ji...@apache.org> on 2012/06/21 10:17:43 UTC, 0 replies.
- [jira] [Updated] (TIKA-940) Support detecting 7-zip format - posted by "Marco Quaranta (JIRA)" <ji...@apache.org> on 2012/06/21 10:17:43 UTC, 0 replies.
- [jira] [Created] (TIKA-941) Detecting KML / KMZ files - posted by "Marco Quaranta (JIRA)" <ji...@apache.org> on 2012/06/21 10:43:43 UTC, 0 replies.
- [jira] [Updated] (TIKA-941) Detecting KML / KMZ files - posted by "Marco Quaranta (JIRA)" <ji...@apache.org> on 2012/06/21 10:45:43 UTC, 1 replies.
- [jira] [Resolved] (TIKA-940) Support detecting 7-zip format - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2012/06/21 19:21:42 UTC, 0 replies.
- Build failed in Jenkins: Tika-trunk #872 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/06/21 20:07:16 UTC, 0 replies.
- [jira] [Commented] (TIKA-766) Trim down the NetCDF dependency - posted by "john caron (JIRA)" <ji...@apache.org> on 2012/06/23 01:01:43 UTC, 0 replies.
- [jira] [Created] (TIKA-942) HTTP Accept header evaluator - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/24 20:24:43 UTC, 0 replies.
- [jira] [Updated] (TIKA-943) Add parameter to tika-app to supply password for decryption - posted by "Jan Høydahl (JIRA)" <ji...@apache.org> on 2012/06/27 00:51:44 UTC, 0 replies.
- [jira] [Created] (TIKA-943) Add parameter to tika-app to supply password for decryption - posted by "Jan Høydahl (JIRA)" <ji...@apache.org> on 2012/06/27 00:51:44 UTC, 0 replies.
- [jira] [Updated] (TIKA-756) XMP output from Tika CLI - posted by "Jörg Ehrlich (JIRA)" <ji...@apache.org> on 2012/06/28 11:49:45 UTC, 3 replies.
- [jira] [Commented] (TIKA-756) XMP output from Tika CLI - posted by "Jörg Ehrlich (JIRA)" <ji...@apache.org> on 2012/06/28 11:57:44 UTC, 1 replies.
- XMP conversion module for Tika - posted by Joerg Ehrlich <je...@adobe.com> on 2012/06/28 12:14:57 UTC, 1 replies.
- [jira] [Commented] (TIKA-811) Upgrade metadatExtractor version for OpenJDK 7 support - posted by "Miguel Moquillon (JIRA)" <ji...@apache.org> on 2012/06/29 09:46:44 UTC, 4 replies.
- [jira] [Updated] (TIKA-811) Upgrade metadatExtractor version for OpenJDK 7 support - posted by "Emmanuel Hugonnet (JIRA)" <ji...@apache.org> on 2012/06/29 11:05:45 UTC, 0 replies.
- Jenkins build is back to normal : Tika-trunk #873 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/06/29 19:27:13 UTC, 0 replies.
- [jira] [Resolved] (TIKA-932) Upgrade to Commons Compress 1.4.1 - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/29 23:21:43 UTC, 0 replies.
- buildbot failure in ASF Buildbot on tika-trunk - posted by bu...@apache.org on 2012/06/29 23:21:54 UTC, 1 replies.
- ZipContainerDetector and TikaInputStream.getFile() - posted by Jukka Zitting <ju...@gmail.com> on 2012/06/30 00:07:02 UTC, 0 replies.
- [jira] [Resolved] (TIKA-941) Detecting KML / KMZ files - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 12:35:43 UTC, 0 replies.
- [jira] [Resolved] (TIKA-929) Consistent, namespaced definitions for office file related metadata - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 13:40:44 UTC, 0 replies.
- [jira] [Resolved] (TIKA-943) Add parameter to tika-app to supply password for decryption - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 13:53:44 UTC, 0 replies.
- [jira] [Resolved] (TIKA-937) RFC822Parser is extracting only the first destination address - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 14:00:46 UTC, 0 replies.
- [jira] [Resolved] (TIKA-934) Tika in server mode stops responding and reports NPE over and over in logs - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 14:15:44 UTC, 0 replies.
- [jira] [Resolved] (TIKA-876) Signed pdf parsing - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 15:02:44 UTC, 0 replies.
- [jira] [Resolved] (TIKA-871) Text in nested groups within a pptx not parsed - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 15:04:44 UTC, 0 replies.
- [jira] [Resolved] (TIKA-908) Adding XMP specification part one namespaces and properties - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 15:23:44 UTC, 0 replies.
- [jira] [Resolved] (TIKA-900) Tika fails to detect ISO9660 disk images - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 15:29:43 UTC, 0 replies.
- [jira] [Resolved] (TIKA-747) Ogg Vorbis and FLAC Parsers - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 16:17:44 UTC, 0 replies.
- [jira] [Resolved] (TIKA-810) Upgrade to PDFbox 1.7.0 as available - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 17:06:44 UTC, 0 replies.
- Build failed in Jenkins: Tika-trunk #882 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/06/30 17:08:28 UTC, 1 replies.
- [jira] [Resolved] (TIKA-758) Address TODOs when we upgrade to next PDFBox release - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 17:10:44 UTC, 0 replies.
- [jira] [Resolved] (TIKA-686) Split tika-parsers into separate components - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 17:10:44 UTC, 0 replies.
- [jira] [Commented] (TIKA-676) Boilerpipe fails - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 17:14:44 UTC, 0 replies.
- [jira] [Resolved] (TIKA-827) ForkServer fails to report issues if an exception is not properly serializable - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 17:18:44 UTC, 0 replies.
- [jira] [Commented] (TIKA-815) Tika parsers should handle failures more gracefully - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 17:22:44 UTC, 0 replies.
- [jira] [Resolved] (TIKA-834) server problem only 1st result is correct additional runs include data from 1st run - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 17:34:44 UTC, 0 replies.
- [jira] [Resolved] (TIKA-507) Parser for font files - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 17:40:44 UTC, 0 replies.
- [jira] [Resolved] (TIKA-848) NullPointerException in SecurityHandler.addDictionaryAndSubDictionary(SecurityHandler.java:185) - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 17:42:44 UTC, 0 replies.
- [jira] [Resolved] (TIKA-847) Add regular expression support to the MagicDetector - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 17:57:43 UTC, 0 replies.
- [jira] [Resolved] (TIKA-860) Make ZIP bomb detection configureable - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 18:03:45 UTC, 0 replies.
- [jira] [Resolved] (TIKA-865) MimeTypes.forName should avoid method-level synchronization - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 18:05:43 UTC, 0 replies.
- Jenkins build is back to normal : Tika-trunk #883 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/06/30 18:09:52 UTC, 0 replies.
- [jira] [Commented] (TIKA-918) iWork Charts not being parsed in all products (Pages, Numbers, Keynote) - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 18:11:45 UTC, 0 replies.
- [jira] [Commented] (TIKA-920) iWork Numbers sheetnames not being parsed into metadata - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 18:11:45 UTC, 0 replies.
- [jira] [Commented] (TIKA-919) iWork Page's cell values not being parsed if calculated via formula - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 18:11:45 UTC, 0 replies.
- [jira] [Commented] (TIKA-921) iWork Numbers - Cell formats which parser is completely ignoring - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 18:11:45 UTC, 0 replies.
- [jira] [Commented] (TIKA-860) Make ZIP bomb detection configureable - posted by "Uwe Schindler (JIRA)" <ji...@apache.org> on 2012/06/30 18:23:43 UTC, 0 replies.
- [jira] [Resolved] (TIKA-832) ForkParser is unfriendly to code that prints things to its output - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/06/30 18:32:43 UTC, 0 replies.
- [jira] [Commented] (TIKA-941) Detecting KML / KMZ files - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2012/06/30 19:20:43 UTC, 0 replies.
- [jira] [Commented] (TIKA-788) DWG parser infinite loop on possibly corrupt file - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2012/06/30 19:31:06 UTC, 0 replies.
- [jira] [Commented] (TIKA-863) MailContentHandler should not create AutoDetectParser on each call - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2012/06/30 19:40:45 UTC, 0 replies.
- [jira] [Resolved] (TIKA-863) MailContentHandler should not create AutoDetectParser on each call - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2012/06/30 19:42:44 UTC, 0 replies.
- [jira] [Commented] (TIKA-937) RFC822Parser is extracting only the first destination address - posted by "Luis Filipe Nassif (JIRA)" <ji...@apache.org> on 2012/06/30 20:37:45 UTC, 1 replies.