You are viewing a plain text version of this content. The canonical link for it is here.
- Re: Build failed in Jenkins: Tika-trunk » Apache Tika OSGi bundle #703 - posted by Jukka Zitting <ju...@gmail.com> on 2011/11/01 01:29:39 UTC, 0 replies.
- Jenkins build is back to normal : Tika-trunk #704 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/11/01 03:13:53 UTC, 0 replies.
- Jenkins build is back to normal : Tika-trunk » Apache Tika OSGi bundle #704 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/11/01 03:13:53 UTC, 0 replies.
- Re: A problem in the right-to-left languages - posted by Ahmad Ajiloo <ah...@gmail.com> on 2011/11/01 11:24:04 UTC, 9 replies.
- Re: location of pdfbox in sources of Tika - posted by Ahmad Ajiloo <ah...@gmail.com> on 2011/11/01 11:32:45 UTC, 1 replies.
- [jira] [Commented] (TIKA-761) Provide version number by CLI argument -V - posted by "Ingo Renner (Commented) (JIRA)" <ji...@apache.org> on 2011/11/01 13:36:32 UTC, 0 replies.
- [jira] [Created] (TIKA-765) add icu dependency - posted by "Robert Muir (Created) (JIRA)" <ji...@apache.org> on 2011/11/01 14:51:32 UTC, 0 replies.
- [jira] [Resolved] (TIKA-763) Update license metadata - posted by "Jukka Zitting (Resolved) (JIRA)" <ji...@apache.org> on 2011/11/01 16:23:32 UTC, 0 replies.
- [jira] [Created] (TIKA-766) Trim down the NetCDF dependency - posted by "Jukka Zitting (Created) (JIRA)" <ji...@apache.org> on 2011/11/01 16:27:32 UTC, 0 replies.
- Re: Tika 1.0 RC? - posted by Jukka Zitting <ju...@gmail.com> on 2011/11/01 16:32:30 UTC, 3 replies.
- [jira] [Created] (TIKA-767) Enable controlling of PDFBOX's setSuppressDuplicateOverlappingText from PDFParser - posted by "Michael McCandless (Created) (JIRA)" <ji...@apache.org> on 2011/11/01 23:57:32 UTC, 0 replies.
- [jira] [Updated] (TIKA-767) Enable controlling of PDFBOX's setSuppressDuplicateOverlappingText from PDFParser - posted by "Michael McCandless (Updated) (JIRA)" <ji...@apache.org> on 2011/11/01 23:59:32 UTC, 0 replies.
- [jira] [Created] (TIKA-768) Parser for EDF files - posted by "Jukka Zitting (Created) (JIRA)" <ji...@apache.org> on 2011/11/02 01:51:32 UTC, 0 replies.
- [jira] [Created] (TIKA-769) Upgrade to Commons Compress 1.3 - posted by "Jukka Zitting (Created) (JIRA)" <ji...@apache.org> on 2011/11/02 11:11:32 UTC, 0 replies.
- [jira] [Resolved] (TIKA-769) Upgrade to Commons Compress 1.3 - posted by "Jukka Zitting (Resolved) (JIRA)" <ji...@apache.org> on 2011/11/02 14:01:34 UTC, 0 replies.
- [jira] [Created] (TIKA-770) New ODF metadata keys - posted by "Jukka Zitting (Created) (JIRA)" <ji...@apache.org> on 2011/11/02 14:09:33 UTC, 0 replies.
- [jira] [Updated] (TIKA-764) OpenDocumentMetaParser should use common metadata keys for document statistics - posted by "Jukka Zitting (Updated) (JIRA)" <ji...@apache.org> on 2011/11/02 14:09:33 UTC, 0 replies.
- [jira] [Resolved] (TIKA-764) OpenDocumentMetaParser should use common metadata keys for document statistics - posted by "Jukka Zitting (Resolved) (JIRA)" <ji...@apache.org> on 2011/11/02 14:09:33 UTC, 0 replies.
- [jira] [Commented] (TIKA-513) Support of Deja Vu (DjVu) format - posted by "Timothy Truckle (Commented) (JIRA)" <ji...@apache.org> on 2011/11/02 15:11:32 UTC, 1 replies.
- [jira] [Created] (TIKA-771) "Hello, World!" in UTF-8/ASCII gets detected as IBM500 - posted by "Jukka Zitting (Created) (JIRA)" <ji...@apache.org> on 2011/11/03 16:09:32 UTC, 0 replies.
- [jira] [Commented] (TIKA-771) "Hello, World!" in UTF-8/ASCII gets detected as IBM500 - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/11/03 16:35:32 UTC, 0 replies.
- [jira] [Commented] (TIKA-369) Improve accuracy of language detection - posted by "Joseph Vychtrle (Commented) (JIRA)" <ji...@apache.org> on 2011/11/03 19:47:33 UTC, 2 replies.
- Embed and ExifTool Contributions - posted by Ray Gauss II <ra...@rightspro.com> on 2011/11/03 21:32:47 UTC, 0 replies.
- [jira] [Created] (TIKA-772) media type detection fails for html documents, results in text/plain instead of text/html - posted by "Joseph Vychtrle (Created) (JIRA)" <ji...@apache.org> on 2011/11/03 22:15:32 UTC, 0 replies.
- Assist please - posted by NDIAYE Bacar <Ba...@murex.com> on 2011/11/04 11:05:24 UTC, 0 replies.
- [VOTE] Apache Tika 1.0 release rc #1 - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2011/11/04 16:42:29 UTC, 8 replies.
- [jira] [Resolved] (TIKA-767) Enable controlling of PDFBOX's setSuppressDuplicateOverlappingText from PDFParser - posted by "Michael McCandless (Resolved) (JIRA)" <ji...@apache.org> on 2011/11/04 17:29:51 UTC, 0 replies.
- Multilingual Tika - posted by Jukka Zitting <ju...@gmail.com> on 2011/11/05 01:22:21 UTC, 5 replies.
- [jira] [Commented] (TIKA-529) IBM420 charset detection's isLamAlef is allocation-happy - posted by "Michael McCandless (Commented) (JIRA)" <ji...@apache.org> on 2011/11/05 11:58:51 UTC, 0 replies.
- [jira] [Commented] (TIKA-772) media type detection fails for html documents, results in text/plain instead of text/html - posted by "Jukka Zitting (Commented) (JIRA)" <ji...@apache.org> on 2011/11/05 18:50:51 UTC, 11 replies.
- [jira] [Updated] (TIKA-772) media type detection fails for html documents, results in text/plain instead of text/html - posted by "Joseph Vychtrle (Updated) (JIRA)" <ji...@apache.org> on 2011/11/05 19:12:51 UTC, 2 replies.
- [jira] [Resolved] (TIKA-772) media type detection fails for html documents, results in text/plain instead of text/html - posted by "Jukka Zitting (Resolved) (JIRA)" <ji...@apache.org> on 2011/11/05 20:32:51 UTC, 0 replies.
- [jira] [Commented] (TIKA-728) Return RDFa meta tags via Metadata - posted by "Paolo Castagna (Commented) (JIRA)" <ji...@apache.org> on 2011/11/06 08:22:51 UTC, 0 replies.
- [jira] [Created] (TIKA-773) .NET version of Tika - posted by "Jukka Zitting (Created) (JIRA)" <ji...@apache.org> on 2011/11/06 11:44:51 UTC, 0 replies.
- [jira] [Updated] (TIKA-773) .NET version of Tika - posted by "Jukka Zitting (Updated) (JIRA)" <ji...@apache.org> on 2011/11/06 11:46:51 UTC, 0 replies.
- [jira] [Assigned] (TIKA-714) Word art isn't extracted for various doc types - posted by "Michael McCandless (Assigned) (JIRA)" <ji...@apache.org> on 2011/11/06 12:10:51 UTC, 0 replies.
- [jira] [Commented] (TIKA-714) Word art isn't extracted for various doc types - posted by "Michael McCandless (Commented) (JIRA)" <ji...@apache.org> on 2011/11/06 12:12:51 UTC, 0 replies.
- [jira] [Resolved] (TIKA-714) Word art isn't extracted for various doc types - posted by "Michael McCandless (Resolved) (JIRA)" <ji...@apache.org> on 2011/11/06 12:16:51 UTC, 0 replies.
- [jira] [Created] (TIKA-774) ExifTool Parser - posted by "Ray Gauss II (Created) (JIRA)" <ji...@apache.org> on 2011/11/06 20:07:52 UTC, 0 replies.
- [jira] [Updated] (TIKA-774) ExifTool Parser - posted by "Ray Gauss II (Updated) (JIRA)" <ji...@apache.org> on 2011/11/06 20:09:51 UTC, 1 replies.
- [jira] [Commented] (TIKA-697) Tika reports the content type of AR archives as "text/plain" - posted by "PNS (Commented) (JIRA)" <ji...@apache.org> on 2011/11/07 10:35:51 UTC, 4 replies.
- [jira] [Issue Comment Edited] (TIKA-697) Tika reports the content type of AR archives as "text/plain" - posted by "PNS (Issue Comment Edited) (JIRA)" <ji...@apache.org> on 2011/11/07 10:39:51 UTC, 4 replies.
- [jira] [Updated] (TIKA-697) Tika reports the content type of AR archives as "text/plain" - posted by "Alex Ott (Updated) (JIRA)" <ji...@apache.org> on 2011/11/07 11:19:51 UTC, 0 replies.
- [jira] [Updated] (TIKA-775) Embed Capabilities - posted by "Ray Gauss II (Updated) (JIRA)" <ji...@apache.org> on 2011/11/08 03:32:51 UTC, 1 replies.
- [jira] [Created] (TIKA-775) Embed Capabilities - posted by "Ray Gauss II (Created) (JIRA)" <ji...@apache.org> on 2011/11/08 03:32:51 UTC, 0 replies.
- [jira] [Created] (TIKA-776) ExifTool Embedder - posted by "Ray Gauss II (Created) (JIRA)" <ji...@apache.org> on 2011/11/08 03:42:51 UTC, 0 replies.
- [jira] [Updated] (TIKA-776) ExifTool Embedder - posted by "Ray Gauss II (Updated) (JIRA)" <ji...@apache.org> on 2011/11/08 03:42:51 UTC, 1 replies.
- [RESULT] [VOTE] Apache Tika 1.0 release rc #1 - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2011/11/08 07:37:21 UTC, 0 replies.
- [ANNOUNCE] Apache Tika 1.0 released - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2011/11/08 08:34:29 UTC, 1 replies.
- [jira] [Updated] (TIKA-777) RTF parser incorrectly applies fonts to complete group - posted by "Arjohn Kampman (Updated) (JIRA)" <ji...@apache.org> on 2011/11/08 17:47:51 UTC, 0 replies.
- [jira] [Created] (TIKA-777) RTF parser incorrectly applies fonts to complete group - posted by "Arjohn Kampman (Created) (JIRA)" <ji...@apache.org> on 2011/11/08 17:47:51 UTC, 0 replies.
- [jira] [Commented] (TIKA-612) Specify PDFBox options via ParseContext - posted by "Michael McCandless (Commented) (JIRA)" <ji...@apache.org> on 2011/11/08 17:55:51 UTC, 2 replies.
- [jira] [Closed] (TIKA-679) Proposal for PRT Parser - posted by "Troy Witthoeft (Closed) (JIRA)" <ji...@apache.org> on 2011/11/08 19:19:51 UTC, 0 replies.
- [jira] [Assigned] (TIKA-777) RTF parser incorrectly applies fonts to complete group - posted by "Michael McCandless (Assigned) (JIRA)" <ji...@apache.org> on 2011/11/08 19:37:51 UTC, 0 replies.
- [jira] [Assigned] (TIKA-529) IBM420 charset detection's isLamAlef is allocation-happy - posted by "Michael McCandless (Assigned) (JIRA)" <ji...@apache.org> on 2011/11/08 20:07:52 UTC, 0 replies.
- [jira] [Resolved] (TIKA-529) IBM420 charset detection's isLamAlef is allocation-happy - posted by "Michael McCandless (Resolved) (JIRA)" <ji...@apache.org> on 2011/11/08 20:07:53 UTC, 0 replies.
- [jira] [Resolved] (TIKA-777) RTF parser incorrectly applies fonts to complete group - posted by "Michael McCandless (Resolved) (JIRA)" <ji...@apache.org> on 2011/11/09 00:22:55 UTC, 0 replies.
- [jira] [Updated] (TIKA-612) Specify PDFBox options via ParseContext - posted by "Gregory Kanevsky (Updated) (JIRA)" <ji...@apache.org> on 2011/11/09 04:31:52 UTC, 1 replies.
- [Shameless Self Promotion] Tika in Action permanent discount code - posted by Chris A Mattmann <ch...@gmail.com> on 2011/11/09 11:30:46 UTC, 0 replies.
- [jira] [Created] (TIKA-778) NullPointerException in tika-app, parsing PDF content - posted by "Bastian Mathes (Created) (JIRA)" <ji...@apache.org> on 2011/11/09 14:03:51 UTC, 0 replies.
- [jira] [Commented] (TIKA-775) Embed Capabilities - posted by "Jukka Zitting (Commented) (JIRA)" <ji...@apache.org> on 2011/11/09 23:35:51 UTC, 1 replies.
- [jira] [Commented] (TIKA-774) ExifTool Parser - posted by "Jukka Zitting (Commented) (JIRA)" <ji...@apache.org> on 2011/11/09 23:41:52 UTC, 1 replies.
- Re: Updating CHANGES.txt? - posted by Jukka Zitting <ju...@gmail.com> on 2011/11/10 10:34:03 UTC, 1 replies.
- [jira] [Created] (TIKA-779) Detection of Microsoft Works 2000 Word Processor files - posted by "Antoni Mylka (Created) (JIRA)" <ji...@apache.org> on 2011/11/10 14:37:52 UTC, 0 replies.
- [jira] [Updated] (TIKA-779) Detection of Microsoft Works 2000 Word Processor files - posted by "Antoni Mylka (Updated) (JIRA)" <ji...@apache.org> on 2011/11/10 14:39:51 UTC, 2 replies.
- [jira] [Created] (TIKA-780) Optimize loading of the media type registry - posted by "Jukka Zitting (Created) (JIRA)" <ji...@apache.org> on 2011/11/10 17:52:51 UTC, 0 replies.
- Build failed in Jenkins: Tika-trunk #717 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/11/10 21:03:51 UTC, 0 replies.
- Tika-605 GDAL Parser - posted by "Ramirez, Paul M (388J)" <pa...@jpl.nasa.gov> on 2011/11/10 23:05:31 UTC, 1 replies.
- [jira] [Commented] (TIKA-593) Tika network server - posted by "Chris A. Mattmann (Commented) (JIRA)" <ji...@apache.org> on 2011/11/11 02:21:51 UTC, 2 replies.
- [jira] [Updated] (TIKA-593) Tika network server - posted by "Ingo Renner (Updated) (JIRA)" <ji...@apache.org> on 2011/11/11 03:08:52 UTC, 0 replies.
- Jenkins build is back to normal : Tika-trunk #718 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/11/11 09:15:30 UTC, 0 replies.
- buildbot failure in ASF Buildbot on tika-trunk - posted by bu...@apache.org on 2011/11/11 12:27:08 UTC, 0 replies.
- Build failed in Jenkins: Tika-trunk » Apache Tika core #719 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/11/11 13:02:39 UTC, 0 replies.
- Build failed in Jenkins: Tika-trunk #719 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/11/11 13:02:40 UTC, 0 replies.
- buildbot success in ASF Buildbot on tika-trunk - posted by bu...@apache.org on 2011/11/11 13:23:33 UTC, 0 replies.
- [jira] [Resolved] (TIKA-780) Optimize loading of the media type registry - posted by "Jukka Zitting (Resolved) (JIRA)" <ji...@apache.org> on 2011/11/11 13:30:51 UTC, 0 replies.
- Build failed in Jenkins: Tika-trunk » Apache Tika core #720 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/11/11 14:07:51 UTC, 0 replies.
- Build failed in Jenkins: Tika-trunk #720 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/11/11 14:07:52 UTC, 1 replies.
- Build failed in Jenkins: Tika-trunk #721 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/11/11 16:08:32 UTC, 0 replies.
- [jira] [Created] (TIKA-781) RTF parser should ignore most control words in ignore groups - posted by "Arjohn Kampman (Created) (JIRA)" <ji...@apache.org> on 2011/11/11 16:32:52 UTC, 0 replies.
- [jira] [Updated] (TIKA-781) RTF parser should ignore most control words in ignore groups - posted by "Arjohn Kampman (Updated) (JIRA)" <ji...@apache.org> on 2011/11/11 16:34:51 UTC, 1 replies.
- [jira] [Assigned] (TIKA-781) RTF parser should ignore most control words in ignore groups - posted by "Michael McCandless (Assigned) (JIRA)" <ji...@apache.org> on 2011/11/11 19:09:51 UTC, 0 replies.
- [jira] [Resolved] (TIKA-781) RTF parser should ignore most control words in ignore groups - posted by "Michael McCandless (Resolved) (JIRA)" <ji...@apache.org> on 2011/11/11 19:27:51 UTC, 0 replies.
- Jenkins build is back to normal : Tika-trunk #722 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2011/11/11 20:13:16 UTC, 0 replies.
- [jira] [Created] (TIKA-782) Add support for parsing binary data in RTF files - posted by "Arjohn Kampman (Created) (JIRA)" <ji...@apache.org> on 2011/11/11 21:10:51 UTC, 0 replies.
- [jira] [Updated] (TIKA-782) Add support for parsing binary data in RTF files - posted by "Arjohn Kampman (Updated) (JIRA)" <ji...@apache.org> on 2011/11/11 21:10:51 UTC, 3 replies.
- [jira] [Created] (TIKA-783) MD5 and SHA1 values posted on the download page for the .jar do not match actual computed values - posted by "Kelvin Meeks (Created) (JIRA)" <ji...@apache.org> on 2011/11/11 22:02:51 UTC, 0 replies.
- [jira] [Commented] (TIKA-663) JSP files data extraction failed - posted by "Dave Meikle (Commented) (JIRA)" <ji...@apache.org> on 2011/11/14 23:11:51 UTC, 2 replies.
- [jira] [Commented] (TIKA-779) Detection of Microsoft Works 2000 Word Processor files - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/11/15 10:44:03 UTC, 0 replies.
- [jira] [Resolved] (TIKA-779) Detection of Microsoft Works 2000 Word Processor files - posted by "Nick Burch (Resolved) (JIRA)" <ji...@apache.org> on 2011/11/15 10:44:03 UTC, 0 replies.
- [jira] [Resolved] (TIKA-783) MD5 and SHA1 values posted on the download page for the .jar do not match actual computed values - posted by "Jukka Zitting (Resolved) (JIRA)" <ji...@apache.org> on 2011/11/15 12:16:52 UTC, 0 replies.
- [jira] [Commented] (TIKA-773) .NET version of Tika - posted by "Jukka Zitting (Commented) (JIRA)" <ji...@apache.org> on 2011/11/15 12:36:51 UTC, 1 replies.
- [jira] [Commented] (TIKA-778) NullPointerException in tika-app, parsing PDF content - posted by "Jukka Zitting (Commented) (JIRA)" <ji...@apache.org> on 2011/11/15 12:40:52 UTC, 1 replies.
- [jira] [Commented] (TIKA-782) Add support for parsing binary data in RTF files - posted by "Arjohn Kampman (Commented) (JIRA)" <ji...@apache.org> on 2011/11/15 20:47:51 UTC, 6 replies.
- [jira] [Assigned] (TIKA-782) Add support for parsing binary data in RTF files - posted by "Michael McCandless (Assigned) (JIRA)" <ji...@apache.org> on 2011/11/15 21:01:52 UTC, 0 replies.
- [jira] [Commented] (TIKA-724) PDF text sometimes has extra space between letters - posted by "Ravish Bhagdev (Commented) (JIRA)" <ji...@apache.org> on 2011/11/17 10:44:52 UTC, 3 replies.
- [jira] [Issue Comment Edited] (TIKA-782) Add support for parsing binary data in RTF files - posted by "Arjohn Kampman (Issue Comment Edited) (JIRA)" <ji...@apache.org> on 2011/11/17 15:38:51 UTC, 0 replies.
- [jira] [Resolved] (TIKA-612) Specify PDFBox options via ParseContext - posted by "Michael McCandless (Resolved) (JIRA)" <ji...@apache.org> on 2011/11/17 18:26:52 UTC, 0 replies.
- [jira] [Resolved] (TIKA-782) Add support for parsing binary data in RTF files - posted by "Michael McCandless (Resolved) (JIRA)" <ji...@apache.org> on 2011/11/17 21:35:51 UTC, 0 replies.
- [jira] [Commented] (TIKA-734) Out of memory exception with Xlsx file less than 5 MB - posted by "Anirban Mitra (Commented) (JIRA)" <ji...@apache.org> on 2011/11/17 22:16:52 UTC, 1 replies.
- [jira] [Created] (TIKA-784) Mimetype entry for DITA - posted by "Nick Burch (Created) (JIRA)" <ji...@apache.org> on 2011/11/18 13:36:51 UTC, 0 replies.
- [jira] [Commented] (TIKA-784) Mimetype entry for DITA - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/11/18 16:14:52 UTC, 3 replies.
- [jira] [Created] (TIKA-785) TikaCLI should include a --list-detectors option similar to --list-parsers - posted by "Nick Burch (Created) (JIRA)" <ji...@apache.org> on 2011/11/21 02:02:30 UTC, 0 replies.
- [jira] [Commented] (TIKA-785) TikaCLI should include a --list-detectors option similar to --list-parsers - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/11/21 02:05:51 UTC, 0 replies.
- [jira] [Resolved] (TIKA-785) TikaCLI should include a --list-detectors option similar to --list-parsers - posted by "Nick Burch (Resolved) (JIRA)" <ji...@apache.org> on 2011/11/21 02:05:51 UTC, 0 replies.
- [jira] [Resolved] (TIKA-784) Mimetype entry for DITA - posted by "Nick Burch (Resolved) (JIRA)" <ji...@apache.org> on 2011/11/21 02:25:52 UTC, 0 replies.
- [jira] [Created] (TIKA-786) Tika CLI --detect returns incorrect content-type for files with altered extensions - posted by "John Mastarone (Created) (JIRA)" <ji...@apache.org> on 2011/11/21 03:43:51 UTC, 0 replies.
- [jira] [Commented] (TIKA-786) Tika CLI --detect returns incorrect content-type for files with altered extensions - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/11/21 11:32:51 UTC, 6 replies.
- [jira] [Resolved] (TIKA-786) Tika CLI --detect returns incorrect content-type for files with altered extensions - posted by "Nick Burch (Resolved) (JIRA)" <ji...@apache.org> on 2011/11/21 14:16:51 UTC, 0 replies.
- Re: Ogg Vorbis support - posted by Nick Burch <ni...@alfresco.com> on 2011/11/21 17:44:26 UTC, 0 replies.
- [jira] [Created] (TIKA-787) CharsetDetector text buffer is too small to small to correctly detect UTF-8 in HTML page - posted by "Maxim Valyanskiy (Created) (JIRA)" <ji...@apache.org> on 2011/11/23 14:23:40 UTC, 0 replies.
- [jira] [Resolved] (TIKA-787) CharsetDetector text buffer is too small to small to correctly detect UTF-8 in HTML page - posted by "Maxim Valyanskiy (Resolved) (JIRA)" <ji...@apache.org> on 2011/11/23 15:13:40 UTC, 0 replies.
- [jira] [Commented] (TIKA-723) Rotated text isn't extracted correctly from PDFs - posted by "John Mastarone (Commented) (JIRA)" <ji...@apache.org> on 2011/11/25 03:58:40 UTC, 2 replies.
- [jira] [Created] (TIKA-788) DWG parser infinite loop on possibly corrupt file - posted by "Stas Shaposhnikov (Created) (JIRA)" <ji...@apache.org> on 2011/11/25 06:38:39 UTC, 0 replies.
- [jira] [Updated] (TIKA-788) DWG parser infinite loop on possibly corrupt file - posted by "Stas Shaposhnikov (Updated) (JIRA)" <ji...@apache.org> on 2011/11/25 07:00:42 UTC, 0 replies.
- [jira] [Created] (TIKA-789) Microsoft Project (MPP) basic support - posted by "Nick Burch (Created) (JIRA)" <ji...@apache.org> on 2011/11/25 15:19:40 UTC, 0 replies.
- [jira] [Commented] (TIKA-789) Microsoft Project (MPP) basic support - posted by "Alex Ott (Commented) (JIRA)" <ji...@apache.org> on 2011/11/25 15:39:39 UTC, 2 replies.
- [jira] [Created] (TIKA-790) Reduce duplication between POIFSDocumentType (in OfficeParser) and POIFSContainerDetector - posted by "Nick Burch (Created) (JIRA)" <ji...@apache.org> on 2011/11/25 15:43:40 UTC, 0 replies.
- [jira] [Resolved] (TIKA-789) Microsoft Project (MPP) basic support - posted by "Nick Burch (Resolved) (JIRA)" <ji...@apache.org> on 2011/11/25 16:29:40 UTC, 0 replies.
- [jira] [Commented] (TIKA-790) Reduce duplication between POIFSDocumentType (in OfficeParser) and POIFSContainerDetector - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/11/25 16:35:40 UTC, 1 replies.
- [jira] [Created] (TIKA-791) Fix the detection of protected OOXML files - posted by "Antoni Mylka (Created) (JIRA)" <ji...@apache.org> on 2011/11/25 16:47:39 UTC, 0 replies.
- [jira] [Updated] (TIKA-791) Fix the detection of protected OOXML files - posted by "Antoni Mylka (Updated) (JIRA)" <ji...@apache.org> on 2011/11/25 16:51:40 UTC, 1 replies.
- [jira] [Commented] (TIKA-791) Fix the detection of protected OOXML files - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/11/25 16:57:40 UTC, 3 replies.
- Possible re-opening of resolved issue TIKA-738? - posted by John M <jf...@gmail.com> on 2011/11/26 03:25:48 UTC, 4 replies.
- [jira] [Reopened] (TIKA-738) Tika fails to extract text from PDF annotations - posted by "Michael McCandless (Reopened) (JIRA)" <ji...@apache.org> on 2011/11/26 14:27:40 UTC, 0 replies.
- [jira] [Updated] (TIKA-738) Tika fails to extract text from PDF annotations - posted by "Michael McCandless (Updated) (JIRA)" <ji...@apache.org> on 2011/11/26 14:37:40 UTC, 0 replies.
- [jira] [Created] (TIKA-792) NoSuchMethodException "CTMarkupImpl.(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document - posted by "Torsten Krah (Created) (JIRA)" <ji...@apache.org> on 2011/11/26 17:09:40 UTC, 0 replies.
- [jira] [Commented] (TIKA-792) NoSuchMethodException "CTMarkupImpl.(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/11/26 18:39:39 UTC, 0 replies.
- [jira] [Resolved] (TIKA-738) Tika fails to extract text from PDF annotations - posted by "Michael McCandless (Resolved) (JIRA)" <ji...@apache.org> on 2011/11/26 20:55:40 UTC, 0 replies.
- [jira] [Assigned] (TIKA-778) NullPointerException in tika-app, parsing PDF content - posted by "Michael McCandless (Assigned) (JIRA)" <ji...@apache.org> on 2011/11/26 20:57:40 UTC, 0 replies.
- [jira] [Resolved] (TIKA-778) NullPointerException in tika-app, parsing PDF content - posted by "Michael McCandless (Resolved) (JIRA)" <ji...@apache.org> on 2011/11/26 20:59:39 UTC, 0 replies.
- [jira] [Created] (TIKA-793) Invalid ASCII character (65533) when retriving MP3 metadata - posted by "William Seemann (Created) (JIRA)" <ji...@apache.org> on 2011/11/27 09:46:39 UTC, 0 replies.
- [jira] [Updated] (TIKA-793) Invalid ASCII character (65533) when retriving MP3 metadata - posted by "William Seemann (Updated) (JIRA)" <ji...@apache.org> on 2011/11/27 09:48:40 UTC, 0 replies.
- [jira] [Commented] (TIKA-793) Invalid ASCII character (65533) when retriving MP3 metadata - posted by "William Seemann (Commented) (JIRA)" <ji...@apache.org> on 2011/11/27 10:00:40 UTC, 0 replies.
- [jira] [Resolved] (TIKA-697) Tika reports the content type of AR archives as "text/plain" - posted by "Nick Burch (Resolved) (JIRA)" <ji...@apache.org> on 2011/11/28 00:00:42 UTC, 0 replies.
- [jira] [Created] (TIKA-794) Mime magic logic for Little16 is incorrect - posted by "Nick Burch (Created) (JIRA)" <ji...@apache.org> on 2011/11/28 01:26:40 UTC, 0 replies.
- [jira] [Resolved] (TIKA-794) Mime magic logic for Little16 is incorrect - posted by "Nick Burch (Resolved) (JIRA)" <ji...@apache.org> on 2011/11/28 01:28:40 UTC, 0 replies.
- [jira] [Commented] (TIKA-794) Mime magic logic for Little16 is incorrect - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/11/28 01:28:40 UTC, 0 replies.
- review board? - posted by Alex Ott <al...@gmail.com> on 2011/11/28 14:45:20 UTC, 1 replies.
- [jira] [Resolved] (TIKA-790) Reduce duplication between POIFSDocumentType (in OfficeParser) and POIFSContainerDetector - posted by "Nick Burch (Resolved) (JIRA)" <ji...@apache.org> on 2011/11/28 15:00:40 UTC, 0 replies.
- [jira] [Updated] (TIKA-700) Upgrade to POI 3.8 as available - posted by "Nick Burch (Updated) (JIRA)" <ji...@apache.org> on 2011/11/28 20:27:41 UTC, 0 replies.
- [jira] [Created] (TIKA-795) [PATCH] NoSuchMethod - XSLFPowerPointExtractorDecorator.buildXHTML POI - XSLFSlide.getMasterSheet() - posted by "Jeremy Anderson (Created) (JIRA)" <ji...@apache.org> on 2011/11/29 18:17:40 UTC, 0 replies.
- [jira] [Updated] (TIKA-795) [PATCH] NoSuchMethod - XSLFPowerPointExtractorDecorator.buildXHTML POI - XSLFSlide.getMasterSheet() - posted by "Jeremy Anderson (Updated) (JIRA)" <ji...@apache.org> on 2011/11/29 18:23:40 UTC, 2 replies.
- [jira] [Commented] (TIKA-795) [PATCH] NoSuchMethod - XSLFPowerPointExtractorDecorator.buildXHTML POI - XSLFSlide.getMasterSheet() - posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2011/11/29 18:49:39 UTC, 2 replies.
- Tesseract OCR engine - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2011/11/30 07:59:49 UTC, 3 replies.