You are viewing a plain text version of this content. The canonical link for it is here.
- Patches for parser.microsoft.WordExtractor - posted by kildishev <ki...@ispras.ru> on 2013/07/01 14:00:11 UTC, 1 replies.
- [ANNOUNCE] Apache Tika 1.4 Released - posted by Chris Mattmann <ma...@apache.org> on 2013/07/02 08:01:18 UTC, 1 replies.
- [jira] [Created] (TIKA-1139) Modify Tika-1129 to test against a local file - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2013/07/02 10:40:21 UTC, 0 replies.
- [jira] [Updated] (TIKA-1139) Modify Tika-1129 to test against a local file - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2013/07/02 10:46:20 UTC, 0 replies.
- [jira] [Created] (TIKA-1140) Better table representation, cell spanning in Word Extractor - posted by "Denis Kildishev (JIRA)" <ji...@apache.org> on 2013/07/02 12:37:19 UTC, 0 replies.
- [jira] [Updated] (TIKA-1140) Better table representation, cell spanning in Word Extractor - posted by "Denis Kildishev (JIRA)" <ji...@apache.org> on 2013/07/02 12:45:19 UTC, 1 replies.
- [jira] [Commented] (TIKA-1053) Upgrade Tika Parsers to use ASM 4.x - posted by "Thomas Mortagne (JIRA)" <ji...@apache.org> on 2013/07/02 14:32:21 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-1053) Upgrade Tika Parsers to use ASM 4.x - posted by "Thomas Mortagne (JIRA)" <ji...@apache.org> on 2013/07/02 14:32:21 UTC, 1 replies.
- [jira] [Updated] (TIKA-973) PDF form data isn't included in extracted content. - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2013/07/02 15:17:20 UTC, 0 replies.
- [jira] [Commented] (TIKA-998) How to handle row span and Colspan in parsing xls or xlsx files - posted by "Himanshu Agrawal (JIRA)" <ji...@apache.org> on 2013/07/02 16:07:20 UTC, 1 replies.
- [jira] [Resolved] (TIKA-1130) .docx text extract leaves out some portions of text - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2013/07/02 17:14:21 UTC, 1 replies.
- [jira] [Commented] (TIKA-1130) .docx text extract leaves out some portions of text - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2013/07/02 19:27:20 UTC, 5 replies.
- [jira] [Created] (TIKA-1141) javascript files that contain " - posted by "David Hara (JIRA)" <ji...@apache.org> on 2013/07/03 01:31:21 UTC, 0 replies.
-
[jira] [Created] (TIKA-1142) 1.3 Version Source Downloads are Missing from all Mirrors - posted by "Anton Spektorov (JIRA)" <ji...@apache.org> on 2013/07/03 08:00:22 UTC, 0 replies.
- [jira] [Updated] (TIKA-1142) 1.3 Version Source Downloads are Missing from all Mirrors - posted by "Anton Spektorov (JIRA)" <ji...@apache.org> on 2013/07/03 08:00:25 UTC, 0 replies.
- [jira] [Commented] (TIKA-1142) 1.3 Version Source Downloads are Missing from all Mirrors - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2013/07/03 08:03:23 UTC, 2 replies.
- [jira] [Resolved] (TIKA-1142) 1.3 Version Source Downloads are Missing from all Mirrors - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2013/07/03 08:03:23 UTC, 0 replies.
- [jira] [Created] (TIKA-1143) Fails to parse some PPT file - posted by "Vincent Massol (JIRA)" <ji...@apache.org> on 2013/07/03 12:26:19 UTC, 0 replies.
- [jira] [Commented] (TIKA-1143) Fails to parse some PPT file - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2013/07/03 12:34:20 UTC, 7 replies.
- [jira] [Updated] (TIKA-1143) Fails to parse some PPT file - posted by "Vincent Massol (JIRA)" <ji...@apache.org> on 2013/07/03 12:53:20 UTC, 0 replies.
- [jira] [Created] (TIKA-1144) Changes in styling mechanism, inner table support and list support for Word Extractor - posted by "Denis Kildishev (JIRA)" <ji...@apache.org> on 2013/07/03 13:36:20 UTC, 0 replies.
- [jira] [Updated] (TIKA-1144) Changes in styling mechanism, inner table support and list support for Word Extractor - posted by "Denis Kildishev (JIRA)" <ji...@apache.org> on 2013/07/03 13:58:21 UTC, 2 replies.
- [jira] [Created] (TIKA-1145) classloaders issue loading resources when extending Tika - posted by "Maciej Lizewski (JIRA)" <ji...@apache.org> on 2013/07/03 17:40:24 UTC, 0 replies.
- [jira] [Commented] (TIKA-1145) classloaders issue loading resources when extending Tika - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2013/07/03 18:00:20 UTC, 11 replies.
- [jira] [Updated] (TIKA-1145) classloaders issue loading resources when extending Tika - posted by "Maciej Lizewski (JIRA)" <ji...@apache.org> on 2013/07/04 10:04:20 UTC, 1 replies.
- [jira] [Resolved] (TIKA-1145) classloaders issue loading resources when extending Tika - posted by "Maciej Lizewski (JIRA)" <ji...@apache.org> on 2013/07/10 12:27:51 UTC, 0 replies.
- [jira] [Updated] (TIKA-1130) .docx text extract leaves out some portions of text - posted by "Daniel Gibby (JIRA)" <ji...@apache.org> on 2013/07/10 17:57:53 UTC, 1 replies.
- [jira] [Comment Edited] (TIKA-1130) .docx text extract leaves out some portions of text - posted by "Daniel Gibby (JIRA)" <ji...@apache.org> on 2013/07/10 17:59:49 UTC, 0 replies.
- [jira] [Reopened] (TIKA-1130) .docx text extract leaves out some portions of text - posted by "Daniel Gibby (JIRA)" <ji...@apache.org> on 2013/07/10 18:01:51 UTC, 0 replies.
- MagicDetector don't work for all RFC882 message Types. - posted by Kai-Uwe Schmidt <ku...@bel-it.de> on 2013/07/11 12:04:48 UTC, 9 replies.
- [jira] [Created] (TIKA-1146) MagicDetector don't work for all RFC882 message Types - posted by "Kai-Uwe Schmidt (JIRA)" <ji...@apache.org> on 2013/07/11 13:01:48 UTC, 0 replies.
- [jira] [Updated] (TIKA-1146) MagicDetector don't work for all RFC882 message Types - posted by "Kai-Uwe Schmidt (JIRA)" <ji...@apache.org> on 2013/07/11 13:05:48 UTC, 1 replies.
- [jira] [Created] (TIKA-1147) Passing a File-Based TikaInputStream to ExternalEmbedder Delete - posted by "Ray Gauss II (JIRA)" <ji...@apache.org> on 2013/07/17 22:31:47 UTC, 0 replies.
- [jira] [Updated] (TIKA-1147) File-Based TikaInputStreams are Deleted by ExternalEmbedder.embed - posted by "Ray Gauss II (JIRA)" <ji...@apache.org> on 2013/07/17 22:43:46 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1147) File-Based TikaInputStreams are Deleted by ExternalEmbedder.embed - posted by "Ray Gauss II (JIRA)" <ji...@apache.org> on 2013/07/18 00:08:49 UTC, 0 replies.
- Tika Core and Parsers Test Artifacts - posted by Ray Gauss II <ra...@alfresco.com> on 2013/07/18 14:14:31 UTC, 4 replies.
- [jira] [Commented] (TIKA-1146) MagicDetector don't work for all RFC882 message Types - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2013/07/19 15:48:50 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1146) MagicDetector don't work for all RFC882 message Types - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2013/07/19 15:54:51 UTC, 0 replies.
- [jira] [Created] (TIKA-1148) application/x-msdownload should have text/plain as super type - posted by "Torsten Krah (JIRA)" <ji...@apache.org> on 2013/07/19 16:56:49 UTC, 0 replies.
- [jira] [Commented] (TIKA-1148) application/x-msdownload should have text/plain as super type - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2013/07/19 17:08:48 UTC, 0 replies.
- [jira] [Created] (TIKA-1149) 12% performance improvement by caching in CompositeParser - posted by "Luca Della Toffola (JIRA)" <ji...@apache.org> on 2013/07/22 13:40:47 UTC, 0 replies.
- [jira] [Updated] (TIKA-1149) 12% performance improvement by caching in CompositeParser - posted by "Luca Della Toffola (JIRA)" <ji...@apache.org> on 2013/07/22 13:42:48 UTC, 0 replies.
- [jira] [Commented] (TIKA-1149) 12% performance improvement by caching in CompositeParser - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2013/07/22 14:58:48 UTC, 2 replies.
- [jira] [Created] (TIKA-1150) Extract text from textbox in XLSX - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2013/07/22 15:22:48 UTC, 0 replies.
- [jira] [Updated] (TIKA-1150) Extract text from textbox in XLSX - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2013/07/22 15:24:48 UTC, 0 replies.
- [jira] [Created] (TIKA-1151) Maven Build Should Automatically Produce test-jar Artifacts - posted by "Ray Gauss II (JIRA)" <ji...@apache.org> on 2013/07/22 21:32:48 UTC, 0 replies.
- [jira] [Created] (TIKA-1152) Process stucks on parsing of a CHM file - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2013/07/23 15:46:49 UTC, 0 replies.
- [jira] [Updated] (TIKA-1152) Process stucks on parsing of a CHM file - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2013/07/23 15:46:49 UTC, 3 replies.
- [jira] [Commented] (TIKA-1076) Upgrade to Apache POI 3.9 - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2013/07/23 16:00:49 UTC, 2 replies.
- [jira] [Created] (TIKA-1153) Upgrade pdfbox to latest 1.8.2 version - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2013/07/23 16:06:49 UTC, 0 replies.
- [jira] [Commented] (TIKA-1150) Extract text from textbox in XLSX - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2013/07/23 16:38:56 UTC, 0 replies.
- [jira] [Closed] (TIKA-1150) Extract text from textbox in XLSX - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2013/07/23 16:38:56 UTC, 0 replies.
- [jira] [Commented] (TIKA-1100) cannot extract text in text-box for Excel 2007 file(.xlsx, .xlsm) - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2013/07/23 16:40:54 UTC, 1 replies.
- [jira] [Updated] (TIKA-1100) cannot extract text in text-box for Excel 2007 file(.xlsx, .xlsm) - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2013/07/23 16:44:51 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-1149) 12% performance improvement by caching in CompositeParser - posted by "Luca Della Toffola (JIRA)" <ji...@apache.org> on 2013/07/23 17:08:48 UTC, 4 replies.
- [jira] [Updated] (TIKA-1152) Process loops infinitely on parsing of a CHM file - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2013/07/23 17:30:49 UTC, 2 replies.
- [jira] [Commented] (TIKA-1152) Process loops infinitely on parsing of a CHM file - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2013/07/23 18:14:49 UTC, 1 replies.
- [jira] [Comment Edited] (TIKA-1152) Process loops infinitely on parsing of a CHM file - posted by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2013/07/23 18:18:48 UTC, 3 replies.
- [jira] [Created] (TIKA-1154) Tika hangs on format detection of malformed HTML file. - posted by "Andrew Jackson (JIRA)" <ji...@apache.org> on 2013/07/25 11:32:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-1154) Tika hangs on format detection of malformed HTML file. - posted by "Andrew Jackson (JIRA)" <ji...@apache.org> on 2013/07/25 11:33:48 UTC, 0 replies.
- [jira] [Commented] (TIKA-1154) Tika hangs on format detection of malformed HTML file. - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2013/07/25 12:09:54 UTC, 6 replies.
- [jira] [Created] (TIKA-1155) Number Format is converted with an error - posted by "Evgeniy Buyanov (JIRA)" <ji...@apache.org> on 2013/07/25 12:59:50 UTC, 0 replies.
- [jira] [Updated] (TIKA-1155) Number Format is converted with an error - posted by "Evgeniy Buyanov (JIRA)" <ji...@apache.org> on 2013/07/25 13:03:50 UTC, 0 replies.
- [jira] [Updated] (TIKA-985) Support for HTML5 elements - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2013/07/25 17:13:49 UTC, 0 replies.
- [Announce] Welcome Tim Allison as Tika PM member and committer - posted by Nick Burch <ni...@apache.org> on 2013/07/30 11:29:06 UTC, 4 replies.
- [jira] [Commented] (TIKA-792) NoSuchMethodException "CTMarkupImpl.(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document - posted by "Cyrille Levandowski (JIRA)" <ji...@apache.org> on 2013/07/30 18:09:52 UTC, 1 replies.
- Would become a commiter - posted by Hong-Thai Nguyen <Ho...@polyspot.com> on 2013/07/31 11:43:29 UTC, 1 replies.