You are viewing a plain text version of this content. The canonical link for it is here.
- [CANCEL][VOTE] Release Apache Tika 1.19.1 Candidate #1 - posted by Tim Allison <ta...@apache.org> on 2018/10/01 12:44:39 UTC, 0 replies.
- [jira] [Created] (TIKA-2742) Tika 1.19 trigger a dependency on slf4j-log4j12 - posted by "Thomas Mortagne (JIRA)" <ji...@apache.org> on 2018/10/01 16:15:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2743) Replace com.sun.xml.bind:jaxb-impl and jaxb-core by org.glassfish.jaxb:jaxb-runtime and jaxb-core - posted by "Thomas Mortagne (JIRA)" <ji...@apache.org> on 2018/10/01 17:13:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2742) Tika 1.19 trigger a dependency on slf4j-log4j12 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/01 19:02:00 UTC, 3 replies.
- [jira] [Assigned] (TIKA-2743) Replace com.sun.xml.bind:jaxb-impl and jaxb-core by org.glassfish.jaxb:jaxb-runtime and jaxb-core - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/01 19:03:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2743) Replace com.sun.xml.bind:jaxb-impl and jaxb-core by org.glassfish.jaxb:jaxb-runtime and jaxb-core - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/01 19:31:00 UTC, 2 replies.
- [jira] [Created] (TIKA-2744) rss+xml doesnt accept files with .xml extension - posted by "Martin (JIRA)" <ji...@apache.org> on 2018/10/02 10:14:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2723) Issue with parsing .mht container - posted by "Ghenadie (JIRA)" <ji...@apache.org> on 2018/10/02 11:33:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2742) Tika 1.19 trigger a dependency on slf4j-log4j12 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/02 14:50:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2743) Replace com.sun.xml.bind:jaxb-impl and jaxb-core by org.glassfish.jaxb:jaxb-runtime and jaxb-core - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/02 14:51:00 UTC, 0 replies.
- tika-2.x-windows - Build # 325 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/10/02 15:19:03 UTC, 0 replies.
- [jira] [Commented] (TIKA-2473) PCX and DCX image support - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/10/02 15:20:00 UTC, 4 replies.
- [jira] [Commented] (TIKA-2740) Update Python dependency check for TesseractOCR Parser rotation.py script - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/10/02 15:43:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2745) Upgrade to PDFBox 2.0.12 when available - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/02 20:11:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2746) ParsingReader#throwable is not shared but not volatile - posted by "Rohan Padhye (JIRA)" <ji...@apache.org> on 2018/10/03 07:22:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2746) ParsingReader#throwable is shared across threads but not volatile - posted by "Rohan Padhye (JIRA)" <ji...@apache.org> on 2018/10/03 07:23:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2747) Expose custom MAPI properties as a result of the OutlookExtractor metadata - posted by "Vittorio (JIRA)" <ji...@apache.org> on 2018/10/03 15:23:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2748) trivial tika-server bug w -maxFiles in new -spawnChild mode - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/03 19:25:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2748) trivial tika-server bug w -maxFiles in new -spawnChild mode - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/03 19:28:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2735) notes and footer contents are duplicated in extracting text from power point slides - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/03 19:59:00 UTC, 8 replies.
- [jira] [Commented] (TIKA-2734) Tika addes extra characters at the end of text in extracting from excel file - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/03 20:08:00 UTC, 6 replies.
- [jira] [Commented] (TIKA-2744) rss+xml doesnt accept files with .xml extension - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2018/10/03 20:33:00 UTC, 8 replies.
- tika-2.x-windows - Build # 326 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/10/03 20:33:35 UTC, 0 replies.
- [jira] [Commented] (TIKA-2478) RFC822 includes redundant copies of the text - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/10/03 20:34:00 UTC, 2 replies.
- tika-2.x-windows - Build # 327 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/10/04 08:51:01 UTC, 0 replies.
- [jira] [Created] (TIKA-2749) OCR on PDFs should "just work" out of the box - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/04 12:41:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-2749) OCR on PDFs should "just work" out of the box - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/04 12:46:00 UTC, 1 replies.
- [jira] [Comment Edited] (TIKA-2749) OCR on PDFs should "just work" out of the box - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/04 12:47:00 UTC, 1 replies.
- [jira] [Resolved] (TIKA-2734) Tika addes extra characters at the end of text in extracting from excel file - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/04 15:01:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2745) Upgrade to PDFBox 2.0.12 when available - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/04 20:42:00 UTC, 0 replies.
- tika-2.x-windows - Build # 330 - Failure - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/10/04 21:16:50 UTC, 0 replies.
- [jira] [Commented] (TIKA-2745) Upgrade to PDFBox 2.0.12 when available - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/10/04 21:17:00 UTC, 2 replies.
- [VOTE] Release Apache Tika 1.19.1 Candidate #2 - posted by Tim Allison <ta...@apache.org> on 2018/10/04 22:03:07 UTC, 7 replies.
- [jira] [Commented] (TIKA-2679) Bump 1.x branch to Java 1.8 - posted by "KIRUBHAAKARAN S (JIRA)" <ji...@apache.org> on 2018/10/05 10:15:00 UTC, 1 replies.
- [jira] [Updated] (TIKA-2747) Expose custom MAPI properties as a result of the OutlookExtractor metadata - posted by "Vittorio (JIRA)" <ji...@apache.org> on 2018/10/05 11:07:00 UTC, 4 replies.
- [jira] [Commented] (TIKA-2747) Expose custom MAPI properties as a result of the OutlookExtractor metadata - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/05 11:41:00 UTC, 3 replies.
- Re: Welcome to the regression vm! - posted by Tim Allison <ta...@apache.org> on 2018/10/05 13:18:05 UTC, 0 replies.
- [jira] [Updated] (TIKA-2734) Tika addes extra characters at the end of text in extracting from excel file - posted by "feng ye (JIRA)" <ji...@apache.org> on 2018/10/05 13:20:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2750) Update regression corpus - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/05 13:25:00 UTC, 0 replies.
- updating data on the regression corpus - posted by Tim Allison <ta...@apache.org> on 2018/10/05 13:29:44 UTC, 0 replies.
- [jira] [Commented] (TIKA-2750) Update regression corpus - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/05 13:34:00 UTC, 4 replies.
- [jira] [Issue Comment Deleted] (TIKA-1358) Add support for newer iWork file formats - posted by "king.wyx (JIRA)" <ji...@apache.org> on 2018/10/07 13:10:00 UTC, 1 replies.
- [jira] [Commented] (TIKA-2696) Support output of Tesseract OSD output for psm mode 0 - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2018/10/09 16:25:00 UTC, 5 replies.
- [RESULT][VOTE] Release Apache Tika 1.19.1 Candidate #2 - posted by Tim Allison <ta...@apache.org> on 2018/10/09 18:41:13 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2727) Parsing and detect mime type of XML file stuck in infinite loop - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/09 19:08:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2730) parseToString fails for a simple mp3 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/09 19:08:00 UTC, 0 replies.
- [ANNOUNCE] Apache Tika 1.19.1 released - posted by Tim Allison <ta...@apache.org> on 2018/10/09 19:57:50 UTC, 0 replies.
- [CVE-2018-11796] Apache Tika Denial of Service via XML Entity Expansion Vulnerability - posted by Tim Allison <ta...@apache.org> on 2018/10/09 20:05:18 UTC, 0 replies.
- [jira] [Commented] (TIKA-2703) Error indexing a xlsx file - posted by "Mario Bisonti (JIRA)" <ji...@apache.org> on 2018/10/11 08:49:00 UTC, 1 replies.
- [jira] [Comment Edited] (TIKA-2703) Error indexing a xlsx file - posted by "Mario Bisonti (JIRA)" <ji...@apache.org> on 2018/10/11 09:46:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2703) Error indexing a xlsx file - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/11 17:50:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2751) Upgrade to POI 4.0.1 when available - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/11 17:54:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2752) Tika-App RTFParser crashes with NullPointerException - posted by "Vicky Chawda (JIRA)" <ji...@apache.org> on 2018/10/12 07:07:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2752) Tika-App RTFParser crashes with NullPointerException - posted by "Vicky Chawda (JIRA)" <ji...@apache.org> on 2018/10/12 07:12:00 UTC, 3 replies.
- Fwd: DIH for TikaEntityProcessor - posted by Oleg Tikhonov <ol...@gmail.com> on 2018/10/12 10:42:50 UTC, 0 replies.
- [jira] [Created] (TIKA-2753) ChildProcess does not use the JAVA_HOME - posted by "Julien Massiera (JIRA)" <ji...@apache.org> on 2018/10/12 12:32:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2754) Log file name in tika-server on exception/error - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/12 15:33:00 UTC, 0 replies.
- [jira] [Assigned] (TIKA-2753) ChildProcess does not use the JAVA_HOME - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/12 15:34:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2753) ChildProcess does not use the JAVA_HOME - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/10/12 16:35:00 UTC, 2 replies.
- [jira] [Resolved] (TIKA-2754) Log file name in tika-server on exception/error - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/12 16:39:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2753) ChildProcess does not use the JAVA_HOME - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/12 16:41:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2731) Unecessary call to System.getProperties() in XMLReaderUtils - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/12 16:46:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2729) add -Djava.awt.headless=true to child process in tika-server - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/12 16:47:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2754) Log file name in tika-server on exception/error - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/10/12 17:35:00 UTC, 2 replies.
- [jira] [Created] (TIKA-2755) Allow Tika to skip extraction of tags in HTML - posted by "Harinder (JIRA)" <ji...@apache.org> on 2018/10/12 18:17:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2755) Allow Tika to skip extraction of tags in HTML - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/12 20:21:00 UTC, 4 replies.
- [jira] [Commented] (TIKA-2752) Tika-App RTFParser crashes with NullPointerException - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/12 20:34:00 UTC, 4 replies.
- [jira] [Comment Edited] (TIKA-2755) Allow Tika to skip extraction of tags in HTML - posted by "Harinder (JIRA)" <ji...@apache.org> on 2018/10/12 20:54:00 UTC, 1 replies.
- [jira] [Commented] (TIKA-2368) Clean up SentimentParser dependencies - posted by "Oleg Tikhonov (JIRA)" <ji...@apache.org> on 2018/10/14 08:14:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-2368) Clean up SentimentParser dependencies - posted by "Oleg Tikhonov (JIRA)" <ji...@apache.org> on 2018/10/14 08:15:00 UTC, 0 replies.
- JDK 12 Early Access build 15 is available - posted by Rory O'Donnell <ro...@oracle.com> on 2018/10/15 10:38:08 UTC, 0 replies.
- [jira] [Moved] (TIKA-2756) Switch to commons-lang 3 - posted by "Robert Munteanu (JIRA)" <ji...@apache.org> on 2018/10/16 12:50:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2756) Switch to commons-lang 3 - posted by "Robert Munteanu (JIRA)" <ji...@apache.org> on 2018/10/16 12:53:00 UTC, 9 replies.
- [jira] [Reopened] (TIKA-2734) Tika addes extra characters at the end of text in extracting from excel file - posted by "feng ye (JIRA)" <ji...@apache.org> on 2018/10/16 15:13:00 UTC, 0 replies.
- tika-2.x-windows - Build # 334 - Failure - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/10/17 13:18:06 UTC, 0 replies.
- [jira] [Created] (TIKA-2757) Add versions-maven-plugin - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/17 13:35:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2757) Add versions-maven-plugin - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/17 13:40:00 UTC, 3 replies.
- tika-2.x-windows - Build # 335 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/10/17 14:18:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2543) No content extraction for application/x-webarchive format - posted by "Rafael Ferreira (JIRA)" <ji...@apache.org> on 2018/10/17 15:10:00 UTC, 5 replies.
- [jira] [Updated] (TIKA-2543) No content extraction for application/x-webarchive format - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/17 16:15:00 UTC, 0 replies.
- tika-2.x-windows - Build # 336 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/10/17 17:17:52 UTC, 0 replies.
- [jira] [Commented] (TIKA-2577) Sonatype Nexus Auditor is reporting that the Bouncy castle version used by Tika 1.17 is vulnerable - posted by "Andrew Pavlin (JIRA)" <ji...@apache.org> on 2018/10/17 18:20:00 UTC, 1 replies.
- [jira] [Resolved] (TIKA-2577) Sonatype Nexus Auditor is reporting that the Bouncy castle version used by Tika 1.17 is vulnerable - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/17 18:35:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2744) rss+xml doesnt accept files with .xml extension - posted by "Martin (JIRA)" <ji...@apache.org> on 2018/10/18 05:54:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2758) Possible error charset detection - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2018/10/18 11:08:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2758) Possible error charset detection - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2018/10/18 11:08:00 UTC, 3 replies.
- [jira] [Created] (TIKA-2759) ScriptsExtractor incorrectly reports Javascript to characters() in SAX ContentHandler - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2018/10/18 11:10:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2759) ScriptsExtractor incorrectly reports Javascript to characters() in SAX ContentHandler - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2018/10/18 11:11:00 UTC, 1 replies.
- [jira] [Created] (TIKA-2760) LinkContentHandler does not report hyperlinks - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2018/10/18 12:30:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2760) LinkContentHandler does not report hyperlinks - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2018/10/18 12:31:00 UTC, 1 replies.
- [jira] [Comment Edited] (TIKA-2744) rss+xml doesnt accept files with .xml extension - posted by "Martin (JIRA)" <ji...@apache.org> on 2018/10/18 12:32:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2760) LinkContentHandler does not report hyperlinks - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2018/10/18 12:35:00 UTC, 2 replies.
- [jira] [Created] (TIKA-2761) XML Structured Text Is Missing Metadata Fields for mp3 files - posted by "Nick Sincaglia (JIRA)" <ji...@apache.org> on 2018/10/18 22:04:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2762) Capture short fields (<150 chars) in EnviParserHeader Metadata - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2018/10/19 23:02:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2762) Capture short fields (<150 chars) in EnviParserHeader Metadata - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2018/10/19 23:04:01 UTC, 7 replies.
- [jira] [Created] (TIKA-2763) PDFParser - java.io.IOException: Missing root object specification in trailer - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2018/10/20 00:27:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2763) PDFParser - java.io.IOException: Missing root object specification in trailer - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/20 15:05:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2758) Possible error charset detection - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2018/10/20 15:07:00 UTC, 5 replies.
- [jira] [Comment Edited] (TIKA-2763) PDFParser - java.io.IOException: Missing root object specification in trailer - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/20 15:23:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-2758) Possible error charset detection - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2018/10/20 19:52:00 UTC, 1 replies.
- [jira] [Closed] (TIKA-2734) Tika addes extra characters at the end of text in extracting from excel file - posted by "feng ye (JIRA)" <ji...@apache.org> on 2018/10/22 03:50:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2764) Allow configuration to include/not deleted text in WordPerfect 6.x files - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/22 16:21:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2761) XML Structured Text Is Missing Metadata Fields for mp3 files - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/22 16:34:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2764) Allow configuration to include/not deleted text in WordPerfect 6.x files - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/22 16:34:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2759) ScriptsExtractor incorrectly reports Javascript to characters() in SAX ContentHandler - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/22 16:58:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2759) ScriptsExtractor incorrectly reports Javascript to characters() in SAX ContentHandler - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/22 17:00:00 UTC, 4 replies.
- [jira] [Commented] (TIKA-2764) Allow configuration to include/not deleted text in WordPerfect 6.x files - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/10/22 17:12:00 UTC, 2 replies.
- [jira] [Commented] (TIKA-2761) XML Structured Text Is Missing Metadata Fields for mp3 files - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/10/22 17:12:00 UTC, 2 replies.
- [jira] [Resolved] (TIKA-2763) PDFParser - java.io.IOException: Missing root object specification in trailer - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2018/10/24 06:15:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2674) Update some dependencies that are incompatible with Java 11 - posted by "Lukasz Lech (JIRA)" <ji...@apache.org> on 2018/10/24 08:21:00 UTC, 1 replies.
- [jira] [Comment Edited] (TIKA-2756) Switch to commons-lang 3 - posted by "Dietrich Travkin (JIRA)" <ji...@apache.org> on 2018/10/24 12:07:00 UTC, 2 replies.
- [jira] [Created] (TIKA-2765) Regression extracting text from corrupted docx files - posted by "Luis Filipe Nassif (JIRA)" <ji...@apache.org> on 2018/10/24 13:49:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2765) Regression extracting text from corrupted docx files - posted by "Luis Filipe Nassif (JIRA)" <ji...@apache.org> on 2018/10/24 13:49:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2765) Regression extracting text from corrupted docx files - posted by "Luis Filipe Nassif (JIRA)" <ji...@apache.org> on 2018/10/24 13:50:00 UTC, 4 replies.
- [jira] [Created] (TIKA-2766) Be able to extract raw values from excel, not formatted - posted by "JTB Development (JIRA)" <ji...@apache.org> on 2018/10/25 12:43:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2767) Problem with import xlsx - posted by "ionut hodor (JIRA)" <ji...@apache.org> on 2018/10/26 10:01:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2767) Problem with import xlsx - posted by "ionut hodor (JIRA)" <ji...@apache.org> on 2018/10/26 10:03:00 UTC, 2 replies.
- [jira] [Updated] (TIKA-2767) Problem with import xlsx with null cells - posted by "ionut hodor (JIRA)" <ji...@apache.org> on 2018/10/26 10:15:00 UTC, 1 replies.
- [jira] [Updated] (TIKA-2750) Update regression corpus - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/26 15:36:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2599) Hyperlink surrounded by Italics not closed Properly - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2018/10/29 21:44:00 UTC, 7 replies.
- [jira] [Assigned] (TIKA-2599) Hyperlink surrounded by Italics not closed Properly - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2018/10/29 21:48:01 UTC, 0 replies.
- [jira] [Updated] (TIKA-2599) Hyperlink surrounded by Italics not closed Properly - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2018/10/29 21:48:01 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2599) Hyperlink surrounded by Italics not closed Properly - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2018/10/29 22:09:01 UTC, 0 replies.
- [jira] [Commented] (TIKA-2479) Handle empty cells in tables uniformly - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2018/10/29 22:22:00 UTC, 2 replies.
- [jira] [Commented] (TIKA-2767) Problem with import xlsx with null cells - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2018/10/29 22:24:00 UTC, 0 replies.
- [jira] [Assigned] (TIKA-2630) Wrong height and width metadata for JPEG images - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2018/10/29 23:24:00 UTC, 0 replies.
- [jira] [Commented] (TIKA-2630) Wrong height and width metadata for JPEG images - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2018/10/29 23:53:00 UTC, 7 replies.
- [jira] [Created] (TIKA-2768) While parsing pdf documents with PDFParser, the marking for bold characters is lost - posted by "Phanindra Ramesh (JIRA)" <ji...@apache.org> on 2018/10/30 10:50:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2768) While parsing pdf documents with PDFParser, the marking for bold characters is lost - posted by "Phanindra Ramesh (JIRA)" <ji...@apache.org> on 2018/10/30 10:51:00 UTC, 0 replies.
- [jira] [Created] (TIKA-2769) Error while using tika-app on some docs - posted by "IvanSorokin (JIRA)" <ji...@apache.org> on 2018/10/30 11:40:01 UTC, 0 replies.
- [jira] [Updated] (TIKA-2769) Error while using tika-app on some docs - posted by "IvanSorokin (JIRA)" <ji...@apache.org> on 2018/10/30 11:42:00 UTC, 2 replies.
- [jira] [Commented] (TIKA-2769) Error while using tika-app on some docs - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/30 14:21:00 UTC, 7 replies.
- [jira] [Commented] (TIKA-2751) Upgrade to POI 4.0.1 when available - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/30 15:39:00 UTC, 0 replies.
- tika-2.x-windows - Build # 339 - Failure - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/10/31 04:09:09 UTC, 0 replies.
- [jira] [Commented] (TIKA-2766) Be able to extract raw values from excel, not formatted - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2018/10/31 11:29:00 UTC, 0 replies.
- [jira] [Updated] (TIKA-2735) notes and footer contents are duplicated in extracting text from power point slides - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/31 17:52:00 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-2735) notes and footer contents are duplicated in extracting text from power point slides - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/31 17:53:00 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2735) notes and footer contents are duplicated in extracting text from power point slides - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/10/31 17:53:00 UTC, 0 replies.