You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] [Updated] (TIKA-1751) Use java.nio.file.Path in TikaConfig - posted by "Yaniv Kunda (JIRA)" <ji...@apache.org> on 2015/10/01 00:19:04 UTC, 2 replies.
- [jira] [Updated] (TIKA-1758) BatchCommandLineBuilder fails on systems with whitespace in path - posted by "Yaniv Kunda (JIRA)" <ji...@apache.org> on 2015/10/01 00:25:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-1758) BatchCommandLineBuilder fails on systems with whitespace in path - posted by "Yaniv Kunda (JIRA)" <ji...@apache.org> on 2015/10/01 00:26:04 UTC, 1 replies.
- [jira] [Resolved] (TIKA-1757) tika-batch tests fail on systems with whitespace or special chars in folder name - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/01 02:26:04 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1758) BatchCommandLineBuilder fails on systems with whitespace in path - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/01 02:27:04 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-1758) BatchCommandLineBuilder fails on systems with whitespace in path - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/01 02:28:04 UTC, 0 replies.
- [jira] [Commented] (TIKA-1757) tika-batch tests fail on systems with whitespace or special chars in folder name - posted by "Hudson (JIRA)" <ji...@apache.org> on 2015/10/01 02:48:04 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1756) Update forbiddenapis to v2.0 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/01 15:26:26 UTC, 0 replies.
- [jira] [Commented] (TIKA-1744) Use java.nio.file.Path in TikaInputStream - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/01 15:37:26 UTC, 1 replies.
- Re: svn commit: r1706077 - /tika/trunk/tika-parsers/src/test/java/org/apache/tika/parser/gdal/TestGDALParser.java - posted by Tyler Palsulich <tp...@gmail.com> on 2015/10/01 15:39:34 UTC, 2 replies.
- [jira] [Commented] (TIKA-1756) Update forbiddenapis to v2.0 - posted by "Hudson (JIRA)" <ji...@apache.org> on 2015/10/01 15:50:27 UTC, 0 replies.
- [jira] [Created] (TIKA-1759) Extract contributor metadata from supporting file formats - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/01 15:59:26 UTC, 0 replies.
- [jira] [Updated] (TIKA-1759) Extract contributor metadata from supporting file formats - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/01 16:00:43 UTC, 1 replies.
- [jira] [Commented] (TIKA-1759) Extract contributor metadata from supporting file formats - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/01 16:02:27 UTC, 6 replies.
- [jira] [Updated] (TIKA-1706) Bring back commons-io to tika-core - posted by "Yaniv Kunda (JIRA)" <ji...@apache.org> on 2015/10/01 16:57:26 UTC, 2 replies.
- [jira] [Created] (TIKA-1760) PDF index fulltext fails. - posted by "Arkady Zalkowitsch (JIRA)" <ji...@apache.org> on 2015/10/01 23:41:26 UTC, 0 replies.
- [jira] [Updated] (TIKA-1760) PDF index fulltext fails. - posted by "Arkady Zalkowitsch (JIRA)" <ji...@apache.org> on 2015/10/01 23:44:26 UTC, 1 replies.
- [jira] [Commented] (TIKA-1760) PDF index fulltext fails. - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/02 12:00:29 UTC, 2 replies.
- [jira] [Created] (TIKA-1761) Error Parsing PPT (97-2003) files with password protection against modification which were created using Office 2013 - posted by "Andriy Budzinskyy (JIRA)" <ji...@apache.org> on 2015/10/02 12:13:26 UTC, 0 replies.
- [jira] [Updated] (TIKA-1761) Error Parsing PPT (97-2003) files with password protection against modification which were created using Office 2013 - posted by "Andriy Budzinskyy (JIRA)" <ji...@apache.org> on 2015/10/02 12:13:27 UTC, 0 replies.
- [jira] [Assigned] (TIKA-1761) Error Parsing PPT (97-2003) files with password protection against modification which were created using Office 2013 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/02 12:46:26 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-1760) PDF index fulltext fails. - posted by "Arkady Zalkowitsch (JIRA)" <ji...@apache.org> on 2015/10/02 22:46:26 UTC, 1 replies.
- [jira] [Commented] (TIKA-1285) Upgrade to PDFBox 2.0.0 when available - posted by "Arkady Zalkowitsch (JIRA)" <ji...@apache.org> on 2015/10/03 00:04:28 UTC, 12 replies.
- [jira] [Commented] (TIKA-1743) NetworkParser can create Unbounded Number of Threads - posted by "Bob Paulin (JIRA)" <ji...@apache.org> on 2015/10/04 03:43:27 UTC, 0 replies.
- [jira] [Created] (TIKA-1762) Create Executor Service from TikaConfig - posted by "Bob Paulin (JIRA)" <ji...@apache.org> on 2015/10/04 03:43:27 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-1285) Upgrade to PDFBox 2.0.0 when available - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/05 15:15:27 UTC, 0 replies.
- [jira] [Created] (TIKA-1763) StringIndexOutOfBoundsException in ImageMetadataExtractor - posted by "Joseph North (JIRA)" <ji...@apache.org> on 2015/10/05 19:07:26 UTC, 0 replies.
- [GitHub] tika pull request: Fix for TIKA-1763 - posted by jrnorth <gi...@git.apache.org> on 2015/10/05 19:22:39 UTC, 1 replies.
- [jira] [Commented] (TIKA-1763) StringIndexOutOfBoundsException in ImageMetadataExtractor - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2015/10/05 19:23:26 UTC, 3 replies.
- [jira] [Commented] (TIKA-1741) Include CTAKESConfig.properties within tika-parsers resources by default - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/06 01:52:27 UTC, 1 replies.
- [jira] [Commented] (TIKA-1737) PDFBox 1.8.10 is still a basket case - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/06 02:11:26 UTC, 1 replies.
- Tika Executor Service Config - posted by Bob Paulin <bo...@bobpaulin.com> on 2015/10/07 00:39:23 UTC, 0 replies.
- [jira] [Created] (TIKA-1764) Provide information on failed document parsing in ParsingEmbeddedDocumentExtractor - posted by "Odilo Oehmichen (JIRA)" <ji...@apache.org> on 2015/10/07 13:52:26 UTC, 0 replies.
- [jira] [Commented] (TIKA-1764) Provide information on failed document parsing in ParsingEmbeddedDocumentExtractor - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/07 14:35:26 UTC, 3 replies.
- [jira] [Created] (TIKA-1765) Some doc and docx store multiple authors as semi-colon delimited list - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/07 15:45:27 UTC, 0 replies.
- [jira] [Commented] (TIKA-1765) Some doc and docx store multiple authors as semi-colon delimited list - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/07 20:53:26 UTC, 1 replies.
- [jira] [Updated] (TIKA-1741) Include CTAKESConfig.properties within tika-parsers resources by default - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/10/07 21:19:27 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1741) Include CTAKESConfig.properties within tika-parsers resources by default - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/10/07 21:20:27 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1765) Some doc and docx store multiple authors as semi-colon delimited list - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/08 03:32:26 UTC, 0 replies.
- [jira] [Commented] (TIKA-1761) Error Parsing PPT (97-2003) files with password protection against modification which were created using Office 2013 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/08 03:42:27 UTC, 3 replies.
- [jira] [Resolved] (TIKA-1755) Make ppt and pptx paragraph/div breaks more consistent - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/08 04:24:26 UTC, 0 replies.
- [jira] [Commented] (TIKA-1755) Make ppt and pptx paragraph/div breaks more consistent - posted by "Hudson (JIRA)" <ji...@apache.org> on 2015/10/08 04:48:27 UTC, 0 replies.
- Educational Website seeking NLP Engineers - posted by es...@monkeytaleslearning.com on 2015/10/08 05:39:03 UTC, 0 replies.
- Revised Posting Please - Educational Website seeking NLP Engineers - posted by es...@monkeytaleslearning.com on 2015/10/08 05:42:50 UTC, 0 replies.
- [jira] [Closed] (TIKA-1763) StringIndexOutOfBoundsException in ImageMetadataExtractor - posted by "Joseph North (JIRA)" <ji...@apache.org> on 2015/10/08 06:53:27 UTC, 0 replies.
- [jira] [Commented] (TIKA-1736) Bouncy Castle version binary incompatibility - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/09 00:47:26 UTC, 1 replies.
- [jira] [Resolved] (TIKA-1736) Bouncy Castle version binary incompatibility - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/09 01:35:26 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-1764) Provide information on failed document parsing in ParsingEmbeddedDocumentExtractor - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/09 13:57:26 UTC, 0 replies.
- [jira] [Created] (TIKA-1766) Insecure repository reference - posted by "Ben McCann (JIRA)" <ji...@apache.org> on 2015/10/11 01:53:05 UTC, 0 replies.
- [jira] [Updated] (TIKA-1753) Improper word concatenation when extracting pdf - posted by "Ben McCann (JIRA)" <ji...@apache.org> on 2015/10/11 02:26:05 UTC, 0 replies.
- [jira] [Commented] (TIKA-1766) Insecure repository reference - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2015/10/11 14:52:05 UTC, 1 replies.
- Now, Apache Tika is a Perl module! - posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov> on 2015/10/11 17:04:46 UTC, 0 replies.
- [jira] [Closed] (TIKA-1766) Insecure repository reference - posted by "Ben McCann (JIRA)" <ji...@apache.org> on 2015/10/11 18:32:05 UTC, 0 replies.
- [jira] [Created] (TIKA-1767) Values of .doc dropdowns are not parsed correctly - posted by "Matthew Williams (JIRA)" <ji...@apache.org> on 2015/10/12 10:10:05 UTC, 0 replies.
- ISO 19115 as a metadata model for Tika? - posted by Martin Desruisseaux <ma...@geomatys.com> on 2015/10/12 12:27:50 UTC, 8 replies.
- [jira] [Created] (TIKA-1768) Document headers and footers in metadata - posted by "Aeham Abushwashi (JIRA)" <ji...@apache.org> on 2015/10/12 18:28:05 UTC, 0 replies.
- [jira] [Updated] (TIKA-1768) Document headers and footers in metadata - posted by "Aeham Abushwashi (JIRA)" <ji...@apache.org> on 2015/10/12 18:29:06 UTC, 7 replies.
- [jira] [Created] (TIKA-1769) External parsers can't be used when using tika-bundle - posted by "Joseph North (JIRA)" <ji...@apache.org> on 2015/10/12 21:58:06 UTC, 0 replies.
- Issue with tika-core & tika-parsers - posted by Ravi Kishan Telu <ra...@tvarana.com> on 2015/10/13 07:55:48 UTC, 1 replies.
- [GitHub] tika pull request: lower priority on magic for application/xhtml+x... - posted by jeremybmerrill <gi...@git.apache.org> on 2015/10/13 22:04:35 UTC, 1 replies.
- [jira] [Created] (TIKA-1770) AutoDetectParser wrongly detects plain text as images/audio - posted by "Ziqi (JIRA)" <ji...@apache.org> on 2015/10/14 12:49:05 UTC, 0 replies.
- [jira] [Updated] (TIKA-1770) AutoDetectParser wrongly detects plain text as images/audio - posted by "Ziqi (JIRA)" <ji...@apache.org> on 2015/10/14 12:50:05 UTC, 0 replies.
- Apache Tika Jar Issues - posted by Telu Ravi Kishan <ra...@gmail.com> on 2015/10/14 14:23:00 UTC, 1 replies.
- Tika Tesseract configuration - posted by Aditya Dhulipala <ad...@usc.edu> on 2015/10/14 15:52:55 UTC, 2 replies.
- [jira] [Updated] (TIKA-1762) Create Executor Service from TikaConfig - posted by "Bob Paulin (JIRA)" <ji...@apache.org> on 2015/10/15 03:35:05 UTC, 0 replies.
- [jira] [Commented] (TIKA-1762) Create Executor Service from TikaConfig - posted by "Bob Paulin (JIRA)" <ji...@apache.org> on 2015/10/15 04:09:05 UTC, 5 replies.
- [jira] [Resolved] (TIKA-1762) Create Executor Service from TikaConfig - posted by "Bob Paulin (JIRA)" <ji...@apache.org> on 2015/10/15 04:51:06 UTC, 0 replies.
- [jira] [Created] (TIKA-1771) lower magic priority xhtml magic priority to ensure emails detected as message/rfc822 - posted by "Jeremy B. Merrill (JIRA)" <ji...@apache.org> on 2015/10/15 22:34:05 UTC, 0 replies.
- [jira] [Updated] (TIKA-1771) lower magic priority xhtml magic priority to ensure emails detected as message/rfc822 - posted by "Jeremy B. Merrill (JIRA)" <ji...@apache.org> on 2015/10/15 22:35:05 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-1762) Create Executor Service from TikaConfig - posted by "Bob Paulin (JIRA)" <ji...@apache.org> on 2015/10/16 00:45:05 UTC, 0 replies.
- [jira] [Created] (TIKA-1772) Mimetype of VTT files - posted by "Alexander Widera (JIRA)" <ji...@apache.org> on 2015/10/16 08:43:05 UTC, 0 replies.
- [jira] [Commented] (TIKA-1753) Improper word concatenation when extracting pdf - posted by "Ben McCann (JIRA)" <ji...@apache.org> on 2015/10/16 08:50:05 UTC, 0 replies.
- [jira] [Closed] (TIKA-1753) Improper word concatenation when extracting pdf - posted by "Ben McCann (JIRA)" <ji...@apache.org> on 2015/10/16 08:51:05 UTC, 0 replies.
- [GitHub] tika pull request: fix for TIKA-1772 contributed by wiedsche - posted by wiedsche <gi...@git.apache.org> on 2015/10/16 09:16:36 UTC, 1 replies.
- [jira] [Commented] (TIKA-1772) Mimetype of VTT files - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2015/10/16 09:17:05 UTC, 6 replies.
- [jira] [Created] (TIKA-1773) No XML Metadata output for JP2 files - posted by "Andreas Hirtzel (JIRA)" <ji...@apache.org> on 2015/10/16 11:20:05 UTC, 0 replies.
- [jira] [Commented] (TIKA-1773) No XML Metadata output for JP2 files - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2015/10/16 11:58:05 UTC, 2 replies.
- [jira] [Updated] (TIKA-1773) No XML Metadata output for JP2 files - posted by "Andreas Hirtzel (JIRA)" <ji...@apache.org> on 2015/10/16 12:15:06 UTC, 0 replies.
- [jira] [Updated] (TIKA-1772) Mimetype of VTT files - posted by "Alexander Widera (JIRA)" <ji...@apache.org> on 2015/10/16 12:22:05 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1772) Mimetype of VTT files - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2015/10/16 15:46:07 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-1772) Mimetype of VTT files - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2015/10/16 15:47:05 UTC, 0 replies.
- [jira] [Commented] (TIKA-1358) Add support for newer iWork file formats - posted by "Ben Summers (JIRA)" <ji...@apache.org> on 2015/10/16 18:25:05 UTC, 0 replies.
- [jira] [Created] (TIKA-1774) org.xml.sax.SAXException: Namespace http://www.w3.org/1999/xhtml not declared - posted by "Steve K (JIRA)" <ji...@apache.org> on 2015/10/18 17:33:05 UTC, 0 replies.
- [jira] [Commented] (TIKA-1774) org.xml.sax.SAXException: Namespace http://www.w3.org/1999/xhtml not declared - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2015/10/18 18:33:05 UTC, 1 replies.
- [jira] [Assigned] (TIKA-1771) lower magic priority xhtml magic priority to ensure emails detected as message/rfc822 - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:19:05 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1771) lower magic priority xhtml magic priority to ensure emails detected as message/rfc822 - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:22:05 UTC, 0 replies.
- [jira] [Updated] (TIKA-988) We don't extract a placeholder for a Word document embedded in an Excel document - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:06 UTC, 0 replies.
- [jira] [Updated] (TIKA-1508) Add uniformity to parser parameter configuration - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:06 UTC, 0 replies.
- [jira] [Updated] (TIKA-1709) Tika Server doesn't handle multi-part attachments or form-encoded inputs - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:06 UTC, 0 replies.
- [jira] [Updated] (TIKA-1208) Migrate Any23 mime contributions to Tika - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:06 UTC, 0 replies.
- [jira] [Updated] (TIKA-1688) Tika Version in Metadata - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:06 UTC, 0 replies.
- [jira] [Updated] (TIKA-1425) Automatic batching of Microsoft service calls - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:06 UTC, 0 replies.
- [jira] [Updated] (TIKA-1059) Better Handling of InterruptedException in ExternalParser and ExternalEmbedder - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:06 UTC, 0 replies.
- [jira] [Updated] (TIKA-1513) Add mime detection and parsing for dbf files - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:06 UTC, 0 replies.
- [jira] [Updated] (TIKA-1745) Add methods accepting java.nio.file.Path to org.apache.tika.Tika and org.apache.tika.parser.ParsingReader - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:06 UTC, 2 replies.
- [jira] [Updated] (TIKA-1674) Add example to show how to extract embedded files - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:06 UTC, 0 replies.
- [jira] [Updated] (TIKA-1390) Create tika-example module - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:06 UTC, 0 replies.
- [jira] [Updated] (TIKA-1456) Visual Sentiment API parser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-1516) Downgrade Rome dependency to 0.9 to avoid nasty NPE - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-1505) chmparser breaks down when extracting from file of CHM format v3 - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-891) Use POST in addition to PUT on method calls in tika-server - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-1746) modify TikaFileTypeDetector to use new detect method accepting java.nio.file.Path - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-819) Make Option to Exclude Embedded Files' Text for Text Content - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-1329) Add RecursiveParserWrapper aka Jukka's (and Nick's) RecursiveMetadataParser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-1697) Parser Implementation for AkomaNtoso Legal XML Documents - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-1598) Parser Implementation for Streaming Video - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-894) Add webapp mode for Tika Server, simplifies deployment - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-1696) Language Identification with Text Processing Toolkit from MITLL - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-1616) Tika Parser for GIBS Metadata - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-1705) Update ASM dependency to 5.0.4 - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-1657) Allow easier XML serialization of TikaConfig - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-1343) Create a Tika Translator implementation that uses JoshuaDecoder - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-1724) Create parser for .obo file format. - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-1276) Missing embedded dependencies in tika-bundle - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-1367) Tika documentation should list tika-parsers parser dependencies - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-1106) CLAVIN Integration - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-1328) Translate Metadata and Content - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-1672) Integrate tika-java7 component - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:07 UTC, 0 replies.
- [jira] [Updated] (TIKA-1318) Use of Deprecated Word6Extractor.getParagraphText() Method - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-1395) Create embedded image extraction example - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-987) Embedded drawing (SHAPE MERGEFORMAT) sometimes not extracted - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-1738) ForkClient does not always delete temporary bootstrap jar - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-1609) Leverage Google's LibPhonenumber for enhanced phone number extraction and metadata modeling - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-1436) improvement to PDFParser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-1640) Make ExternalParser support aliases for key names in extracted metadata - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-1108) Represent individual slides in pptx - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-1379) error in Tika().detect for xml files with xades signature - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-1607) Introduce new arbitrary object key/values data structure for persistence of Tika Metadata - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-1220) Parser implementration for IFC files - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-1301) Establish TikaServer on Apache hosted VM - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-1518) Docker with Tika Server - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-715) Some parsers produce non-well-formed XHTML SAX events - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-1540) New Tika plugin for image based feature extraction using computer vision techniques - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-1726) Augment public methods that use a java.io.File with methods that use a java.nio.file.Path - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-985) Support for HTML5 elements - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-1366) Update some of Tika Server services to support JAX-RS 2.0 AsyncResponse - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:08 UTC, 0 replies.
- [jira] [Updated] (TIKA-1417) Create Extract Embedded Images from PDFs Example - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:09 UTC, 0 replies.
- [jira] [Updated] (TIKA-1577) NetCDF Data Extraction - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:09 UTC, 0 replies.
- [jira] [Updated] (TIKA-1465) Implement extraction of non-global variables from netCDF3 and netCDF4 - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:09 UTC, 0 replies.
- [jira] [Updated] (TIKA-1295) Make some Dublin Core items multi-valued - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:09 UTC, 0 replies.
- [jira] [Updated] (TIKA-1435) Update rome dependency to 1.5 - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:09 UTC, 0 replies.
- [jira] [Updated] (TIKA-1308) Support in memory parse mode(don't create temp file): to support run Tika in GAE - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:09 UTC, 0 replies.
- [jira] [Updated] (TIKA-774) ExifTool Parser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:09 UTC, 0 replies.
- [jira] [Updated] (TIKA-539) Encoding detection is too biased by encoding in meta tag - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:09 UTC, 0 replies.
- [jira] [Updated] (TIKA-776) ExifTool Embedder - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:09 UTC, 0 replies.
- [jira] [Updated] (TIKA-980) MicrodataContentHandler for Apache Tika - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/18 21:44:09 UTC, 0 replies.
- [DISCUSS] 1.11 RC #1 today - posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov> on 2015/10/18 21:45:34 UTC, 0 replies.
- [jira] [Commented] (TIKA-1771) lower magic priority xhtml magic priority to ensure emails detected as message/rfc822 - posted by "Hudson (JIRA)" <ji...@apache.org> on 2015/10/18 21:48:05 UTC, 0 replies.
- [jira] [Commented] (TIKA-1672) Integrate tika-java7 component - posted by "Yaniv Kunda (JIRA)" <ji...@apache.org> on 2015/10/18 22:00:06 UTC, 0 replies.
- [jira] [Created] (TIKA-1775) Failed to load Main-Class manifest attribute from tika-app-1.10.jar - posted by "QiaoMan (JIRA)" <ji...@apache.org> on 2015/10/19 06:21:05 UTC, 0 replies.
- [jira] [Assigned] (TIKA-1745) Add methods accepting java.nio.file.Path to org.apache.tika.Tika and org.apache.tika.parser.ParsingReader - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/19 07:25:05 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1745) Add methods accepting java.nio.file.Path to org.apache.tika.Tika and org.apache.tika.parser.ParsingReader - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/19 07:27:05 UTC, 0 replies.
- [jira] [Assigned] (TIKA-1746) modify TikaFileTypeDetector to use new detect method accepting java.nio.file.Path - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/19 07:28:05 UTC, 1 replies.
- [jira] [Resolved] (TIKA-1746) modify TikaFileTypeDetector to use new detect method accepting java.nio.file.Path - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/19 07:30:05 UTC, 0 replies.
- [jira] [Assigned] (TIKA-1751) Use java.nio.file.Path in TikaConfig - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/19 07:30:05 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1751) Use java.nio.file.Path in TikaConfig - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/19 07:34:05 UTC, 0 replies.
- [jira] [Assigned] (TIKA-1726) Augment public methods that use a java.io.File with methods that use a java.nio.file.Path - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/19 07:35:05 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1726) Augment public methods that use a java.io.File with methods that use a java.nio.file.Path - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/10/19 07:37:05 UTC, 0 replies.
- [jira] [Closed] (TIKA-1775) Failed to load Main-Class manifest attribute from tika-app-1.10.jar - posted by "QiaoMan (JIRA)" <ji...@apache.org> on 2015/10/19 07:47:05 UTC, 0 replies.
- [jira] [Commented] (TIKA-1751) Use java.nio.file.Path in TikaConfig - posted by "Hudson (JIRA)" <ji...@apache.org> on 2015/10/19 07:49:05 UTC, 0 replies.
- [jira] [Commented] (TIKA-1745) Add methods accepting java.nio.file.Path to org.apache.tika.Tika and org.apache.tika.parser.ParsingReader - posted by "Hudson (JIRA)" <ji...@apache.org> on 2015/10/19 07:49:05 UTC, 0 replies.
- [jira] [Commented] (TIKA-1746) modify TikaFileTypeDetector to use new detect method accepting java.nio.file.Path - posted by "Hudson (JIRA)" <ji...@apache.org> on 2015/10/19 07:49:05 UTC, 0 replies.
- [VOTE] Apache Tika 1.11 Release Candidate #1 - posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov> on 2015/10/19 16:23:23 UTC, 6 replies.
- [jira] [Created] (TIKA-1776) tika stop converting at this pdf document - posted by "tranquillo (JIRA)" <ji...@apache.org> on 2015/10/20 09:03:27 UTC, 0 replies.
- [jira] [Commented] (TIKA-1776) tika stop converting at this pdf document - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/20 14:47:27 UTC, 1 replies.
- [jira] [Commented] (TIKA-1379) error in Tika().detect for xml files with xades signature - posted by "Alessandro De Angelis (JIRA)" <ji...@apache.org> on 2015/10/20 16:31:27 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-1776) tika stop converting at this pdf document - posted by "tranquillo (JIRA)" <ji...@apache.org> on 2015/10/20 19:07:27 UTC, 0 replies.
- [jira] [Closed] (TIKA-1776) tika stop converting at this pdf document - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/21 03:37:27 UTC, 0 replies.
- Re: Questions about using the Tika - posted by "Cao, Renzhi (MU-Student)" <rc...@mail.missouri.edu> on 2015/10/21 14:45:25 UTC, 0 replies.
- [jira] [Created] (TIKA-1777) Regression in spacing around differently formatted runs in PPT - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/21 16:11:27 UTC, 0 replies.
- [jira] [Created] (TIKA-1778) Regression in spacing around differently formatted runs in PPT - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/21 16:13:27 UTC, 0 replies.
- [jira] [Updated] (TIKA-1778) Regression in spacing around differently formatted runs in PPT - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/21 16:14:27 UTC, 0 replies.
- [jira] [Closed] (TIKA-1777) Regression in spacing around differently formatted runs in PPT - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/21 16:16:27 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-1707) Upgrade to Apache POI 3.13 Beta 2 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/21 16:28:27 UTC, 3 replies.
- [jira] [Commented] (TIKA-1707) Upgrade to Apache POI 3.13 Beta 2 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/21 16:28:27 UTC, 3 replies.
- [jira] [Created] (TIKA-1779) different outputs between cmd & srv version - posted by "tranquillo (JIRA)" <ji...@apache.org> on 2015/10/21 23:14:27 UTC, 0 replies.
- [jira] [Updated] (TIKA-1707) Upgrade to Apache POI 3.13 Beta 2 - posted by "Andreas Beeker (JIRA)" <ji...@apache.org> on 2015/10/22 00:57:27 UTC, 1 replies.
- [jira] [Created] (TIKA-1780) Not common regression in AIOOBE for some ppts - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/22 02:55:27 UTC, 0 replies.
- [jira] [Updated] (TIKA-1780) Not common regression in AIOOBE for some ppts - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/22 02:55:27 UTC, 1 replies.
- [jira] [Commented] (TIKA-1780) Not common regression in AIOOBE for some ppts - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/22 02:59:27 UTC, 0 replies.
- [jira] [Commented] (TIKA-1779) different outputs between cmd & srv version - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/22 03:03:27 UTC, 2 replies.
- [jira] [Resolved] (TIKA-1778) Regression in spacing around differently formatted runs in PPT - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/22 16:12:27 UTC, 0 replies.
- [jira] [Commented] (TIKA-1777) Regression in spacing around differently formatted runs in PPT - posted by "Hudson (JIRA)" <ji...@apache.org> on 2015/10/22 16:53:27 UTC, 1 replies.
- [jira] [Created] (TIKA-1781) Tika generates broken XML file - posted by "tranquillo (JIRA)" <ji...@apache.org> on 2015/10/22 21:49:27 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-1779) different outputs between cmd & srv version - posted by "tranquillo (JIRA)" <ji...@apache.org> on 2015/10/22 22:01:27 UTC, 2 replies.
- [jira] [Commented] (TIKA-1781) Tika generates broken XML file - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/23 00:09:27 UTC, 0 replies.
- [GitHub] tika pull request: Add parse(file, metadata) method - posted by thammegowda <gi...@git.apache.org> on 2015/10/23 08:43:45 UTC, 1 replies.
- Need Help on enabling Basic Authentication in Apache Tika - posted by Rahul Khandelwal <ra...@gmail.com> on 2015/10/23 11:26:02 UTC, 0 replies.
- [RESULT] [VOTE] Apache Tika 1.11 Release Candidate #1 - posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov> on 2015/10/25 19:22:46 UTC, 0 replies.
- [ANNOUNCE] Apache Tika 1.11 release - posted by Chris Mattmann <ma...@apache.org> on 2015/10/26 06:49:16 UTC, 0 replies.
- [jira] [Created] (TIKA-1782) XHTMLContentHandler doesn't pass attributes of html element - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2015/10/26 15:46:27 UTC, 0 replies.
- [jira] [Updated] (TIKA-1782) XHTMLContentHandler doesn't pass attributes of html element - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2015/10/26 15:51:28 UTC, 0 replies.
- [jira] [Commented] (TIKA-1782) XHTMLContentHandler doesn't pass attributes of html element - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/26 16:49:27 UTC, 7 replies.
- [jira] [Resolved] (TIKA-1782) XHTMLContentHandler doesn't pass attributes of html element - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/27 13:48:27 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-1782) XHTMLContentHandler doesn't pass attributes of html element - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/27 14:49:27 UTC, 0 replies.
- [jira] [Created] (TIKA-1783) Make the pdf parser extract text by area/region - posted by "Asitang Mishra (JIRA)" <ji...@apache.org> on 2015/10/28 20:54:27 UTC, 0 replies.
- Tika website links - posted by "André Warnier (tomcat)" <aw...@ice-sa.com> on 2015/10/29 18:24:25 UTC, 6 replies.
- [jira] [Commented] (TIKA-1769) External parsers can't be used when using tika-bundle - posted by "Joseph North (JIRA)" <ji...@apache.org> on 2015/10/29 21:35:27 UTC, 1 replies.
- [jira] [Created] (TIKA-1784) Use of ThreadLocal in Tika causes memory leaks and warnings in Tomcat - posted by "Aleksandr Dubinsky (JIRA)" <ji...@apache.org> on 2015/10/30 10:58:27 UTC, 0 replies.
- [jira] [Commented] (TIKA-1784) Use of ThreadLocal in Tika causes memory leaks and warnings in Tomcat - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/30 14:46:27 UTC, 0 replies.
- [jira] [Commented] (TIKA-1443) Add a junk text detector to Tika - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/30 15:08:27 UTC, 2 replies.
- [jira] [Comment Edited] (TIKA-1443) Add a junk text detector to Tika - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/10/30 15:22:27 UTC, 0 replies.
- [GitHub] tika pull request: NamedEntityParser - posted by thammegowda <gi...@git.apache.org> on 2015/10/31 05:53:46 UTC, 0 replies.