You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] [Commented] (TIKA-1991) Incorporate latest version of bouncy castle library - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/01 00:22:12 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1991) Incorporate latest version of bouncy castle library - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/01 00:22:12 UTC, 0 replies.
- [jira] [Commented] (TIKA-1986) support parser parameters with type (int, double, etc) in configuration XML file - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/01 00:25:12 UTC, 32 replies.
- [jira] [Created] (TIKA-1992) Check for duplicate inline images via COSStream not name in PDFParser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/02 00:42:59 UTC, 0 replies.
- [jira] [Created] (TIKA-1993) Image Recognition with Tika - posted by "Thamme Gowda (JIRA)" <ji...@apache.org> on 2016/06/02 01:33:59 UTC, 0 replies.
- [jira] [Commented] (TIKA-1992) Check for duplicate inline images via COSStream not name in PDFParser - posted by "Hudson (JIRA)" <ji...@apache.org> on 2016/06/02 02:04:59 UTC, 2 replies.
- tika-2.x-windows - Build # 10 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/06/02 02:16:19 UTC, 0 replies.
- [jira] [Assigned] (TIKA-1994) Integrate OCR with PDFParser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/02 15:27:59 UTC, 0 replies.
- [jira] [Created] (TIKA-1994) Integrate OCR with PDFParser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/02 15:27:59 UTC, 0 replies.
- [jira] [Updated] (TIKA-1994) Integrate OCR with PDFParser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/02 15:29:59 UTC, 0 replies.
- [jira] [Commented] (TIKA-1994) Integrate OCR with PDFParser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/02 16:07:59 UTC, 11 replies.
- [jira] [Commented] (TIKA-1984) Add configurability for language detection to BasicContentHandlerFactory - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2016/06/02 22:48:59 UTC, 1 replies.
- [jira] [Comment Edited] (TIKA-1994) Integrate OCR with PDFParser - posted by "Luis Filipe Nassif (JIRA)" <ji...@apache.org> on 2016/06/03 01:09:59 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1994) Integrate OCR with PDFParser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/03 18:53:59 UTC, 0 replies.
- [jira] [Created] (TIKA-1995) Improve OCR Strategy options for the PDFParser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/03 18:55:59 UTC, 0 replies.
- tika-2.x-windows - Build # 11 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/06/03 19:16:21 UTC, 0 replies.
- Re: Updates on SentimentAnalysisParser - posted by Anthony Beylerian <an...@gmail.com> on 2016/06/04 13:19:22 UTC, 0 replies.
- [jira] [Commented] (TIKA-1821) Problem in Tika().detect for xml file signed in CADES - posted by "Michele Andreano (JIRA)" <ji...@apache.org> on 2016/06/06 15:08:21 UTC, 0 replies.
- [jira] [Created] (TIKA-1996) Upgrade to PDFBox 2.0.2 when available - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/06 18:52:21 UTC, 0 replies.
- [jira] [Created] (TIKA-1997) Problem in Tika().detect for xml file signed in CADES - posted by "Michele Andreano (JIRA)" <ji...@apache.org> on 2016/06/07 08:06:20 UTC, 0 replies.
- [jira] [Updated] (TIKA-1997) Problem in Tika().detect for xml file signed in CADES - posted by "Michele Andreano (JIRA)" <ji...@apache.org> on 2016/06/07 08:09:21 UTC, 3 replies.
- [jira] [Commented] (TIKA-1997) Problem in Tika().detect for xml file signed in CADES - posted by "Michele Andreano (JIRA)" <ji...@apache.org> on 2016/06/07 08:09:21 UTC, 0 replies.
- [jira] [Created] (TIKA-1998) jhighlight license concerns - posted by "Daniel Gratzl (JIRA)" <ji...@apache.org> on 2016/06/07 10:56:20 UTC, 0 replies.
- [jira] [Closed] (TIKA-1998) jhighlight license concerns - posted by "Daniel Gratzl (JIRA)" <ji...@apache.org> on 2016/06/07 11:07:20 UTC, 0 replies.
- [jira] [Created] (TIKA-1999) org.apache.tika.sax.ToXMLContentHandler$ElementInfo.getPrefix(ToXMLContentHandler.java:58) - posted by "Egbert (JIRA)" <ji...@apache.org> on 2016/06/07 14:44:21 UTC, 0 replies.
- [jira] [Updated] (TIKA-1999) org.apache.tika.sax.ToXMLContentHandler$ElementInfo.getPrefix(ToXMLContentHandler.java:58) - posted by "Egbert (JIRA)" <ji...@apache.org> on 2016/06/07 14:53:21 UTC, 0 replies.
- Profiler for OpenNLP - posted by Anthony Beylerian <an...@gmail.com> on 2016/06/07 16:36:12 UTC, 3 replies.
- [jira] [Assigned] (TIKA-1999) org.apache.tika.sax.ToXMLContentHandler$ElementInfo.getPrefix(ToXMLContentHandler.java:58) - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/08 00:14:21 UTC, 0 replies.
- [jira] [Commented] (TIKA-1999) org.apache.tika.sax.ToXMLContentHandler$ElementInfo.getPrefix(ToXMLContentHandler.java:58) - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/08 00:17:21 UTC, 9 replies.
- [jira] [Commented] (TIKA-1817) Extracts entire file content for ASCII DXF files - posted by "Zoltan Toth (JIRA)" <ji...@apache.org> on 2016/06/08 05:24:21 UTC, 0 replies.
- [jira] [Created] (TIKA-2000) Author profile parser - posted by "Anthony Beylerian (JIRA)" <ji...@apache.org> on 2016/06/08 10:29:21 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1999) org.apache.tika.sax.ToXMLContentHandler$ElementInfo.getPrefix(ToXMLContentHandler.java:58) - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/08 15:56:21 UTC, 0 replies.
- tika-2.x-windows - Build # 12 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/06/08 16:16:35 UTC, 0 replies.
- tika-2.x-windows - Build # 13 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/06/08 18:16:31 UTC, 0 replies.
- [jira] [Updated] (TIKA-2000) Author profile parser - posted by "Anthony Beylerian (JIRA)" <ji...@apache.org> on 2016/06/09 10:41:21 UTC, 5 replies.
- [jira] [Created] (TIKA-2001) Parsing XML outputs empty string - posted by "George L. Yermulnik (JIRA)" <ji...@apache.org> on 2016/06/09 11:43:20 UTC, 0 replies.
- [jira] [Commented] (TIKA-2001) Parsing XML outputs empty string - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2016/06/09 12:09:21 UTC, 3 replies.
- [jira] [Comment Edited] (TIKA-2001) Parsing XML outputs empty string - posted by "George L. Yermulnik (JIRA)" <ji...@apache.org> on 2016/06/09 15:40:21 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1966) Issue in parsing iWorksDocument with Apache Tika - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/09 17:38:21 UTC, 0 replies.
- [jira] [Commented] (TIKA-1358) Add support for newer iWork file formats - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/09 17:42:21 UTC, 12 replies.
- Re: About tika-python error - posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov> on 2016/06/10 15:08:29 UTC, 2 replies.
- [jira] [Comment Edited] (TIKA-1358) Add support for newer iWork file formats - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/10 20:21:21 UTC, 4 replies.
- [jira] [Created] (TIKA-2002) ExternalParser.check(...) hangs since STDOUT and STDERR buffers are not being emptied - posted by "Thamme Gowda (JIRA)" <ji...@apache.org> on 2016/06/11 01:43:21 UTC, 0 replies.
- [GitHub] tika pull request #125: TIKA-1993: ObjectRecognitionParser + Tensorflow imag... - posted by thammegowda <gi...@git.apache.org> on 2016/06/12 03:26:52 UTC, 0 replies.
- [jira] [Commented] (TIKA-1993) Image Recognition with Tika - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2016/06/12 03:27:21 UTC, 3 replies.
- [jira] [Updated] (TIKA-1988) Tika parser for extracting text based features - posted by "Madhav Sharan (JIRA)" <ji...@apache.org> on 2016/06/12 22:16:20 UTC, 1 replies.
- [jira] [Updated] (TIKA-1358) Add support for newer iWork file formats - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/13 12:41:21 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1996) Upgrade to PDFBox 2.0.2 when available - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/13 13:25:20 UTC, 0 replies.
- $PROJECT_NAME - Build # $BUILD_NUMBER - $BUILD_STATUS - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/06/13 13:36:01 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-1986) support parser parameters with type (int, double, etc) in configuration XML file - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/13 14:06:21 UTC, 3 replies.
- tika-2.x-windows - Build # 14 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/06/13 14:16:44 UTC, 0 replies.
- [jira] [Commented] (TIKA-1996) Upgrade to PDFBox 2.0.2 when available - posted by "Hudson (JIRA)" <ji...@apache.org> on 2016/06/13 14:17:21 UTC, 4 replies.
- [jira] [Updated] (TIKA-2003) Tika 1.13 gpg signature not validating. - posted by "STEPHEN DURHAM (JIRA)" <ji...@apache.org> on 2016/06/13 16:05:21 UTC, 0 replies.
- [jira] [Created] (TIKA-2003) Tika 1.13 gpg signature not validating. - posted by "STEPHEN DURHAM (JIRA)" <ji...@apache.org> on 2016/06/13 16:05:21 UTC, 0 replies.
- [jira] [Commented] (TIKA-2003) Tika 1.13 gpg signature not validating. - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2016/06/13 16:50:20 UTC, 2 replies.
- Build step 'Execute shell' marked build as failure in tika-2.x-windows Jenkins build - posted by lewis john mcgibbney <le...@apache.org> on 2016/06/13 16:54:37 UTC, 0 replies.
- [jira] [Created] (TIKA-2004) Add mime detection for Windows Media Metafile, PRONOM: application/x-puid-fmt-584 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/14 17:15:09 UTC, 0 replies.
- [jira] [Created] (TIKA-2006) Add mime detection for vcalendar - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/14 18:01:27 UTC, 0 replies.
- [jira] [Created] (TIKA-2005) Add mime detection for vcard - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/14 18:01:27 UTC, 0 replies.
- [jira] [Updated] (TIKA-2006) Add mime detection for vCalendar and iCalendar - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/14 18:49:27 UTC, 0 replies.
- [jira] [Created] (TIKA-2007) Tika 1.13 uses vulnerable version of jackson-core: CVE-2016-3720 - posted by "Goetz Neumann (JIRA)" <ji...@apache.org> on 2016/06/14 21:11:30 UTC, 0 replies.
- [jira] [Updated] (TIKA-2007) Tika 1.13 uses vulnerable version of jackson-core: CVE-2016-3720 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/15 11:19:09 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2006) Add mime detection for vCalendar and iCalendar - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/15 11:19:09 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2005) Add mime detection for vcard - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/15 11:30:10 UTC, 0 replies.
- [jira] [Updated] (TIKA-2004) Add mime detection for Windows Media Metafile, PRONOM: application/x-puid-fmt-584 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/15 11:46:09 UTC, 0 replies.
- [jira] [Commented] (TIKA-2004) Add mime detection for Windows Media Metafile, PRONOM: application/x-puid-fmt-584 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/15 11:50:09 UTC, 4 replies.
- [jira] [Resolved] (TIKA-2004) Add mime detection for Windows Media Metafile, PRONOM: application/x-puid-fmt-584 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/15 12:31:09 UTC, 0 replies.
- [jira] [Created] (TIKA-2008) Add mime detection (and parser?) for MSOffice Owner File (PRONOM fmt/473) - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/15 12:35:09 UTC, 0 replies.
- [jira] [Updated] (TIKA-2008) Add mime detection (and parser?) for MSOffice Owner File (PRONOM fmt/473) - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/15 12:35:09 UTC, 0 replies.
- [jira] [Assigned] (TIKA-2003) Tika 1.13 gpg signature not validating. - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2016/06/15 13:17:09 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2008) Add mime detection (and parser?) for MSOffice Owner File (PRONOM fmt/473) - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/15 13:24:09 UTC, 0 replies.
- [jira] [Created] (TIKA-2009) Add magic for djvu - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/15 14:02:09 UTC, 0 replies.
- [jira] [Created] (TIKA-2010) Unable to get value when header is incorrect - posted by "Florent Valdelievre (JIRA)" <ji...@apache.org> on 2016/06/15 14:05:09 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2009) Add magic for djvu - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/15 14:08:09 UTC, 0 replies.
- [jira] [Updated] (TIKA-2006) Add magic for vCalendar and iCalendar - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/15 14:23:10 UTC, 0 replies.
- [jira] [Commented] (TIKA-2010) Unable to get value when header is incorrect - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2016/06/15 14:27:09 UTC, 2 replies.
- [jira] [Updated] (TIKA-2010) Unable to get value when header is incorrect - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2016/06/15 14:28:09 UTC, 0 replies.
- [jira] [Created] (TIKA-2011) Add mime detection for Endnote Import File (PRONOM: fmt/328) - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/15 14:38:09 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2011) Add mime detection for Endnote Import File (PRONOM: fmt/328) - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/15 14:46:09 UTC, 0 replies.
- tika-2.x-windows - Build # 15 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/06/15 17:24:07 UTC, 0 replies.
- [jira] [Commented] (TIKA-2011) Add mime detection for Endnote Import File (PRONOM: fmt/328) - posted by "Hudson (JIRA)" <ji...@apache.org> on 2016/06/15 18:53:09 UTC, 0 replies.
- [jira] [Commented] (TIKA-2006) Add magic for vCalendar and iCalendar - posted by "Hudson (JIRA)" <ji...@apache.org> on 2016/06/15 18:53:09 UTC, 1 replies.
- [jira] [Commented] (TIKA-2008) Add mime detection (and parser?) for MSOffice Owner File (PRONOM fmt/473) - posted by "Hudson (JIRA)" <ji...@apache.org> on 2016/06/15 18:53:09 UTC, 1 replies.
- [jira] [Commented] (TIKA-2009) Add magic for djvu - posted by "Hudson (JIRA)" <ji...@apache.org> on 2016/06/15 18:53:09 UTC, 1 replies.
- tika-2.x - Build # 111 - Failure - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/06/15 19:14:44 UTC, 0 replies.
- [jira] [Created] (TIKA-2012) PGP key missing from KEYS - posted by "Ryan Kimbrell (JIRA)" <ji...@apache.org> on 2016/06/16 02:31:05 UTC, 0 replies.
- [jira] [Closed] (TIKA-2012) PGP key missing from KEYS - posted by "Ryan Kimbrell (JIRA)" <ji...@apache.org> on 2016/06/16 03:08:05 UTC, 0 replies.
- [jira] [Commented] (TIKA-2012) PGP key missing from KEYS - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2016/06/16 08:50:05 UTC, 0 replies.
- [jira] [Created] (TIKA-2013) Upgrade to POI 3.15-beta2 when available - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/16 11:00:11 UTC, 0 replies.
- [jira] [Created] (TIKA-2014) Unable to parse doc file - posted by "Richa Garg (JIRA)" <ji...@apache.org> on 2016/06/17 05:49:05 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2014) Unable to parse doc file - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2016/06/17 10:26:05 UTC, 0 replies.
- [jira] [Commented] (TIKA-1836) Convertion DOC->TXT failed due to POI issue - posted by "Richa Garg (JIRA)" <ji...@apache.org> on 2016/06/17 10:41:05 UTC, 2 replies.
- Fwd: [jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available - posted by Lewis John Mcgibbney <le...@gmail.com> on 2016/06/17 14:46:59 UTC, 3 replies.
- doubling of body tag in HTMLParser? - posted by "Allison, Timothy B." <ta...@mitre.org> on 2016/06/17 18:09:05 UTC, 1 replies.
- [jira] [Reopened] (TIKA-995) XHTMLContentHandler doesn't pass attributes of body element - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/17 18:19:05 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-995) XHTMLContentHandler doesn't pass attributes of body element - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/17 18:23:05 UTC, 0 replies.
- Sentiment Analysis Parser updates - posted by Anastasija Mensikova <me...@gmail.com> on 2016/06/17 21:28:45 UTC, 7 replies.
- [jira] [Created] (TIKA-2015) MAPIMessage String fileName constructor leaves file open - posted by "Tim Barrett (JIRA)" <ji...@apache.org> on 2016/06/18 08:45:05 UTC, 0 replies.
- [jira] [Commented] (TIKA-2000) Author profile parser - posted by "Anthony Beylerian (JIRA)" <ji...@apache.org> on 2016/06/19 14:50:05 UTC, 0 replies.
- [jira] [Commented] (TIKA-2015) MAPIMessage String fileName constructor leaves file open - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2016/06/19 22:18:05 UTC, 0 replies.
- regression corpus/vm discussions - posted by "Allison, Timothy B." <ta...@mitre.org> on 2016/06/23 14:12:09 UTC, 1 replies.
- [jira] [Created] (TIKA-2016) A parser that combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text. - posted by "Anastasija Mensikova (JIRA)" <ji...@apache.org> on 2016/06/23 19:12:16 UTC, 0 replies.
- [jira] [Created] (TIKA-2017) Tika Server Cannot handle large files - posted by "Harshavardhan Manjunatha (JIRA)" <ji...@apache.org> on 2016/06/23 22:59:16 UTC, 0 replies.
- [jira] [Updated] (TIKA-2017) Tika Server Cannot handle large files - posted by "Harshavardhan Manjunatha (JIRA)" <ji...@apache.org> on 2016/06/23 23:01:16 UTC, 0 replies.
- [jira] [Commented] (TIKA-2017) Tika Server Cannot handle large files - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2016/06/23 23:09:16 UTC, 2 replies.
- [vm] mimes of files in our corpus - posted by "Allison, Timothy B." <ta...@mitre.org> on 2016/06/24 11:09:42 UTC, 2 replies.
- [jira] [Comment Edited] (TIKA-2017) Tika Server Cannot handle large files - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/24 11:18:16 UTC, 0 replies.
- [jira] [Updated] (TIKA-2017) Tika Server Cannot handle large files; add option for metadata only - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/24 11:26:16 UTC, 0 replies.
- [jira] [Created] (TIKA-2018) Attempt to get Title from Full text if not present in MetaData ( Application/Pdf ) - posted by "Florent Valdelievre (JIRA)" <ji...@apache.org> on 2016/06/24 13:17:16 UTC, 0 replies.
- [jira] [Commented] (TIKA-2018) Attempt to get Title from Full text if not present in MetaData ( Application/Pdf ) - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/24 13:46:16 UTC, 3 replies.
- [jira] [Created] (TIKA-2019) WordMLParser and SpreadsheetMLParser incorrectly concatenate tokens with ToTextHandler - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/24 13:54:16 UTC, 0 replies.
- [jira] [Updated] (TIKA-2019) WordMLParser and SpreadsheetMLParser incorrectly concatenate tokens with ToTextHandler - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/24 14:00:22 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2019) WordMLParser and SpreadsheetMLParser incorrectly concatenate tokens with ToTextHandler - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/24 14:22:16 UTC, 0 replies.
- [jira] [Created] (TIKA-2020) Tika 2.0 - remove AbstractParser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/24 14:29:16 UTC, 0 replies.
- [jira] [Commented] (TIKA-2019) WordMLParser and SpreadsheetMLParser incorrectly concatenate tokens with ToTextHandler - posted by "Hudson (JIRA)" <ji...@apache.org> on 2016/06/24 14:49:16 UTC, 2 replies.
- [jira] [Updated] (TIKA-2020) Tika 2.0 - remove AbstractParser's 3 parameter parse - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/24 15:13:16 UTC, 2 replies.
- tika-2.x-windows - Build # 16 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/06/24 15:16:28 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2020) Tika 2.0 - remove AbstractParser's 3 parameter parse - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/24 15:47:16 UTC, 0 replies.
- tika-2.x-windows - Build # 17 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/06/24 16:16:25 UTC, 0 replies.
- [jira] [Commented] (TIKA-2020) Tika 2.0 - remove AbstractParser's 3 parameter parse - posted by "Hudson (JIRA)" <ji...@apache.org> on 2016/06/24 16:17:16 UTC, 1 replies.
- [jira] [Created] (TIKA-2021) Improving accuracy of Tesseract parser - posted by "Zarana Parekh (JIRA)" <ji...@apache.org> on 2016/06/24 21:08:16 UTC, 0 replies.
- [GitHub] tika pull request #126: fix for TIKA-2021 contributed by Zarana Parekh - posted by Zarana-Parekh <gi...@git.apache.org> on 2016/06/25 02:32:24 UTC, 0 replies.
- [jira] [Commented] (TIKA-2021) Improving accuracy of Tesseract parser - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2016/06/25 02:33:16 UTC, 0 replies.
- [jira] [Updated] (TIKA-2021) Improving accuracy of Tesseract parser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2016/06/25 02:37:16 UTC, 2 replies.
- [jira] [Assigned] (TIKA-2021) Improving accuracy of Tesseract parser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2016/06/25 02:37:16 UTC, 0 replies.
- [jira] [Created] (TIKA-2022) Add applefile parser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/25 10:49:37 UTC, 0 replies.
- [jira] [Commented] (TIKA-2022) Add applefile parser - posted by "Hudson (JIRA)" <ji...@apache.org> on 2016/06/27 13:46:52 UTC, 5 replies.
- [jira] [Resolved] (TIKA-2022) Add applefile parser - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/27 14:23:52 UTC, 0 replies.
- [jira] [Resolved] (TIKA-1644) Mime type diffs between 1.8 and 1.9-rc1 - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/27 14:47:52 UTC, 0 replies.
- tika-2.x-windows - Build # 18 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/06/27 15:16:24 UTC, 0 replies.
- [GitHub] tika pull request #127: creation of TIKA-2016 contributed by amensiko - posted by amensiko <gi...@git.apache.org> on 2016/06/27 16:39:27 UTC, 0 replies.
- [jira] [Commented] (TIKA-2016) A parser that combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text. - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2016/06/27 16:39:51 UTC, 1 replies.
- [jira] [Updated] (TIKA-2016) A parser that combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text. - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2016/06/27 16:42:52 UTC, 2 replies.
- [jira] [Created] (TIKA-2023) Clean up RTFParser to use EndianUtils when extracting embedded objects - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/27 16:42:52 UTC, 0 replies.
- [jira] [Assigned] (TIKA-2016) A parser that combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text. - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2016/06/27 16:42:52 UTC, 0 replies.
- [jira] [Updated] (TIKA-2023) Clean up RTFParser to use EndianUtils when extracting embedded objects - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/27 16:49:52 UTC, 0 replies.
- [jira] [Commented] (TIKA-2017) Tika Server Cannot handle large files; add option for metadata only - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/27 16:53:52 UTC, 0 replies.
- [jira] [Commented] (TIKA-1715) Save embedded images into another location - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/27 17:05:52 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-1715) Save embedded images into another location - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/27 17:05:52 UTC, 0 replies.
- Metadata key for "original file location/name"? - posted by "Allison, Timothy B." <ta...@mitre.org> on 2016/06/27 17:08:16 UTC, 1 replies.
- tika-2.x-windows - Build # 19 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/06/27 17:16:26 UTC, 0 replies.
- [jira] [Commented] (TIKA-2023) Clean up RTFParser to use EndianUtils when extracting embedded objects - posted by "Hudson (JIRA)" <ji...@apache.org> on 2016/06/27 17:16:52 UTC, 2 replies.
- [jira] [Created] (TIKA-2024) Extract original filename/path when possible - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/27 17:20:52 UTC, 0 replies.
- [jira] [Created] (TIKA-2025) Extraction of long sequences of digits from Excel spreadsheets using Tika 1.13 doesn’t yield the expected results - posted by "Aeham Abushwashi (JIRA)" <ji...@apache.org> on 2016/06/27 19:42:52 UTC, 0 replies.
- [jira] [Updated] (TIKA-2025) Extraction of long sequences of digits from Excel spreadsheets using Tika 1.13 doesn’t yield the expected results - posted by "Aeham Abushwashi (JIRA)" <ji...@apache.org> on 2016/06/27 19:43:52 UTC, 0 replies.
- [jira] [Assigned] (TIKA-2025) Extraction of long sequences of digits from Excel spreadsheets using Tika 1.13 doesn’t yield the expected results - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/27 20:07:52 UTC, 0 replies.
- [jira] [Updated] (TIKA-1768) Document headers and footers in metadata - posted by "Aeham Abushwashi (JIRA)" <ji...@apache.org> on 2016/06/27 20:08:52 UTC, 0 replies.
- [jira] [Created] (TIKA-2026) Handle embedded comp_obj/ oleObject.bin files stored in PPT/X - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/28 12:56:57 UTC, 0 replies.
- [jira] [Commented] (TIKA-2026) Handle embedded comp_obj/ oleObject.bin files stored in PPT/X - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/28 12:58:57 UTC, 1 replies.
- [jira] [Updated] (TIKA-2026) Handle embedded comp_obj/ oleObject.bin files stored in PPT/X - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/28 12:58:57 UTC, 2 replies.
- [jira] [Commented] (TIKA-2024) Extract original filename/path when possible - posted by "Hudson (JIRA)" <ji...@apache.org> on 2016/06/28 15:44:57 UTC, 3 replies.
- [jira] [Updated] (TIKA-2026) Handle OLE 2.0 embedded non-Office document in PPT/X and XLSX - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/28 15:50:57 UTC, 1 replies.
- [jira] [Commented] (TIKA-2026) Handle OLE 2.0 embedded non-Office document in PPT/X and XLSX - posted by "Hudson (JIRA)" <ji...@apache.org> on 2016/06/28 17:44:57 UTC, 2 replies.
- [jira] [Resolved] (TIKA-2026) Handle OLE 2.0 embedded non-Office document in PPT/X and XLSX - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/29 00:59:10 UTC, 0 replies.
- tika-2.x-windows - Build # 20 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/06/29 01:16:33 UTC, 0 replies.
- tika-2.x-windows - Build # 21 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/06/29 11:16:22 UTC, 0 replies.
- tika-2.x-windows - Build # 22 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/06/29 12:16:20 UTC, 0 replies.
- [jira] [Resolved] (TIKA-2024) Extract original filename/path when possible - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/29 12:18:37 UTC, 0 replies.
- tika-2.x-windows - Build # 23 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/06/29 14:16:22 UTC, 0 replies.
- [jira] [Commented] (TIKA-2025) Extraction of long sequences of digits from Excel spreadsheets using Tika 1.13 doesn’t yield the expected results - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/29 18:53:05 UTC, 3 replies.
- [jira] [Comment Edited] (TIKA-2025) Extraction of long sequences of digits from Excel spreadsheets using Tika 1.13 doesn’t yield the expected results - posted by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/29 18:56:06 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-2018) Attempt to get Title from Full text if not present in MetaData ( Application/Pdf ) - posted by "Joeran (JIRA)" <ji...@apache.org> on 2016/06/30 09:16:10 UTC, 0 replies.
- [GitHub] tika pull request #124: TIKA-1978 Invocation of java.net.URL.equals(Object),... - posted by asfgit <gi...@git.apache.org> on 2016/06/30 19:26:11 UTC, 0 replies.
- [jira] [Commented] (TIKA-1978) Invocation of java.net.URL.equals(Object), which blocks to do domain name resolution, in org.apache.tika.parser.geo.topic.GeoParser.initialize(URL) - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2016/06/30 19:27:10 UTC, 2 replies.
- tika-2.x-windows - Build # 24 - Still Failing - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/06/30 20:16:24 UTC, 0 replies.
- Tika-Python: parsing PDFs and showing analytics - posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov> on 2016/06/30 22:06:42 UTC, 0 replies.