You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] Updated: (TIKA-446) Upgrade to PDFBox 1.3.1 - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/11/01 00:05:23 UTC, 0 replies.
- [jira] Resolved: (TIKA-446) Upgrade to PDFBox 1.3.1 - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/11/01 00:11:23 UTC, 0 replies.
- [jira] Commented: (TIKA-517) java.io.UnsupportedEncodingException with Russian, Chinese, ... document - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/11/01 00:53:23 UTC, 1 replies.
- [jira] Commented: (TIKA-536) Updated site layout - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/11/01 00:59:23 UTC, 0 replies.
- [jira] Commented: (TIKA-531) xmpTPg:NPages creates invalid XML - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/11/01 01:17:23 UTC, 1 replies.
- Hudson build is still unstable: Tika-trunk » Apache Tika parsers #395 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/01 02:03:42 UTC, 0 replies.
- Hudson build is still unstable: Tika-trunk #395 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/01 02:03:43 UTC, 5 replies.
- [jira] Resolved: (TIKA-490) Support for adding language profiles dynamically - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/01 06:21:23 UTC, 0 replies.
- [jira] Updated: (TIKA-524) Unification of HTML output from Office, OOXML and Open Document parsers - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/01 06:23:23 UTC, 0 replies.
- [jira] Updated: (TIKA-508) HtmlParser link processing should skip usemap and codebase attributes - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/01 06:25:23 UTC, 0 replies.
- [jira] Updated: (TIKA-497) HtmlHandler should fix up incorrect capitalization of names in attributes before putting into metadata - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/01 06:25:26 UTC, 1 replies.
- [jira] Updated: (TIKA-538) Add method get file extension from MimeTypes - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/01 06:29:28 UTC, 0 replies.
- [jira] Updated: (TIKA-526) OOXMLParser fails to extract text from within smart tags - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/01 06:29:31 UTC, 0 replies.
- Hudson build is still unstable: Tika-trunk » Apache Tika parsers #396 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/01 06:30:36 UTC, 0 replies.
- Hudson build is still unstable: Tika-trunk #396 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/01 06:30:37 UTC, 0 replies.
- [jira] Updated: (TIKA-390) Missing Header/Footer text for ODT documents - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/01 06:31:23 UTC, 0 replies.
- [jira] Updated: (TIKA-525) Mismatched start and end elements in HtmlParser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/01 06:31:25 UTC, 0 replies.
- [jira] Updated: (TIKA-533) Mis-detection of zip files as application/vnd.apple.iwork - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/01 06:31:26 UTC, 0 replies.
- [jira] Updated: (TIKA-539) Encoding detection is too biased by encoding in meta tag - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/01 07:13:24 UTC, 1 replies.
- [jira] Resolved: (TIKA-503) Add a ContentHandler for collecting links from parser output - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/01 07:15:28 UTC, 0 replies.
- [jira] Resolved: (TIKA-531) xmpTPg:NPages creates invalid XML - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/01 07:17:26 UTC, 0 replies.
- [jira] Updated: (TIKA-503) Add a ContentHandler for collecting links from parser output - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/01 07:19:23 UTC, 0 replies.
- 0.8 release: latest status - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2010/11/01 07:22:39 UTC, 7 replies.
- Hudson build is still unstable: Tika-trunk » Apache Tika parsers #397 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/01 08:01:16 UTC, 0 replies.
- Hudson build is still unstable: Tika-trunk #397 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/01 08:01:18 UTC, 0 replies.
- [jira] Commented: (TIKA-373) Upgrade to POI 3.7 - posted by "Attila Király (JIRA)" <ji...@apache.org> on 2010/11/01 10:13:23 UTC, 0 replies.
- Java 6 (Was: Hudson build is still unstable: Tika-trunk #395) - posted by Jukka Zitting <jz...@adobe.com> on 2010/11/01 14:39:03 UTC, 0 replies.
- Hudson build is back to stable : Tika-trunk » Apache Tika parsers #398 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/01 15:49:29 UTC, 0 replies.
- Hudson build is back to stable : Tika-trunk #398 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/01 15:49:31 UTC, 0 replies.
- Hudson build is back to normal : Tika-trunk » Apache Tika application #400 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/01 22:07:39 UTC, 0 replies.
- Hudson build is back to normal : Tika-trunk #400 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/01 22:07:42 UTC, 0 replies.
- [jira] Resolved: (TIKA-373) Upgrade to POI 3.7 - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/11/01 23:52:24 UTC, 0 replies.
- [jira] Created: (TIKA-541) Use commons-cli in lieu of writing our own option parser - posted by "Hasan Diwan (JIRA)" <ji...@apache.org> on 2010/11/02 07:58:23 UTC, 0 replies.
- [jira] Updated: (TIKA-541) Use commons-cli in lieu of writing our own option parser - posted by "Hasan Diwan (JIRA)" <ji...@apache.org> on 2010/11/02 08:00:26 UTC, 0 replies.
- [jira] Issue Comment Edited: (TIKA-517) java.io.UnsupportedEncodingException with Russian, Chinese, ... document - posted by "Dominique Béjean (JIRA)" <ji...@apache.org> on 2010/11/02 13:02:25 UTC, 1 replies.
- [jira] Closed: (TIKA-517) java.io.UnsupportedEncodingException with Russian, Chinese, ... document - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/11/02 14:12:25 UTC, 0 replies.
- [jira] Updated: (TIKA-537) Command line option --list-parsers should list 2nd level parsers below CompositeParsers - posted by "Jan Høydahl (JIRA)" <ji...@apache.org> on 2010/11/03 02:06:24 UTC, 1 replies.
- [jira] Commented: (TIKA-537) Command line option --list-parsers should list 2nd level parsers below CompositeParsers - posted by "Jan Høydahl (JIRA)" <ji...@apache.org> on 2010/11/03 02:18:24 UTC, 0 replies.
- [jira] Updated: (TIKA-527) Allow override mapping mime<-->parsers through config - posted by "Jan Høydahl (JIRA)" <ji...@apache.org> on 2010/11/03 02:34:24 UTC, 0 replies.
- Build problem with trunk? - posted by Benson Margulies <bi...@gmail.com> on 2010/11/04 13:21:45 UTC, 2 replies.
- [jira] Created: (TIKA-542) Publish Javadoc on tika.apache.org - posted by "Benson Margulies (JIRA)" <ji...@apache.org> on 2010/11/04 13:33:53 UTC, 0 replies.
- Boilerpipe is nice, but what about readability? - posted by Benson Margulies <bi...@gmail.com> on 2010/11/04 14:02:10 UTC, 0 replies.
- [jira] Created: (TIKA-543) Remove rome 1.0 dependency on java.net repository - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/11/04 14:47:44 UTC, 0 replies.
- [jira] Commented: (TIKA-543) Remove rome 1.0 dependency on java.net repository - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/11/04 14:59:43 UTC, 2 replies.
- [jira] Commented: (TIKA-466) Feed Parser - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/11/04 14:59:54 UTC, 0 replies.
- Charset SPI - posted by Benson Margulies <bi...@gmail.com> on 2010/11/04 15:08:48 UTC, 2 replies.
- [jira] Resolved: (TIKA-542) Publish Javadoc on tika.apache.org - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/11/04 15:41:41 UTC, 0 replies.
- [jira] Closed: (TIKA-531) xmpTPg:NPages creates invalid XML - posted by "Sjoerd Smeets (JIRA)" <ji...@apache.org> on 2010/11/04 16:32:44 UTC, 0 replies.
- [jira] Commented: (TIKA-462) Add Boilerpipe 1.0.4 to Maven central and remove java.net repository from parser pom - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/11/04 17:09:44 UTC, 1 replies.
- [jira] Commented: (TIKA-540) extract text from .docx footnotes - posted by "Maxim Valyanskiy (JIRA)" <ji...@apache.org> on 2010/11/05 14:03:42 UTC, 0 replies.
- [jira] Resolved: (TIKA-540) extract text from .docx footnotes - posted by "Maxim Valyanskiy (JIRA)" <ji...@apache.org> on 2010/11/05 14:05:42 UTC, 0 replies.
- [jira] Updated: (TIKA-462) Add Boilerpipe 1.0.4 to Maven central and remove java.net repository from parser pom - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/11/05 14:34:42 UTC, 0 replies.
- [jira] Resolved: (TIKA-462) Add Boilerpipe 1.0.4 to Maven central and remove java.net repository from parser pom - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/11/05 14:54:41 UTC, 0 replies.
- [jira] Created: (TIKA-544) AutoDetectParser ignores charset in Content-Type metadata - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/11/05 21:14:47 UTC, 0 replies.
- [jira] Closed: (TIKA-544) AutoDetectParser ignores charset in Content-Type metadata - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/11/05 22:30:46 UTC, 0 replies.
- [jira] Commented: (TIKA-539) Encoding detection is too biased by encoding in meta tag - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/11/05 22:36:42 UTC, 3 replies.
- [jira] Issue Comment Edited: (TIKA-539) Encoding detection is too biased by encoding in meta tag - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/11/05 22:36:45 UTC, 2 replies.
- My ApacheConNA 2010 slides - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2010/11/06 20:52:15 UTC, 0 replies.
- [jira] Updated: (TIKA-543) Remove rome 1.0 dependency on java.net repository - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/11/07 00:48:22 UTC, 0 replies.
- [jira] Resolved: (TIKA-543) Remove rome 1.0 dependency on java.net repository - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/11/07 00:48:32 UTC, 0 replies.
- [jira] Assigned: (TIKA-537) Command line option --list-parsers should list 2nd level parsers below CompositeParsers - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/07 00:54:23 UTC, 0 replies.
- [jira] Resolved: (TIKA-537) Command line option --list-parsers should list 2nd level parsers below CompositeParsers - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/07 01:00:22 UTC, 0 replies.
- [jira] Updated: (TIKA-523) Add application/ms-tnef as alias to application/vnd.ms-tnef - posted by "Jan Høydahl (JIRA)" <ji...@apache.org> on 2010/11/07 14:29:06 UTC, 0 replies.
- [jira] Assigned: (TIKA-523) Add application/ms-tnef as alias to application/vnd.ms-tnef - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/07 18:20:09 UTC, 0 replies.
- [jira] Resolved: (TIKA-523) Add application/ms-tnef as alias to application/vnd.ms-tnef - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/07 18:32:22 UTC, 0 replies.
- [jira] Updated: (TIKA-487) ContainerAwareDetector doesn't support truncated Open XML files - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/08 01:38:06 UTC, 0 replies.
- [jira] Updated: (TIKA-518) Attribute values are not indexed - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/08 01:40:06 UTC, 0 replies.
- [jira] Updated: (TIKA-521) OutOfMemoryError Parsing XSLX File - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/08 01:40:07 UTC, 5 replies.
- [jira] Updated: (TIKA-530) InvalidFormatException on a PackagePart in OOXML - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/08 01:40:08 UTC, 0 replies.
- [jira] Updated: (TIKA-471) Avoid Charset name bottleneck when multiple threads are using HtmlParser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/08 01:42:10 UTC, 0 replies.
- [ANNOUNCE] Welcome Maxim Valyanskiy as Tika PMC/Committer - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2010/11/08 08:20:52 UTC, 1 replies.
- [jira] Created: (TIKA-545) While trying to extract meta data(Created date,Modified date) from .docx,.xlsx files it returns only current date. - posted by "samraj (JIRA)" <ji...@apache.org> on 2010/11/08 08:47:24 UTC, 0 replies.
- [jira] Created: (TIKA-546) Add ability to create language profiles to tika-app - posted by "Jan Høydahl (JIRA)" <ji...@apache.org> on 2010/11/08 09:39:10 UTC, 0 replies.
- [jira] Updated: (TIKA-545) While trying to extract meta data(Created date,Modified date) from .docx,.xlsx files it returns only current date. - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/08 14:57:09 UTC, 2 replies.
- [jira] Resolved: (TIKA-545) While trying to extract meta data(Created date,Modified date) from .docx,.xlsx files it returns only current date. - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/08 14:57:11 UTC, 0 replies.
- [jira] Reopened: (TIKA-545) While trying to extract meta data(Created date,Modified date) from .docx,.xlsx files it returns only current date. - posted by "samraj (JIRA)" <ji...@apache.org> on 2010/11/09 04:38:19 UTC, 0 replies.
- [jira] Commented: (TIKA-545) While trying to extract meta data(Created date,Modified date) from .docx,.xlsx files it returns only current date. - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/09 04:44:07 UTC, 5 replies.
- [jira] Resolved: (TIKA-510) Use POI API for text extraction from XSLF shape - posted by "Maxim Valyanskiy (JIRA)" <ji...@apache.org> on 2010/11/09 12:23:07 UTC, 0 replies.
- [jira] Resolved: (TIKA-511) NPE when POI is configured to prefer event extractors - posted by "Maxim Valyanskiy (JIRA)" <ji...@apache.org> on 2010/11/09 12:23:08 UTC, 0 replies.
- [jira] Created: (TIKA-547) Can't extract PDF text - posted by "Igor Spasic (JIRA)" <ji...@apache.org> on 2010/11/09 15:06:09 UTC, 0 replies.
- [jira] Updated: (TIKA-547) Can't extract PDF text - posted by "Igor Spasic (JIRA)" <ji...@apache.org> on 2010/11/09 15:08:10 UTC, 1 replies.
- [jira] Commented: (TIKA-547) Can't extract PDF text - posted by "Daan de Wit (JIRA)" <ji...@apache.org> on 2010/11/09 15:12:07 UTC, 3 replies.
- [jira] Resolved: (TIKA-547) Can't extract PDF text - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/09 15:38:12 UTC, 0 replies.
- [jira] Commented: (TIKA-461) RFC822 messages not parsed - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/11/09 17:03:07 UTC, 6 replies.
- XML parsing hang - posted by Ken Krugler <kk...@transpac.com> on 2010/11/09 19:35:51 UTC, 0 replies.
- [VOTE] Apache Tika 0.8 Release Candidate #1 - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2010/11/09 22:29:59 UTC, 1 replies.
- buildbot failure in ASF Buildbot on tika-trunk - posted by bu...@apache.org on 2010/11/10 16:58:48 UTC, 5 replies.
- [jira] Commented: (TIKA-482) Refactor image and jpeg parsers for access to MetadataExtractor API - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/11/10 17:02:14 UTC, 2 replies.
- buildbot success in ASF Buildbot on tika-trunk - posted by bu...@apache.org on 2010/11/10 17:06:31 UTC, 2 replies.
- Re: ReviewBoard instance - posted by Jukka Zitting <ju...@gmail.com> on 2010/11/10 20:23:10 UTC, 1 replies.
- [jira] Commented: (TIKA-392) RTF parser smashes words together in subsequent table cells - posted by "Thiago Souza (JIRA)" <ji...@apache.org> on 2010/11/10 20:30:14 UTC, 0 replies.
- tika and plain text -- bug or feature? - posted by qubit <la...@yahoo.com> on 2010/11/10 21:34:57 UTC, 4 replies.
- OOPS -- my mistake, text/plain issues - posted by qubit <la...@yahoo.com> on 2010/11/11 06:11:37 UTC, 0 replies.
- Single line in extracted PDF contents - posted by Staffan <so...@gmail.com> on 2010/11/11 10:14:52 UTC, 1 replies.
- Re: svn commit: r1033937 - in /tika/trunk: tika-core/src/main/java/org/apache/tika/extractor/ tika-parsers/src/main/java/org/apache/tika/parser/microsoft/ tika-parsers/src/main/java/org/apache/tika/parser/pkg/ - posted by Jukka Zitting <ju...@gmail.com> on 2010/11/11 15:05:56 UTC, 4 replies.
- [jira] Created: (TIKA-548) PDF content extracted as single line - posted by "Staffan Olsson (JIRA)" <ji...@apache.org> on 2010/11/11 20:39:15 UTC, 0 replies.
- [jira] Updated: (TIKA-548) PDF content extracted as single line - posted by "Staffan Olsson (JIRA)" <ji...@apache.org> on 2010/11/11 20:41:13 UTC, 1 replies.
- MS Lectures on office file formats - posted by Alex Ott <al...@gmail.com> on 2010/11/12 11:05:25 UTC, 1 replies.
- [jira] Created: (TIKA-549) There is no support for extracting OLE-shapes from PPT - posted by "Maxim Valyanskiy (JIRA)" <ji...@apache.org> on 2010/11/12 12:51:13 UTC, 0 replies.
- [jira] Resolved: (TIKA-549) There is no support for extracting OLE-shapes from PPT - posted by "Maxim Valyanskiy (JIRA)" <ji...@apache.org> on 2010/11/12 13:07:13 UTC, 0 replies.
- [jira] Created: (TIKA-550) Add stable filenames for extracted embedded files from Office binaries - posted by "Maxim Valyanskiy (JIRA)" <ji...@apache.org> on 2010/11/12 13:17:13 UTC, 0 replies.
- [jira] Updated: (TIKA-550) Add stable filenames for extracted embedded files from Office binaries - posted by "Maxim Valyanskiy (JIRA)" <ji...@apache.org> on 2010/11/12 13:17:14 UTC, 0 replies.
- [jira] Resolved: (TIKA-550) Add stable filenames for extracted embedded files from Office binaries - posted by "Maxim Valyanskiy (JIRA)" <ji...@apache.org> on 2010/11/12 13:33:13 UTC, 0 replies.
- [jira] Created: (TIKA-551) Unit test failures in org.apache.tika.parser.image.ImageParserTest on JDK 1.6.0_05 - posted by "Maxim Valyanskiy (JIRA)" <ji...@apache.org> on 2010/11/12 13:57:13 UTC, 0 replies.
- [jira] Updated: (TIKA-551) Unit test failures in org.apache.tika.parser.image.ImageParserTest on JDK 1.6.0_05 - posted by "Maxim Valyanskiy (JIRA)" <ji...@apache.org> on 2010/11/12 13:59:14 UTC, 0 replies.
- [jira] Commented: (TIKA-551) Unit test failures in org.apache.tika.parser.image.ImageParserTest on JDK 1.6.0_05 - posted by "Maxim Valyanskiy (JIRA)" <ji...@apache.org> on 2010/11/12 14:01:17 UTC, 3 replies.
- [jira] Issue Comment Edited: (TIKA-551) Unit test failures in org.apache.tika.parser.image.ImageParserTest on JDK 1.6.0_05 - posted by "Staffan Olsson (JIRA)" <ji...@apache.org> on 2010/11/12 15:19:15 UTC, 0 replies.
- [jira] Created: (TIKA-552) Further improvements to Word .doc and .docx parsing - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/11/12 17:40:13 UTC, 0 replies.
- [jira] Commented: (TIKA-552) Further improvements to Word .doc and .docx parsing - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/11/12 17:46:13 UTC, 0 replies.
- [jira] Created: (TIKA-553) Automatic license header checks - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/11/12 21:03:13 UTC, 0 replies.
- [jira] Resolved: (TIKA-553) Automatic license header checks - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/11/12 22:28:14 UTC, 0 replies.
- [RESULT] [VOTE] Apache Tika 0.8 Release Candidate #1 - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2010/11/13 04:45:01 UTC, 0 replies.
- [ANNOUNCE] Apache Tika 0.8 released - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2010/11/13 08:09:22 UTC, 0 replies.
- Build failed in Hudson: Tika-trunk » Apache Tika OSGi bundle #416 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/13 09:33:15 UTC, 0 replies.
- Build failed in Hudson: Tika-trunk #416 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/13 09:33:18 UTC, 0 replies.
- Hudson build is back to normal : Tika-trunk » Apache Tika OSGi bundle #417 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/13 14:57:18 UTC, 0 replies.
- Hudson build is back to normal : Tika-trunk #417 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/13 14:57:20 UTC, 0 replies.
- RecursiveMetadata and MetadataDiscussion - some long-term input - posted by Leo Sauermann <le...@gnowsis.com> on 2010/11/14 10:13:35 UTC, 1 replies.
- Re: RecursiveMetadata and MetadataDiscussion - some long-term input - if you need RDF call xesam or aperture - posted by Leo Sauermann <le...@gnowsis.com> on 2010/11/15 16:15:37 UTC, 2 replies.
- Supported Document Format web page out of date - posted by Paul Jakubik <pa...@purediscovery.com> on 2010/11/17 22:25:49 UTC, 0 replies.
- the PDF content regression - posted by Staffan <so...@gmail.com> on 2010/11/18 08:32:01 UTC, 0 replies.
- [jira] Commented: (TIKA-422) Wrong charset conversion in some RTF documents. - posted by "Piotr Bartosiewicz (JIRA)" <ji...@apache.org> on 2010/11/18 08:42:14 UTC, 0 replies.
- [jira] Resolved: (TIKA-548) PDF content extracted as single line - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/11/18 19:12:14 UTC, 0 replies.
- [jira] Commented: (TIKA-548) PDF content extracted as single line - posted by "Staffan Olsson (JIRA)" <ji...@apache.org> on 2010/11/18 20:36:15 UTC, 3 replies.
- [jira] Created: (TIKA-554) ParseUtils.getStringContent needs an option to set the write limit that can be passed into the BodyContentHandler - posted by "Grant Ingersoll (JIRA)" <ji...@apache.org> on 2010/11/18 21:31:13 UTC, 0 replies.
- [jira] Commented: (TIKA-554) ParseUtils.getStringContent needs an option to set the write limit that can be passed into the BodyContentHandler - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/11/18 21:41:14 UTC, 0 replies.
- [jira] Updated: (TIKA-554) ParseUtils.getStringContent needs an option to set the write limit that can be passed into the BodyContentHandler - posted by "Grant Ingersoll (JIRA)" <ji...@apache.org> on 2010/11/18 21:43:15 UTC, 0 replies.
- [jira] Updated: (TIKA-369) Improve accuracy of language detection - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/11/20 22:48:28 UTC, 0 replies.
- [jira] Issue Comment Edited: (TIKA-521) OutOfMemoryError Parsing XSLX File - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/11/22 16:27:13 UTC, 0 replies.
- [jira] Created: (TIKA-555) image/bmp mime type does not exist - posted by "Erik Hetzner (JIRA)" <ji...@apache.org> on 2010/11/23 03:12:13 UTC, 0 replies.
- [jira] Updated: (TIKA-555) image/bmp mime type does not exist - posted by "Erik Hetzner (JIRA)" <ji...@apache.org> on 2010/11/23 03:14:14 UTC, 0 replies.
- [jira] Created: (TIKA-556) Problems with the NetCDF jar - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/11/23 16:44:13 UTC, 0 replies.
- [jira] Commented: (TIKA-556) Problems with the NetCDF jar - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/23 16:48:15 UTC, 5 replies.
- [jira] Assigned: (TIKA-556) Problems with the NetCDF jar - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/24 07:15:14 UTC, 0 replies.
- [jira] Resolved: (TIKA-556) Problems with the NetCDF jar - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2010/11/24 07:48:16 UTC, 0 replies.
- [jira] Created: (TIKA-557) Extract text file PDF error - posted by "Them Ta (JIRA)" <ji...@apache.org> on 2010/11/25 05:06:16 UTC, 0 replies.
- [jira] Updated: (TIKA-557) Extract text file PDF error - posted by "Them Ta (JIRA)" <ji...@apache.org> on 2010/11/25 05:12:14 UTC, 1 replies.
- [jira] Resolved: (TIKA-557) Extract text file PDF error - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/11/25 11:25:13 UTC, 0 replies.
- [jira] Created: (TIKA-558) Problems/inconsistency with jar edu.ucar:netcdf:4.2 used by Tika 0.8 - posted by "Guest (JIRA)" <ji...@apache.org> on 2010/11/25 13:22:09 UTC, 0 replies.
- [jira] Updated: (TIKA-558) Problems/inconsistency with jar edu.ucar:netcdf:4.2 used by Tika 0.8 - posted by "Guest (JIRA)" <ji...@apache.org> on 2010/11/25 13:25:05 UTC, 0 replies.
- [jira] Resolved: (TIKA-558) Problems/inconsistency with jar edu.ucar:netcdf:4.2 used by Tika 0.8 - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/11/25 14:51:23 UTC, 0 replies.
- [jira] Commented: (TIKA-557) Extract text file PDF error - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2010/11/25 15:51:13 UTC, 0 replies.
- [jira] Created: (TIKA-559) [PDF Parser] New paragraph not taken into account sometime - posted by "Antoine L. (JIRA)" <ji...@apache.org> on 2010/11/25 16:55:14 UTC, 0 replies.
- [jira] Updated: (TIKA-559) [PDF Parser] New paragraph not taken into account sometime - posted by "Antoine L. (JIRA)" <ji...@apache.org> on 2010/11/25 16:57:14 UTC, 0 replies.
- [jira] Commented: (TIKA-559) [PDF Parser] New paragraph not taken into account sometime - posted by "Staffan Olsson (JIRA)" <ji...@apache.org> on 2010/11/25 17:34:24 UTC, 0 replies.
- Furthering Along TIKA-461 - posted by Benjamin Douglas <bb...@basistech.com> on 2010/11/25 19:01:45 UTC, 1 replies.
- [jira] Created: (TIKA-560) Improve detection of .mht, Foxmail, and OOXML files - posted by "Antoni Mylka (JIRA)" <ji...@apache.org> on 2010/11/25 20:11:14 UTC, 0 replies.
- [jira] Updated: (TIKA-560) Improve detection of .mht, Foxmail, and OOXML files - posted by "Antoni Mylka (JIRA)" <ji...@apache.org> on 2010/11/25 20:19:14 UTC, 0 replies.
- [jira] Created: (TIKA-561) Support EMLX file detection - posted by "Antoni Mylka (JIRA)" <ji...@apache.org> on 2010/11/25 20:35:15 UTC, 0 replies.
- [jira] Updated: (TIKA-561) Support EMLX file detection - posted by "Antoni Mylka (JIRA)" <ji...@apache.org> on 2010/11/25 20:41:15 UTC, 0 replies.
- [jira] Closed: (TIKA-559) [PDF Parser] New paragraph not taken into account sometime - posted by "Antoine L. (JIRA)" <ji...@apache.org> on 2010/11/26 10:16:15 UTC, 0 replies.
- [jira] Commented: (TIKA-560) Improve detection of .mht, Foxmail, and OOXML files - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/11/26 17:47:15 UTC, 3 replies.
- [jira] Updated: (TIKA-461) RFC822 messages not parsed - posted by "Benjamin Douglas (JIRA)" <ji...@apache.org> on 2010/11/30 09:47:10 UTC, 3 replies.
- [jira] Issue Comment Edited: (TIKA-560) Improve detection of .mht, Foxmail, and OOXML files - posted by "Antoni Mylka (JIRA)" <ji...@apache.org> on 2010/11/30 21:50:11 UTC, 0 replies.
- [jira] Commented: (TIKA-389) Garbled metadata when dealing with encrypted PDF files. - posted by "Michel Tremblay (JIRA)" <ji...@apache.org> on 2010/11/30 23:56:11 UTC, 0 replies.