You are viewing a plain text version of this content. The canonical link for it is here.
- buildbot failure in ASF Buildbot on tika-trunk - posted by bu...@apache.org on 2012/07/01 13:17:03 UTC, 2 replies.
- JAX-RS overhead in tika-server - posted by Jukka Zitting <ju...@gmail.com> on 2012/07/01 14:09:51 UTC, 12 replies.
- Re: svn commit: r1355877 - in /tika/trunk: ./ tika-dll/ tika-dll/src/ tika-dll/src/main/ tika-dll/src/main/csharp/ tika-dll/src/main/csharp/Apache/ - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2012/07/01 18:29:48 UTC, 1 replies.
- [jira] [Created] (TIKA-944) Extend tika-server API to be consistent with tika-app CLI - posted by "Jason Judge (JIRA)" <ji...@apache.org> on 2012/07/01 18:45:42 UTC, 0 replies.
- [jira] [Commented] (TIKA-944) Extend tika-server API to be consistent with tika-app CLI - posted by "Jason Judge (JIRA)" <ji...@apache.org> on 2012/07/01 18:50:47 UTC, 1 replies.
- Re: svn commit: r1355947 - /tika/trunk/tika-parent/pom.xml - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2012/07/01 18:53:14 UTC, 0 replies.
- [jira] [Comment Edited] (TIKA-944) Extend tika-server API to be consistent with tika-app CLI - posted by "Jason Judge (JIRA)" <ji...@apache.org> on 2012/07/01 19:02:44 UTC, 0 replies.
- [jira] [Updated] (TIKA-872) Tika --extract fails for RTF - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/01 23:00:50 UTC, 2 replies.
- [jira] [Updated] (TIKA-906) Headers, footers, and footnotes not extracted from Pages documents - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/01 23:00:51 UTC, 2 replies.
- [jira] [Updated] (TIKA-774) ExifTool Parser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/01 23:00:51 UTC, 1 replies.
- [jira] [Updated] (TIKA-757) Address TODOs when we upgrade to next POI release (3.8 beta 5) - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/01 23:00:51 UTC, 1 replies.
- [jira] [Updated] (TIKA-715) Some parsers produce non-well-formed XHTML SAX events - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/01 23:00:51 UTC, 1 replies.
- [jira] [Updated] (TIKA-868) TXT parser does not honour the specified encoding - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/01 23:00:51 UTC, 2 replies.
- [jira] [Updated] (TIKA-754) Automatic line break insertion (BR element) instead of '\n' in XHTMLContentHandler - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/01 23:00:52 UTC, 1 replies.
- [jira] [Updated] (TIKA-776) ExifTool Embedder - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/01 23:00:52 UTC, 1 replies.
- [jira] [Updated] (TIKA-891) Use POST in addition to PUT on method calls in tika-server - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/01 23:00:52 UTC, 1 replies.
- [jira] [Updated] (TIKA-819) Make Option to Exclude Embedded Files' Text for Text Content - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/01 23:00:52 UTC, 1 replies.
- [jira] [Updated] (TIKA-775) Embed Capabilities - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/01 23:00:52 UTC, 1 replies.
- [jira] [Updated] (TIKA-539) Encoding detection is too biased by encoding in meta tag - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/01 23:00:53 UTC, 1 replies.
- [jira] [Updated] (TIKA-605) Tika GDAL parser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/01 23:00:53 UTC, 1 replies.
- [jira] [Updated] (TIKA-820) Locator is unset for HTML parser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/01 23:00:53 UTC, 1 replies.
- [jira] [Updated] (TIKA-817) (PPT/PPTX) Missing date/time in text content. - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/01 23:00:54 UTC, 1 replies.
- [jira] [Created] (TIKA-945) Upgrade tika-server to CXF 2.6.1 - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/01 23:03:42 UTC, 0 replies.
- [jira] [Commented] (TIKA-758) Address TODOs when we upgrade to next PDFBox release - posted by "Michael McCandless (JIRA)" <ji...@apache.org> on 2012/07/01 23:05:46 UTC, 0 replies.
- [jira] [Reopened] (TIKA-758) Address TODOs when we upgrade to next PDFBox release - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/07/01 23:22:46 UTC, 0 replies.
- [jira] [Resolved] (TIKA-757) Address TODOs when we upgrade to next POI release (3.8 beta 5) - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2012/07/01 23:24:44 UTC, 0 replies.
- [jira] [Created] (TIKA-946) Improve how the PPTX parser uses XLSF from POI - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2012/07/01 23:30:52 UTC, 0 replies.
- [jira] [Resolved] (TIKA-513) Support of Deja Vu (DjVu) format - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/07/02 00:09:43 UTC, 0 replies.
- Build failed in Jenkins: Tika-trunk #887 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/02 01:12:20 UTC, 0 replies.
- [jira] [Commented] (TIKA-930) Consolidation of Some Tika Core Properties - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2012/07/02 02:03:59 UTC, 1 replies.
- [jira] [Updated] (TIKA-756) XMP output from Tika CLI - posted by "Jörg Ehrlich (JIRA)" <ji...@apache.org> on 2012/07/02 13:57:22 UTC, 3 replies.
- [jira] [Commented] (TIKA-756) XMP output from Tika CLI - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/07/02 14:18:22 UTC, 3 replies.
- Build failed in Jenkins: Tika-trunk #888 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/02 18:57:40 UTC, 3 replies.
- [jira] [Created] (TIKA-947) AbstractMetadataHandler addMetadata Does not Check Property.isMultiValuePermitted - posted by "Ray Gauss II (JIRA)" <ji...@apache.org> on 2012/07/02 20:39:23 UTC, 0 replies.
- [jira] [Resolved] (TIKA-947) AbstractMetadataHandler addMetadata Does not Check Property.isMultiValuePermitted - posted by "Ray Gauss II (JIRA)" <ji...@apache.org> on 2012/07/02 20:59:23 UTC, 0 replies.
- [jira] [Resolved] (TIKA-930) Consolidation of Some Tika Core Properties - posted by "Ray Gauss II (JIRA)" <ji...@apache.org> on 2012/07/03 05:25:02 UTC, 0 replies.
- Build failed in Jenkins: Tika-trunk #889 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/03 08:57:27 UTC, 1 replies.
- [jira] [Commented] (TIKA-915) Image geodata being rounded to integers - posted by "Emmanuel Hugonnet (JIRA)" <ji...@apache.org> on 2012/07/03 13:14:25 UTC, 3 replies.
- Build failed in Jenkins: Tika-trunk #890 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/03 15:04:09 UTC, 0 replies.
- buildbot success in ASF Buildbot on tika-trunk - posted by bu...@apache.org on 2012/07/03 17:52:55 UTC, 1 replies.
- Jenkins build is back to normal : Tika-trunk #891 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/03 21:58:53 UTC, 0 replies.
- [jira] [Created] (TIKA-948) Embedded PDF extracted incorrectly as MS Works file from Word 97-2003 doc - posted by "Michael McCandless (JIRA)" <ji...@apache.org> on 2012/07/04 01:32:34 UTC, 0 replies.
- [jira] [Updated] (TIKA-948) Embedded PDF extracted incorrectly as MS Works file from Word 97-2003 doc - posted by "Michael McCandless (JIRA)" <ji...@apache.org> on 2012/07/04 01:39:33 UTC, 1 replies.
- [jira] [Created] (TIKA-949) Mimetype magic needed for mapping formats such as XMind Pro and MindMapper - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2012/07/04 17:04:34 UTC, 0 replies.
- [jira] [Resolved] (TIKA-949) Mimetype magic needed for mapping formats such as XMind Pro and MindMapper - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2012/07/04 17:08:36 UTC, 0 replies.
- [jira] [Assigned] (TIKA-948) Embedded PDF extracted incorrectly as MS Works file from Word 97-2003 doc - posted by "Michael McCandless (JIRA)" <ji...@apache.org> on 2012/07/05 18:22:34 UTC, 0 replies.
- [jira] [Created] (TIKA-950) Wrong Office Open XML detection in ZipContainerDetector - posted by "Marco Quaranta (JIRA)" <ji...@apache.org> on 2012/07/06 12:32:33 UTC, 0 replies.
- [jira] [Updated] (TIKA-950) Wrong Office Open XML detection in ZipContainerDetector - posted by "Marco Quaranta (JIRA)" <ji...@apache.org> on 2012/07/06 12:36:33 UTC, 2 replies.
- [jira] [Commented] (TIKA-950) Wrong Office Open XML detection in ZipContainerDetector - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2012/07/06 13:37:34 UTC, 5 replies.
- [jira] [Commented] (TIKA-948) Embedded PDF extracted incorrectly as MS Works file from Word 97-2003 doc - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2012/07/06 18:44:34 UTC, 6 replies.
- [jira] [Comment Edited] (TIKA-948) Embedded PDF extracted incorrectly as MS Works file from Word 97-2003 doc - posted by "Alex Ott (JIRA)" <ji...@apache.org> on 2012/07/06 19:50:34 UTC, 0 replies.
- [jira] [Created] (TIKA-951) Bundle activation policy for Eclipse - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/07/06 23:16:34 UTC, 0 replies.
- [jira] [Resolved] (TIKA-951) Bundle activation policy for Eclipse - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/07/06 23:28:35 UTC, 0 replies.
- Build failed in Jenkins: Tika-trunk #895 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/07 00:18:21 UTC, 0 replies.
- Jenkins build is back to normal : Tika-trunk #896 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/07 03:16:12 UTC, 0 replies.
- Build failed in Jenkins: Tika-trunk #897 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/07 14:15:48 UTC, 0 replies.
- [jira] [Resolved] (TIKA-561) Support EMLX file detection - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/07/07 18:05:34 UTC, 0 replies.
- Build failed in Jenkins: Tika-trunk #898 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/07 18:08:37 UTC, 0 replies.
- [jira] [Resolved] (TIKA-322) Improve encoding detection speed and accuracy - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/07/07 21:46:33 UTC, 0 replies.
- [jira] [Resolved] (TIKA-482) Refactor image and jpeg parsers for access to MetadataExtractor API - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/07/07 21:48:35 UTC, 0 replies.
- [jira] [Resolved] (TIKA-530) InvalidFormatException on a PackagePart in OOXML - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/07/07 23:32:36 UTC, 0 replies.
- [jira] [Resolved] (TIKA-518) Attribute values are not indexed - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/07/07 23:34:34 UTC, 0 replies.
- [jira] [Commented] (TIKA-885) Possible ConcurrentModificationException while accessing Metadata produced by ParsingReader - posted by "Luis Filipe Nassif (JIRA)" <ji...@apache.org> on 2012/07/08 01:26:33 UTC, 1 replies.
- [jira] [Comment Edited] (TIKA-885) Possible ConcurrentModificationException while accessing Metadata produced by ParsingReader - posted by "Luis Filipe Nassif (JIRA)" <ji...@apache.org> on 2012/07/08 02:04:33 UTC, 0 replies.
- [jira] [Resolved] (TIKA-471) Avoid Charset name bottleneck when multiple threads are using HtmlParser - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/07/08 14:07:34 UTC, 0 replies.
- [jira] [Resolved] (TIKA-502) Add programming language mime-types - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/07/08 16:08:34 UTC, 0 replies.
- [jira] [Resolved] (TIKA-458) Specify HTMLHandler via Context - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/07/08 16:14:34 UTC, 0 replies.
- [jira] [Resolved] (TIKA-430) Automatically let all valid XHTML 1.0 attributes through from HTML documents - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/07/08 16:17:35 UTC, 0 replies.
- [jira] [Resolved] (TIKA-242) Incremental configuration AutoDetectParser - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/07/08 16:21:35 UTC, 0 replies.
- [jira] [Commented] (TIKA-456) Support timeouts for parsers - posted by "Luis Filipe Nassif (JIRA)" <ji...@apache.org> on 2012/07/08 21:11:35 UTC, 1 replies.
- [jira] [Resolved] (TIKA-431) Tika currently misuses the HTTP Content-Encoding header, and does not seem to use the charset part of the Content-Type header properly. - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/07/09 00:47:34 UTC, 0 replies.
- [jira] [Commented] (TIKA-906) Headers, footers, and footnotes not extracted from Pages documents - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2012/07/09 00:49:34 UTC, 3 replies.
- [jira] [Resolved] (TIKA-906) Headers, footers, and footnotes not extracted from Pages documents - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2012/07/09 00:51:34 UTC, 0 replies.
- FYI: text/plain and text/html media types now come with charset info - posted by Jukka Zitting <ju...@gmail.com> on 2012/07/09 01:17:57 UTC, 0 replies.
- [jira] [Resolved] (TIKA-892) Tika does not use the HTML5 meta charset tag when determining charset - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/07/09 01:29:34 UTC, 0 replies.
- [jira] [Resolved] (TIKA-815) Tika parsers should handle failures more gracefully - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/07/09 01:43:34 UTC, 0 replies.
- [jira] [Resolved] (TIKA-754) Automatic line break insertion (BR element) instead of '\n' in XHTMLContentHandler - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/07/09 16:54:34 UTC, 0 replies.
- Build failed in Jenkins: Tika-trunk #899 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/09 21:15:08 UTC, 0 replies.
- [jira] [Created] (TIKA-952) HTML meta tags ignored for encoding detection - posted by "Tomas Safarik (JIRA)" <ji...@apache.org> on 2012/07/10 14:16:33 UTC, 0 replies.
- [jira] [Resolved] (TIKA-945) Upgrade tika-server to CXF 2.6.1 - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/10 19:40:36 UTC, 0 replies.
- [jira] [Commented] (TIKA-945) Upgrade tika-server to CXF 2.6.1 - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/10 19:40:36 UTC, 0 replies.
- [VOTE] Apache Tika 1.2 release rc #1 - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2012/07/10 22:29:48 UTC, 12 replies.
- Build failed in Jenkins: Tika-trunk #900 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/11 01:14:15 UTC, 0 replies.
- Build failed in Jenkins: Tika-trunk #901 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/11 11:02:39 UTC, 0 replies.
- Tika build error using Maven - posted by 122jxgcn <yw...@gmail.com> on 2012/07/11 13:42:44 UTC, 1 replies.
- [jira] [Created] (TIKA-953) Tika failed to recognize non-ustar Tar file? - posted by "Jing Li (JIRA)" <ji...@apache.org> on 2012/07/13 10:37:33 UTC, 0 replies.
- [jira] [Commented] (TIKA-953) Tika failed to recognize non-ustar Tar file? - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2012/07/13 11:43:34 UTC, 5 replies.
- [jira] [Created] (TIKA-954) Tika throws OOM and GC limited exceeded on Microsoft docx file - posted by "Rob Tulloh (JIRA)" <ji...@apache.org> on 2012/07/13 18:20:33 UTC, 0 replies.
- [jira] [Updated] (TIKA-954) Tika throws OOM and GC limited exceeded on Microsoft docx file - posted by "Rob Tulloh (JIRA)" <ji...@apache.org> on 2012/07/13 18:22:33 UTC, 0 replies.
- [jira] [Commented] (TIKA-954) Tika throws OOM and GC limited exceeded on Microsoft docx file - posted by "Rob Tulloh (JIRA)" <ji...@apache.org> on 2012/07/13 18:22:35 UTC, 6 replies.
- Fixing the problem of TIKA-895 and TIKA-914</a> - posted by John M <jf...@gmail.com> on 2012/07/14 23:18:30 UTC, 4 replies.<br/> - <a href="?thread=g2x4wxo05b6f9yf80jqdyvwjxobhct83">[jira] [Comment Edited] (TIKA-950) Wrong Office Open XML detection in ZipContainerDetector</a> - posted by "Marco Quaranta (JIRA)" <ji...@apache.org> on 2012/07/16 11:38:34 UTC, 0 replies.<br/> - <a href="?thread=lldz2z9l38dz4fcm96r9j80dqy37hwpr">[RESULT] [VOTE] Apache Tika 1.2 release rc #1</a> - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2012/07/16 14:01:12 UTC, 0 replies.<br/> - <a href="?thread=nq3qdfxvtj1phljog99v38vydo355kn8">[jira] [Created] (TIKA-955) Unable to extract "Track Changes" metadata from a microsoft word document</a> - posted by "Priya Kujur (JIRA)" <ji...@apache.org> on 2012/07/16 19:03:35 UTC, 0 replies.<br/> - <a href="?thread=4n40h3nh7wybvrnj1op8yfsssc2sxhds">[ANNOUNCE] Apache Tika 1.2 released</a> - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2012/07/17 07:00:08 UTC, 0 replies.<br/> - <a href="?thread=rynqg71lcr21h9fkstkx6b2n7t87gqdt">Can't build javadocs for 1.2 API site docs</a> - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2012/07/17 07:39:18 UTC, 6 replies.<br/> - <a href="?thread=yoz3ss2lx2jb87xkkhfrchh0yr5rz8zf">Build failed in Jenkins: Tika-trunk #902</a> - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/17 12:03:53 UTC, 0 replies.<br/> - <a href="?thread=xcy7s7hq6w03wv1wd9fl1fcl37dghs2y">[jira] [Created] (TIKA-956) Embedded docs in Word doc are not inlined (text is always added to the end)</a> - posted by "Michael McCandless (JIRA)" <ji...@apache.org> on 2012/07/17 17:03:33 UTC, 0 replies.<br/> - <a href="?thread=gpb5xb5kpsfl7kgb0nosqq38ydgrvl66">[jira] [Created] (TIKA-957) Mimetype magic entry for NITF images</a> - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2012/07/19 00:39:33 UTC, 0 replies.<br/> - <a href="?thread=gtmr2gcj0b0t70k60xr5m037jx54h4zk">[jira] [Resolved] (TIKA-957) Mimetype magic entry for NITF images</a> - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2012/07/19 00:39:35 UTC, 0 replies.<br/> - <a href="?thread=566dt1rvm5cyl01fjwl281fx03pj5yzs">Build failed in Jenkins: Tika-trunk #903</a> - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/19 01:07:11 UTC, 0 replies.<br/> - <a href="?thread=2yz6v39zmk20k3rzkqxqyxv1hf4cp17q">[jira] [Created] (TIKA-958) MIME magic for HDF4 and HDF5</a> - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/19 17:50:33 UTC, 0 replies.<br/> - <a href="?thread=w4gfkolm69r14k4dnnj5hbm6wtfz2yr9">[jira] [Updated] (TIKA-889) XHTMLContentHandler wont emit newline when html element matches ENDLINE set</a> - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/19 17:50:35 UTC, 0 replies.<br/> - <a href="?thread=1y0h452zkmfkkpsltorqkdw9hh17nxck">[jira] [Updated] (TIKA-869) IdentityHtmlMapper.mapSafeElement() needs to return lower-cased incoming name</a> - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/19 17:52:35 UTC, 0 replies.<br/> - <a href="?thread=nhk3bs40jt6836f03sq2y3ms936fc1gt">[jira] [Updated] (TIKA-911) Converted PDF document contains question marks in place of spaces and inconsistent case</a> - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/19 17:52:35 UTC, 0 replies.<br/> - <a href="?thread=p0plv6034qgvskj4dmdxm87x3wzzmhf3">[jira] [Updated] (TIKA-944) Extend tika-server API to be consistent with tika-app CLI</a> - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/19 17:52:35 UTC, 0 replies.<br/> - <a href="?thread=dh72nno4ql7db0bgj8t0k7ybfltch6yj">[jira] [Updated] (TIKA-758) Address TODOs when we upgrade to next PDFBox release</a> - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/19 17:52:35 UTC, 0 replies.<br/> - <a href="?thread=65rz49q5vkkgv7rvykrv0dn74dgwr8cs">[jira] [Updated] (TIKA-671) Support for FB2 (fiction book document) format</a> - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/19 17:52:35 UTC, 1 replies.<br/> - <a href="?thread=gyb29pb3xw6s4qd8l34vkcqotp33ry5v">[jira] [Updated] (TIKA-568) Language Detection isReasonablyCertain() hides valuable information</a> - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/19 17:54:35 UTC, 0 replies.<br/> - <a href="?thread=x0vq5sdpt8n3kryx2q65ghmw0gx0gzkw">[jira] [Updated] (TIKA-771) "Hello, World!" in UTF-8/ASCII gets detected as IBM500</a> - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/19 17:54:35 UTC, 0 replies.<br/> - <a href="?thread=whmg8v0chq1ssdqsozyhchondty4f3n9">[jira] [Updated] (TIKA-728) Return RDFa meta tags via Metadata</a> - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/19 17:54:35 UTC, 0 replies.<br/> - <a href="?thread=g3xx60tgvqktlf00kptl9627bnm72kzc">[jira] [Updated] (TIKA-676) Boilerpipe fails</a> - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/19 17:56:34 UTC, 0 replies.<br/> - <a href="?thread=jzc29d75nsvz75q87o2175593o3n1kg5">[DISCUSS] Tika Hardener?</a> - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2012/07/19 18:43:19 UTC, 0 replies.<br/> - <a href="?thread=65l4qtop9fkd37mbhp0kj6cc2xrsfyy4">[jira] [Commented] (TIKA-815) Tika parsers should handle failures more gracefully</a> - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2012/07/19 18:43:34 UTC, 0 replies.<br/> - <a href="?thread=95r6vf11d05krckvbk2f6l32db5lnwrx">[jira] [Resolved] (TIKA-950) Wrong Office Open XML detection in ZipContainerDetector</a> - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2012/07/19 19:52:35 UTC, 0 replies.<br/> - <a href="?thread=sdhkfnyw4hdd8yw25cqlxf7x1qx0vkbw">Fwd: Call for Papers for ApacheCon Europe 2012 now open!</a> - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2012/07/20 01:08:15 UTC, 0 replies.<br/> - <a href="?thread=34mb88l62hmrxkqzqb3jbb1rgvjx2wzs">[jira] [Created] (TIKA-959) Unwanted diamond with a question mark</a> - posted by "Qbcvparser (JIRA)" <ji...@apache.org> on 2012/07/20 12:16:33 UTC, 0 replies.<br/> - <a href="?thread=nfpj1fv1f5g0cpf3k92bbxklmlyqpw5n">[jira] [Commented] (TIKA-959) Unwanted diamond with a question mark</a> - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2012/07/20 12:22:35 UTC, 1 replies.<br/> - <a href="?thread=35pqn566bl6fkpdgrm11h1x3mb42ctbt">[DISCUSS] Including tika-server WAR in 1.3 artifacts?</a> - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2012/07/20 17:32:12 UTC, 1 replies.<br/> - <a href="?thread=p9wo0bn6jqb6qpys6o0krs46njzd2g4c">[jira] [Commented] (TIKA-956) Embedded docs in Word doc are not inlined (text is always added to the end)</a> - posted by "Michael McCandless (JIRA)" <ji...@apache.org> on 2012/07/20 18:55:35 UTC, 0 replies.<br/> - <a href="?thread=0oqzv409yv244zpf6ddp87msj9wcf4tg">[jira] [Created] (TIKA-960) Duplicate letters in text extracted from PDF files</a> - posted by "Christof Luick (JIRA)" <ji...@apache.org> on 2012/07/23 16:09:34 UTC, 0 replies.<br/> - <a href="?thread=lkgzhdj6nzfmznl63dml24czowrwkxkj">[jira] [Commented] (TIKA-960) Duplicate letters in text extracted from PDF files</a> - posted by "Dave Meikle (JIRA)" <ji...@apache.org> on 2012/07/23 19:37:34 UTC, 0 replies.<br/> - <a href="?thread=k61vh1tgzm0or4bl7qmc78bmhnhpnxz2">[jira] [Closed] (TIKA-952) HTML meta tags ignored for encoding detection</a> - posted by "Tomas Safarik (JIRA)" <ji...@apache.org> on 2012/07/24 21:49:36 UTC, 0 replies.<br/> - <a href="?thread=6d9n2qkc7g440n9h5hc1g37m1rq6z61p">[jira] [Commented] (TIKA-431) Tika currently misuses the HTTP Content-Encoding header, and does not seem to use the charset part of the Content-Type header properly.</a> - posted by "Tomas Safarik (JIRA)" <ji...@apache.org> on 2012/07/24 21:53:35 UTC, 0 replies.<br/> - <a href="?thread=b7mo2okgzlqcvkrmr83kwrcynb2czvg2">How to counting images and videos in an Embedded document?</a> - posted by chraj007 <ch...@gmail.com> on 2012/07/25 12:31:23 UTC, 4 replies.<br/> - <a href="?thread=lm2dvybxot8hp8q388150s0g6oztx8vl">[DISCUSS] Any23 Graduation to TLP</a> - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2012/07/26 16:22:55 UTC, 0 replies.<br/> - <a href="?thread=j8ypv82f79w5t6hfghk9wmw14ojj32yb">[jira] [Updated] (TIKA-953) Tika failed to recognize non-ustar Tar file?</a> - posted by "Jing Li (JIRA)" <ji...@apache.org> on 2012/07/27 04:31:34 UTC, 0 replies.<br/> - <a href="?thread=mot9swbrm82yb99vtd42wrq2h7bx04p7">[jira] [Commented] (TIKA-811) Upgrade metadatExtractor version for OpenJDK 7 support</a> - posted by "Ray Gauss II (JIRA)" <ji...@apache.org> on 2012/07/30 01:27:36 UTC, 0 replies.<br/> - <a href="?thread=1djz8ofn7jxpm5bqr8j6zwoyrtw8zt5g">Build failed in Jenkins: Tika-trunk #904</a> - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/30 02:18:54 UTC, 0 replies.<br/> - <a href="?thread=r9pvgyndv5qbynlj8b2nh8cfm82r3x51">[jira] [Resolved] (TIKA-953) Tika failed to recognize non-ustar Tar file?</a> - posted by "Jing Li (JIRA)" <ji...@apache.org> on 2012/07/30 10:07:34 UTC, 0 replies.<br/> - <a href="?thread=oq0s1pd1ksnnbcvqrw5bwc5203gb8jdf">[jira] [Resolved] (TIKA-811) Upgrade metadatExtractor version for OpenJDK 7 support</a> - posted by "Ray Gauss II (JIRA)" <ji...@apache.org> on 2012/07/30 14:33:34 UTC, 0 replies.<br/> - <a href="?thread=sstoksdfg7cxfrcllf3bhtxwzr2nqrcy">[jira] [Resolved] (TIKA-915) Image geodata being rounded to integers</a> - posted by "Ray Gauss II (JIRA)" <ji...@apache.org> on 2012/07/30 14:35:34 UTC, 1 replies.<br/> - <a href="?thread=4pzsllhnxjzx4prgsd9dmyfvs7lrhpbq">[jira] [Created] (TIKA-961) No whitespace added if BoilerpipeContentHandler.setIncludeMarkup(true)</a> - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/30 15:03:33 UTC, 0 replies.<br/> - <a href="?thread=dvrq8837b4z45zd1ynjmdfy7r15p3d52">[jira] [Updated] (TIKA-956) Embedded docs in Word doc are not inlined (text is always added to the end)</a> - posted by "Michael McCandless (JIRA)" <ji...@apache.org> on 2012/07/30 15:07:34 UTC, 0 replies.<br/> - <a href="?thread=rpfzxzx2jztkl1st179rl7konj5thzzc">[jira] [Updated] (TIKA-961) No whitespace added if BoilerpipeContentHandler.setIncludeMarkup(true)</a> - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/30 16:01:34 UTC, 0 replies.<br/> - <a href="?thread=p43oxwkll1cffrlt8rqbdq945w15xryn">[jira] [Comment Edited] (TIKA-961) No whitespace added if BoilerpipeContentHandler.setIncludeMarkup(true)</a> - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/30 16:03:35 UTC, 0 replies.<br/> - <a href="?thread=ms435sqr84obvrp3fh9bbv9wh65z86so">[jira] [Reopened] (TIKA-915) Image geodata being rounded to integers</a> - posted by "Ray Gauss II (JIRA)" <ji...@apache.org> on 2012/07/30 20:17:35 UTC, 0 replies.<br/> - <a href="?thread=z9mn5psl2d41fvb99dcsj7ftqkmy0njo">[jira] [Created] (TIKA-962) Backwards Compatibility for Metadata.LAST_AUTHOR is Broken</a> - posted by "Ray Gauss II (JIRA)" <ji...@apache.org> on 2012/07/30 20:57:34 UTC, 0 replies.<br/> - <a href="?thread=0wl92v70x4xycmso138ds5gmhsbn4n1n">[jira] [Commented] (TIKA-962) Backwards Compatibility for Metadata.LAST_AUTHOR is Broken</a> - posted by "Ray Gauss II (JIRA)" <ji...@apache.org> on 2012/07/30 21:01:35 UTC, 0 replies.<br/> - <a href="?thread=h3cgqb6l89bh2tp2cwrm0nk7rzyyr9cx">[jira] [Created] (TIKA-963) Backwards Compatibility for Metadata.DATE is Incorrect</a> - posted by "Ray Gauss II (JIRA)" <ji...@apache.org> on 2012/07/30 21:41:34 UTC, 0 replies.<br/> - <a href="?thread=bhh0j1z96z6rv3c3nzgzy69yd85832bm">[jira] [Commented] (TIKA-963) Backwards Compatibility for Metadata.DATE is Incorrect</a> - posted by "Ray Gauss II (JIRA)" <ji...@apache.org> on 2012/07/30 21:47:35 UTC, 0 replies.<br/> - <a href="?thread=hfrxxcvqv2t9ft1gl0zyoppr3lmyl3k0">[ANNOUNCE] Welcome Sergey Beryozkin as Apache Tika PMC member and committer</a> - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2012/07/30 22:54:44 UTC, 0 replies.<br/> - <a href="?thread=125c8mvkxh0njyxg00d8j578j5kjvro2">[jira] [Created] (TIKA-964) Ability to specify bind address</a> - posted by "Vitaliy Filippov (JIRA)" <ji...@apache.org> on 2012/07/31 01:12:35 UTC, 0 replies.<br/> - <a href="?thread=nvlwzssznlmvnp45o2fhbjcbyx3w0xgg">[jira] [Updated] (TIKA-964) Ability to specify bind address</a> - posted by "Vitaliy Filippov (JIRA)" <ji...@apache.org> on 2012/07/31 01:22:36 UTC, 0 replies.<br/> - <a href="?thread=zwqtvjjq8s2bxwbxq26owxnr6nr8r10r">[jira] [Updated] (TIKA-709) Tika network server does not print anything in response to, for example, Word documents</a> - posted by "Vitaliy Filippov (JIRA)" <ji...@apache.org> on 2012/07/31 01:24:34 UTC, 0 replies.<br/> - <a href="?thread=vtvozqh2b11c600l2qpl4llyt5xdl9zd">[jira] [Reopened] (TIKA-709) Tika network server does not print anything in response to, for example, Word documents</a> - posted by "Vitaliy Filippov (JIRA)" <ji...@apache.org> on 2012/07/31 01:24:34 UTC, 0 replies.<br/> - <a href="?thread=d41sqztjx34j7lhdd2mts65dnn4c4bov">Custom parser error</a> - posted by 122jxgcn <yw...@gmail.com> on 2012/07/31 10:48:25 UTC, 3 replies.<br/> - <a href="?thread=4tys00dgs80k3dpdk1l021zp045n52ht">[jira] [Created] (TIKA-965) Text Detection Fails on Mostly Non-ASCII UTF-8 Files</a> - posted by "Ray Gauss II (JIRA)" <ji...@apache.org> on 2012/07/31 13:50:33 UTC, 0 replies.<br/> - <a href="?thread=127xbs7dmkorwoy6hchzddqbp7ojd2vd">[ANNOUNCE] Welcome Ingo Renner as Tika PMC member and committer</a> - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2012/07/31 15:44:28 UTC, 0 replies.<br/> - <a href="?thread=lddvlozm7o8qnv8ty5zmb1vmhjb2lcq6">[jira] [Commented] (TIKA-965) Text Detection Fails on Mostly Non-ASCII UTF-8 Files</a> - posted by "Nick Burch (JIRA)" <ji...@apache.org> on 2012/07/31 16:09:35 UTC, 3 replies.<br/> - <a href="?thread=ohvpbkp1gbx38o0qgrjmxo49p1f2tlsx">[jira] [Created] (TIKA-966) org.apache.tika.Tika missing from tika-bundle-1.2.jar</a> - posted by "Gary Karasiuk (JIRA)" <ji...@apache.org> on 2012/07/31 16:09:35 UTC, 0 replies.<br/> - <a href="?thread=fbvkk00hqmozl8l75mqrdpxss5ky84bk">[jira] [Commented] (TIKA-966) org.apache.tika.Tika missing from tika-bundle-1.2.jar</a> - posted by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2012/07/31 16:21:35 UTC, 2 replies.<br/> - <a href="?thread=rl7bqrxogvvmf39mlw7ts6rhxbkzglm4">[ANNOUNCE] Welcome Jörg Ehrlich as new Tika PMC member and committer</a> - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2012/07/31 17:35:13 UTC, 1 replies.<br/> - <a href="?thread=k41y8t4hhd5tjxsg5r16vhxyfoq1rvt3">[jira] [Comment Edited] (TIKA-965) Text Detection Fails on Mostly Non-ASCII UTF-8 Files</a> - posted by "Ray Gauss II (JIRA)" <ji...@apache.org> on 2012/07/31 20:18:35 UTC, 0 replies.<br/> </body> </html>