You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tika.apache.org by bo...@apache.org on 2015/12/29 00:10:28 UTC
svn commit: r1722027 [1/27] - in /tika/branches/2.x: tika-parser-test/
tika-parser-test/src/ tika-parser-test/src/main/
tika-parser-test/src/main/java/ tika-parser-test/src/main/resources/
tika-parser-test/src/main/resources/META-INF/ tika-parser-test/...
Author: bob
Date: Mon Dec 28 23:10:16 2015
New Revision: 1722027
URL: http://svn.apache.org/viewvc?rev=1722027&view=rev
Log:
TIKA-1818 - Decouple test documents from parsers so they can be reused.
Added:
tika/branches/2.x/tika-parser-test/
tika/branches/2.x/tika-parser-test/pom.xml
tika/branches/2.x/tika-parser-test/src/
tika/branches/2.x/tika-parser-test/src/main/
tika/branches/2.x/tika-parser-test/src/main/java/
tika/branches/2.x/tika-parser-test/src/main/resources/
tika/branches/2.x/tika-parser-test/src/main/resources/META-INF/
tika/branches/2.x/tika-parser-test/src/main/resources/META-INF/services/
tika/branches/2.x/tika-parser-test/src/main/resources/META-INF/services/org.apache.tika.parser.Parser
tika/branches/2.x/tika-parser-test/src/main/resources/log4j.properties
tika/branches/2.x/tika-parser-test/src/main/resources/org/
tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/
tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/
tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/
tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1558-blacklist.xml
tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1558-blacklistsub.xml
tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1702-detector-blacklist.xml
tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1702-translator-default.xml
tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1702-translator-empty-default.xml
tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1702-translator-empty.xml
tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1708-detector-composite.xml
tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1708-detector-default.xml
tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/mime/
tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/mime/custom-mimetypes.xml
tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/
tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/
tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/
tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/ModelGetter.groovy
tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/get-models.sh
tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/ner-date.bin (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/ner-location.bin (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/ner-organization.bin (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/ner-person.bin (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/regex/
tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/regex/ner-regex.txt
tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/tika-config.xml
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/AutoDetectParser.class (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/Doc1_ole.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/EmbeddedDocument.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/EmbeddedOutlook.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/EmbeddedPDF.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/GLDAS_CLM10SUBP_3H.A19790202.0000.001.grb (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/NUTCH-1997.cbor
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/NullHeader.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/TIKA-216.tgz (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/WFPC2u5780205r_c0fx.fits
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/Zamora2010.dif
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/active_layer_arcss_grid_barrow_alaska_2012.dif
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/big-preamble.html
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/boilerplate-whitespace.html
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/boilerplate.html
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/breidamerkurjokull_radar_profiles_2009.mat (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/carbon_isotopic_values_of_alkanes_extracted_from_paleosols.dif
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/chm/
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/chm/IMJPCL.CHM (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/chm/IMJPCLE.CHM (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/chm/IMTCEN.CHM (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/chm/admin.chm (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/chm/cmak_ops.CHM (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/chm/comexp.CHM (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/chm/gpedit.CHM (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/chm/tcpip.CHM (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/chm/wmicontrol.CHM (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/complex.mbox
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/egyl03.gdas.200811.00Z.grb2 (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/english.cp500.txt
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/envi_test_header.hdr
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/footnotes.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/gdas1.forecmwf.2014062612.grib2 (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/headerPic.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/headers.mbox
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/jxl.xls (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/moby.zip (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/mock/
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/mock/embedded_then_npe.xml
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/mock/example.xml
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/mock/fake_oom.xml
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/mock/heavy_hang.xml
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/mock/nothing_bad.xml
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/mock/null_pointer.xml
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/mock/null_pointer_no_msg.xml
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/mock/real_oom.xml
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/mock/sleep.xml
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/mock/sleep_interruptible.xml
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/mock/sleep_not_interruptible.xml
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/multiline.mbox
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/pictures.ppt (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/protect.xlsx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/protectedFile.xlsx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/protectedSheets.xlsx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/quoted.mbox
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/resume.html
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/rsstest.rss
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/russian.cp866.txt
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/sampleFile.iso19139
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/simple.mbox
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/sresa1b_ncar_ccsm3_0_run1_200001.nc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/tableHeaders.numbers (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/tableNames.numbers (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test-documents-spanned.z01 (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test-documents-spanned.zip (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test-documents.7z (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test-documents.cpio (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test-documents.rar (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test-documents.tar (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test-documents.tar.Z (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test-documents.tbz2 (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test-documents.tgz (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test-documents.zip (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test-outlook.msg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test-outlook2003.msg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test-zip-of-zip.zip (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test.fb2
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test.hdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test.he5 (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test1.swf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test2.swf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test3.swf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test7Z_protected_passTika.7z (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testACCESS.mdb (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testAFM.afm
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testAIFF.aif (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testAMR-WB.amr (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testAMR.amr (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testAPK.apk (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testARofSND.ar (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testARofText.ar
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testASF.asf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testASiCE.asice (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testASiCS.asics (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testATOM.atom
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testAU.au (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testAccess2.accdb (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testAccess2_2000.mdb (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testAccess2_2002-2003.mdb (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testAccess2_encrypted.accdb (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testAccess_V1997.mdb (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testAnnotations.pdf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testBDB_btree_2.db (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testBDB_btree_3.db (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testBDB_btree_4.db (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testBDB_btree_5.db (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testBDB_hash_2.db (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testBDB_hash_3.db (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testBDB_hash_4.db (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testBDB_hash_5.db (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testBIBTEX.bib
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testBMP.bmp (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testBMPfp.txt
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testBPG.bpg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testBPG_GEO.bpg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testBPG_commented.bpg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testBPG_commented_xnviewmp026.bpg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testBinControlWord.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testBulletPoints.key (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testC.c
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testCADKEY.prt (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testCADKEY2.prt (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testCOREL.shw (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testCPP.cpp
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testCSS.css
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testCSV.csv
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testChm.chm (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testChm2.chm (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testChm3.chm (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testComment.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testComment.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testComment.pdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testComment.ppt (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testComment.pptx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testComment.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testComment.xls (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testComment.xlsx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testControlCharacters.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testDITA.dita
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testDITA.ditamap
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testDITA2.dita
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testDOCX_Thumbnail.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testDOTM.dotm (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testDWG2000.dwg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testDWG2004.dwg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testDWG2004_no_header.dwg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testDWG2007.dwg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testDWG2010.dwg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testDWG2010_custom_props.dwg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testDWGmech2004.dwg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testDWGmech2004DX.dwg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testDWGmech2005.dwg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testDWGmech2006.dwg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testDWGmech2007.dwg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testDWGmech2008.dwg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testDWGmech2009.dwg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testDWGmech2010.dwg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testDWGmech2011.dwg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testDWGmech6.dwg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testDetached.p7s (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testDocumentLink.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEAR.ear (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEMF.emf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEMLX.emlx
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEPUB.epub (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEXCEL-charts.xls (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEXCEL-formats.xls (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEXCEL-formats.xlsx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEXCEL.strict.xlsx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEXCEL.xls (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEXCEL.xlsb (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEXCEL.xlsx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEXCEL_1img.xls (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEXCEL_1img.xlsx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEXCEL_4.xls (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEXCEL_5.xls (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEXCEL_95.xls (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEXCEL_custom_props.xls (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEXCEL_custom_props.xlsx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEXCEL_embeded.xls (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEXCEL_embeded.xlsx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEXCEL_headers_footers.xls (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEXCEL_headers_footers.xlsx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEXCEL_protected_passtika.xls (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEXCEL_protected_passtika.xlsx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEXCEL_textbox.xlsx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testEmbedded.zip (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testException1.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testException2.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testExtraSpaces.pdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testFITS.fits
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testFLAC.flac (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testFLAC.oga (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testFLV.flv (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testFOXMAIL.box
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testFontAfterBufferedText.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testFooter.ods (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testFooter.odt (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testFreeBSD-x86-64 (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testGIF.gif (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testGROOVY.groovy
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testGroupWiseEml.eml
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testH.h
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testHTML.html
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testHTMLNoisyMetaEncoding_1.html
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testHTMLNoisyMetaEncoding_2.html
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testHTMLNoisyMetaEncoding_3.html
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testHTMLNoisyMetaEncoding_4.html
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testHTML_utf8.html
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testHWP_3.0.hwp (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testHWP_5.0.hwp (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testINDD.indd (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testIPA.ipa (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testISATab_BII-I-1/
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testISATab_BII-I-1/a_bii-s-2_metabolite profiling_NMR spectroscopy.txt
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testISATab_BII-I-1/a_metabolome.txt
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testISATab_BII-I-1/a_microarray.txt
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testISATab_BII-I-1/a_proteome.txt
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testISATab_BII-I-1/a_transcriptome.txt
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testISATab_BII-I-1/i_investigation.txt
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testISATab_BII-I-1/s_BII-S-1.txt
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testISATab_BII-I-1/s_BII-S-2.txt
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testJAR.jar (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testJAR_with_HTML.jar (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testJAR_with_PEHDR.jar (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testJAVA.java
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testJAVAPROPS.properties
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testJNILIB.jnilib (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testJPEG.jp2 (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testJPEG.jpg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testJPEG_EXIF.jpg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testJPEG_EXIF_emptyDateTime.jpg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testJPEG_GEO.jpg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testJPEG_GEO_2.jpg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testJPEG_commented.jpg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testJPEG_commented_pspcs2mac.jpg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testJPEG_commented_xnviewmp026.jpg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testJPEG_oddTagComponent.jpg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testJS.js
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testJournalParser.pdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testKML.kml
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testKMZ.kmz (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testKeynote.key (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testLinux-arm-32le (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testLinux-mips-32be (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testLinux-mips-32le (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testLinux-ppc-32be (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testLinux-x86-32 (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testLinux-x86-64 (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testLotusEml.eml
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testMATLAB.m
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testMATLAB_barcast.m
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testMATLAB_wtsgaus.m
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testMHTMLFirefox.mhtml
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testMID.mid (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testMKV.mkv (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testMP3i18n.mp3 (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testMP3id3v1.mp3 (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testMP3id3v1_v2.mp3 (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testMP3id3v2.mp3 (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testMP3id3v24.mp3 (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testMP3lyrics.mp3 (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testMP3noid3.mp3 (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testMP3truncated.mp3 (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testMP4.m4a (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testMSG.msg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testMSG_att_doc.msg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testMSG_att_msg.msg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testMSG_chinese.msg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testMSG_forwarded.msg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testMYSQL.MYD (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testMYSQL.MYI (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testMYSQL.frm (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testMasterFooter.odp (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testMasterSlideTable.key (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testNPEOpenDocument.odt (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testNakedUTF16BOM.mp3 (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testNumbers.numbers (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testNumbersCharts.numbers (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testOCR.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testOCR.jpg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testOCR.pdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testOCR.pptx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testOCTET_header.dbase3 (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testODFwithOOo3.odt (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testODT-TIKA-6000.odt (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testOPUS.opus (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testOpenOffice2.odf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testOpenOffice2.odt (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testOptionalHyphen.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testOptionalHyphen.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testOptionalHyphen.pdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testOptionalHyphen.ppt (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testOptionalHyphen.pptx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testOptionalHyphen.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testOverlappingText.pdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPBM.pbm
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDF-custommetadata.pdf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDF.pdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDFEmbeddingAndEmbedded.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDFFileEmbInAnnotation.pdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDFPackage.pdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDFTripleLangTitle.pdf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDFTwoTextBoxes.pdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDFVarious.pdf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDF_PDFEncodedStringInXMP.pdf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDF_Version.10.x.pdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDF_Version.11.x.PDFA-1b.pdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDF_Version.4.x.pdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDF_Version.5.x.pdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDF_Version.6.x.pdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDF_Version.7.x.pdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDF_Version.8.x.pdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDF_Version.9.x.pdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDF_acroform3.pdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDF_bom.pdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDF_bookmarks.pdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDF_childAttachments.pdf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDF_multiFormatEmbFiles.pdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDF_no_extract_no_accessibility_owner_empty.pdf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDF_no_extract_no_accessibility_owner_user.pdf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDF_no_extract_yes_accessibility_owner_empty.pdf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDF_no_extract_yes_accessibility_owner_user.pdf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDF_protected.pdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPDF_twoAuthors.pdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPGM.pgm
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPICT.pct (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPNG.png (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPM.ppm
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT.potm (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT.ppsm (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT.ppsx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT.ppt (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT.pptm (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT.pptx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT.thmx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT.xps (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPTX_Thumbnail.pptx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT_2imgs.ppt (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT_2imgs.pptx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT_autodate.ppt (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT_autodate.pptx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT_comment.ppt (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT_comment.pptx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT_custom_props.ppt (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT_custom_props.pptx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT_embedded2.ppt (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT_embedded_two_slides.pptx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT_embeded.ppt (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT_embeded.pptx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT_masterFooter.ppt (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT_masterFooter.pptx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT_masterText.ppt (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT_masterText.pptx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT_masterText2.ppt (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT_masterText2.pptx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT_protected_passtika.ppt (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT_protected_passtika.pptx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT_various.ppt (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPPT_various.pptx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPROJECT2003.mpp (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPROJECT2007.mpp (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPSD.psd (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPSD2.psd (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPST.pst (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPUBLISHER.pub (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPageNumber.pdf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPages.pages (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPagesComments.pages (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPagesHeadersFootersAlphaLower.pages (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPagesHeadersFootersAlphaUpper.pages (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPagesHeadersFootersFootnotes.pages (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPagesHeadersFootersRomanLower.pages (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPagesHeadersFootersRomanUpper.pages (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPagesLayout.pages (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPagesPwdProtected.pages (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPhoneNumberExtractor.odt (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testPopupAnnotation.pdf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testQUATTRO.qpw (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testQUATTRO.wb3 (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRDF.rdf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRFC822
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRFC822-CC-BCC
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRFC822-big
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRFC822-limitedheaders
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRFC822-multipart
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRFC822_base64
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRFC822_encrypted_zip
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRFC822_i18nheaders
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRFC822_normal_zip
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRFC822_oddfrom
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRFC822_quoted
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTF-ms932.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTF.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFBoldItalic.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFControls.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFCorruptListOverride.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFEmbeddedFiles.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFEmbeddedLink.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFHexEscapeInsideWord.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFHyperlink.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFIgnoredControlWord.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFInvalidUnicode.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFJapanese.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFListLibreOffice.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFListMicrosoftWord.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFListOverride.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFNewlines.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFRegularImages.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFTableCellSeparation.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFTableCellSeparation2.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFUmlautSpaces.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFUmlautSpaces2.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFUnicodeGothic.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFUnicodeUCNControlWordCharacterDoubling.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFVarious.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFWindowsCodepage1250.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFWithCurlyBraces.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFWord2010CzechCharacters.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testRTFWordPadCzechCharacters.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testSQLITE3.db (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testSVG.svg
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testSVG.svgz (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testSolaris-x86-32 (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testSqlite3b.db (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testStarOffice-5.2-calc.sdc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testStarOffice-5.2-draw.sda (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testStarOffice-5.2-impress.sdd (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testStarOffice-5.2-writer.sdw (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testStyles.odt (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testTIFF.tif (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testTXT-tika.axx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testTXT.txt
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testTXT.zlib (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testTXT.zlib0 (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testTXT.zlib5 (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testTXT.zlib9 (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testTXTNonASCIIUTF8.txt
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testTables.key (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testTextBoxes.key (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testThunderbirdEml.eml
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testTinyPE.exe (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testTrueType3.ttf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testUserDefinedCharset.mhtml
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testVISIO.vsd (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testVISIO.vsdm (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testVISIO.vsdx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testVISIO.vssm (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testVISIO.vssx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testVISIO.vstm (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testVISIO.vstx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testVORBIS.ogg (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testVORCalcTemplate.vor (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testVORDrawTemplate.vor (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testVORImpressTemplate.vor (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testVORWriterTemplate.vor (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWAR.war (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWAV.wav (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWEBARCHIVE.webarchive
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWEBM.webm (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWEBP.webp (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWINMAIL.dat (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWMA.wma (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWMF.wmf (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWMV.wmv (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD6.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_1img.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_1img.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_3imgs.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_3imgs.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_bold_character_runs.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_bold_character_runs.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_bold_character_runs2.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_bold_character_runs2.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_closingSmartQInHyperLink.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_custom_props.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_custom_props.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_embedded_pdf.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_embedded_pdf.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_embedded_rtf.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_embeded.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_embeded.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_header_hyperlink.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_missing_ooxml_bean1.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_missing_text.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_multi_authors.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_multi_authors.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_no_format.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_no_format.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_null_style.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_numbered_list.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_numbered_list.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_override_list_numbering.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_override_list_numbering.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_protected_passtika.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_protected_passtika.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_tabular_symbol.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_text_box.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_various.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORD_various.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORKS.wps (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORKS2000.wps (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORKSSpreadsheet7.0.xlr (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORKSWordProcessor3.0.wps (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWORKSWordProcessor4.0.wps (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWebVTT.vtt
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWebp_Alpha_Lossless.webp (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWebp_Alpha_Lossy.webp (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWindows-x86-32.exe (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testWordArt.pptx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testXHTML.html
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testXLSX_Thumbnail.xlsx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testXML.xml
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testXML2.xml
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testXML3.xml
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test_TIKA-1251.doc (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test_embedded_package.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test_embedded_zip.pptx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test_list_override.rtf
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test_mat_text.mat (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test_recursive_embedded.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/test_recursive_embedded_npe.docx (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testiBooks.ibooks (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testsolidworksAssembly2013SP2.SLDASM (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testsolidworksAssembly2014SP0.SLDASM (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testsolidworksDrawing2013SP2.SLDDRW (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testsolidworksDrawing2014SP0.SLDDRW (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testsolidworksPart2013SP2.SLDPRT (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/testsolidworksPart2014SP0.SLDPRT (with props)
tika/branches/2.x/tika-parser-test/src/main/resources/test-documents/tika434.html
tika/branches/2.x/tika-parser-test/src/main/resources/test-properties/
tika/branches/2.x/tika-parser-test/src/main/resources/test-properties/StringsConfig-full.properties
tika/branches/2.x/tika-parser-test/src/main/resources/test-properties/StringsConfig-partial.properties
tika/branches/2.x/tika-parser-test/src/main/resources/test-properties/TesseractOCRConfig-full.properties
tika/branches/2.x/tika-parser-test/src/main/resources/test-properties/TesseractOCRConfig-partial.properties
Removed:
tika/branches/2.x/tika-parsers/src/test/resources/
Modified:
tika/branches/2.x/tika-parsers/pom.xml
Added: tika/branches/2.x/tika-parser-test/pom.xml
URL: http://svn.apache.org/viewvc/tika/branches/2.x/tika-parser-test/pom.xml?rev=1722027&view=auto
==============================================================================
--- tika/branches/2.x/tika-parser-test/pom.xml (added)
+++ tika/branches/2.x/tika-parser-test/pom.xml Mon Dec 28 23:10:16 2015
@@ -0,0 +1,68 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<project xmlns="http://maven.apache.org/POM/4.0.0"
+ xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
+ xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd">
+ <modelVersion>4.0.0</modelVersion>
+ <parent>
+ <groupId>org.apache.tika</groupId>
+ <artifactId>tika-parent</artifactId>
+ <version>2.0-SNAPSHOT</version>
+ </parent>
+
+ <artifactId>tika-parser-test</artifactId>
+ <name>Apache Tika Multimedia Module</name>
+ <url>http://tika.apache.org/</url>
+
+ <dependencies>
+ <dependency>
+ <groupId>commons-io</groupId>
+ <artifactId>commons-io</artifactId>
+ <version>${commons.io.version}</version>
+ </dependency>
+ <dependency>
+ <groupId>${project.groupId}</groupId>
+ <artifactId>tika-core</artifactId>
+ <version>${project.version}</version>
+ </dependency>
+ </dependencies><profiles>
+ <profile>
+ <id>testSetup</id>
+ <activation>
+ <!-- auto activate -->
+ <file>
+ <missing>${basedir}/src/main/resources/org/apache/tika/parser/ner/opennlp/ner-person.bin</missing>
+ </file>
+ </activation>
+ <dependencies>
+ <dependency>
+ <groupId>org.apache.maven</groupId>
+ <artifactId>maven-model</artifactId>
+ <version>3.3.3</version>
+ </dependency>
+ </dependencies>
+ <build>
+ <plugins>
+ <plugin>
+ <groupId>org.codehaus.groovy.maven</groupId>
+ <artifactId>gmaven-plugin</artifactId>
+ <executions>
+ <execution>
+ <id>testSetup</id>
+ <phase>generate-test-resources</phase>
+ <goals>
+ <goal>execute</goal>
+ </goals>
+ <configuration>
+ <source>${basedir}/src/main/resources/org/apache/tika/parser/ner/opennlp/ModelGetter.groovy</source>
+ </configuration>
+ </execution>
+ </executions>
+ </plugin>
+ </plugins>
+ </build>
+ </profile>
+ </profiles>
+
+
+
+</project>
\ No newline at end of file
Added: tika/branches/2.x/tika-parser-test/src/main/resources/META-INF/services/org.apache.tika.parser.Parser
URL: http://svn.apache.org/viewvc/tika/branches/2.x/tika-parser-test/src/main/resources/META-INF/services/org.apache.tika.parser.Parser?rev=1722027&view=auto
==============================================================================
--- tika/branches/2.x/tika-parser-test/src/main/resources/META-INF/services/org.apache.tika.parser.Parser (added)
+++ tika/branches/2.x/tika-parser-test/src/main/resources/META-INF/services/org.apache.tika.parser.Parser Mon Dec 28 23:10:16 2015
@@ -0,0 +1 @@
+org.apache.tika.parser.mock.MockParser
\ No newline at end of file
Added: tika/branches/2.x/tika-parser-test/src/main/resources/log4j.properties
URL: http://svn.apache.org/viewvc/tika/branches/2.x/tika-parser-test/src/main/resources/log4j.properties?rev=1722027&view=auto
==============================================================================
--- tika/branches/2.x/tika-parser-test/src/main/resources/log4j.properties (added)
+++ tika/branches/2.x/tika-parser-test/src/main/resources/log4j.properties Mon Dec 28 23:10:16 2015
@@ -0,0 +1,24 @@
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# contributor license agreements. See the NOTICE file distributed with
+# this work for additional information regarding copyright ownership.
+# The ASF licenses this file to You under the Apache License, Version 2.0
+# (the "License"); you may not use this file except in compliance with
+# the License. You may obtain a copy of the License at
+#
+# http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+#info,debug, error,fatal ...
+log4j.rootLogger=info,stdout
+
+#console
+log4j.appender.stdout=org.apache.log4j.ConsoleAppender
+log4j.appender.stdout.layout=org.apache.log4j.PatternLayout
+
+# Pattern to output the caller's file name and line number.
+log4j.appender.stdout.layout.ConversionPattern=%5p [%t] (%F:%L) - %m%n
Added: tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1558-blacklist.xml
URL: http://svn.apache.org/viewvc/tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1558-blacklist.xml?rev=1722027&view=auto
==============================================================================
--- tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1558-blacklist.xml (added)
+++ tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1558-blacklist.xml Mon Dec 28 23:10:16 2015
@@ -0,0 +1,29 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<!--
+ Licensed to the Apache Software Foundation (ASF) under one or more
+ contributor license agreements. See the NOTICE file distributed with
+ this work for additional information regarding copyright ownership.
+ The ASF licenses this file to You under the Apache License, Version 2.0
+ (the "License"); you may not use this file except in compliance with
+ the License. You may obtain a copy of the License at
+
+ http://www.apache.org/licenses/LICENSE-2.0
+
+ Unless required by applicable law or agreed to in writing, software
+ distributed under the License is distributed on an "AS IS" BASIS,
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ See the License for the specific language governing permissions and
+ limitations under the License.
+-->
+<properties>
+ <parsers>
+ <parser class="org.apache.tika.parser.DefaultParser">
+ <mime-exclude>image/jpeg</mime-exclude>
+ <mime-exclude>application/pdf</mime-exclude>
+ <parser-exclude class="org.apache.tika.parser.executable.ExecutableParser"/>
+ </parser>
+ <parser class="org.apache.tika.parser.EmptyParser">
+ <mime>application/pdf</mime>
+ </parser>
+ </parsers>
+</properties>
Added: tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1558-blacklistsub.xml
URL: http://svn.apache.org/viewvc/tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1558-blacklistsub.xml?rev=1722027&view=auto
==============================================================================
--- tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1558-blacklistsub.xml (added)
+++ tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1558-blacklistsub.xml Mon Dec 28 23:10:16 2015
@@ -0,0 +1,24 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<!--
+ Licensed to the Apache Software Foundation (ASF) under one or more
+ contributor license agreements. See the NOTICE file distributed with
+ this work for additional information regarding copyright ownership.
+ The ASF licenses this file to You under the Apache License, Version 2.0
+ (the "License"); you may not use this file except in compliance with
+ the License. You may obtain a copy of the License at
+
+ http://www.apache.org/licenses/LICENSE-2.0
+
+ Unless required by applicable law or agreed to in writing, software
+ distributed under the License is distributed on an "AS IS" BASIS,
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ See the License for the specific language governing permissions and
+ limitations under the License.
+-->
+<properties>
+ <parsers>
+ <parser class="org.apache.tika.parser.DefaultParser">
+ <parser-exclude class="org.apache.tika.parser.xml.XMLParser"/>
+ </parser>
+ </parsers>
+</properties>
Added: tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1702-detector-blacklist.xml
URL: http://svn.apache.org/viewvc/tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1702-detector-blacklist.xml?rev=1722027&view=auto
==============================================================================
--- tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1702-detector-blacklist.xml (added)
+++ tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1702-detector-blacklist.xml Mon Dec 28 23:10:16 2015
@@ -0,0 +1,31 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<!--
+ Licensed to the Apache Software Foundation (ASF) under one or more
+ contributor license agreements. See the NOTICE file distributed with
+ this work for additional information regarding copyright ownership.
+ The ASF licenses this file to You under the Apache License, Version 2.0
+ (the "License"); you may not use this file except in compliance with
+ the License. You may obtain a copy of the License at
+
+ http://www.apache.org/licenses/LICENSE-2.0
+
+ Unless required by applicable law or agreed to in writing, software
+ distributed under the License is distributed on an "AS IS" BASIS,
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ See the License for the specific language governing permissions and
+ limitations under the License.
+-->
+<properties>
+ <!-- Explicitly request default parsers -->
+ <parsers/>
+ <detectors>
+ <!-- All detectors except built-in container ones -->
+ <detector class="org.apache.tika.detect.DefaultDetector">
+ <detector-exclude class="org.apache.tika.parser.pkg.ZipContainerDetector"/>
+ <detector-exclude class="org.apache.tika.parser.microsoft.POIFSContainerDetector"/>
+ </detector>
+ <!-- One other detector, to check ordering -->
+ <detector class="org.apache.tika.detect.EmptyDetector">
+ </detector>
+ </detectors>
+</properties>
Added: tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1702-translator-default.xml
URL: http://svn.apache.org/viewvc/tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1702-translator-default.xml?rev=1722027&view=auto
==============================================================================
--- tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1702-translator-default.xml (added)
+++ tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1702-translator-default.xml Mon Dec 28 23:10:16 2015
@@ -0,0 +1,24 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<!--
+ Licensed to the Apache Software Foundation (ASF) under one or more
+ contributor license agreements. See the NOTICE file distributed with
+ this work for additional information regarding copyright ownership.
+ The ASF licenses this file to You under the Apache License, Version 2.0
+ (the "License"); you may not use this file except in compliance with
+ the License. You may obtain a copy of the License at
+
+ http://www.apache.org/licenses/LICENSE-2.0
+
+ Unless required by applicable law or agreed to in writing, software
+ distributed under the License is distributed on an "AS IS" BASIS,
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ See the License for the specific language governing permissions and
+ limitations under the License.
+-->
+<properties>
+ <!-- Explicitly request default parsers and translators -->
+ <parsers/>
+ <detectors/>
+ <!-- Explicitly request the default Translator -->
+ <translator class="org.apache.tika.language.translate.DefaultTranslator"/>
+</properties>
Added: tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1702-translator-empty-default.xml
URL: http://svn.apache.org/viewvc/tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1702-translator-empty-default.xml?rev=1722027&view=auto
==============================================================================
--- tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1702-translator-empty-default.xml (added)
+++ tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1702-translator-empty-default.xml Mon Dec 28 23:10:16 2015
@@ -0,0 +1,22 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<!--
+ Licensed to the Apache Software Foundation (ASF) under one or more
+ contributor license agreements. See the NOTICE file distributed with
+ this work for additional information regarding copyright ownership.
+ The ASF licenses this file to You under the Apache License, Version 2.0
+ (the "License"); you may not use this file except in compliance with
+ the License. You may obtain a copy of the License at
+
+ http://www.apache.org/licenses/LICENSE-2.0
+
+ Unless required by applicable law or agreed to in writing, software
+ distributed under the License is distributed on an "AS IS" BASIS,
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ See the License for the specific language governing permissions and
+ limitations under the License.
+-->
+<properties>
+ <!-- As Translators don't support Composites, Empty used -->
+ <translator class="org.apache.tika.language.translate.EmptyTranslator"/>
+ <translator class="org.apache.tika.language.translate.DefaultTranslator"/>
+</properties>
Added: tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1702-translator-empty.xml
URL: http://svn.apache.org/viewvc/tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1702-translator-empty.xml?rev=1722027&view=auto
==============================================================================
--- tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1702-translator-empty.xml (added)
+++ tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1702-translator-empty.xml Mon Dec 28 23:10:16 2015
@@ -0,0 +1,20 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<!--
+ Licensed to the Apache Software Foundation (ASF) under one or more
+ contributor license agreements. See the NOTICE file distributed with
+ this work for additional information regarding copyright ownership.
+ The ASF licenses this file to You under the Apache License, Version 2.0
+ (the "License"); you may not use this file except in compliance with
+ the License. You may obtain a copy of the License at
+
+ http://www.apache.org/licenses/LICENSE-2.0
+
+ Unless required by applicable law or agreed to in writing, software
+ distributed under the License is distributed on an "AS IS" BASIS,
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ See the License for the specific language governing permissions and
+ limitations under the License.
+-->
+<properties>
+ <translator class="org.apache.tika.language.translate.EmptyTranslator"/>
+</properties>
Added: tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1708-detector-composite.xml
URL: http://svn.apache.org/viewvc/tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1708-detector-composite.xml?rev=1722027&view=auto
==============================================================================
--- tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1708-detector-composite.xml (added)
+++ tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1708-detector-composite.xml Mon Dec 28 23:10:16 2015
@@ -0,0 +1,25 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<!--
+ Licensed to the Apache Software Foundation (ASF) under one or more
+ contributor license agreements. See the NOTICE file distributed with
+ this work for additional information regarding copyright ownership.
+ The ASF licenses this file to You under the Apache License, Version 2.0
+ (the "License"); you may not use this file except in compliance with
+ the License. You may obtain a copy of the License at
+
+ http://www.apache.org/licenses/LICENSE-2.0
+
+ Unless required by applicable law or agreed to in writing, software
+ distributed under the License is distributed on an "AS IS" BASIS,
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ See the License for the specific language governing permissions and
+ limitations under the License.
+-->
+<properties>
+ <parsers/>
+ <detectors>
+ <detector class="org.apache.tika.parser.microsoft.POIFSContainerDetector"/>
+ <detector class="org.apache.tika.mime.MimeTypes"/>
+ </detectors>
+ <translator class="org.apache.tika.language.translate.DefaultTranslator"/>
+</properties>
Added: tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1708-detector-default.xml
URL: http://svn.apache.org/viewvc/tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1708-detector-default.xml?rev=1722027&view=auto
==============================================================================
--- tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1708-detector-default.xml (added)
+++ tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/config/TIKA-1708-detector-default.xml Mon Dec 28 23:10:16 2015
@@ -0,0 +1,26 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<!--
+ Licensed to the Apache Software Foundation (ASF) under one or more
+ contributor license agreements. See the NOTICE file distributed with
+ this work for additional information regarding copyright ownership.
+ The ASF licenses this file to You under the Apache License, Version 2.0
+ (the "License"); you may not use this file except in compliance with
+ the License. You may obtain a copy of the License at
+
+ http://www.apache.org/licenses/LICENSE-2.0
+
+ Unless required by applicable law or agreed to in writing, software
+ distributed under the License is distributed on an "AS IS" BASIS,
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ See the License for the specific language governing permissions and
+ limitations under the License.
+-->
+<properties>
+ <parsers/>
+ <detectors>
+ <detector class="org.apache.tika.detect.DefaultDetector">
+ <detector-exclude class="org.apache.tika.parser.pkg.ZipContainerDetector"/>
+ </detector>
+ </detectors>
+ <translator class="org.apache.tika.language.translate.DefaultTranslator"/>
+</properties>
Added: tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/mime/custom-mimetypes.xml
URL: http://svn.apache.org/viewvc/tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/mime/custom-mimetypes.xml?rev=1722027&view=auto
==============================================================================
--- tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/mime/custom-mimetypes.xml (added)
+++ tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/mime/custom-mimetypes.xml Mon Dec 28 23:10:16 2015
@@ -0,0 +1,23 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<!--
+ Licensed to the Apache Software Foundation (ASF) under one or more
+ contributor license agreements. See the NOTICE file distributed with
+ this work for additional information regarding copyright ownership.
+ The ASF licenses this file to You under the Apache License, Version 2.0
+ (the "License"); you may not use this file except in compliance with
+ the License. You may obtain a copy of the License at
+
+ http://www.apache.org/licenses/LICENSE-2.0
+
+ Unless required by applicable law or agreed to in writing, software
+ distributed under the License is distributed on an "AS IS" BASIS,
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ See the License for the specific language governing permissions and
+ limitations under the License.
+-->
+<mime-info>
+ <mime-type type="application/mock+xml">
+ <root-XML localName="mock"/>
+ <sub-class-of type="application/xml"/>
+ </mime-type>
+</mime-info>
Added: tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/ModelGetter.groovy
URL: http://svn.apache.org/viewvc/tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/ModelGetter.groovy?rev=1722027&view=auto
==============================================================================
--- tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/ModelGetter.groovy (added)
+++ tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/ModelGetter.groovy Mon Dec 28 23:10:16 2015
@@ -0,0 +1,93 @@
+
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements. See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License. You may obtain a copy of the License at
+ *
+ * http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+/*
+ * This file downloads Apache OpenNLP NER models for testing the NamedEntityParser
+ */
+
+import org.apache.commons.io.IOUtils
+
+/**
+ * Copies input stream to output stream, additionally printing the progress.
+ * NOTE: this is optimized for large content
+ * @param inStr source stream
+ * @param outStr target stream
+ * @param totalLength the total length of the content (used to calculate progress)
+ * @return
+ */
+def copyWithProgress(InputStream inStr, OutputStream outStr, long totalLength){
+ int PROGRESS_DELAY = 1000;
+ byte[] buffer = new byte[1024 * 4]
+ long count = 0
+ int len
+ long tt = System.currentTimeMillis()
+ while ((len = inStr.read(buffer)) > 0) {
+ outStr.write(buffer, 0, len)
+ count += len
+ if (System.currentTimeMillis() - tt > PROGRESS_DELAY) {
+ println "${count * 100.0/totalLength}% : $count bytes of $totalLength"
+ tt = System.currentTimeMillis()
+ }
+ }
+ println "Copy complete. "
+ IOUtils.closeQuietly(inStr)
+ IOUtils.closeQuietly(outStr)
+}
+
+/**
+ * Downloads file
+ * @param urlStr url of file
+ * @param file path to store file
+ * @return
+ */
+def downloadFile(String urlStr, File file) {
+ println "GET : $urlStr -> $file"
+ urlConn = new URL(urlStr).openConnection()
+ contentLength = urlConn.getContentLengthLong()
+
+ file.getParentFile().mkdirs()
+ inStream = urlConn.getInputStream()
+ outStream = new FileOutputStream(file)
+ copyWithProgress(inStream, outStream, contentLength)
+ IOUtils.closeQuietly(outStream)
+ IOUtils.closeQuietly(inStream)
+ println "Download Complete.."
+}
+
+
+def urlPrefix = "http://opennlp.sourceforge.net/models-1.5"
+def prefixPath = "src/main/resources/org/apache/tika/parser/ner/opennlp/"
+
+// detecting proper path for test resources
+if (new File("tika-parser-test").exists() && new File("tika-app").exists() ) {
+ // running from parent maven project, but resources should go to sub-module
+ prefixPath = "tika-parser-test/" + prefixPath
+}
+
+def modelFiles = //filePath : url
+ [ (prefixPath + "ner-person.bin"): (urlPrefix + "/en-ner-person.bin"),
+ (prefixPath + "ner-location.bin"): (urlPrefix + "/en-ner-location.bin"),
+ (prefixPath + "ner-organization.bin"): (urlPrefix + "/en-ner-organization.bin"),
+ (prefixPath + "ner-date.bin"): (urlPrefix + "/en-ner-date.bin")]
+
+for (def entry : modelFiles) {
+ File file = new File(entry.key)
+ if (!file.exists()) {
+ downloadFile(entry.value, file)
+ }
+}
\ No newline at end of file
Added: tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/get-models.sh
URL: http://svn.apache.org/viewvc/tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/get-models.sh?rev=1722027&view=auto
==============================================================================
--- tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/get-models.sh (added)
+++ tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/get-models.sh Mon Dec 28 23:10:16 2015
@@ -0,0 +1,26 @@
+#!/usr/bin/env bash
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# contributor license agreements. See the NOTICE file distributed with
+# this work for additional information regarding copyright ownership.
+# The ASF licenses this file to You under the Apache License, Version 2.0
+# (the "License"); you may not use this file except in compliance with
+# the License. You may obtain a copy of the License at
+#
+# http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+echo "Getting OpenNLP NER models"
+wget "http://opennlp.sourceforge.net/models-1.5/en-ner-person.bin" -O ner-person.bin
+wget "http://opennlp.sourceforge.net/models-1.5/en-ner-location.bin" -O ner-location.bin
+wget "http://opennlp.sourceforge.net/models-1.5/en-ner-organization.bin" -O ner-organization.bin
+
+# Additional 4
+wget "http://opennlp.sourceforge.net/models-1.5/en-ner-date.bin" -O ner-date.bin
+wget "http://opennlp.sourceforge.net/models-1.5/en-ner-money.bin" -O ner-money.bin
+wget "http://opennlp.sourceforge.net/models-1.5/en-ner-time.bin" -O ner-time.bin
+wget "http://opennlp.sourceforge.net/models-1.5/en-ner-percentage.bin" -O ner-percentage.bin
\ No newline at end of file
Added: tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/ner-date.bin
URL: http://svn.apache.org/viewvc/tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/ner-date.bin?rev=1722027&view=auto
==============================================================================
Binary file - no diff available.
Propchange: tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/ner-date.bin
------------------------------------------------------------------------------
svn:mime-type = application/octet-stream
Added: tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/ner-location.bin
URL: http://svn.apache.org/viewvc/tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/ner-location.bin?rev=1722027&view=auto
==============================================================================
Binary file - no diff available.
Propchange: tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/ner-location.bin
------------------------------------------------------------------------------
svn:mime-type = application/octet-stream
Added: tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/ner-organization.bin
URL: http://svn.apache.org/viewvc/tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/ner-organization.bin?rev=1722027&view=auto
==============================================================================
Binary file - no diff available.
Propchange: tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/ner-organization.bin
------------------------------------------------------------------------------
svn:mime-type = application/octet-stream
Added: tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/ner-person.bin
URL: http://svn.apache.org/viewvc/tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/ner-person.bin?rev=1722027&view=auto
==============================================================================
Binary file - no diff available.
Propchange: tika/branches/2.x/tika-parser-test/src/main/resources/org/apache/tika/parser/ner/opennlp/ner-person.bin
------------------------------------------------------------------------------
svn:mime-type = application/octet-stream