You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2015/02/20 00:08:14 UTC

[jira] [Created] (NUTCH-1945) Test for XLSX parser

Sebastian Nagel created NUTCH-1945:
--------------------------------------

             Summary: Test for XLSX parser
                 Key: NUTCH-1945
                 URL: https://issues.apache.org/jira/browse/NUTCH-1945
             Project: Nutch
          Issue Type: Test
          Components: parser
    Affects Versions: 1.10, 2.3.1
            Reporter: Sebastian Nagel
            Priority: Minor
             Fix For: 1.11, 2.3.1


Add a test for Excel spreadsheets (xlsx) files: because the are formally also zip files (as well as other composite files) the MIME type detection is crucial also for parsing, cf. NUTCH-1605 and NUTCH-1925.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)