You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Prasanth Iyer (JIRA)" <ji...@apache.org> on 2014/11/17 20:12:36 UTC

[jira] [Created] (TIKA-1478) Build a parser to extract data from .dif format

Prasanth Iyer created TIKA-1478:
-----------------------------------

             Summary: Build a parser to extract data from .dif format
                 Key: TIKA-1478
                 URL: https://issues.apache.org/jira/browse/TIKA-1478
             Project: Tika
          Issue Type: New Feature
          Components: metadata, mime, parser
    Affects Versions: 1.6
            Reporter: Prasanth Iyer


An initial crawl of the Acadis website (https://www.aoncadis.org/home.htm) revealed that a number of the files on this website are of the .dif type. Currently, Tika categorizes these files as text/plain since it does not have a parser for this type of file. The need is to provide metadata support and to build a parser for this kind of file.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)