You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Prasanth Iyer (JIRA)" <ji...@apache.org> on 2014/11/17 20:12:36 UTC
[jira] [Created] (TIKA-1478) Build a parser to extract data from
.dif format
Prasanth Iyer created TIKA-1478:
-----------------------------------
Summary: Build a parser to extract data from .dif format
Key: TIKA-1478
URL: https://issues.apache.org/jira/browse/TIKA-1478
Project: Tika
Issue Type: New Feature
Components: metadata, mime, parser
Affects Versions: 1.6
Reporter: Prasanth Iyer
An initial crawl of the Acadis website (https://www.aoncadis.org/home.htm) revealed that a number of the files on this website are of the .dif type. Currently, Tika categorizes these files as text/plain since it does not have a parser for this type of file. The need is to provide metadata support and to build a parser for this kind of file.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)