You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/08/05 16:47:20 UTC
[jira] [Created] (TIKA-2049) Add parser for vcal
Tim Allison created TIKA-2049:
---------------------------------
Summary: Add parser for vcal
Key: TIKA-2049
URL: https://issues.apache.org/jira/browse/TIKA-2049
Project: Tika
Issue Type: Improvement
Reporter: Tim Allison
Priority: Trivial
vcal files can contain embedded html. We used to detect them as html and extract content roughly correctly. Now that they are being correctly detected, but are subclasses of the text/plain, we're getting html markup in the extracted text.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)