You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/08/14 10:29:16 UTC

[jira] Resolved: (TIKA-460) HTMLHandler misses treatment of A elements

     [ https://issues.apache.org/jira/browse/TIKA-460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Julien Nioche resolved TIKA-460.
--------------------------------

    Resolution: Fixed

Committed revision 985444

The A elements are now processed correctly when using the IdentityMapper. I have added <A> to the list of safe elements in the DefaultHTMLMapper.

Ken - the element A still have a special treatment so the safe attributes you added in       

{code}
put("a", attrSet("rel", "name"));
{code}

are still not used. Since A was not in the list of safe elements these attributes were not used anyway

I still think that we should delegate the logic to the mappers as suggested in TIKA-463 but in the meantime this fix allows us to get to the A's using the IdentityMapper and simplifies the code a bit. 

> HTMLHandler misses treatment of A elements 
> -------------------------------------------
>
>                 Key: TIKA-460
>                 URL: https://issues.apache.org/jira/browse/TIKA-460
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 0.7
>            Reporter: Julien Nioche
>            Assignee: Julien Nioche
>             Fix For: 0.8
>
>         Attachments: TIKA-460.patch
>
>
> The A elements should be processed before any other safe element, otherwise it never happens

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.