You are viewing a plain text version of this content. The canonical link for it is here.
Posted to droids-dev@incubator.apache.org by "Bertil Chapuis (JIRA)" <ji...@apache.org> on 2009/07/27 13:16:14 UTC

[jira] Created: (DROIDS-59) Link extraction problem

Link extraction problem
-----------------------

                 Key: DROIDS-59
                 URL: https://issues.apache.org/jira/browse/DROIDS-59
             Project: Droids
          Issue Type: Bug
            Reporter: Bertil Chapuis
            Priority: Minor
         Attachments: LinkExtractor.diff

I have a problem with the link extractor for the following html structure:

<a href="..."><img src="..." ></img></a>

In this situation, the characters() method of the SAX handler is never called for the <a/> tag. In consequence, all these links are skipped.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Closed: (DROIDS-59) Link extraction problem

Posted by "Thorsten Scherler (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/DROIDS-59?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Thorsten Scherler closed DROIDS-59.
-----------------------------------

    Resolution: Fixed

Thanks Bertli for the patch. 

Committed revision 801086.

> Link extraction problem
> -----------------------
>
>                 Key: DROIDS-59
>                 URL: https://issues.apache.org/jira/browse/DROIDS-59
>             Project: Droids
>          Issue Type: Bug
>            Reporter: Bertil Chapuis
>            Priority: Minor
>         Attachments: LinkExtractor.diff
>
>
> I have a problem with the link extractor for the following html structure:
> <a href="..."><img src="..." ></img></a>
> In this situation, the characters() method of the SAX handler is never called for the <a/> tag. In consequence, all these links are skipped.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (DROIDS-59) Link extraction problem

Posted by "Bertil Chapuis (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/DROIDS-59?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Bertil Chapuis updated DROIDS-59:
---------------------------------

    Attachment: LinkExtractor.diff

> Link extraction problem
> -----------------------
>
>                 Key: DROIDS-59
>                 URL: https://issues.apache.org/jira/browse/DROIDS-59
>             Project: Droids
>          Issue Type: Bug
>            Reporter: Bertil Chapuis
>            Priority: Minor
>         Attachments: LinkExtractor.diff
>
>
> I have a problem with the link extractor for the following html structure:
> <a href="..."><img src="..." ></img></a>
> In this situation, the characters() method of the SAX handler is never called for the <a/> tag. In consequence, all these links are skipped.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.