You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Canan Girgin (JIRA)" <ji...@apache.org> on 2014/01/15 15:19:21 UTC

[jira] [Comment Edited] (NUTCH-1703) Nutch ignores alt text of images

    [ https://issues.apache.org/jira/browse/NUTCH-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13872106#comment-13872106 ] 

Canan Girgin edited comment on NUTCH-1703 at 1/15/14 2:18 PM:
--------------------------------------------------------------

ok. A new patch Patch had been added which contains TestDOMContentUtils class. (NUTCH_1703.patch_v2)


was (Author: dandelion):
ok. A new patch Patch had been added which contains TestDOMContentUtils class. (NUTCH_1703.patch_v1)

> Nutch ignores alt text of images
> --------------------------------
>
>                 Key: NUTCH-1703
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1703
>             Project: Nutch
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 2.2.1
>            Reporter: Canan Girgin
>             Fix For: 2.3, 1.8
>
>         Attachments: NUTCH_1703.patch, NUTCH_1703_v2.patch
>
>
> If you put image as link alt text of that image is equivalent to the anchor text of text link. During content parse nutch does not give image alt text and  anchor text for that link is empty.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)