You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@any23.apache.org by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2018/01/10 02:07:00 UTC

[jira] [Commented] (ANY23-132) Handle value-title class in microformat

    [ https://issues.apache.org/jira/browse/ANY23-132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16319567#comment-16319567 ] 

Lewis John McGibbney commented on ANY23-132:
--------------------------------------------

[~sebastienrichard] the offending classes seems to be HTMLDocument.java around the following line, can you confirm?
https://github.com/apache/any23/blob/master/core/src/main/java/org/apache/any23/extractor/html/HTMLDocument.java#L72-L81
Thanks

> Handle value-title class in microformat
> ---------------------------------------
>
>                 Key: ANY23-132
>                 URL: https://issues.apache.org/jira/browse/ANY23-132
>             Project: Apache Any23
>          Issue Type: Bug
>          Components: core
>            Reporter: Sebastien Richard
>             Fix For: 2.2
>
>
> http://microformats.org/wiki/value-class-pattern
> Proposed patch:
> 72,78c72,73
> <         // first check if there are value-title inside
> <         List<Node> values = DomUtils.findAllByClassName(node, "value-title");
> <         if (!values.isEmpty()) {
> <             return new TextField(values.get(0).getAttributes().getNamedItem("title").getNodeValue(), node); 
> <         }
> <         // next value
> <         values = DomUtils.findAllByClassName(node, "value");
> ---
> >         // first check if there are values inside
> >         List<Node> values = DomUtils.findAllByClassName(node, "value");



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)