You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@any23.apache.org by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2018/01/10 02:07:00 UTC
[jira] [Commented] (ANY23-132) Handle value-title class in
microformat
[ https://issues.apache.org/jira/browse/ANY23-132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16319567#comment-16319567 ]
Lewis John McGibbney commented on ANY23-132:
--------------------------------------------
[~sebastienrichard] the offending classes seems to be HTMLDocument.java around the following line, can you confirm?
https://github.com/apache/any23/blob/master/core/src/main/java/org/apache/any23/extractor/html/HTMLDocument.java#L72-L81
Thanks
> Handle value-title class in microformat
> ---------------------------------------
>
> Key: ANY23-132
> URL: https://issues.apache.org/jira/browse/ANY23-132
> Project: Apache Any23
> Issue Type: Bug
> Components: core
> Reporter: Sebastien Richard
> Fix For: 2.2
>
>
> http://microformats.org/wiki/value-class-pattern
> Proposed patch:
> 72,78c72,73
> < // first check if there are value-title inside
> < List<Node> values = DomUtils.findAllByClassName(node, "value-title");
> < if (!values.isEmpty()) {
> < return new TextField(values.get(0).getAttributes().getNamedItem("title").getNodeValue(), node);
> < }
> < // next value
> < values = DomUtils.findAllByClassName(node, "value");
> ---
> > // first check if there are values inside
> > List<Node> values = DomUtils.findAllByClassName(node, "value");
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)