You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@any23.apache.org by "Hans Brende (JIRA)" <ji...@apache.org> on 2018/04/16 05:06:00 UTC
[jira] [Created] (ANY23-351) NullPointerException in HCardExtractor
Hans Brende created ANY23-351:
---------------------------------
Summary: NullPointerException in HCardExtractor
Key: ANY23-351
URL: https://issues.apache.org/jira/browse/ANY23-351
Project: Apache Any23
Issue Type: Bug
Components: microformats
Affects Versions: 2.3
Reporter: Hans Brende
When extracting from the url: https://cambridgewi.com/make-cambridge-home/char/V/
I get the following NullPointerException, which kills the entire extraction process:
{code}
java.lang.NullPointerException
at org.apache.any23.extractor.html.HTMLDocument.readUrlField(HTMLDocument.java:119)
at org.apache.any23.extractor.html.HTMLDocument.getPluralUrlField(HTMLDocument.java:288)
at org.apache.any23.extractor.html.HCardExtractor.addLogo(HCardExtractor.java:267)
at org.apache.any23.extractor.html.HCardExtractor.extractEntity(HCardExtractor.java:130)
at org.apache.any23.extractor.html.EntityBasedMicroformatExtractor.extract(EntityBasedMicroformatExtractor.java:66)
at org.apache.any23.extractor.html.MicroformatExtractor.run(MicroformatExtractor.java:102)
at org.apache.any23.extractor.html.MicroformatExtractor.run(MicroformatExtractor.java:44)
at org.apache.any23.extractor.SingleDocumentExtraction.runExtractor(SingleDocumentExtraction.java:480)
at org.apache.any23.extractor.SingleDocumentExtraction.run(SingleDocumentExtraction.java:259)
at org.apache.any23.Any23.extract(Any23.java:302)
at org.apache.any23.Any23.extract(Any23.java:437)
{code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)