You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@any23.apache.org by "Hans Brende (JIRA)" <ji...@apache.org> on 2018/04/16 05:06:00 UTC

[jira] [Created] (ANY23-351) NullPointerException in HCardExtractor

Hans Brende created ANY23-351:
---------------------------------

             Summary: NullPointerException in HCardExtractor
                 Key: ANY23-351
                 URL: https://issues.apache.org/jira/browse/ANY23-351
             Project: Apache Any23
          Issue Type: Bug
          Components: microformats
    Affects Versions: 2.3
            Reporter: Hans Brende


When extracting from the url: https://cambridgewi.com/make-cambridge-home/char/V/

I get the following NullPointerException, which kills the entire extraction process:

{code}
java.lang.NullPointerException
	at org.apache.any23.extractor.html.HTMLDocument.readUrlField(HTMLDocument.java:119)
	at org.apache.any23.extractor.html.HTMLDocument.getPluralUrlField(HTMLDocument.java:288)
	at org.apache.any23.extractor.html.HCardExtractor.addLogo(HCardExtractor.java:267)
	at org.apache.any23.extractor.html.HCardExtractor.extractEntity(HCardExtractor.java:130)
	at org.apache.any23.extractor.html.EntityBasedMicroformatExtractor.extract(EntityBasedMicroformatExtractor.java:66)
	at org.apache.any23.extractor.html.MicroformatExtractor.run(MicroformatExtractor.java:102)
	at org.apache.any23.extractor.html.MicroformatExtractor.run(MicroformatExtractor.java:44)
	at org.apache.any23.extractor.SingleDocumentExtraction.runExtractor(SingleDocumentExtraction.java:480)
	at org.apache.any23.extractor.SingleDocumentExtraction.run(SingleDocumentExtraction.java:259)
	at org.apache.any23.Any23.extract(Any23.java:302)
	at org.apache.any23.Any23.extract(Any23.java:437)
{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)