You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@any23.apache.org by "Hans Brende (JIRA)" <ji...@apache.org> on 2018/08/03 14:45:00 UTC

[jira] [Created] (ANY23-382) Distinguish between fatal and recoverable json-ld parsing errors

Hans Brende created ANY23-382:
---------------------------------

             Summary: Distinguish between fatal and recoverable json-ld parsing errors
                 Key: ANY23-382
                 URL: https://issues.apache.org/jira/browse/ANY23-382
             Project: Apache Any23
          Issue Type: Bug
          Components: extractors
    Affects Versions: 2.3
            Reporter: Hans Brende
             Fix For: 2.3


I've fixed several recoverable json-ld parsing errors, but there are also errors of the (probably) non-recoverable kind, e.g., 

{noformat}
com.fasterxml.jackson.core.JsonParseException: Unexpected end-of-input: was expecting closing quote for a string value
    at [Source: java.io.BufferedReader@36bc415e; line: 1, column: 33786]
    at com.fasterxml.jackson.core.JsonParser._constructError(JsonParser.java:1369)
    at com.fasterxml.jackson.core.base.ParserMinimalBase._reportError(ParserMinimalBase.java:532)
    at com.fasterxml.jackson.core.base.ParserMinimalBase._reportInvalidEOF(ParserMinimalBase.java:465)
    at com.fasterxml.jackson.core.json.ReaderBasedJsonParser._finishString2(ReaderBasedJsonParser.java:1524)
    at com.fasterxml.jackson.core.json.ReaderBasedJsonParser._finishString(ReaderBasedJsonParser.java:1512)
    at com.fasterxml.jackson.core.json.ReaderBasedJsonParser.getText(ReaderBasedJsonParser.java:205)
    at com.fasterxml.jackson.databind.deser.std.UntypedObjectDeserializer.deserialize(UntypedObjectDeserializer.java:59)
    at com.fasterxml.jackson.databind.deser.std.UntypedObjectDeserializer.mapObject(UntypedObjectDeserializer.java:225)
    at com.fasterxml.jackson.databind.deser.std.UntypedObjectDeserializer.deserialize(UntypedObjectDeserializer.java:51)
    at com.fasterxml.jackson.databind.deser.std.CollectionDeserializer.deserialize(CollectionDeserializer.java:230)
    at com.fasterxml.jackson.databind.deser.std.CollectionDeserializer.deserialize(CollectionDeserializer.java:207)
    at com.fasterxml.jackson.databind.deser.std.CollectionDeserializer.deserialize(CollectionDeserializer.java:23)
    at com.fasterxml.jackson.databind.ObjectMapper._readValue(ObjectMapper.java:2860)
    at com.fasterxml.jackson.databind.ObjectMapper.readValue(ObjectMapper.java:1569)
    at com.fasterxml.jackson.core.JsonParser.readValueAs(JsonParser.java:1276)
    at com.github.jsonldjava.utils.JsonUtils.fromJsonParser(JsonUtils.java:194)
    at com.github.jsonldjava.utils.JsonUtils.fromReader(JsonUtils.java:173)
    at com.github.jsonldjava.utils.JsonUtils.fromInputStream(JsonUtils.java:154)
    at com.github.jsonldjava.utils.JsonUtils.fromInputStream(JsonUtils.java:111)
    at org.eclipse.rdf4j.rio.jsonld.JSONLDParser.parse(JSONLDParser.java:71)	... 13 more
{noformat}

caused by a truncated document, or:

{noformat}
com.fasterxml.jackson.core.JsonParseException: Unexpected character ('"' (code 34)): was expecting comma to separate OBJECT entries
at [Source: java.io.BufferedReader@591fd34d; line: 1, column: 230]
at com.fasterxml.jackson.core.JsonParser._constructError(JsonParser.java:1369)
at com.fasterxml.jackson.core.base.ParserMinimalBase._reportError(ParserMinimalBase.java:532)
at com.fasterxml.jackson.core.base.ParserMinimalBase._reportUnexpectedChar(ParserMinimalBase.java:453)
at com.fasterxml.jackson.core.json.ReaderBasedJsonParser.nextToken(ReaderBasedJsonParser.java:590)
at com.fasterxml.jackson.databind.deser.std.UntypedObjectDeserializer.mapObject(UntypedObjectDeserializer.java:226)
at com.fasterxml.jackson.databind.deser.std.UntypedObjectDeserializer.deserialize(UntypedObjectDeserializer.java:51)
at com.fasterxml.jackson.databind.deser.std.UntypedObjectDeserializer.mapArray(UntypedObjectDeserializer.java:172)
at com.fasterxml.jackson.databind.deser.std.UntypedObjectDeserializer.deserialize(UntypedObjectDeserializer.java:53)
at com.fasterxml.jackson.databind.deser.std.MapDeserializer._readAndBindStringMap(MapDeserializer.java:432)
at com.fasterxml.jackson.databind.deser.std.MapDeserializer.deserialize(MapDeserializer.java:314)
at com.fasterxml.jackson.databind.deser.std.MapDeserializer.deserialize(MapDeserializer.java:26)
at com.fasterxml.jackson.databind.ObjectMapper._readValue(ObjectMapper.java:2860)
at com.fasterxml.jackson.databind.ObjectMapper.readValue(ObjectMapper.java:1569)
at com.fasterxml.jackson.core.JsonParser.readValueAs(JsonParser.java:1276)
at com.github.jsonldjava.utils.JsonUtils.fromJsonParser(JsonUtils.java:196)
at com.github.jsonldjava.utils.JsonUtils.fromReader(JsonUtils.java:173)
at com.github.jsonldjava.utils.JsonUtils.fromInputStream(JsonUtils.java:154)
at com.github.jsonldjava.utils.JsonUtils.fromInputStream(JsonUtils.java:111)
at org.eclipse.rdf4j.rio.jsonld.JSONLDParser.parse(JSONLDParser.java:71)	... 13 more
{noformat}

caused by the following JSON:

{noformat}
{
	  "@context": "http://schema.org",
	  "@type": "Organization",
	  "url": "http://osl.ugr.es",
	  "contactPoint": [{
	    "@type": "ContactPoint",
	    "email": "osl@ugr.es",
		"name": "Jornadas de Software Libre"
	    "contactType": "Organizing committee",
		"url": "http://osl.ugr.es":
	  }]
	}
{noformat}

Errors of the recoverable kind should be fixed, whereas errors of the non-recoverable kind should be reported as fatal errors without causing the rest of the extraction process to fail.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)