You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@any23.apache.org by lewismc <gi...@git.apache.org> on 2017/12/12 21:55:28 UTC

[GitHub] any23 pull request #49: ANY23-314 Service fails to return extraction in case...

GitHub user lewismc opened a pull request:

    https://github.com/apache/any23/pull/49

    ANY23-314 Service fails to return extraction in case of extraction error

    This issue primarily addresses https://issues.apache.org/jira/browse/ANY23-314
    Additional changes are trivial code cleanups.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/lewismc/any23 ANY23-314

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/any23/pull/49.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #49
    
----
commit d0e627a957c6ba5ec59ff40ba5a73cf7e52dd1d4
Author: Lewis John McGibbney <le...@gmail.com>
Date:   2017-12-12T21:51:48Z

    ANY23-314 Service fails to return extraction in case of extraction error

----


---

[GitHub] any23 issue #49: ANY23-314 Service fails to return extraction in case of ext...

Posted by lewismc <gi...@git.apache.org>.
Github user lewismc commented on the issue:

    https://github.com/apache/any23/pull/49
  
    Any comments folks?


---

[GitHub] any23 issue #49: ANY23-314 Service fails to return extraction in case of ext...

Posted by lewismc <gi...@git.apache.org>.
Github user lewismc commented on the issue:

    https://github.com/apache/any23/pull/49
  
    In the case where one encounters a parse and/or extraction error hence an unsuccessful extraction, one would see the following result. 
    As you can see, the partial extraction is now included at the bottom of the servlet response which is much better e.g. more forgiving, than a plain stack trace and error message.
    ```
    Failed to fully parse input. The extraction result, at the bottom of this response, if any, will contain extractions only up until the extraction error.
    ================================================================
    
    ------------ BEGIN Exception context ------------
    ExtractionContext(urn:x-any23:html-rdfa11:root-extraction-result-id:http://any23.apache.org/)
    Errors {
    	ERROR: 	'The entity "copy" was referenced, but not declared.' 	(-1,-1)
    }
    ------------ END   Exception context ------------
    
    org.apache.any23.extractor.ExtractionException: Error while parsing RDF document.
    	at org.apache.any23.extractor.rdf.BaseRDFExtractor.run(BaseRDFExtractor.java:109)
    	at org.apache.any23.extractor.rdf.BaseRDFExtractor.run(BaseRDFExtractor.java:41)
    	at org.apache.any23.extractor.SingleDocumentExtraction.runExtractor(SingleDocumentExtraction.java:467)
    	at org.apache.any23.extractor.SingleDocumentExtraction.run(SingleDocumentExtraction.java:256)
    	at org.apache.any23.Any23.extract(Any23.java:300)
    	at org.apache.any23.Any23.extract(Any23.java:452)
    	at org.apache.any23.servlet.WebResponder.runExtraction(WebResponder.java:117)
    	at org.apache.any23.servlet.Servlet.doGet(Servlet.java:82)
    	at javax.servlet.http.HttpServlet.service(HttpServlet.java:624)
    	at javax.servlet.http.HttpServlet.service(HttpServlet.java:731)
    	at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:303)
    	at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:208)
    	at org.apache.tomcat.websocket.server.WsFilter.doFilter(WsFilter.java:52)
    	at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:241)
    	at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:208)
    	at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:218)
    	at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:122)
    	at org.apache.catalina.authenticator.AuthenticatorBase.invoke(AuthenticatorBase.java:505)
    	at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:169)
    	at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:103)
    	at org.apache.catalina.valves.AccessLogValve.invoke(AccessLogValve.java:956)
    	at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:116)
    	at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:442)
    	at org.apache.coyote.http11.AbstractHttp11Processor.process(AbstractHttp11Processor.java:1083)
    	at org.apache.coyote.AbstractProtocol$AbstractConnectionHandler.process(AbstractProtocol.java:640)
    	at org.apache.tomcat.util.net.JIoEndpoint$SocketProcessor.run(JIoEndpoint.java:318)
    	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
    	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
    	at org.apache.tomcat.util.threads.TaskThread$WrappingRunnable.run(TaskThread.java:61)
    	at java.lang.Thread.run(Thread.java:748)
    Caused by: org.eclipse.rdf4j.rio.RDFParseException: org.xml.sax.SAXParseException; lineNumber: 306; columnNumber: 55; The entity "copy" was referenced, but not declared.
    	at org.semarglproject.rdf4j.rdf.rdfa.RDF4JRDFaParser.parse(RDF4JRDFaParser.java:111)
    	at org.semarglproject.rdf4j.rdf.rdfa.RDF4JRDFaParser.parse(RDF4JRDFaParser.java:95)
    	at org.apache.any23.extractor.rdf.BaseRDFExtractor.run(BaseRDFExtractor.java:105)
    	... 29 more
    Caused by: org.semarglproject.rdf.ParseException: org.xml.sax.SAXParseException; lineNumber: 306; columnNumber: 55; The entity "copy" was referenced, but not declared.
    	at org.semarglproject.rdf.rdfa.RdfaParser.processException(RdfaParser.java:1141)
    	at org.semarglproject.source.XmlSource.process(XmlSource.java:50)
    	at org.semarglproject.source.StreamProcessor.processInternal(StreamProcessor.java:87)
    	at org.semarglproject.source.BaseStreamProcessor.process(BaseStreamProcessor.java:167)
    	at org.semarglproject.source.BaseStreamProcessor.process(BaseStreamProcessor.java:154)
    	at org.semarglproject.rdf4j.rdf.rdfa.RDF4JRDFaParser.parse(RDF4JRDFaParser.java:109)
    	... 31 more
    Caused by: org.xml.sax.SAXParseException; lineNumber: 306; columnNumber: 55; The entity "copy" was referenced, but not declared.
    	at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source)
    	at org.semarglproject.source.XmlSource.process(XmlSource.java:48)
    	... 35 more
    ================================================================
    <data>
    <![CDATA[
    @prefix sindice: <http://vocab.sindice.net/> .
    
    <http://any23.apache.org/> <http://vocab.sindice.net/any23#Date-Revision-yyyymmdd> "20171101"@en ;
    	<http://vocab.sindice.net/any23#Content-Language> "en"@en ;
    	<http://vocab.sindice.net/any23#viewport> "width=device-width, initial-scale=1.0"@en ;
    	<http://vocab.sindice.net/any23#author> "The Apache Software Foundation"@en .
    @prefix dcterms: <http://purl.org/dc/terms/> .
    
    <http://any23.apache.org/> dcterms:title "Apache Any23 – Apache Any23 - Introduction"@en .
    ]]>
    </data>
    
    ```


---

[GitHub] any23 pull request #49: ANY23-314 Service fails to return extraction in case...

Posted by asfgit <gi...@git.apache.org>.
Github user asfgit closed the pull request at:

    https://github.com/apache/any23/pull/49


---