You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@stanbol.apache.org by Johannes Goslar <jo...@dkd.de> on 2013/12/17 17:20:58 UTC

dot escaping in sparql queries

Hello,
our stanbol (full-war checked out from today) has problems when I feed the enhancer queries containing html tags containing points.
Our referenced site/sesame will complain about wrongly (not escaped) dots.
For example this text sent to the enhancer:
	<b>from 9 a.m. - 4 p.m.</b>
Will generate this request:

http://localhost:8090/openrdf-sesame/repositories/stanbol?query=CONSTRUCT+%7B+%0A++?id+%3Chttp://www.w3.org/2003/01/geo/wgs84_pos%23long%3E+?v_1+.%0A++?id+%3Chttp://xmlns.com/foaf/0.1/depiction%3E+?v_2+.%0A++?id+%3Chttp://www.w3.org/1999/02/22-rdf-syntax-ns%23type%3E+?v_4+.%0A++?id+%3Chttp://stanbol.apache.org/ontology/entityhub/entityhub%23entityRank%3E+?v_3+.%0A++?id+%3Chttp://dbpedia.org/ontology/thumbnail%3E+?v_5+.%0A++?id+%3Chttp://www.w3.org/2000/01/rdf-schema%23comment%3E+?v_6+.%0A++?id+%3Chttp://www.w3.org/2000/01/rdf-schema%23label%3E+?v_7+.%0A++?id+%3Chttp://www.w3.org/2003/01/geo/wgs84_pos%23lat%3E+?v_8+.%0A++%3Chttp://stanbol.apache.org/ontology/entityhub/query%23QueryResultSet%3E+%3Chttp://stanbol.apache.org/ontology/entityhub/query%23queryResult%3E+?id+.+%0A%7D+WHERE+%7B+%0A++%7B+%0A++++?id+%3Chttp://www.w3.org/2000/01/rdf-schema%23label%3E+?v_7++%0A++++++FILTER(regex(str(?v_7),%22%5Ep%5C.m%5C.$%22,%22i%22)+%7C%7C+regex(str(?v_7),%22%5E%3C/b%3E$%22,%22i%22)+%26%26+((lang(?v_7)+%3D+%22%22)+%7C%7C+(lang(?v_7)+%3D+%22en%22)))+.+%0A++%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://www.w3.org/2003/01/geo/wgs84_pos%23long%3E+?v_1+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://xmlns.com/foaf/0.1/depiction%3E+?v_2+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://www.w3.org/1999/02/22-rdf-syntax-ns%23type%3E+?v_4+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://stanbol.apache.org/ontology/entityhub/entityhub%23entityRank%3E+?v_3+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://dbpedia.org/ontology/thumbnail%3E+?v_5+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://www.w3.org/2000/01/rdf-schema%23comment%3E+?v_6+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://www.w3.org/2003/01/geo/wgs84_pos%23lat%3E+?v_8+.+%7D+%0A%7D+%0ALIMIT+12+%0A&format=application/rdf%2Bxml

Or 

http://localhost:8090/openrdf-sesame/repositories/stanbol?query=CONSTRUCT { 
  ?id <http://www.w3.org/2003/01/geo/wgs84_pos#long> ?v_1 .
  ?id <http://xmlns.com/foaf/0.1/depiction> ?v_2 .
  ?id <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> ?v_4 .
  ?id <http://stanbol.apache.org/ontology/entityhub/entityhub#entityRank> ?v_3 .
  ?id <http://dbpedia.org/ontology/thumbnail> ?v_5 .
  ?id <http://www.w3.org/2000/01/rdf-schema#comment> ?v_6 .
  ?id <http://www.w3.org/2000/01/rdf-schema#label> ?v_7 .
  ?id <http://www.w3.org/2003/01/geo/wgs84_pos#lat> ?v_8 .
  <http://stanbol.apache.org/ontology/entityhub/query#QueryResultSet> <http://stanbol.apache.org/ontology/entityhub/query#queryResult> ?id . 
} WHERE { 
  { 
    ?id <http://www.w3.org/2000/01/rdf-schema#label> ?v_7  
      FILTER(regex(str(?v_7),"^p\.m\.$","i") || regex(str(?v_7),"^</b>$","i") && ((lang(?v_7) = "") || (lang(?v_7) = "en"))) . 
  } 
  OPTIONAL { ?id <http://www.w3.org/2003/01/geo/wgs84_pos#long> ?v_1 . } 
  OPTIONAL { ?id <http://xmlns.com/foaf/0.1/depiction> ?v_2 . } 
  OPTIONAL { ?id <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> ?v_4 . } 
  OPTIONAL { ?id <http://stanbol.apache.org/ontology/entityhub/entityhub#entityRank> ?v_3 . } 
  OPTIONAL { ?id <http://dbpedia.org/ontology/thumbnail> ?v_5 . } 
  OPTIONAL { ?id <http://www.w3.org/2000/01/rdf-schema#comment> ?v_6 . } 
  OPTIONAL { ?id <http://www.w3.org/2003/01/geo/wgs84_pos#lat> ?v_8 . } 
} 
LIMIT 12 
&format=application/rdf+xml

If I change the contained \. to \\. it will not error.

If I remove the <b></b> it will not error as well.

Kind Regards
Johannes

-- 
Johannes Goslar

dkd Internet Service GmbH 
development // kommunikation // design 
Kaiserstraße 73 
60329 Frankfurt am Main 

Kontakt: 
- email: johannes.goslar@dkd.de 
- fon: +49 69 2475218-0 
- fax: +49 69 2475218-99
- web: http://www.dkd.de
- social media: http://social.dkd.de

Aktuelle Projekte:
- http://j.mp/SehBiS-App – iPhone-App Sehbehinderungssimulator
- http://www.ellen-wille.de - Launch Website (TYPO3)
- http://www.vgf-ffm.de - Relaunch Website (TYPO3)

Geschäftsführer: O. Dobberkau, S. Schaffstein, G. Wegenast, C. Zabanski 
Registergericht: Amtsgericht Frankfurt am Main 
Registernummer: HRB 45590




Re: dot escaping in sparql queries

Posted by Johannes Goslar <jo...@dkd.de>.
Hi Rupert,
yes, at the moment I have a Sesame db for testing. But I am still in the exploration phase, so I can switch if I find better ways.
I have seen the Sesame files in the commits, so I did update to trunk. Sadly I am not a Java expert, is there any guidance on how to create these services/which is the best to adapt?
 
It looks related to STANBOL-877.
As soon as I got my debugging setup, I will try to look into it.

Cheers
Johannes
-- 
Johannes Goslar

dkd Internet Service GmbH 
development // kommunikation // design 
Kaiserstraße 73 
60329 Frankfurt am Main 

Kontakt: 
- email: johannes.goslar@dkd.de 
- fon: +49 69 2475218-0 
- fax: +49 69 2475218-99
- web: http://www.dkd.de
- social media: http://social.dkd.de

Aktuelle Projekte:
- http://j.mp/SehBiS-App – iPhone-App Sehbehinderungssimulator
- http://www.ellen-wille.de - Launch Website (TYPO3)
- http://www.vgf-ffm.de - Relaunch Website (TYPO3)

Geschäftsführer: O. Dobberkau, S. Schaffstein, G. Wegenast, C. Zabanski 
Registergericht: Amtsgericht Frankfurt am Main 
Registernummer: HRB 45590



On 18.12.2013, at 10:26, Rupert Westenthaler <ru...@gmail.com> wrote:

> Hi Johannes,
> 
> Do you use Sesame? There is already a SesameYard that implements the
> Entityhub Yard interface based on a Sesame repository. This was added
> as part of the Apache Marmotta - Apache Stanbol integration
> (STANBOL-1165) [1]. However there is - not yet a actual SesameYard for
> a specific Sesame Repository implementation. The SesameYard is just a
> PoJo that is intended to be configured by a OSGI Component and
> registered as OSGI service.
> 
> Regarding your encoding issue: I will take a look at string encoding
> of SPARQL queries. Maybe this is even related to STANBOL-877 [2]
> 
> 
> best
> Rupert
> 
> 
> [1] https://issues.apache.org/jira/browse/STANBOL-1165
> [2] https://issues.apache.org/jira/browse/STANBOL-877
> 
> 
> On Wed, Dec 18, 2013 at 9:17 AM, Johannes Goslar <jo...@dkd.de> wrote:
>> Hi Rupert,
>> no, at the moment it is configured as a referenced site with accessUri and queryUri. (no index)
>> But I was wondering, if there is some kind of local bridge so we can avoid the HTTP overhead.
>> 
>> Kind Regards
>> Johannes
>> 
>> --
>> Johannes Goslar
>> 
>> dkd Internet Service GmbH
>> development // kommunikation // design
>> Kaiserstraße 73
>> 60329 Frankfurt am Main
>> 
>> Kontakt:
>> - email: johannes.goslar@dkd.de
>> - fon: +49 69 2475218-0
>> - fax: +49 69 2475218-99
>> - web: http://www.dkd.de
>> - social media: http://social.dkd.de
>> 
>> Aktuelle Projekte:
>> - http://j.mp/SehBiS-App – iPhone-App Sehbehinderungssimulator
>> - http://www.ellen-wille.de - Launch Website (TYPO3)
>> - http://www.vgf-ffm.de - Relaunch Website (TYPO3)
>> 
>> Geschäftsführer: O. Dobberkau, S. Schaffstein, G. Wegenast, C. Zabanski
>> Registergericht: Amtsgericht Frankfurt am Main
>> Registernummer: HRB 45590
>> 
>> 
>> 
>> On 17.12.2013, at 21:12, Rupert Westenthaler <ru...@gmail.com> wrote:
>> 
>>> Hi Johannes,
>>> 
>>> Thanks for the Report! Just to be sure, what service do you call? A
>>> find query on a Entityhub Site backed by ClerezzaYard? Or do you
>>> directly call the SPARQL endpoint of Stanbol?
>>> 
>>> best
>>> Rupert
>>> 
>>> On Tue, Dec 17, 2013 at 5:20 PM, Johannes Goslar <jo...@dkd.de> wrote:
>>>> Hello,
>>>> our stanbol (full-war checked out from today) has problems when I feed the
>>>> enhancer queries containing html tags containing points.
>>>> Our referenced site/sesame will complain about wrongly (not escaped) dots.
>>>> For example this text sent to the enhancer:
>>>> <b>from 9 a.m. - 4 p.m.</b>
>>>> Will generate this request:
>>>> 
>>>> http://localhost:8090/openrdf-sesame/repositories/stanbol?query=CONSTRUCT+%7B+%0A++?id+%3Chttp://www.w3.org/2003/01/geo/wgs84_pos%23long%3E+?v_1+.%0A++?id+%3Chttp://xmlns.com/foaf/0.1/depiction%3E+?v_2+.%0A++?id+%3Chttp://www.w3.org/1999/02/22-rdf-syntax-ns%23type%3E+?v_4+.%0A++?id+%3Chttp://stanbol.apache.org/ontology/entityhub/entityhub%23entityRank%3E+?v_3+.%0A++?id+%3Chttp://dbpedia.org/ontology/thumbnail%3E+?v_5+.%0A++?id+%3Chttp://www.w3.org/2000/01/rdf-schema%23comment%3E+?v_6+.%0A++?id+%3Chttp://www.w3.org/2000/01/rdf-schema%23label%3E+?v_7+.%0A++?id+%3Chttp://www.w3.org/2003/01/geo/wgs84_pos%23lat%3E+?v_8+.%0A++%3Chttp://stanbol.apache.org/ontology/entityhub/query%23QueryResultSet%3E+%3Chttp://stanbol.apache.org/ontology/entityhub/query%23queryResult%3E+?id+.+%0A%7D+WHERE+%7B+%0A++%7B+%0A++++?id+%3Chttp://www.w3.org/2000/01/rdf-schema%23label%3E+?v_7++%0A++++++FILTER(regex(str(?v_7),%22%5Ep%5C.m%5C.$%22,%22i%22)+%7C%7C+regex(str(?v_7),%22%5E%3C/b%3E$%22,%22i%22)+%26%26+((lang(?v_7)+%3D+%22%22)+%7C%7C+(lang(?v_7)+%3D+%22en%22)))+.+%0A++%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://www.w3.org/2003/01/geo/wgs84_pos%23long%3E+?v_1+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://xmlns.com/foaf/0.1/depiction%3E+?v_2+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://www.w3.org/1999/02/22-rdf-syntax-ns%23type%3E+?v_4+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://stanbol.apache.org/ontology/entityhub/entityhub%23entityRank%3E+?v_3+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://dbpedia.org/ontology/thumbnail%3E+?v_5+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://www.w3.org/2000/01/rdf-schema%23comment%3E+?v_6+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://www.w3.org/2003/01/geo/wgs84_pos%23lat%3E+?v_8+.+%7D+%0A%7D+%0ALIMIT+12+%0A&format=application/rdf%2Bxml
>>>> 
>>>> Or
>>>> 
>>>> http://localhost:8090/openrdf-sesame/repositories/stanbol?query=CONSTRUCT {
>>>> ?id <http://www.w3.org/2003/01/geo/wgs84_pos#long> ?v_1 .
>>>> ?id <http://xmlns.com/foaf/0.1/depiction> ?v_2 .
>>>> ?id <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> ?v_4 .
>>>> ?id <http://stanbol.apache.org/ontology/entityhub/entityhub#entityRank>
>>>> ?v_3 .
>>>> ?id <http://dbpedia.org/ontology/thumbnail> ?v_5 .
>>>> ?id <http://www.w3.org/2000/01/rdf-schema#comment> ?v_6 .
>>>> ?id <http://www.w3.org/2000/01/rdf-schema#label> ?v_7 .
>>>> ?id <http://www.w3.org/2003/01/geo/wgs84_pos#lat> ?v_8 .
>>>> <http://stanbol.apache.org/ontology/entityhub/query#QueryResultSet>
>>>> <http://stanbol.apache.org/ontology/entityhub/query#queryResult> ?id .
>>>> } WHERE {
>>>> {
>>>>   ?id <http://www.w3.org/2000/01/rdf-schema#label> ?v_7
>>>>     FILTER(regex(str(?v_7),"^p\.m\.$","i") ||
>>>> regex(str(?v_7),"^</b>$","i") && ((lang(?v_7) = "") || (lang(?v_7) = "en")))
>>>> .
>>>> }
>>>> OPTIONAL { ?id <http://www.w3.org/2003/01/geo/wgs84_pos#long> ?v_1 . }
>>>> OPTIONAL { ?id <http://xmlns.com/foaf/0.1/depiction> ?v_2 . }
>>>> OPTIONAL { ?id <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> ?v_4 . }
>>>> OPTIONAL { ?id
>>>> <http://stanbol.apache.org/ontology/entityhub/entityhub#entityRank> ?v_3 . }
>>>> OPTIONAL { ?id <http://dbpedia.org/ontology/thumbnail> ?v_5 . }
>>>> OPTIONAL { ?id <http://www.w3.org/2000/01/rdf-schema#comment> ?v_6 . }
>>>> OPTIONAL { ?id <http://www.w3.org/2003/01/geo/wgs84_pos#lat> ?v_8 . }
>>>> }
>>>> LIMIT 12
>>>> &format=application/rdf+xml
>>>> 
>>>> If I change the contained \. to \\. it will not error.
>>>> 
>>>> If I remove the <b></b> it will not error as well.
>>>> 
>>>> Kind Regards
>>>> Johannes
>>>> 
>>>> --
>>>> Johannes Goslar
>>>> 
>>>> dkd Internet Service GmbH
>>>> development // kommunikation // design
>>>> Kaiserstraße 73
>>>> 60329 Frankfurt am Main
>>>> 
>>>> Kontakt:
>>>> - email: johannes.goslar@dkd.de
>>>> - fon: +49 69 2475218-0
>>>> - fax: +49 69 2475218-99
>>>> - web: http://www.dkd.de
>>>> - social media: http://social.dkd.de
>>>> 
>>>> Aktuelle Projekte:
>>>> - http://j.mp/SehBiS-App – iPhone-App Sehbehinderungssimulator
>>>> - http://www.ellen-wille.de - Launch Website (TYPO3)
>>>> - http://www.vgf-ffm.de - Relaunch Website (TYPO3)
>>>> 
>>>> Geschäftsführer: O. Dobberkau, S. Schaffstein, G. Wegenast, C. Zabanski
>>>> Registergericht: Amtsgericht Frankfurt am Main
>>>> Registernummer: HRB 45590
>>>> 
>>>> 
>>>> 
>>> 
>>> 
>>> 
>>> --
>>> | Rupert Westenthaler             rupert.westenthaler@gmail.com
>>> | Bodenlehenstraße 11                             ++43-699-11108907
>>> | A-5500 Bischofshofen
>> 
> 
> 
> 
> -- 
> | Rupert Westenthaler             rupert.westenthaler@gmail.com
> | Bodenlehenstraße 11                             ++43-699-11108907
> | A-5500 Bischofshofen


Re: dot escaping in sparql queries

Posted by Rupert Westenthaler <ru...@gmail.com>.
Hi Johannes,

Do you use Sesame? There is already a SesameYard that implements the
Entityhub Yard interface based on a Sesame repository. This was added
as part of the Apache Marmotta - Apache Stanbol integration
(STANBOL-1165) [1]. However there is - not yet a actual SesameYard for
a specific Sesame Repository implementation. The SesameYard is just a
PoJo that is intended to be configured by a OSGI Component and
registered as OSGI service.

Regarding your encoding issue: I will take a look at string encoding
of SPARQL queries. Maybe this is even related to STANBOL-877 [2]


best
Rupert


[1] https://issues.apache.org/jira/browse/STANBOL-1165
[2] https://issues.apache.org/jira/browse/STANBOL-877


On Wed, Dec 18, 2013 at 9:17 AM, Johannes Goslar <jo...@dkd.de> wrote:
> Hi Rupert,
> no, at the moment it is configured as a referenced site with accessUri and queryUri. (no index)
> But I was wondering, if there is some kind of local bridge so we can avoid the HTTP overhead.
>
> Kind Regards
> Johannes
>
> --
> Johannes Goslar
>
> dkd Internet Service GmbH
> development // kommunikation // design
> Kaiserstraße 73
> 60329 Frankfurt am Main
>
> Kontakt:
> - email: johannes.goslar@dkd.de
> - fon: +49 69 2475218-0
> - fax: +49 69 2475218-99
> - web: http://www.dkd.de
> - social media: http://social.dkd.de
>
> Aktuelle Projekte:
> - http://j.mp/SehBiS-App – iPhone-App Sehbehinderungssimulator
> - http://www.ellen-wille.de - Launch Website (TYPO3)
> - http://www.vgf-ffm.de - Relaunch Website (TYPO3)
>
> Geschäftsführer: O. Dobberkau, S. Schaffstein, G. Wegenast, C. Zabanski
> Registergericht: Amtsgericht Frankfurt am Main
> Registernummer: HRB 45590
>
>
>
> On 17.12.2013, at 21:12, Rupert Westenthaler <ru...@gmail.com> wrote:
>
>> Hi Johannes,
>>
>> Thanks for the Report! Just to be sure, what service do you call? A
>> find query on a Entityhub Site backed by ClerezzaYard? Or do you
>> directly call the SPARQL endpoint of Stanbol?
>>
>> best
>> Rupert
>>
>> On Tue, Dec 17, 2013 at 5:20 PM, Johannes Goslar <jo...@dkd.de> wrote:
>>> Hello,
>>> our stanbol (full-war checked out from today) has problems when I feed the
>>> enhancer queries containing html tags containing points.
>>> Our referenced site/sesame will complain about wrongly (not escaped) dots.
>>> For example this text sent to the enhancer:
>>> <b>from 9 a.m. - 4 p.m.</b>
>>> Will generate this request:
>>>
>>> http://localhost:8090/openrdf-sesame/repositories/stanbol?query=CONSTRUCT+%7B+%0A++?id+%3Chttp://www.w3.org/2003/01/geo/wgs84_pos%23long%3E+?v_1+.%0A++?id+%3Chttp://xmlns.com/foaf/0.1/depiction%3E+?v_2+.%0A++?id+%3Chttp://www.w3.org/1999/02/22-rdf-syntax-ns%23type%3E+?v_4+.%0A++?id+%3Chttp://stanbol.apache.org/ontology/entityhub/entityhub%23entityRank%3E+?v_3+.%0A++?id+%3Chttp://dbpedia.org/ontology/thumbnail%3E+?v_5+.%0A++?id+%3Chttp://www.w3.org/2000/01/rdf-schema%23comment%3E+?v_6+.%0A++?id+%3Chttp://www.w3.org/2000/01/rdf-schema%23label%3E+?v_7+.%0A++?id+%3Chttp://www.w3.org/2003/01/geo/wgs84_pos%23lat%3E+?v_8+.%0A++%3Chttp://stanbol.apache.org/ontology/entityhub/query%23QueryResultSet%3E+%3Chttp://stanbol.apache.org/ontology/entityhub/query%23queryResult%3E+?id+.+%0A%7D+WHERE+%7B+%0A++%7B+%0A++++?id+%3Chttp://www.w3.org/2000/01/rdf-schema%23label%3E+?v_7++%0A++++++FILTER(regex(str(?v_7),%22%5Ep%5C.m%5C.$%22,%22i%22)+%7C%7C+regex(str(?v_7),%22%5E%3C/b%3E$%22,%22i%22)+%26%26+((lang(?v_7)+%3D+%22%22)+%7C%7C+(lang(?v_7)+%3D+%22en%22)))+.+%0A++%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://www.w3.org/2003/01/geo/wgs84_pos%23long%3E+?v_1+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://xmlns.com/foaf/0.1/depiction%3E+?v_2+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://www.w3.org/1999/02/22-rdf-syntax-ns%23type%3E+?v_4+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://stanbol.apache.org/ontology/entityhub/entityhub%23entityRank%3E+?v_3+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://dbpedia.org/ontology/thumbnail%3E+?v_5+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://www.w3.org/2000/01/rdf-schema%23comment%3E+?v_6+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://www.w3.org/2003/01/geo/wgs84_pos%23lat%3E+?v_8+.+%7D+%0A%7D+%0ALIMIT+12+%0A&format=application/rdf%2Bxml
>>>
>>> Or
>>>
>>> http://localhost:8090/openrdf-sesame/repositories/stanbol?query=CONSTRUCT {
>>>  ?id <http://www.w3.org/2003/01/geo/wgs84_pos#long> ?v_1 .
>>>  ?id <http://xmlns.com/foaf/0.1/depiction> ?v_2 .
>>>  ?id <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> ?v_4 .
>>>  ?id <http://stanbol.apache.org/ontology/entityhub/entityhub#entityRank>
>>> ?v_3 .
>>>  ?id <http://dbpedia.org/ontology/thumbnail> ?v_5 .
>>>  ?id <http://www.w3.org/2000/01/rdf-schema#comment> ?v_6 .
>>>  ?id <http://www.w3.org/2000/01/rdf-schema#label> ?v_7 .
>>>  ?id <http://www.w3.org/2003/01/geo/wgs84_pos#lat> ?v_8 .
>>>  <http://stanbol.apache.org/ontology/entityhub/query#QueryResultSet>
>>> <http://stanbol.apache.org/ontology/entityhub/query#queryResult> ?id .
>>> } WHERE {
>>>  {
>>>    ?id <http://www.w3.org/2000/01/rdf-schema#label> ?v_7
>>>      FILTER(regex(str(?v_7),"^p\.m\.$","i") ||
>>> regex(str(?v_7),"^</b>$","i") && ((lang(?v_7) = "") || (lang(?v_7) = "en")))
>>> .
>>>  }
>>>  OPTIONAL { ?id <http://www.w3.org/2003/01/geo/wgs84_pos#long> ?v_1 . }
>>>  OPTIONAL { ?id <http://xmlns.com/foaf/0.1/depiction> ?v_2 . }
>>>  OPTIONAL { ?id <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> ?v_4 . }
>>>  OPTIONAL { ?id
>>> <http://stanbol.apache.org/ontology/entityhub/entityhub#entityRank> ?v_3 . }
>>>  OPTIONAL { ?id <http://dbpedia.org/ontology/thumbnail> ?v_5 . }
>>>  OPTIONAL { ?id <http://www.w3.org/2000/01/rdf-schema#comment> ?v_6 . }
>>>  OPTIONAL { ?id <http://www.w3.org/2003/01/geo/wgs84_pos#lat> ?v_8 . }
>>> }
>>> LIMIT 12
>>> &format=application/rdf+xml
>>>
>>> If I change the contained \. to \\. it will not error.
>>>
>>> If I remove the <b></b> it will not error as well.
>>>
>>> Kind Regards
>>> Johannes
>>>
>>> --
>>> Johannes Goslar
>>>
>>> dkd Internet Service GmbH
>>> development // kommunikation // design
>>> Kaiserstraße 73
>>> 60329 Frankfurt am Main
>>>
>>> Kontakt:
>>> - email: johannes.goslar@dkd.de
>>> - fon: +49 69 2475218-0
>>> - fax: +49 69 2475218-99
>>> - web: http://www.dkd.de
>>> - social media: http://social.dkd.de
>>>
>>> Aktuelle Projekte:
>>> - http://j.mp/SehBiS-App – iPhone-App Sehbehinderungssimulator
>>> - http://www.ellen-wille.de - Launch Website (TYPO3)
>>> - http://www.vgf-ffm.de - Relaunch Website (TYPO3)
>>>
>>> Geschäftsführer: O. Dobberkau, S. Schaffstein, G. Wegenast, C. Zabanski
>>> Registergericht: Amtsgericht Frankfurt am Main
>>> Registernummer: HRB 45590
>>>
>>>
>>>
>>
>>
>>
>> --
>> | Rupert Westenthaler             rupert.westenthaler@gmail.com
>> | Bodenlehenstraße 11                             ++43-699-11108907
>> | A-5500 Bischofshofen
>



-- 
| Rupert Westenthaler             rupert.westenthaler@gmail.com
| Bodenlehenstraße 11                             ++43-699-11108907
| A-5500 Bischofshofen

Re: dot escaping in sparql queries

Posted by Johannes Goslar <jo...@dkd.de>.
Hi Rupert,
no, at the moment it is configured as a referenced site with accessUri and queryUri. (no index)
But I was wondering, if there is some kind of local bridge so we can avoid the HTTP overhead.

Kind Regards
Johannes

-- 
Johannes Goslar

dkd Internet Service GmbH 
development // kommunikation // design 
Kaiserstraße 73 
60329 Frankfurt am Main 

Kontakt: 
- email: johannes.goslar@dkd.de 
- fon: +49 69 2475218-0 
- fax: +49 69 2475218-99
- web: http://www.dkd.de
- social media: http://social.dkd.de

Aktuelle Projekte:
- http://j.mp/SehBiS-App – iPhone-App Sehbehinderungssimulator
- http://www.ellen-wille.de - Launch Website (TYPO3)
- http://www.vgf-ffm.de - Relaunch Website (TYPO3)

Geschäftsführer: O. Dobberkau, S. Schaffstein, G. Wegenast, C. Zabanski 
Registergericht: Amtsgericht Frankfurt am Main 
Registernummer: HRB 45590



On 17.12.2013, at 21:12, Rupert Westenthaler <ru...@gmail.com> wrote:

> Hi Johannes,
> 
> Thanks for the Report! Just to be sure, what service do you call? A
> find query on a Entityhub Site backed by ClerezzaYard? Or do you
> directly call the SPARQL endpoint of Stanbol?
> 
> best
> Rupert
> 
> On Tue, Dec 17, 2013 at 5:20 PM, Johannes Goslar <jo...@dkd.de> wrote:
>> Hello,
>> our stanbol (full-war checked out from today) has problems when I feed the
>> enhancer queries containing html tags containing points.
>> Our referenced site/sesame will complain about wrongly (not escaped) dots.
>> For example this text sent to the enhancer:
>> <b>from 9 a.m. - 4 p.m.</b>
>> Will generate this request:
>> 
>> http://localhost:8090/openrdf-sesame/repositories/stanbol?query=CONSTRUCT+%7B+%0A++?id+%3Chttp://www.w3.org/2003/01/geo/wgs84_pos%23long%3E+?v_1+.%0A++?id+%3Chttp://xmlns.com/foaf/0.1/depiction%3E+?v_2+.%0A++?id+%3Chttp://www.w3.org/1999/02/22-rdf-syntax-ns%23type%3E+?v_4+.%0A++?id+%3Chttp://stanbol.apache.org/ontology/entityhub/entityhub%23entityRank%3E+?v_3+.%0A++?id+%3Chttp://dbpedia.org/ontology/thumbnail%3E+?v_5+.%0A++?id+%3Chttp://www.w3.org/2000/01/rdf-schema%23comment%3E+?v_6+.%0A++?id+%3Chttp://www.w3.org/2000/01/rdf-schema%23label%3E+?v_7+.%0A++?id+%3Chttp://www.w3.org/2003/01/geo/wgs84_pos%23lat%3E+?v_8+.%0A++%3Chttp://stanbol.apache.org/ontology/entityhub/query%23QueryResultSet%3E+%3Chttp://stanbol.apache.org/ontology/entityhub/query%23queryResult%3E+?id+.+%0A%7D+WHERE+%7B+%0A++%7B+%0A++++?id+%3Chttp://www.w3.org/2000/01/rdf-schema%23label%3E+?v_7++%0A++++++FILTER(regex(str(?v_7),%22%5Ep%5C.m%5C.$%22,%22i%22)+%7C%7C+regex(str(?v_7),%22%5E%3C/b%3E$%22,%22i%22)+%26%26+((lang(?v_7)+%3D+%22%22)+%7C%7C+(lang(?v_7)+%3D+%22en%22)))+.+%0A++%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://www.w3.org/2003/01/geo/wgs84_pos%23long%3E+?v_1+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://xmlns.com/foaf/0.1/depiction%3E+?v_2+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://www.w3.org/1999/02/22-rdf-syntax-ns%23type%3E+?v_4+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://stanbol.apache.org/ontology/entityhub/entityhub%23entityRank%3E+?v_3+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://dbpedia.org/ontology/thumbnail%3E+?v_5+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://www.w3.org/2000/01/rdf-schema%23comment%3E+?v_6+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://www.w3.org/2003/01/geo/wgs84_pos%23lat%3E+?v_8+.+%7D+%0A%7D+%0ALIMIT+12+%0A&format=application/rdf%2Bxml
>> 
>> Or
>> 
>> http://localhost:8090/openrdf-sesame/repositories/stanbol?query=CONSTRUCT {
>>  ?id <http://www.w3.org/2003/01/geo/wgs84_pos#long> ?v_1 .
>>  ?id <http://xmlns.com/foaf/0.1/depiction> ?v_2 .
>>  ?id <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> ?v_4 .
>>  ?id <http://stanbol.apache.org/ontology/entityhub/entityhub#entityRank>
>> ?v_3 .
>>  ?id <http://dbpedia.org/ontology/thumbnail> ?v_5 .
>>  ?id <http://www.w3.org/2000/01/rdf-schema#comment> ?v_6 .
>>  ?id <http://www.w3.org/2000/01/rdf-schema#label> ?v_7 .
>>  ?id <http://www.w3.org/2003/01/geo/wgs84_pos#lat> ?v_8 .
>>  <http://stanbol.apache.org/ontology/entityhub/query#QueryResultSet>
>> <http://stanbol.apache.org/ontology/entityhub/query#queryResult> ?id .
>> } WHERE {
>>  {
>>    ?id <http://www.w3.org/2000/01/rdf-schema#label> ?v_7
>>      FILTER(regex(str(?v_7),"^p\.m\.$","i") ||
>> regex(str(?v_7),"^</b>$","i") && ((lang(?v_7) = "") || (lang(?v_7) = "en")))
>> .
>>  }
>>  OPTIONAL { ?id <http://www.w3.org/2003/01/geo/wgs84_pos#long> ?v_1 . }
>>  OPTIONAL { ?id <http://xmlns.com/foaf/0.1/depiction> ?v_2 . }
>>  OPTIONAL { ?id <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> ?v_4 . }
>>  OPTIONAL { ?id
>> <http://stanbol.apache.org/ontology/entityhub/entityhub#entityRank> ?v_3 . }
>>  OPTIONAL { ?id <http://dbpedia.org/ontology/thumbnail> ?v_5 . }
>>  OPTIONAL { ?id <http://www.w3.org/2000/01/rdf-schema#comment> ?v_6 . }
>>  OPTIONAL { ?id <http://www.w3.org/2003/01/geo/wgs84_pos#lat> ?v_8 . }
>> }
>> LIMIT 12
>> &format=application/rdf+xml
>> 
>> If I change the contained \. to \\. it will not error.
>> 
>> If I remove the <b></b> it will not error as well.
>> 
>> Kind Regards
>> Johannes
>> 
>> --
>> Johannes Goslar
>> 
>> dkd Internet Service GmbH
>> development // kommunikation // design
>> Kaiserstraße 73
>> 60329 Frankfurt am Main
>> 
>> Kontakt:
>> - email: johannes.goslar@dkd.de
>> - fon: +49 69 2475218-0
>> - fax: +49 69 2475218-99
>> - web: http://www.dkd.de
>> - social media: http://social.dkd.de
>> 
>> Aktuelle Projekte:
>> - http://j.mp/SehBiS-App – iPhone-App Sehbehinderungssimulator
>> - http://www.ellen-wille.de - Launch Website (TYPO3)
>> - http://www.vgf-ffm.de - Relaunch Website (TYPO3)
>> 
>> Geschäftsführer: O. Dobberkau, S. Schaffstein, G. Wegenast, C. Zabanski
>> Registergericht: Amtsgericht Frankfurt am Main
>> Registernummer: HRB 45590
>> 
>> 
>> 
> 
> 
> 
> -- 
> | Rupert Westenthaler             rupert.westenthaler@gmail.com
> | Bodenlehenstraße 11                             ++43-699-11108907
> | A-5500 Bischofshofen


Re: dot escaping in sparql queries

Posted by Rupert Westenthaler <ru...@gmail.com>.
Hi Johannes,

Thanks for the Report! Just to be sure, what service do you call? A
find query on a Entityhub Site backed by ClerezzaYard? Or do you
directly call the SPARQL endpoint of Stanbol?

best
Rupert

On Tue, Dec 17, 2013 at 5:20 PM, Johannes Goslar <jo...@dkd.de> wrote:
> Hello,
> our stanbol (full-war checked out from today) has problems when I feed the
> enhancer queries containing html tags containing points.
> Our referenced site/sesame will complain about wrongly (not escaped) dots.
> For example this text sent to the enhancer:
> <b>from 9 a.m. - 4 p.m.</b>
> Will generate this request:
>
> http://localhost:8090/openrdf-sesame/repositories/stanbol?query=CONSTRUCT+%7B+%0A++?id+%3Chttp://www.w3.org/2003/01/geo/wgs84_pos%23long%3E+?v_1+.%0A++?id+%3Chttp://xmlns.com/foaf/0.1/depiction%3E+?v_2+.%0A++?id+%3Chttp://www.w3.org/1999/02/22-rdf-syntax-ns%23type%3E+?v_4+.%0A++?id+%3Chttp://stanbol.apache.org/ontology/entityhub/entityhub%23entityRank%3E+?v_3+.%0A++?id+%3Chttp://dbpedia.org/ontology/thumbnail%3E+?v_5+.%0A++?id+%3Chttp://www.w3.org/2000/01/rdf-schema%23comment%3E+?v_6+.%0A++?id+%3Chttp://www.w3.org/2000/01/rdf-schema%23label%3E+?v_7+.%0A++?id+%3Chttp://www.w3.org/2003/01/geo/wgs84_pos%23lat%3E+?v_8+.%0A++%3Chttp://stanbol.apache.org/ontology/entityhub/query%23QueryResultSet%3E+%3Chttp://stanbol.apache.org/ontology/entityhub/query%23queryResult%3E+?id+.+%0A%7D+WHERE+%7B+%0A++%7B+%0A++++?id+%3Chttp://www.w3.org/2000/01/rdf-schema%23label%3E+?v_7++%0A++++++FILTER(regex(str(?v_7),%22%5Ep%5C.m%5C.$%22,%22i%22)+%7C%7C+regex(str(?v_7),%22%5E%3C/b%3E$%22,%22i%22)+%26%26+((lang(?v_7)+%3D+%22%22)+%7C%7C+(lang(?v_7)+%3D+%22en%22)))+.+%0A++%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://www.w3.org/2003/01/geo/wgs84_pos%23long%3E+?v_1+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://xmlns.com/foaf/0.1/depiction%3E+?v_2+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://www.w3.org/1999/02/22-rdf-syntax-ns%23type%3E+?v_4+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://stanbol.apache.org/ontology/entityhub/entityhub%23entityRank%3E+?v_3+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://dbpedia.org/ontology/thumbnail%3E+?v_5+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://www.w3.org/2000/01/rdf-schema%23comment%3E+?v_6+.+%7D+%0A++OPTIONAL+%7B+?id+%3Chttp://www.w3.org/2003/01/geo/wgs84_pos%23lat%3E+?v_8+.+%7D+%0A%7D+%0ALIMIT+12+%0A&format=application/rdf%2Bxml
>
> Or
>
> http://localhost:8090/openrdf-sesame/repositories/stanbol?query=CONSTRUCT {
>   ?id <http://www.w3.org/2003/01/geo/wgs84_pos#long> ?v_1 .
>   ?id <http://xmlns.com/foaf/0.1/depiction> ?v_2 .
>   ?id <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> ?v_4 .
>   ?id <http://stanbol.apache.org/ontology/entityhub/entityhub#entityRank>
> ?v_3 .
>   ?id <http://dbpedia.org/ontology/thumbnail> ?v_5 .
>   ?id <http://www.w3.org/2000/01/rdf-schema#comment> ?v_6 .
>   ?id <http://www.w3.org/2000/01/rdf-schema#label> ?v_7 .
>   ?id <http://www.w3.org/2003/01/geo/wgs84_pos#lat> ?v_8 .
>   <http://stanbol.apache.org/ontology/entityhub/query#QueryResultSet>
> <http://stanbol.apache.org/ontology/entityhub/query#queryResult> ?id .
> } WHERE {
>   {
>     ?id <http://www.w3.org/2000/01/rdf-schema#label> ?v_7
>       FILTER(regex(str(?v_7),"^p\.m\.$","i") ||
> regex(str(?v_7),"^</b>$","i") && ((lang(?v_7) = "") || (lang(?v_7) = "en")))
> .
>   }
>   OPTIONAL { ?id <http://www.w3.org/2003/01/geo/wgs84_pos#long> ?v_1 . }
>   OPTIONAL { ?id <http://xmlns.com/foaf/0.1/depiction> ?v_2 . }
>   OPTIONAL { ?id <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> ?v_4 . }
>   OPTIONAL { ?id
> <http://stanbol.apache.org/ontology/entityhub/entityhub#entityRank> ?v_3 . }
>   OPTIONAL { ?id <http://dbpedia.org/ontology/thumbnail> ?v_5 . }
>   OPTIONAL { ?id <http://www.w3.org/2000/01/rdf-schema#comment> ?v_6 . }
>   OPTIONAL { ?id <http://www.w3.org/2003/01/geo/wgs84_pos#lat> ?v_8 . }
> }
> LIMIT 12
> &format=application/rdf+xml
>
> If I change the contained \. to \\. it will not error.
>
> If I remove the <b></b> it will not error as well.
>
> Kind Regards
> Johannes
>
> --
> Johannes Goslar
>
> dkd Internet Service GmbH
> development // kommunikation // design
> Kaiserstraße 73
> 60329 Frankfurt am Main
>
> Kontakt:
> - email: johannes.goslar@dkd.de
> - fon: +49 69 2475218-0
> - fax: +49 69 2475218-99
> - web: http://www.dkd.de
> - social media: http://social.dkd.de
>
> Aktuelle Projekte:
> - http://j.mp/SehBiS-App – iPhone-App Sehbehinderungssimulator
> - http://www.ellen-wille.de - Launch Website (TYPO3)
> - http://www.vgf-ffm.de - Relaunch Website (TYPO3)
>
> Geschäftsführer: O. Dobberkau, S. Schaffstein, G. Wegenast, C. Zabanski
> Registergericht: Amtsgericht Frankfurt am Main
> Registernummer: HRB 45590
>
>
>



-- 
| Rupert Westenthaler             rupert.westenthaler@gmail.com
| Bodenlehenstraße 11                             ++43-699-11108907
| A-5500 Bischofshofen