You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@sis.apache.org by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2012/04/01 19:37:13 UTC

Re: Implementing GeoSPARQL

Hi Andy,

Just to follow up on this, I went ahead and filed this issue:

https://issues.apache.org/jira/browse/SIS-42

I'm going to start work on implementing the Geo SPARQL spec in SIS, maybe
even integrating with Any23 (and Jena down the road, but Any23, I'm a tad
more familiar with at the moment). 

I'm CC'ing the any23-dev and jena-dev user lists (apologies for the SPAM guys)
just to keep them in the loop.

Cheers,
Chris

On Jan 31, 2012, at 4:59 AM, Andy Seaborne wrote:

> Hi there,
> 
> I'm investigating what it would take to implement GeoSPARQL.
> There is already an Apache-licensed SPARQL engine in podling Jena.
> 
> Of the things needed are a persistent storage layer with the right 
> license.  Maybe the SIS project has something to use.
> 
> If I understand it correctly, the qtree implementation is an in-memory 
> structure, with the ability to read from a serialized form on disk, and 
> to be able to write it to disk in that form.
> 
> Is there any information on scaling for the qtree?  Memory usage?
> 
> California_Restaurants.csv is 54K points - is that typical usage size?
> 
> (yes ... there are other things needed as well such as conversion code 
> between coodinate systems, format parsers, polygon code, ... but a start 
> would be just for point data in one system :)
> 
> An open copy of the spec is available at:
> 
> http://www.w3.org/2011/02/GeoSPARQL.pdf
> 
> 	Andy


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: Implementing GeoSPARQL

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hi Rob,

Thanks for your email! My replies inline below.

On Apr 4, 2012, at 12:23 PM, Rob Battle wrote:

> Chris,
> 
> I am the primary developer of the GeoSPARQL index in Parliament.  The spatial part is certainly pluggable. We have an interface for the index and have implemented it with a variety of backends including a persistent RTree from the degree project [1], PostgreSQL w/PostGIS [2], and an in-memory quad tree using JTS [3].  

Gotcha, OK. 

>  One part that is not pluggable is the query engine for for Parliament (which replaces the default ARQ query engine).  We optimize the query such that the parts of the query that can be answered more efficiently by the spatial index are executed first.  We would certainly consider using SIS if it could be easily integrated and provides better performance than what we are currently using.  

Cool. I'm not sure SIS will be more efficient than JTS, but SIS is ALv2 licensed, so it will at least carry with it less
virality and won't cause your own software to be LGPL (or GPL), in a copyleft fashion. And, there are a budding
number of people here and a positive community of many people who are interested in helping you solve 
use cases that are interesting and challenging.

> TBH I hadn't heard of it before I saw this thread.

No worries. We're trying to grow a community here at Apache and it's been slow, but extremely positive 
and a great overall experience.

> 
> Unfortunately, I'm not sure if our implementation is ALv2.  Parliament is BSD licensed and some of the libraries we link to are LGPL.

Depends on the version of LGPL, and specific instances, but what is known is that JTS brings with it LGPL dependencies and virality,
with respect to licensing. There are different ways you can link against it, and legal "syntactic sugar", but rather than worry about that, 
we're trying to be an ALv2 "clean" version of similar functionality.

And I (specifically, because we're all individuals here at Apache that have our own "itches" to scratch) am interested in a real
use case for SIS beyond Quad Tree and our simple GMaps demo...GeoSPARQL sounds like a great use case!

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: Implementing GeoSPARQL

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hi Rob,

Thanks for your email! My replies inline below.

On Apr 4, 2012, at 12:23 PM, Rob Battle wrote:

> Chris,
> 
> I am the primary developer of the GeoSPARQL index in Parliament.  The spatial part is certainly pluggable. We have an interface for the index and have implemented it with a variety of backends including a persistent RTree from the degree project [1], PostgreSQL w/PostGIS [2], and an in-memory quad tree using JTS [3].  

Gotcha, OK. 

>  One part that is not pluggable is the query engine for for Parliament (which replaces the default ARQ query engine).  We optimize the query such that the parts of the query that can be answered more efficiently by the spatial index are executed first.  We would certainly consider using SIS if it could be easily integrated and provides better performance than what we are currently using.  

Cool. I'm not sure SIS will be more efficient than JTS, but SIS is ALv2 licensed, so it will at least carry with it less
virality and won't cause your own software to be LGPL (or GPL), in a copyleft fashion. And, there are a budding
number of people here and a positive community of many people who are interested in helping you solve 
use cases that are interesting and challenging.

> TBH I hadn't heard of it before I saw this thread.

No worries. We're trying to grow a community here at Apache and it's been slow, but extremely positive 
and a great overall experience.

> 
> Unfortunately, I'm not sure if our implementation is ALv2.  Parliament is BSD licensed and some of the libraries we link to are LGPL.

Depends on the version of LGPL, and specific instances, but what is known is that JTS brings with it LGPL dependencies and virality,
with respect to licensing. There are different ways you can link against it, and legal "syntactic sugar", but rather than worry about that, 
we're trying to be an ALv2 "clean" version of similar functionality.

And I (specifically, because we're all individuals here at Apache that have our own "itches" to scratch) am interested in a real
use case for SIS beyond Quad Tree and our simple GMaps demo...GeoSPARQL sounds like a great use case!

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: Implementing GeoSPARQL

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hi Rob,

Thanks for your email! My replies inline below.

On Apr 4, 2012, at 12:23 PM, Rob Battle wrote:

> Chris,
> 
> I am the primary developer of the GeoSPARQL index in Parliament.  The spatial part is certainly pluggable. We have an interface for the index and have implemented it with a variety of backends including a persistent RTree from the degree project [1], PostgreSQL w/PostGIS [2], and an in-memory quad tree using JTS [3].  

Gotcha, OK. 

>  One part that is not pluggable is the query engine for for Parliament (which replaces the default ARQ query engine).  We optimize the query such that the parts of the query that can be answered more efficiently by the spatial index are executed first.  We would certainly consider using SIS if it could be easily integrated and provides better performance than what we are currently using.  

Cool. I'm not sure SIS will be more efficient than JTS, but SIS is ALv2 licensed, so it will at least carry with it less
virality and won't cause your own software to be LGPL (or GPL), in a copyleft fashion. And, there are a budding
number of people here and a positive community of many people who are interested in helping you solve 
use cases that are interesting and challenging.

> TBH I hadn't heard of it before I saw this thread.

No worries. We're trying to grow a community here at Apache and it's been slow, but extremely positive 
and a great overall experience.

> 
> Unfortunately, I'm not sure if our implementation is ALv2.  Parliament is BSD licensed and some of the libraries we link to are LGPL.

Depends on the version of LGPL, and specific instances, but what is known is that JTS brings with it LGPL dependencies and virality,
with respect to licensing. There are different ways you can link against it, and legal "syntactic sugar", but rather than worry about that, 
we're trying to be an ALv2 "clean" version of similar functionality.

And I (specifically, because we're all individuals here at Apache that have our own "itches" to scratch) am interested in a real
use case for SIS beyond Quad Tree and our simple GMaps demo...GeoSPARQL sounds like a great use case!

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: Implementing GeoSPARQL

Posted by Rob Battle <rb...@bbn.com>.
Chris,

I am the primary developer of the GeoSPARQL index in Parliament.  The spatial part is certainly pluggable. We have an interface for the index and have implemented it with a variety of backends including a persistent RTree from the degree project [1], PostgreSQL w/PostGIS [2], and an in-memory quad tree using JTS [3].   One part that is not pluggable is the query engine for for Parliament (which replaces the default ARQ query engine).  We optimize the query such that the parts of the query that can be answered more efficiently by the spatial index are executed first.  We would certainly consider using SIS if it could be easily integrated and provides better performance than what we are currently using.  TBH I hadn't heard of it before I saw this thread.

Unfortunately, I'm not sure if our implementation is ALv2.  Parliament is BSD licensed and some of the libraries we link to are LGPL.

-rob

[1] http://www.deegree.org
[2] http://www.postgis.org
[3] http://sourceforge.net/projects/jts-topo-suite/?_test=b

On Apr 3, 2012, at 1:59 PM, Mattmann, Chris A (388J) wrote:

> Hey Rob,
> 
> Thanks for the FYI on this. I will check it out. I would still like to figure out how to implement GeoSPARQL 
> with SIS if for nothing else than to harden SIS and to make it better in a SIS selfish way :) I am out to make
> it an awesome ALv2 licensed spatial implementation.
> 
> Regarding Parliament, what are you guys using for your spatial library? Is it something that is ALv2? If not,
> would you consider SIS as a potential alternative to whatever spatial library you guys are using? Or, should I 
> ask, are you part of the Parliament project?
> 
> Thanks for the FYI again.
> 
> Cheers,
> Chris
> 
> On Apr 3, 2012, at 10:11 AM, Rob Battle wrote:
> 
>> Chris,
>> 
>> I just wanted to let you know that the Parliament [1] triple store (a triple store that is built with a Jena interface) has a mostly complete implementation of GeoSPARQL.  We have implemented the property functions and filter functions for GeoSPARQL on top of our existing spatial index.  For more info, you can check out [2].  
>> 
>> [1] http://parliament.semwebcentral.org
>> [2] http://geosparql.bbn.com
>> 
>> -rob
>> 
>> On Apr 2, 2012, at 10:10 PM, Mattmann, Chris A (388J) wrote:
>> 
>>> Thanks for the FYI on this, Lewis!
>>> 
>>> Cheers,
>>> Chris
>>> 
>>> On Apr 2, 2012, at 4:29 AM, Lewis John Mcgibbney wrote:
>>> 
>>>> Hi,
>>>> 
>>>> Thought I would chime in here as it's a very interesting thread and I think
>>>> this is relevant.
>>>> 
>>>> On Mon, Apr 2, 2012 at 10:01 AM, Paolo Castagna <
>>>> castagna.lists@googlemail.com> wrote:
>>>> 
>>>>> 
>>>>> I see two main use cases here:
>>>>> 
>>>>> 1. Crawling the web and build a dataset of statements with locations.
>>>>> 2. Indexing a dataset with statements with locations and extend SPARQL to
>>>>> perform queries over it.
>>>>> 
>>>>> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
>>>>> 
>>>>> I will begin working on this once we release Any23 0.7.0-incubating and
>>>> the library is available on maven central. It will be a step in the right
>>>> direction to achieving (part) of an overall potential use case as above.
>>>> Any23/Tika wrapped plugin for Apache Nutch. It would be really great to
>>>> work towards the pluggable datastores as you said Paolo, and yes this is
>>>> also something which is required outside of RDF stores, but as you mention
>>>> this is another story which we need to focus on separately.
>>>> 
>>>> I'll certainly get in touch when the above is done to update you guys.
>>>> 
>>>> Lewis
>>> 
>>> 
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Chris Mattmann, Ph.D.
>>> Senior Computer Scientist
>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>> Office: 171-266B, Mailstop: 171-246
>>> Email: chris.a.mattmann@nasa.gov
>>> WWW:   http://sunset.usc.edu/~mattmann/
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Adjunct Assistant Professor, Computer Science Department
>>> University of Southern California, Los Angeles, CA 90089 USA
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> 
>> 
> 
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 


Re: Implementing GeoSPARQL

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hi Rob,

On Apr 3, 2012, at 11:23 AM, Robert Vesse wrote:

> So I think was Chris is asking is whether your implementation of GeoSPARQL is written in such a way that it is easily divorce able from the spatial index backend i.e. could you contribute the code that implements all the relevant functions in such a way that we could plug in alternative backends and/or libraries for spatial representation and calculation?

Yep this is exactly what I was asking. The goal of SIS is to provide such a library, in an ALv2
licensed fashion:

http://wiki.apache.org/incubator/SpatialProposal

I was thinking if the spatial part of Parliament was pluggable, we could use SIS to implement that part.
Right now, I will likely start looking at parts of Jena, Any23, and Tika that are relevant.

Note, I'm also doing some stuff in OODT-ville through GSoC (see OODT-402 [1]) that is connecting stores like OODT
to SIS, and looking to do something similar here.

Cheers,
Chris

[1] http://issues.apache.org/jira/browse/OODT-402


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: Implementing GeoSPARQL

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hi Rob,

On Apr 3, 2012, at 11:23 AM, Robert Vesse wrote:

> So I think was Chris is asking is whether your implementation of GeoSPARQL is written in such a way that it is easily divorce able from the spatial index backend i.e. could you contribute the code that implements all the relevant functions in such a way that we could plug in alternative backends and/or libraries for spatial representation and calculation?

Yep this is exactly what I was asking. The goal of SIS is to provide such a library, in an ALv2
licensed fashion:

http://wiki.apache.org/incubator/SpatialProposal

I was thinking if the spatial part of Parliament was pluggable, we could use SIS to implement that part.
Right now, I will likely start looking at parts of Jena, Any23, and Tika that are relevant.

Note, I'm also doing some stuff in OODT-ville through GSoC (see OODT-402 [1]) that is connecting stores like OODT
to SIS, and looking to do something similar here.

Cheers,
Chris

[1] http://issues.apache.org/jira/browse/OODT-402


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: Implementing GeoSPARQL

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hi Rob,

On Apr 3, 2012, at 11:23 AM, Robert Vesse wrote:

> So I think was Chris is asking is whether your implementation of GeoSPARQL is written in such a way that it is easily divorce able from the spatial index backend i.e. could you contribute the code that implements all the relevant functions in such a way that we could plug in alternative backends and/or libraries for spatial representation and calculation?

Yep this is exactly what I was asking. The goal of SIS is to provide such a library, in an ALv2
licensed fashion:

http://wiki.apache.org/incubator/SpatialProposal

I was thinking if the spatial part of Parliament was pluggable, we could use SIS to implement that part.
Right now, I will likely start looking at parts of Jena, Any23, and Tika that are relevant.

Note, I'm also doing some stuff in OODT-ville through GSoC (see OODT-402 [1]) that is connecting stores like OODT
to SIS, and looking to do something similar here.

Cheers,
Chris

[1] http://issues.apache.org/jira/browse/OODT-402


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: Implementing GeoSPARQL

Posted by Robert Vesse <rv...@yarcdata.com>.
So I think was Chris is asking is whether your implementation of GeoSPARQL is written in such a way that it is easily divorce able from the spatial index backend i.e. could you contribute the code that implements all the relevant functions in such a way that we could plug in alternative backends and/or libraries for spatial representation and calculation?

Rob

On Apr 3, 2012, at 10:59 AM, Mattmann, Chris A (388J) wrote:

> Hey Rob,
> 
> Thanks for the FYI on this. I will check it out. I would still like to figure out how to implement GeoSPARQL 
> with SIS if for nothing else than to harden SIS and to make it better in a SIS selfish way :) I am out to make
> it an awesome ALv2 licensed spatial implementation.
> 
> Regarding Parliament, what are you guys using for your spatial library? Is it something that is ALv2? If not,
> would you consider SIS as a potential alternative to whatever spatial library you guys are using? Or, should I 
> ask, are you part of the Parliament project?
> 
> Thanks for the FYI again.
> 
> Cheers,
> Chris
> 
> On Apr 3, 2012, at 10:11 AM, Rob Battle wrote:
> 
>> Chris,
>> 
>> I just wanted to let you know that the Parliament [1] triple store (a triple store that is built with a Jena interface) has a mostly complete implementation of GeoSPARQL.  We have implemented the property functions and filter functions for GeoSPARQL on top of our existing spatial index.  For more info, you can check out [2].  
>> 
>> [1] http://parliament.semwebcentral.org
>> [2] http://geosparql.bbn.com
>> 
>> -rob
>> 
>> On Apr 2, 2012, at 10:10 PM, Mattmann, Chris A (388J) wrote:
>> 
>>> Thanks for the FYI on this, Lewis!
>>> 
>>> Cheers,
>>> Chris
>>> 
>>> On Apr 2, 2012, at 4:29 AM, Lewis John Mcgibbney wrote:
>>> 
>>>> Hi,
>>>> 
>>>> Thought I would chime in here as it's a very interesting thread and I think
>>>> this is relevant.
>>>> 
>>>> On Mon, Apr 2, 2012 at 10:01 AM, Paolo Castagna <
>>>> castagna.lists@googlemail.com> wrote:
>>>> 
>>>>> 
>>>>> I see two main use cases here:
>>>>> 
>>>>> 1. Crawling the web and build a dataset of statements with locations.
>>>>> 2. Indexing a dataset with statements with locations and extend SPARQL to
>>>>> perform queries over it.
>>>>> 
>>>>> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
>>>>> 
>>>>> I will begin working on this once we release Any23 0.7.0-incubating and
>>>> the library is available on maven central. It will be a step in the right
>>>> direction to achieving (part) of an overall potential use case as above.
>>>> Any23/Tika wrapped plugin for Apache Nutch. It would be really great to
>>>> work towards the pluggable datastores as you said Paolo, and yes this is
>>>> also something which is required outside of RDF stores, but as you mention
>>>> this is another story which we need to focus on separately.
>>>> 
>>>> I'll certainly get in touch when the above is done to update you guys.
>>>> 
>>>> Lewis
>>> 
>>> 
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Chris Mattmann, Ph.D.
>>> Senior Computer Scientist
>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>> Office: 171-266B, Mailstop: 171-246
>>> Email: chris.a.mattmann@nasa.gov
>>> WWW:   http://sunset.usc.edu/~mattmann/
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Adjunct Assistant Professor, Computer Science Department
>>> University of Southern California, Los Angeles, CA 90089 USA
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> 
>> 
> 
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 


Re: Implementing GeoSPARQL

Posted by Robert Vesse <rv...@yarcdata.com>.
So I think was Chris is asking is whether your implementation of GeoSPARQL is written in such a way that it is easily divorce able from the spatial index backend i.e. could you contribute the code that implements all the relevant functions in such a way that we could plug in alternative backends and/or libraries for spatial representation and calculation?

Rob

On Apr 3, 2012, at 10:59 AM, Mattmann, Chris A (388J) wrote:

> Hey Rob,
> 
> Thanks for the FYI on this. I will check it out. I would still like to figure out how to implement GeoSPARQL 
> with SIS if for nothing else than to harden SIS and to make it better in a SIS selfish way :) I am out to make
> it an awesome ALv2 licensed spatial implementation.
> 
> Regarding Parliament, what are you guys using for your spatial library? Is it something that is ALv2? If not,
> would you consider SIS as a potential alternative to whatever spatial library you guys are using? Or, should I 
> ask, are you part of the Parliament project?
> 
> Thanks for the FYI again.
> 
> Cheers,
> Chris
> 
> On Apr 3, 2012, at 10:11 AM, Rob Battle wrote:
> 
>> Chris,
>> 
>> I just wanted to let you know that the Parliament [1] triple store (a triple store that is built with a Jena interface) has a mostly complete implementation of GeoSPARQL.  We have implemented the property functions and filter functions for GeoSPARQL on top of our existing spatial index.  For more info, you can check out [2].  
>> 
>> [1] http://parliament.semwebcentral.org
>> [2] http://geosparql.bbn.com
>> 
>> -rob
>> 
>> On Apr 2, 2012, at 10:10 PM, Mattmann, Chris A (388J) wrote:
>> 
>>> Thanks for the FYI on this, Lewis!
>>> 
>>> Cheers,
>>> Chris
>>> 
>>> On Apr 2, 2012, at 4:29 AM, Lewis John Mcgibbney wrote:
>>> 
>>>> Hi,
>>>> 
>>>> Thought I would chime in here as it's a very interesting thread and I think
>>>> this is relevant.
>>>> 
>>>> On Mon, Apr 2, 2012 at 10:01 AM, Paolo Castagna <
>>>> castagna.lists@googlemail.com> wrote:
>>>> 
>>>>> 
>>>>> I see two main use cases here:
>>>>> 
>>>>> 1. Crawling the web and build a dataset of statements with locations.
>>>>> 2. Indexing a dataset with statements with locations and extend SPARQL to
>>>>> perform queries over it.
>>>>> 
>>>>> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
>>>>> 
>>>>> I will begin working on this once we release Any23 0.7.0-incubating and
>>>> the library is available on maven central. It will be a step in the right
>>>> direction to achieving (part) of an overall potential use case as above.
>>>> Any23/Tika wrapped plugin for Apache Nutch. It would be really great to
>>>> work towards the pluggable datastores as you said Paolo, and yes this is
>>>> also something which is required outside of RDF stores, but as you mention
>>>> this is another story which we need to focus on separately.
>>>> 
>>>> I'll certainly get in touch when the above is done to update you guys.
>>>> 
>>>> Lewis
>>> 
>>> 
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Chris Mattmann, Ph.D.
>>> Senior Computer Scientist
>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>> Office: 171-266B, Mailstop: 171-246
>>> Email: chris.a.mattmann@nasa.gov
>>> WWW:   http://sunset.usc.edu/~mattmann/
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Adjunct Assistant Professor, Computer Science Department
>>> University of Southern California, Los Angeles, CA 90089 USA
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> 
>> 
> 
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 


Re: Implementing GeoSPARQL

Posted by Robert Vesse <rv...@yarcdata.com>.
So I think was Chris is asking is whether your implementation of GeoSPARQL is written in such a way that it is easily divorce able from the spatial index backend i.e. could you contribute the code that implements all the relevant functions in such a way that we could plug in alternative backends and/or libraries for spatial representation and calculation?

Rob

On Apr 3, 2012, at 10:59 AM, Mattmann, Chris A (388J) wrote:

> Hey Rob,
> 
> Thanks for the FYI on this. I will check it out. I would still like to figure out how to implement GeoSPARQL 
> with SIS if for nothing else than to harden SIS and to make it better in a SIS selfish way :) I am out to make
> it an awesome ALv2 licensed spatial implementation.
> 
> Regarding Parliament, what are you guys using for your spatial library? Is it something that is ALv2? If not,
> would you consider SIS as a potential alternative to whatever spatial library you guys are using? Or, should I 
> ask, are you part of the Parliament project?
> 
> Thanks for the FYI again.
> 
> Cheers,
> Chris
> 
> On Apr 3, 2012, at 10:11 AM, Rob Battle wrote:
> 
>> Chris,
>> 
>> I just wanted to let you know that the Parliament [1] triple store (a triple store that is built with a Jena interface) has a mostly complete implementation of GeoSPARQL.  We have implemented the property functions and filter functions for GeoSPARQL on top of our existing spatial index.  For more info, you can check out [2].  
>> 
>> [1] http://parliament.semwebcentral.org
>> [2] http://geosparql.bbn.com
>> 
>> -rob
>> 
>> On Apr 2, 2012, at 10:10 PM, Mattmann, Chris A (388J) wrote:
>> 
>>> Thanks for the FYI on this, Lewis!
>>> 
>>> Cheers,
>>> Chris
>>> 
>>> On Apr 2, 2012, at 4:29 AM, Lewis John Mcgibbney wrote:
>>> 
>>>> Hi,
>>>> 
>>>> Thought I would chime in here as it's a very interesting thread and I think
>>>> this is relevant.
>>>> 
>>>> On Mon, Apr 2, 2012 at 10:01 AM, Paolo Castagna <
>>>> castagna.lists@googlemail.com> wrote:
>>>> 
>>>>> 
>>>>> I see two main use cases here:
>>>>> 
>>>>> 1. Crawling the web and build a dataset of statements with locations.
>>>>> 2. Indexing a dataset with statements with locations and extend SPARQL to
>>>>> perform queries over it.
>>>>> 
>>>>> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
>>>>> 
>>>>> I will begin working on this once we release Any23 0.7.0-incubating and
>>>> the library is available on maven central. It will be a step in the right
>>>> direction to achieving (part) of an overall potential use case as above.
>>>> Any23/Tika wrapped plugin for Apache Nutch. It would be really great to
>>>> work towards the pluggable datastores as you said Paolo, and yes this is
>>>> also something which is required outside of RDF stores, but as you mention
>>>> this is another story which we need to focus on separately.
>>>> 
>>>> I'll certainly get in touch when the above is done to update you guys.
>>>> 
>>>> Lewis
>>> 
>>> 
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Chris Mattmann, Ph.D.
>>> Senior Computer Scientist
>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>> Office: 171-266B, Mailstop: 171-246
>>> Email: chris.a.mattmann@nasa.gov
>>> WWW:   http://sunset.usc.edu/~mattmann/
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Adjunct Assistant Professor, Computer Science Department
>>> University of Southern California, Los Angeles, CA 90089 USA
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> 
>> 
> 
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 


Re: Implementing GeoSPARQL

Posted by Rob Battle <rb...@bbn.com>.
Chris,

I am the primary developer of the GeoSPARQL index in Parliament.  The spatial part is certainly pluggable. We have an interface for the index and have implemented it with a variety of backends including a persistent RTree from the degree project [1], PostgreSQL w/PostGIS [2], and an in-memory quad tree using JTS [3].   One part that is not pluggable is the query engine for for Parliament (which replaces the default ARQ query engine).  We optimize the query such that the parts of the query that can be answered more efficiently by the spatial index are executed first.  We would certainly consider using SIS if it could be easily integrated and provides better performance than what we are currently using.  TBH I hadn't heard of it before I saw this thread.

Unfortunately, I'm not sure if our implementation is ALv2.  Parliament is BSD licensed and some of the libraries we link to are LGPL.

-rob

[1] http://www.deegree.org
[2] http://www.postgis.org
[3] http://sourceforge.net/projects/jts-topo-suite/?_test=b

On Apr 3, 2012, at 1:59 PM, Mattmann, Chris A (388J) wrote:

> Hey Rob,
> 
> Thanks for the FYI on this. I will check it out. I would still like to figure out how to implement GeoSPARQL 
> with SIS if for nothing else than to harden SIS and to make it better in a SIS selfish way :) I am out to make
> it an awesome ALv2 licensed spatial implementation.
> 
> Regarding Parliament, what are you guys using for your spatial library? Is it something that is ALv2? If not,
> would you consider SIS as a potential alternative to whatever spatial library you guys are using? Or, should I 
> ask, are you part of the Parliament project?
> 
> Thanks for the FYI again.
> 
> Cheers,
> Chris
> 
> On Apr 3, 2012, at 10:11 AM, Rob Battle wrote:
> 
>> Chris,
>> 
>> I just wanted to let you know that the Parliament [1] triple store (a triple store that is built with a Jena interface) has a mostly complete implementation of GeoSPARQL.  We have implemented the property functions and filter functions for GeoSPARQL on top of our existing spatial index.  For more info, you can check out [2].  
>> 
>> [1] http://parliament.semwebcentral.org
>> [2] http://geosparql.bbn.com
>> 
>> -rob
>> 
>> On Apr 2, 2012, at 10:10 PM, Mattmann, Chris A (388J) wrote:
>> 
>>> Thanks for the FYI on this, Lewis!
>>> 
>>> Cheers,
>>> Chris
>>> 
>>> On Apr 2, 2012, at 4:29 AM, Lewis John Mcgibbney wrote:
>>> 
>>>> Hi,
>>>> 
>>>> Thought I would chime in here as it's a very interesting thread and I think
>>>> this is relevant.
>>>> 
>>>> On Mon, Apr 2, 2012 at 10:01 AM, Paolo Castagna <
>>>> castagna.lists@googlemail.com> wrote:
>>>> 
>>>>> 
>>>>> I see two main use cases here:
>>>>> 
>>>>> 1. Crawling the web and build a dataset of statements with locations.
>>>>> 2. Indexing a dataset with statements with locations and extend SPARQL to
>>>>> perform queries over it.
>>>>> 
>>>>> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
>>>>> 
>>>>> I will begin working on this once we release Any23 0.7.0-incubating and
>>>> the library is available on maven central. It will be a step in the right
>>>> direction to achieving (part) of an overall potential use case as above.
>>>> Any23/Tika wrapped plugin for Apache Nutch. It would be really great to
>>>> work towards the pluggable datastores as you said Paolo, and yes this is
>>>> also something which is required outside of RDF stores, but as you mention
>>>> this is another story which we need to focus on separately.
>>>> 
>>>> I'll certainly get in touch when the above is done to update you guys.
>>>> 
>>>> Lewis
>>> 
>>> 
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Chris Mattmann, Ph.D.
>>> Senior Computer Scientist
>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>> Office: 171-266B, Mailstop: 171-246
>>> Email: chris.a.mattmann@nasa.gov
>>> WWW:   http://sunset.usc.edu/~mattmann/
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Adjunct Assistant Professor, Computer Science Department
>>> University of Southern California, Los Angeles, CA 90089 USA
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> 
>> 
> 
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 


Re: Implementing GeoSPARQL

Posted by Rob Battle <rb...@bbn.com>.
Chris,

I am the primary developer of the GeoSPARQL index in Parliament.  The spatial part is certainly pluggable. We have an interface for the index and have implemented it with a variety of backends including a persistent RTree from the degree project [1], PostgreSQL w/PostGIS [2], and an in-memory quad tree using JTS [3].   One part that is not pluggable is the query engine for for Parliament (which replaces the default ARQ query engine).  We optimize the query such that the parts of the query that can be answered more efficiently by the spatial index are executed first.  We would certainly consider using SIS if it could be easily integrated and provides better performance than what we are currently using.  TBH I hadn't heard of it before I saw this thread.

Unfortunately, I'm not sure if our implementation is ALv2.  Parliament is BSD licensed and some of the libraries we link to are LGPL.

-rob

[1] http://www.deegree.org
[2] http://www.postgis.org
[3] http://sourceforge.net/projects/jts-topo-suite/?_test=b

On Apr 3, 2012, at 1:59 PM, Mattmann, Chris A (388J) wrote:

> Hey Rob,
> 
> Thanks for the FYI on this. I will check it out. I would still like to figure out how to implement GeoSPARQL 
> with SIS if for nothing else than to harden SIS and to make it better in a SIS selfish way :) I am out to make
> it an awesome ALv2 licensed spatial implementation.
> 
> Regarding Parliament, what are you guys using for your spatial library? Is it something that is ALv2? If not,
> would you consider SIS as a potential alternative to whatever spatial library you guys are using? Or, should I 
> ask, are you part of the Parliament project?
> 
> Thanks for the FYI again.
> 
> Cheers,
> Chris
> 
> On Apr 3, 2012, at 10:11 AM, Rob Battle wrote:
> 
>> Chris,
>> 
>> I just wanted to let you know that the Parliament [1] triple store (a triple store that is built with a Jena interface) has a mostly complete implementation of GeoSPARQL.  We have implemented the property functions and filter functions for GeoSPARQL on top of our existing spatial index.  For more info, you can check out [2].  
>> 
>> [1] http://parliament.semwebcentral.org
>> [2] http://geosparql.bbn.com
>> 
>> -rob
>> 
>> On Apr 2, 2012, at 10:10 PM, Mattmann, Chris A (388J) wrote:
>> 
>>> Thanks for the FYI on this, Lewis!
>>> 
>>> Cheers,
>>> Chris
>>> 
>>> On Apr 2, 2012, at 4:29 AM, Lewis John Mcgibbney wrote:
>>> 
>>>> Hi,
>>>> 
>>>> Thought I would chime in here as it's a very interesting thread and I think
>>>> this is relevant.
>>>> 
>>>> On Mon, Apr 2, 2012 at 10:01 AM, Paolo Castagna <
>>>> castagna.lists@googlemail.com> wrote:
>>>> 
>>>>> 
>>>>> I see two main use cases here:
>>>>> 
>>>>> 1. Crawling the web and build a dataset of statements with locations.
>>>>> 2. Indexing a dataset with statements with locations and extend SPARQL to
>>>>> perform queries over it.
>>>>> 
>>>>> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
>>>>> 
>>>>> I will begin working on this once we release Any23 0.7.0-incubating and
>>>> the library is available on maven central. It will be a step in the right
>>>> direction to achieving (part) of an overall potential use case as above.
>>>> Any23/Tika wrapped plugin for Apache Nutch. It would be really great to
>>>> work towards the pluggable datastores as you said Paolo, and yes this is
>>>> also something which is required outside of RDF stores, but as you mention
>>>> this is another story which we need to focus on separately.
>>>> 
>>>> I'll certainly get in touch when the above is done to update you guys.
>>>> 
>>>> Lewis
>>> 
>>> 
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Chris Mattmann, Ph.D.
>>> Senior Computer Scientist
>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>> Office: 171-266B, Mailstop: 171-246
>>> Email: chris.a.mattmann@nasa.gov
>>> WWW:   http://sunset.usc.edu/~mattmann/
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Adjunct Assistant Professor, Computer Science Department
>>> University of Southern California, Los Angeles, CA 90089 USA
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> 
>> 
> 
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 


Re: Implementing GeoSPARQL

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hey Rob,

Thanks for the FYI on this. I will check it out. I would still like to figure out how to implement GeoSPARQL 
with SIS if for nothing else than to harden SIS and to make it better in a SIS selfish way :) I am out to make
it an awesome ALv2 licensed spatial implementation.

Regarding Parliament, what are you guys using for your spatial library? Is it something that is ALv2? If not,
would you consider SIS as a potential alternative to whatever spatial library you guys are using? Or, should I 
ask, are you part of the Parliament project?

Thanks for the FYI again.

Cheers,
Chris

On Apr 3, 2012, at 10:11 AM, Rob Battle wrote:

> Chris,
> 
> I just wanted to let you know that the Parliament [1] triple store (a triple store that is built with a Jena interface) has a mostly complete implementation of GeoSPARQL.  We have implemented the property functions and filter functions for GeoSPARQL on top of our existing spatial index.  For more info, you can check out [2].  
> 
> [1] http://parliament.semwebcentral.org
> [2] http://geosparql.bbn.com
> 
> -rob
> 
> On Apr 2, 2012, at 10:10 PM, Mattmann, Chris A (388J) wrote:
> 
>> Thanks for the FYI on this, Lewis!
>> 
>> Cheers,
>> Chris
>> 
>> On Apr 2, 2012, at 4:29 AM, Lewis John Mcgibbney wrote:
>> 
>>> Hi,
>>> 
>>> Thought I would chime in here as it's a very interesting thread and I think
>>> this is relevant.
>>> 
>>> On Mon, Apr 2, 2012 at 10:01 AM, Paolo Castagna <
>>> castagna.lists@googlemail.com> wrote:
>>> 
>>>> 
>>>> I see two main use cases here:
>>>> 
>>>> 1. Crawling the web and build a dataset of statements with locations.
>>>> 2. Indexing a dataset with statements with locations and extend SPARQL to
>>>> perform queries over it.
>>>> 
>>>> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
>>>> 
>>>> I will begin working on this once we release Any23 0.7.0-incubating and
>>> the library is available on maven central. It will be a step in the right
>>> direction to achieving (part) of an overall potential use case as above.
>>> Any23/Tika wrapped plugin for Apache Nutch. It would be really great to
>>> work towards the pluggable datastores as you said Paolo, and yes this is
>>> also something which is required outside of RDF stores, but as you mention
>>> this is another story which we need to focus on separately.
>>> 
>>> I'll certainly get in touch when the above is done to update you guys.
>>> 
>>> Lewis
>> 
>> 
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Senior Computer Scientist
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 171-266B, Mailstop: 171-246
>> Email: chris.a.mattmann@nasa.gov
>> WWW:   http://sunset.usc.edu/~mattmann/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Adjunct Assistant Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> 
> 


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: Implementing GeoSPARQL

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hey Dave,

Thanks for the FYI on this...

Cheers,
Chris

On Apr 3, 2012, at 10:21 AM, Dave Kolas wrote:

> Chris, all,
> 
> Just FYI, the GeoSPARQL specification is currently getting final 
> approval by the OGC technical committee.  The vote ends on Apr 7, and 
> the finished spec should be available shortly after that.
> 
> --Dave--
> 
> On 4/3/2012 1:11 PM, Rob Battle wrote:
>> Chris,
>> 
>> I just wanted to let you know that the Parliament [1] triple store (a triple store that is built with a Jena interface) has a mostly complete implementation of GeoSPARQL.  We have implemented the property functions and filter functions for GeoSPARQL on top of our existing spatial index.  For more info, you can check out [2].
>> 
>> [1] http://parliament.semwebcentral.org
>> [2] http://geosparql.bbn.com
>> 
>> -rob
>> 
>> On Apr 2, 2012, at 10:10 PM, Mattmann, Chris A (388J) wrote:
>> 
>>> Thanks for the FYI on this, Lewis!
>>> 
>>> Cheers,
>>> Chris
>>> 
>>> On Apr 2, 2012, at 4:29 AM, Lewis John Mcgibbney wrote:
>>> 
>>>> Hi,
>>>> 
>>>> Thought I would chime in here as it's a very interesting thread and I think
>>>> this is relevant.
>>>> 
>>>> On Mon, Apr 2, 2012 at 10:01 AM, Paolo Castagna<
>>>> castagna.lists@googlemail.com>  wrote:
>>>> 
>>>>> I see two main use cases here:
>>>>> 
>>>>> 1. Crawling the web and build a dataset of statements with locations.
>>>>> 2. Indexing a dataset with statements with locations and extend SPARQL to
>>>>>  perform queries over it.
>>>>> 
>>>>> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
>>>>> 
>>>>> I will begin working on this once we release Any23 0.7.0-incubating and
>>>> the library is available on maven central. It will be a step in the right
>>>> direction to achieving (part) of an overall potential use case as above.
>>>> Any23/Tika wrapped plugin for Apache Nutch. It would be really great to
>>>> work towards the pluggable datastores as you said Paolo, and yes this is
>>>> also something which is required outside of RDF stores, but as you mention
>>>> this is another story which we need to focus on separately.
>>>> 
>>>> I'll certainly get in touch when the above is done to update you guys.
>>>> 
>>>> Lewis
>>> 
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Chris Mattmann, Ph.D.
>>> Senior Computer Scientist
>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>> Office: 171-266B, Mailstop: 171-246
>>> Email: chris.a.mattmann@nasa.gov
>>> WWW:   http://sunset.usc.edu/~mattmann/
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Adjunct Assistant Professor, Computer Science Department
>>> University of Southern California, Los Angeles, CA 90089 USA
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> 
>> 


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: Implementing GeoSPARQL

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hey Dave,

Thanks for the FYI on this...

Cheers,
Chris

On Apr 3, 2012, at 10:21 AM, Dave Kolas wrote:

> Chris, all,
> 
> Just FYI, the GeoSPARQL specification is currently getting final 
> approval by the OGC technical committee.  The vote ends on Apr 7, and 
> the finished spec should be available shortly after that.
> 
> --Dave--
> 
> On 4/3/2012 1:11 PM, Rob Battle wrote:
>> Chris,
>> 
>> I just wanted to let you know that the Parliament [1] triple store (a triple store that is built with a Jena interface) has a mostly complete implementation of GeoSPARQL.  We have implemented the property functions and filter functions for GeoSPARQL on top of our existing spatial index.  For more info, you can check out [2].
>> 
>> [1] http://parliament.semwebcentral.org
>> [2] http://geosparql.bbn.com
>> 
>> -rob
>> 
>> On Apr 2, 2012, at 10:10 PM, Mattmann, Chris A (388J) wrote:
>> 
>>> Thanks for the FYI on this, Lewis!
>>> 
>>> Cheers,
>>> Chris
>>> 
>>> On Apr 2, 2012, at 4:29 AM, Lewis John Mcgibbney wrote:
>>> 
>>>> Hi,
>>>> 
>>>> Thought I would chime in here as it's a very interesting thread and I think
>>>> this is relevant.
>>>> 
>>>> On Mon, Apr 2, 2012 at 10:01 AM, Paolo Castagna<
>>>> castagna.lists@googlemail.com>  wrote:
>>>> 
>>>>> I see two main use cases here:
>>>>> 
>>>>> 1. Crawling the web and build a dataset of statements with locations.
>>>>> 2. Indexing a dataset with statements with locations and extend SPARQL to
>>>>>  perform queries over it.
>>>>> 
>>>>> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
>>>>> 
>>>>> I will begin working on this once we release Any23 0.7.0-incubating and
>>>> the library is available on maven central. It will be a step in the right
>>>> direction to achieving (part) of an overall potential use case as above.
>>>> Any23/Tika wrapped plugin for Apache Nutch. It would be really great to
>>>> work towards the pluggable datastores as you said Paolo, and yes this is
>>>> also something which is required outside of RDF stores, but as you mention
>>>> this is another story which we need to focus on separately.
>>>> 
>>>> I'll certainly get in touch when the above is done to update you guys.
>>>> 
>>>> Lewis
>>> 
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Chris Mattmann, Ph.D.
>>> Senior Computer Scientist
>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>> Office: 171-266B, Mailstop: 171-246
>>> Email: chris.a.mattmann@nasa.gov
>>> WWW:   http://sunset.usc.edu/~mattmann/
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Adjunct Assistant Professor, Computer Science Department
>>> University of Southern California, Los Angeles, CA 90089 USA
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> 
>> 


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: Implementing GeoSPARQL

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hey Dave,

Thanks for the FYI on this...

Cheers,
Chris

On Apr 3, 2012, at 10:21 AM, Dave Kolas wrote:

> Chris, all,
> 
> Just FYI, the GeoSPARQL specification is currently getting final 
> approval by the OGC technical committee.  The vote ends on Apr 7, and 
> the finished spec should be available shortly after that.
> 
> --Dave--
> 
> On 4/3/2012 1:11 PM, Rob Battle wrote:
>> Chris,
>> 
>> I just wanted to let you know that the Parliament [1] triple store (a triple store that is built with a Jena interface) has a mostly complete implementation of GeoSPARQL.  We have implemented the property functions and filter functions for GeoSPARQL on top of our existing spatial index.  For more info, you can check out [2].
>> 
>> [1] http://parliament.semwebcentral.org
>> [2] http://geosparql.bbn.com
>> 
>> -rob
>> 
>> On Apr 2, 2012, at 10:10 PM, Mattmann, Chris A (388J) wrote:
>> 
>>> Thanks for the FYI on this, Lewis!
>>> 
>>> Cheers,
>>> Chris
>>> 
>>> On Apr 2, 2012, at 4:29 AM, Lewis John Mcgibbney wrote:
>>> 
>>>> Hi,
>>>> 
>>>> Thought I would chime in here as it's a very interesting thread and I think
>>>> this is relevant.
>>>> 
>>>> On Mon, Apr 2, 2012 at 10:01 AM, Paolo Castagna<
>>>> castagna.lists@googlemail.com>  wrote:
>>>> 
>>>>> I see two main use cases here:
>>>>> 
>>>>> 1. Crawling the web and build a dataset of statements with locations.
>>>>> 2. Indexing a dataset with statements with locations and extend SPARQL to
>>>>>  perform queries over it.
>>>>> 
>>>>> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
>>>>> 
>>>>> I will begin working on this once we release Any23 0.7.0-incubating and
>>>> the library is available on maven central. It will be a step in the right
>>>> direction to achieving (part) of an overall potential use case as above.
>>>> Any23/Tika wrapped plugin for Apache Nutch. It would be really great to
>>>> work towards the pluggable datastores as you said Paolo, and yes this is
>>>> also something which is required outside of RDF stores, but as you mention
>>>> this is another story which we need to focus on separately.
>>>> 
>>>> I'll certainly get in touch when the above is done to update you guys.
>>>> 
>>>> Lewis
>>> 
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Chris Mattmann, Ph.D.
>>> Senior Computer Scientist
>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>> Office: 171-266B, Mailstop: 171-246
>>> Email: chris.a.mattmann@nasa.gov
>>> WWW:   http://sunset.usc.edu/~mattmann/
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Adjunct Assistant Professor, Computer Science Department
>>> University of Southern California, Los Angeles, CA 90089 USA
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> 
>> 


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: Implementing GeoSPARQL

Posted by Dave Kolas <dk...@bbn.com>.
Chris, all,

Just FYI, the GeoSPARQL specification is currently getting final 
approval by the OGC technical committee.  The vote ends on Apr 7, and 
the finished spec should be available shortly after that.

--Dave--

On 4/3/2012 1:11 PM, Rob Battle wrote:
> Chris,
>
> I just wanted to let you know that the Parliament [1] triple store (a triple store that is built with a Jena interface) has a mostly complete implementation of GeoSPARQL.  We have implemented the property functions and filter functions for GeoSPARQL on top of our existing spatial index.  For more info, you can check out [2].
>
> [1] http://parliament.semwebcentral.org
> [2] http://geosparql.bbn.com
>
> -rob
>
> On Apr 2, 2012, at 10:10 PM, Mattmann, Chris A (388J) wrote:
>
>> Thanks for the FYI on this, Lewis!
>>
>> Cheers,
>> Chris
>>
>> On Apr 2, 2012, at 4:29 AM, Lewis John Mcgibbney wrote:
>>
>>> Hi,
>>>
>>> Thought I would chime in here as it's a very interesting thread and I think
>>> this is relevant.
>>>
>>> On Mon, Apr 2, 2012 at 10:01 AM, Paolo Castagna<
>>> castagna.lists@googlemail.com>  wrote:
>>>
>>>> I see two main use cases here:
>>>>
>>>> 1. Crawling the web and build a dataset of statements with locations.
>>>> 2. Indexing a dataset with statements with locations and extend SPARQL to
>>>>   perform queries over it.
>>>>
>>>> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
>>>>
>>>> I will begin working on this once we release Any23 0.7.0-incubating and
>>> the library is available on maven central. It will be a step in the right
>>> direction to achieving (part) of an overall potential use case as above.
>>> Any23/Tika wrapped plugin for Apache Nutch. It would be really great to
>>> work towards the pluggable datastores as you said Paolo, and yes this is
>>> also something which is required outside of RDF stores, but as you mention
>>> this is another story which we need to focus on separately.
>>>
>>> I'll certainly get in touch when the above is done to update you guys.
>>>
>>> Lewis
>>
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Senior Computer Scientist
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 171-266B, Mailstop: 171-246
>> Email: chris.a.mattmann@nasa.gov
>> WWW:   http://sunset.usc.edu/~mattmann/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Adjunct Assistant Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>

Re: Implementing GeoSPARQL

Posted by Dave Kolas <dk...@bbn.com>.
Chris, all,

Just FYI, the GeoSPARQL specification is currently getting final 
approval by the OGC technical committee.  The vote ends on Apr 7, and 
the finished spec should be available shortly after that.

--Dave--

On 4/3/2012 1:11 PM, Rob Battle wrote:
> Chris,
>
> I just wanted to let you know that the Parliament [1] triple store (a triple store that is built with a Jena interface) has a mostly complete implementation of GeoSPARQL.  We have implemented the property functions and filter functions for GeoSPARQL on top of our existing spatial index.  For more info, you can check out [2].
>
> [1] http://parliament.semwebcentral.org
> [2] http://geosparql.bbn.com
>
> -rob
>
> On Apr 2, 2012, at 10:10 PM, Mattmann, Chris A (388J) wrote:
>
>> Thanks for the FYI on this, Lewis!
>>
>> Cheers,
>> Chris
>>
>> On Apr 2, 2012, at 4:29 AM, Lewis John Mcgibbney wrote:
>>
>>> Hi,
>>>
>>> Thought I would chime in here as it's a very interesting thread and I think
>>> this is relevant.
>>>
>>> On Mon, Apr 2, 2012 at 10:01 AM, Paolo Castagna<
>>> castagna.lists@googlemail.com>  wrote:
>>>
>>>> I see two main use cases here:
>>>>
>>>> 1. Crawling the web and build a dataset of statements with locations.
>>>> 2. Indexing a dataset with statements with locations and extend SPARQL to
>>>>   perform queries over it.
>>>>
>>>> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
>>>>
>>>> I will begin working on this once we release Any23 0.7.0-incubating and
>>> the library is available on maven central. It will be a step in the right
>>> direction to achieving (part) of an overall potential use case as above.
>>> Any23/Tika wrapped plugin for Apache Nutch. It would be really great to
>>> work towards the pluggable datastores as you said Paolo, and yes this is
>>> also something which is required outside of RDF stores, but as you mention
>>> this is another story which we need to focus on separately.
>>>
>>> I'll certainly get in touch when the above is done to update you guys.
>>>
>>> Lewis
>>
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Senior Computer Scientist
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 171-266B, Mailstop: 171-246
>> Email: chris.a.mattmann@nasa.gov
>> WWW:   http://sunset.usc.edu/~mattmann/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Adjunct Assistant Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>

Re: Implementing GeoSPARQL

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hey Rob,

Thanks for the FYI on this. I will check it out. I would still like to figure out how to implement GeoSPARQL 
with SIS if for nothing else than to harden SIS and to make it better in a SIS selfish way :) I am out to make
it an awesome ALv2 licensed spatial implementation.

Regarding Parliament, what are you guys using for your spatial library? Is it something that is ALv2? If not,
would you consider SIS as a potential alternative to whatever spatial library you guys are using? Or, should I 
ask, are you part of the Parliament project?

Thanks for the FYI again.

Cheers,
Chris

On Apr 3, 2012, at 10:11 AM, Rob Battle wrote:

> Chris,
> 
> I just wanted to let you know that the Parliament [1] triple store (a triple store that is built with a Jena interface) has a mostly complete implementation of GeoSPARQL.  We have implemented the property functions and filter functions for GeoSPARQL on top of our existing spatial index.  For more info, you can check out [2].  
> 
> [1] http://parliament.semwebcentral.org
> [2] http://geosparql.bbn.com
> 
> -rob
> 
> On Apr 2, 2012, at 10:10 PM, Mattmann, Chris A (388J) wrote:
> 
>> Thanks for the FYI on this, Lewis!
>> 
>> Cheers,
>> Chris
>> 
>> On Apr 2, 2012, at 4:29 AM, Lewis John Mcgibbney wrote:
>> 
>>> Hi,
>>> 
>>> Thought I would chime in here as it's a very interesting thread and I think
>>> this is relevant.
>>> 
>>> On Mon, Apr 2, 2012 at 10:01 AM, Paolo Castagna <
>>> castagna.lists@googlemail.com> wrote:
>>> 
>>>> 
>>>> I see two main use cases here:
>>>> 
>>>> 1. Crawling the web and build a dataset of statements with locations.
>>>> 2. Indexing a dataset with statements with locations and extend SPARQL to
>>>> perform queries over it.
>>>> 
>>>> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
>>>> 
>>>> I will begin working on this once we release Any23 0.7.0-incubating and
>>> the library is available on maven central. It will be a step in the right
>>> direction to achieving (part) of an overall potential use case as above.
>>> Any23/Tika wrapped plugin for Apache Nutch. It would be really great to
>>> work towards the pluggable datastores as you said Paolo, and yes this is
>>> also something which is required outside of RDF stores, but as you mention
>>> this is another story which we need to focus on separately.
>>> 
>>> I'll certainly get in touch when the above is done to update you guys.
>>> 
>>> Lewis
>> 
>> 
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Senior Computer Scientist
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 171-266B, Mailstop: 171-246
>> Email: chris.a.mattmann@nasa.gov
>> WWW:   http://sunset.usc.edu/~mattmann/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Adjunct Assistant Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> 
> 


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: Implementing GeoSPARQL

Posted by Dave Kolas <dk...@bbn.com>.
Chris, all,

Just FYI, the GeoSPARQL specification is currently getting final 
approval by the OGC technical committee.  The vote ends on Apr 7, and 
the finished spec should be available shortly after that.

--Dave--

On 4/3/2012 1:11 PM, Rob Battle wrote:
> Chris,
>
> I just wanted to let you know that the Parliament [1] triple store (a triple store that is built with a Jena interface) has a mostly complete implementation of GeoSPARQL.  We have implemented the property functions and filter functions for GeoSPARQL on top of our existing spatial index.  For more info, you can check out [2].
>
> [1] http://parliament.semwebcentral.org
> [2] http://geosparql.bbn.com
>
> -rob
>
> On Apr 2, 2012, at 10:10 PM, Mattmann, Chris A (388J) wrote:
>
>> Thanks for the FYI on this, Lewis!
>>
>> Cheers,
>> Chris
>>
>> On Apr 2, 2012, at 4:29 AM, Lewis John Mcgibbney wrote:
>>
>>> Hi,
>>>
>>> Thought I would chime in here as it's a very interesting thread and I think
>>> this is relevant.
>>>
>>> On Mon, Apr 2, 2012 at 10:01 AM, Paolo Castagna<
>>> castagna.lists@googlemail.com>  wrote:
>>>
>>>> I see two main use cases here:
>>>>
>>>> 1. Crawling the web and build a dataset of statements with locations.
>>>> 2. Indexing a dataset with statements with locations and extend SPARQL to
>>>>   perform queries over it.
>>>>
>>>> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
>>>>
>>>> I will begin working on this once we release Any23 0.7.0-incubating and
>>> the library is available on maven central. It will be a step in the right
>>> direction to achieving (part) of an overall potential use case as above.
>>> Any23/Tika wrapped plugin for Apache Nutch. It would be really great to
>>> work towards the pluggable datastores as you said Paolo, and yes this is
>>> also something which is required outside of RDF stores, but as you mention
>>> this is another story which we need to focus on separately.
>>>
>>> I'll certainly get in touch when the above is done to update you guys.
>>>
>>> Lewis
>>
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Senior Computer Scientist
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 171-266B, Mailstop: 171-246
>> Email: chris.a.mattmann@nasa.gov
>> WWW:   http://sunset.usc.edu/~mattmann/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Adjunct Assistant Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>

Re: Implementing GeoSPARQL

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hey Rob,

Thanks for the FYI on this. I will check it out. I would still like to figure out how to implement GeoSPARQL 
with SIS if for nothing else than to harden SIS and to make it better in a SIS selfish way :) I am out to make
it an awesome ALv2 licensed spatial implementation.

Regarding Parliament, what are you guys using for your spatial library? Is it something that is ALv2? If not,
would you consider SIS as a potential alternative to whatever spatial library you guys are using? Or, should I 
ask, are you part of the Parliament project?

Thanks for the FYI again.

Cheers,
Chris

On Apr 3, 2012, at 10:11 AM, Rob Battle wrote:

> Chris,
> 
> I just wanted to let you know that the Parliament [1] triple store (a triple store that is built with a Jena interface) has a mostly complete implementation of GeoSPARQL.  We have implemented the property functions and filter functions for GeoSPARQL on top of our existing spatial index.  For more info, you can check out [2].  
> 
> [1] http://parliament.semwebcentral.org
> [2] http://geosparql.bbn.com
> 
> -rob
> 
> On Apr 2, 2012, at 10:10 PM, Mattmann, Chris A (388J) wrote:
> 
>> Thanks for the FYI on this, Lewis!
>> 
>> Cheers,
>> Chris
>> 
>> On Apr 2, 2012, at 4:29 AM, Lewis John Mcgibbney wrote:
>> 
>>> Hi,
>>> 
>>> Thought I would chime in here as it's a very interesting thread and I think
>>> this is relevant.
>>> 
>>> On Mon, Apr 2, 2012 at 10:01 AM, Paolo Castagna <
>>> castagna.lists@googlemail.com> wrote:
>>> 
>>>> 
>>>> I see two main use cases here:
>>>> 
>>>> 1. Crawling the web and build a dataset of statements with locations.
>>>> 2. Indexing a dataset with statements with locations and extend SPARQL to
>>>> perform queries over it.
>>>> 
>>>> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
>>>> 
>>>> I will begin working on this once we release Any23 0.7.0-incubating and
>>> the library is available on maven central. It will be a step in the right
>>> direction to achieving (part) of an overall potential use case as above.
>>> Any23/Tika wrapped plugin for Apache Nutch. It would be really great to
>>> work towards the pluggable datastores as you said Paolo, and yes this is
>>> also something which is required outside of RDF stores, but as you mention
>>> this is another story which we need to focus on separately.
>>> 
>>> I'll certainly get in touch when the above is done to update you guys.
>>> 
>>> Lewis
>> 
>> 
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Senior Computer Scientist
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 171-266B, Mailstop: 171-246
>> Email: chris.a.mattmann@nasa.gov
>> WWW:   http://sunset.usc.edu/~mattmann/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Adjunct Assistant Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> 
> 


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: Implementing GeoSPARQL

Posted by Rob Battle <rb...@bbn.com>.
Chris,

I just wanted to let you know that the Parliament [1] triple store (a triple store that is built with a Jena interface) has a mostly complete implementation of GeoSPARQL.  We have implemented the property functions and filter functions for GeoSPARQL on top of our existing spatial index.  For more info, you can check out [2].  

[1] http://parliament.semwebcentral.org
[2] http://geosparql.bbn.com

-rob

On Apr 2, 2012, at 10:10 PM, Mattmann, Chris A (388J) wrote:

> Thanks for the FYI on this, Lewis!
> 
> Cheers,
> Chris
> 
> On Apr 2, 2012, at 4:29 AM, Lewis John Mcgibbney wrote:
> 
>> Hi,
>> 
>> Thought I would chime in here as it's a very interesting thread and I think
>> this is relevant.
>> 
>> On Mon, Apr 2, 2012 at 10:01 AM, Paolo Castagna <
>> castagna.lists@googlemail.com> wrote:
>> 
>>> 
>>> I see two main use cases here:
>>> 
>>> 1. Crawling the web and build a dataset of statements with locations.
>>> 2. Indexing a dataset with statements with locations and extend SPARQL to
>>>  perform queries over it.
>>> 
>>> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
>>> 
>>> I will begin working on this once we release Any23 0.7.0-incubating and
>> the library is available on maven central. It will be a step in the right
>> direction to achieving (part) of an overall potential use case as above.
>> Any23/Tika wrapped plugin for Apache Nutch. It would be really great to
>> work towards the pluggable datastores as you said Paolo, and yes this is
>> also something which is required outside of RDF stores, but as you mention
>> this is another story which we need to focus on separately.
>> 
>> I'll certainly get in touch when the above is done to update you guys.
>> 
>> Lewis
> 
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 


Re: Implementing GeoSPARQL

Posted by Rob Battle <rb...@bbn.com>.
Chris,

I just wanted to let you know that the Parliament [1] triple store (a triple store that is built with a Jena interface) has a mostly complete implementation of GeoSPARQL.  We have implemented the property functions and filter functions for GeoSPARQL on top of our existing spatial index.  For more info, you can check out [2].  

[1] http://parliament.semwebcentral.org
[2] http://geosparql.bbn.com

-rob

On Apr 2, 2012, at 10:10 PM, Mattmann, Chris A (388J) wrote:

> Thanks for the FYI on this, Lewis!
> 
> Cheers,
> Chris
> 
> On Apr 2, 2012, at 4:29 AM, Lewis John Mcgibbney wrote:
> 
>> Hi,
>> 
>> Thought I would chime in here as it's a very interesting thread and I think
>> this is relevant.
>> 
>> On Mon, Apr 2, 2012 at 10:01 AM, Paolo Castagna <
>> castagna.lists@googlemail.com> wrote:
>> 
>>> 
>>> I see two main use cases here:
>>> 
>>> 1. Crawling the web and build a dataset of statements with locations.
>>> 2. Indexing a dataset with statements with locations and extend SPARQL to
>>>  perform queries over it.
>>> 
>>> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
>>> 
>>> I will begin working on this once we release Any23 0.7.0-incubating and
>> the library is available on maven central. It will be a step in the right
>> direction to achieving (part) of an overall potential use case as above.
>> Any23/Tika wrapped plugin for Apache Nutch. It would be really great to
>> work towards the pluggable datastores as you said Paolo, and yes this is
>> also something which is required outside of RDF stores, but as you mention
>> this is another story which we need to focus on separately.
>> 
>> I'll certainly get in touch when the above is done to update you guys.
>> 
>> Lewis
> 
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 


Re: Implementing GeoSPARQL

Posted by Rob Battle <rb...@bbn.com>.
Chris,

I just wanted to let you know that the Parliament [1] triple store (a triple store that is built with a Jena interface) has a mostly complete implementation of GeoSPARQL.  We have implemented the property functions and filter functions for GeoSPARQL on top of our existing spatial index.  For more info, you can check out [2].  

[1] http://parliament.semwebcentral.org
[2] http://geosparql.bbn.com

-rob

On Apr 2, 2012, at 10:10 PM, Mattmann, Chris A (388J) wrote:

> Thanks for the FYI on this, Lewis!
> 
> Cheers,
> Chris
> 
> On Apr 2, 2012, at 4:29 AM, Lewis John Mcgibbney wrote:
> 
>> Hi,
>> 
>> Thought I would chime in here as it's a very interesting thread and I think
>> this is relevant.
>> 
>> On Mon, Apr 2, 2012 at 10:01 AM, Paolo Castagna <
>> castagna.lists@googlemail.com> wrote:
>> 
>>> 
>>> I see two main use cases here:
>>> 
>>> 1. Crawling the web and build a dataset of statements with locations.
>>> 2. Indexing a dataset with statements with locations and extend SPARQL to
>>>  perform queries over it.
>>> 
>>> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
>>> 
>>> I will begin working on this once we release Any23 0.7.0-incubating and
>> the library is available on maven central. It will be a step in the right
>> direction to achieving (part) of an overall potential use case as above.
>> Any23/Tika wrapped plugin for Apache Nutch. It would be really great to
>> work towards the pluggable datastores as you said Paolo, and yes this is
>> also something which is required outside of RDF stores, but as you mention
>> this is another story which we need to focus on separately.
>> 
>> I'll certainly get in touch when the above is done to update you guys.
>> 
>> Lewis
> 
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 


Re: Implementing GeoSPARQL

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Thanks for the FYI on this, Lewis!

Cheers,
Chris

On Apr 2, 2012, at 4:29 AM, Lewis John Mcgibbney wrote:

> Hi,
> 
> Thought I would chime in here as it's a very interesting thread and I think
> this is relevant.
> 
> On Mon, Apr 2, 2012 at 10:01 AM, Paolo Castagna <
> castagna.lists@googlemail.com> wrote:
> 
>> 
>> I see two main use cases here:
>> 
>> 1. Crawling the web and build a dataset of statements with locations.
>> 2. Indexing a dataset with statements with locations and extend SPARQL to
>>   perform queries over it.
>> 
>> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
>> 
>> I will begin working on this once we release Any23 0.7.0-incubating and
> the library is available on maven central. It will be a step in the right
> direction to achieving (part) of an overall potential use case as above.
> Any23/Tika wrapped plugin for Apache Nutch. It would be really great to
> work towards the pluggable datastores as you said Paolo, and yes this is
> also something which is required outside of RDF stores, but as you mention
> this is another story which we need to focus on separately.
> 
> I'll certainly get in touch when the above is done to update you guys.
> 
> Lewis


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: Implementing GeoSPARQL

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Thanks for the FYI on this, Lewis!

Cheers,
Chris

On Apr 2, 2012, at 4:29 AM, Lewis John Mcgibbney wrote:

> Hi,
> 
> Thought I would chime in here as it's a very interesting thread and I think
> this is relevant.
> 
> On Mon, Apr 2, 2012 at 10:01 AM, Paolo Castagna <
> castagna.lists@googlemail.com> wrote:
> 
>> 
>> I see two main use cases here:
>> 
>> 1. Crawling the web and build a dataset of statements with locations.
>> 2. Indexing a dataset with statements with locations and extend SPARQL to
>>   perform queries over it.
>> 
>> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
>> 
>> I will begin working on this once we release Any23 0.7.0-incubating and
> the library is available on maven central. It will be a step in the right
> direction to achieving (part) of an overall potential use case as above.
> Any23/Tika wrapped plugin for Apache Nutch. It would be really great to
> work towards the pluggable datastores as you said Paolo, and yes this is
> also something which is required outside of RDF stores, but as you mention
> this is another story which we need to focus on separately.
> 
> I'll certainly get in touch when the above is done to update you guys.
> 
> Lewis


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: Implementing GeoSPARQL

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Thanks for the FYI on this, Lewis!

Cheers,
Chris

On Apr 2, 2012, at 4:29 AM, Lewis John Mcgibbney wrote:

> Hi,
> 
> Thought I would chime in here as it's a very interesting thread and I think
> this is relevant.
> 
> On Mon, Apr 2, 2012 at 10:01 AM, Paolo Castagna <
> castagna.lists@googlemail.com> wrote:
> 
>> 
>> I see two main use cases here:
>> 
>> 1. Crawling the web and build a dataset of statements with locations.
>> 2. Indexing a dataset with statements with locations and extend SPARQL to
>>   perform queries over it.
>> 
>> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
>> 
>> I will begin working on this once we release Any23 0.7.0-incubating and
> the library is available on maven central. It will be a step in the right
> direction to achieving (part) of an overall potential use case as above.
> Any23/Tika wrapped plugin for Apache Nutch. It would be really great to
> work towards the pluggable datastores as you said Paolo, and yes this is
> also something which is required outside of RDF stores, but as you mention
> this is another story which we need to focus on separately.
> 
> I'll certainly get in touch when the above is done to update you guys.
> 
> Lewis


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: Implementing GeoSPARQL

Posted by Lewis John Mcgibbney <le...@gmail.com>.
Hi,

Thought I would chime in here as it's a very interesting thread and I think
this is relevant.

On Mon, Apr 2, 2012 at 10:01 AM, Paolo Castagna <
castagna.lists@googlemail.com> wrote:

>
> I see two main use cases here:
>
>  1. Crawling the web and build a dataset of statements with locations.
>  2. Indexing a dataset with statements with locations and extend SPARQL to
>    perform queries over it.
>
> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
>
> I will begin working on this once we release Any23 0.7.0-incubating and
the library is available on maven central. It will be a step in the right
direction to achieving (part) of an overall potential use case as above.
Any23/Tika wrapped plugin for Apache Nutch. It would be really great to
work towards the pluggable datastores as you said Paolo, and yes this is
also something which is required outside of RDF stores, but as you mention
this is another story which we need to focus on separately.

I'll certainly get in touch when the above is done to update you guys.

Lewis

Re: Implementing GeoSPARQL

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hi Paolo,

On Apr 2, 2012, at 2:01 AM, Paolo Castagna wrote:

> Hi Chris
> 
> Mattmann, Chris A (388J) wrote:
>>> In my head, SIS (which I do not know very well) is a low-level geo indexing
>>> library which could be used to provide the indexing capability for a GeoSPARQL
>>> implementation.
>> 
>> Yeah that's what I was thinking too.
> 
> Ok.
> 
> I've played with Lucene spatial capabilities for this sort of things in the
> past. My knowledge of Apache SIS is very limited. In particular, it is not
> clear to me how/when things are persisted on disk. My impression is that SIS
> load the entire index in RAM when it starts and serializes it out at the end.
> Am I right? (I hope not. :-))

Well, what's the end? Basically SIS does in fact build a Quad Tree index, and
it has the capability to store it to disk and update it when requested. SIS is
both an API and web service at the moment, and it can be improved significantly,
that's for sure, but it's functional for now.

> 
> If that is the case, it could be an issue for large indexes.

Possibly -- not sure -- I guess we'll find out. If it falls down, it can always be
made more scalable, it's just a basic spatial data structure.

BTW, SIS started out by importing the code from Lucene spatial which
Patrick O'Leary one of the creators of SIS, wrote. Patrick was the creator
of Local Lucene and Local Solr, the first appearance of these functionalities
in Lucene.

> 
>>> I know that ARQ (i.e. the SPARQL query engine available in Jena) can
>>> provide you with a SPARQL 1.1 engine and extension points to use other
>>> custom indexes (such as SIS in this case).
>>> 
>>> What exactly do you mean with "integrating with Any23"?
>>> Do you mean crawling the web and extract lat/long from web pages?
>> 
>> Yep that's what I was thinking -- maybe doing it in Any23, and/or Tika.
> 
> Doing a web crawl to extract locations out of web pages using Any23|Tika seems
> quite an useful thing for certain use cases.
> 
> In other scenarios people might already have a large dataset with locations in
> it or people might want to leverage datasets such as Geonames, Freebase,
> DBPedia, Yahoo GeoPlanet, etc. so crawling in these use cases is less important.

Yeah potentially. But representing those geo locations and coordinates in a common
way is important.

> 
>>> Where will you store those RDF statements?
>> 
>> It looks like Any23 would store to Sesame -- is that the case?
> 
> Probably.
> 
> It would be nice to have pluggable RDF stores in Any23, but this is another
> story: https://issues.apache.org/jira/browse/ANY23-19 :-)

Yep, thanks for filing that. I'm sure someone will get around to it :)

> 
>>> How can you implement the GeoSPARQL spec without (re)using a SPARQL
>>> query engine (such as ARQ)?
>> 
>> I need that too :) I just don't understand it as well (and understand the Any23/Tika
>> and SIS part better). I'll have to learn Jena it looks like though, you game to 
>> help me out?
> 
> A very old prototype which shows you how you can extend ARQ is here:
> https://github.com/castagna/GeoARQ

Awesome thanks for the link. I'll check it out.

> 
> It is just a prototype and it is using ARQ's property functions rather than
> filter functions (and it is using Lucene spatial rather than SIS). But, it
> is IMHO a good starting point to see how you could have ARQ using a custom
> index to perform spatial searches.

Great.

> 
> The reason why at the time I used Lucene spatial is because that was the only
> alternative (non (L)GPL) I found (I did not know about SIS at the time).

No worries. We created SIS b/c we didn't think this functionality was unique
to Lucene and we wanted an ALv2 licensed toolkit. It's been around since
February 2010. See here:

http://wiki.apache.org/incubator/SpatialProposal

> 
> The reason why I did not implemented GeoSPARQL is simplicity, I wanted just a
> proof of concept and the most important use case IMHO is searching things around
> a point and returning results sorted by distance.
> 
> For GeoSPARQL (which I need to go back an read properly) do we need custom
> FILTER functions or property functions (or both)?

I guess I need to read the spec more to find out :)

> 
>>> IMHO geo location (as well as free text) are two SPARQL extensions which
>>> are very useful in loads of use cases.
>> 
>> Yep I'm super excited to get this implemented. You interested in helping? I think
>> we can bring together Tika, Any23, Jena and SIS here...
> 
> I am interested in learning more about SIS, I have no idea at the moment on how
> much effort is necessary to implement GeoSPARQL and if that spec is going to be
> implemented elsewhere by other RDF stores.

Well even if it is implemented elsewhere by other RDF stores, it wouldn't stop me
from implementing something in SIS. I think it's a great use case for SIS and something
of use to the broader community. The fact that we can't name a ton of other RDF
stores that implement GeoSPARQL is an indication to me that it's not supported
or in widespread use -- so the time is ripe I guess.

> 
> At the moment, I cannot put much effort on this. But, if something similar a la
> LARQ and/or GeoARQ is useful and I can help, I'll do it.
> 
> I see two main use cases here:
> 
> 1. Crawling the web and build a dataset of statements with locations.
> 2. Indexing a dataset with statements with locations and extend SPARQL to
>    perform queries over it.
> 
> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
> For 2. you need SIS and a SPARQL query engine (which of course uses an RDF store).
> 
> If I were trying to implement GeoSPARQL, I would start with 2., SIS and ARQ.

Thanks, will do.

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: Implementing GeoSPARQL

Posted by Lewis John Mcgibbney <le...@gmail.com>.
Hi,

Thought I would chime in here as it's a very interesting thread and I think
this is relevant.

On Mon, Apr 2, 2012 at 10:01 AM, Paolo Castagna <
castagna.lists@googlemail.com> wrote:

>
> I see two main use cases here:
>
>  1. Crawling the web and build a dataset of statements with locations.
>  2. Indexing a dataset with statements with locations and extend SPARQL to
>    perform queries over it.
>
> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
>
> I will begin working on this once we release Any23 0.7.0-incubating and
the library is available on maven central. It will be a step in the right
direction to achieving (part) of an overall potential use case as above.
Any23/Tika wrapped plugin for Apache Nutch. It would be really great to
work towards the pluggable datastores as you said Paolo, and yes this is
also something which is required outside of RDF stores, but as you mention
this is another story which we need to focus on separately.

I'll certainly get in touch when the above is done to update you guys.

Lewis

Re: Implementing GeoSPARQL

Posted by Lewis John Mcgibbney <le...@gmail.com>.
Hi,

Thought I would chime in here as it's a very interesting thread and I think
this is relevant.

On Mon, Apr 2, 2012 at 10:01 AM, Paolo Castagna <
castagna.lists@googlemail.com> wrote:

>
> I see two main use cases here:
>
>  1. Crawling the web and build a dataset of statements with locations.
>  2. Indexing a dataset with statements with locations and extend SPARQL to
>    perform queries over it.
>
> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
>
> I will begin working on this once we release Any23 0.7.0-incubating and
the library is available on maven central. It will be a step in the right
direction to achieving (part) of an overall potential use case as above.
Any23/Tika wrapped plugin for Apache Nutch. It would be really great to
work towards the pluggable datastores as you said Paolo, and yes this is
also something which is required outside of RDF stores, but as you mention
this is another story which we need to focus on separately.

I'll certainly get in touch when the above is done to update you guys.

Lewis

Re: Implementing GeoSPARQL

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hi Paolo,

On Apr 2, 2012, at 2:01 AM, Paolo Castagna wrote:

> Hi Chris
> 
> Mattmann, Chris A (388J) wrote:
>>> In my head, SIS (which I do not know very well) is a low-level geo indexing
>>> library which could be used to provide the indexing capability for a GeoSPARQL
>>> implementation.
>> 
>> Yeah that's what I was thinking too.
> 
> Ok.
> 
> I've played with Lucene spatial capabilities for this sort of things in the
> past. My knowledge of Apache SIS is very limited. In particular, it is not
> clear to me how/when things are persisted on disk. My impression is that SIS
> load the entire index in RAM when it starts and serializes it out at the end.
> Am I right? (I hope not. :-))

Well, what's the end? Basically SIS does in fact build a Quad Tree index, and
it has the capability to store it to disk and update it when requested. SIS is
both an API and web service at the moment, and it can be improved significantly,
that's for sure, but it's functional for now.

> 
> If that is the case, it could be an issue for large indexes.

Possibly -- not sure -- I guess we'll find out. If it falls down, it can always be
made more scalable, it's just a basic spatial data structure.

BTW, SIS started out by importing the code from Lucene spatial which
Patrick O'Leary one of the creators of SIS, wrote. Patrick was the creator
of Local Lucene and Local Solr, the first appearance of these functionalities
in Lucene.

> 
>>> I know that ARQ (i.e. the SPARQL query engine available in Jena) can
>>> provide you with a SPARQL 1.1 engine and extension points to use other
>>> custom indexes (such as SIS in this case).
>>> 
>>> What exactly do you mean with "integrating with Any23"?
>>> Do you mean crawling the web and extract lat/long from web pages?
>> 
>> Yep that's what I was thinking -- maybe doing it in Any23, and/or Tika.
> 
> Doing a web crawl to extract locations out of web pages using Any23|Tika seems
> quite an useful thing for certain use cases.
> 
> In other scenarios people might already have a large dataset with locations in
> it or people might want to leverage datasets such as Geonames, Freebase,
> DBPedia, Yahoo GeoPlanet, etc. so crawling in these use cases is less important.

Yeah potentially. But representing those geo locations and coordinates in a common
way is important.

> 
>>> Where will you store those RDF statements?
>> 
>> It looks like Any23 would store to Sesame -- is that the case?
> 
> Probably.
> 
> It would be nice to have pluggable RDF stores in Any23, but this is another
> story: https://issues.apache.org/jira/browse/ANY23-19 :-)

Yep, thanks for filing that. I'm sure someone will get around to it :)

> 
>>> How can you implement the GeoSPARQL spec without (re)using a SPARQL
>>> query engine (such as ARQ)?
>> 
>> I need that too :) I just don't understand it as well (and understand the Any23/Tika
>> and SIS part better). I'll have to learn Jena it looks like though, you game to 
>> help me out?
> 
> A very old prototype which shows you how you can extend ARQ is here:
> https://github.com/castagna/GeoARQ

Awesome thanks for the link. I'll check it out.

> 
> It is just a prototype and it is using ARQ's property functions rather than
> filter functions (and it is using Lucene spatial rather than SIS). But, it
> is IMHO a good starting point to see how you could have ARQ using a custom
> index to perform spatial searches.

Great.

> 
> The reason why at the time I used Lucene spatial is because that was the only
> alternative (non (L)GPL) I found (I did not know about SIS at the time).

No worries. We created SIS b/c we didn't think this functionality was unique
to Lucene and we wanted an ALv2 licensed toolkit. It's been around since
February 2010. See here:

http://wiki.apache.org/incubator/SpatialProposal

> 
> The reason why I did not implemented GeoSPARQL is simplicity, I wanted just a
> proof of concept and the most important use case IMHO is searching things around
> a point and returning results sorted by distance.
> 
> For GeoSPARQL (which I need to go back an read properly) do we need custom
> FILTER functions or property functions (or both)?

I guess I need to read the spec more to find out :)

> 
>>> IMHO geo location (as well as free text) are two SPARQL extensions which
>>> are very useful in loads of use cases.
>> 
>> Yep I'm super excited to get this implemented. You interested in helping? I think
>> we can bring together Tika, Any23, Jena and SIS here...
> 
> I am interested in learning more about SIS, I have no idea at the moment on how
> much effort is necessary to implement GeoSPARQL and if that spec is going to be
> implemented elsewhere by other RDF stores.

Well even if it is implemented elsewhere by other RDF stores, it wouldn't stop me
from implementing something in SIS. I think it's a great use case for SIS and something
of use to the broader community. The fact that we can't name a ton of other RDF
stores that implement GeoSPARQL is an indication to me that it's not supported
or in widespread use -- so the time is ripe I guess.

> 
> At the moment, I cannot put much effort on this. But, if something similar a la
> LARQ and/or GeoARQ is useful and I can help, I'll do it.
> 
> I see two main use cases here:
> 
> 1. Crawling the web and build a dataset of statements with locations.
> 2. Indexing a dataset with statements with locations and extend SPARQL to
>    perform queries over it.
> 
> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
> For 2. you need SIS and a SPARQL query engine (which of course uses an RDF store).
> 
> If I were trying to implement GeoSPARQL, I would start with 2., SIS and ARQ.

Thanks, will do.

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: Implementing GeoSPARQL

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hi Paolo,

On Apr 2, 2012, at 2:01 AM, Paolo Castagna wrote:

> Hi Chris
> 
> Mattmann, Chris A (388J) wrote:
>>> In my head, SIS (which I do not know very well) is a low-level geo indexing
>>> library which could be used to provide the indexing capability for a GeoSPARQL
>>> implementation.
>> 
>> Yeah that's what I was thinking too.
> 
> Ok.
> 
> I've played with Lucene spatial capabilities for this sort of things in the
> past. My knowledge of Apache SIS is very limited. In particular, it is not
> clear to me how/when things are persisted on disk. My impression is that SIS
> load the entire index in RAM when it starts and serializes it out at the end.
> Am I right? (I hope not. :-))

Well, what's the end? Basically SIS does in fact build a Quad Tree index, and
it has the capability to store it to disk and update it when requested. SIS is
both an API and web service at the moment, and it can be improved significantly,
that's for sure, but it's functional for now.

> 
> If that is the case, it could be an issue for large indexes.

Possibly -- not sure -- I guess we'll find out. If it falls down, it can always be
made more scalable, it's just a basic spatial data structure.

BTW, SIS started out by importing the code from Lucene spatial which
Patrick O'Leary one of the creators of SIS, wrote. Patrick was the creator
of Local Lucene and Local Solr, the first appearance of these functionalities
in Lucene.

> 
>>> I know that ARQ (i.e. the SPARQL query engine available in Jena) can
>>> provide you with a SPARQL 1.1 engine and extension points to use other
>>> custom indexes (such as SIS in this case).
>>> 
>>> What exactly do you mean with "integrating with Any23"?
>>> Do you mean crawling the web and extract lat/long from web pages?
>> 
>> Yep that's what I was thinking -- maybe doing it in Any23, and/or Tika.
> 
> Doing a web crawl to extract locations out of web pages using Any23|Tika seems
> quite an useful thing for certain use cases.
> 
> In other scenarios people might already have a large dataset with locations in
> it or people might want to leverage datasets such as Geonames, Freebase,
> DBPedia, Yahoo GeoPlanet, etc. so crawling in these use cases is less important.

Yeah potentially. But representing those geo locations and coordinates in a common
way is important.

> 
>>> Where will you store those RDF statements?
>> 
>> It looks like Any23 would store to Sesame -- is that the case?
> 
> Probably.
> 
> It would be nice to have pluggable RDF stores in Any23, but this is another
> story: https://issues.apache.org/jira/browse/ANY23-19 :-)

Yep, thanks for filing that. I'm sure someone will get around to it :)

> 
>>> How can you implement the GeoSPARQL spec without (re)using a SPARQL
>>> query engine (such as ARQ)?
>> 
>> I need that too :) I just don't understand it as well (and understand the Any23/Tika
>> and SIS part better). I'll have to learn Jena it looks like though, you game to 
>> help me out?
> 
> A very old prototype which shows you how you can extend ARQ is here:
> https://github.com/castagna/GeoARQ

Awesome thanks for the link. I'll check it out.

> 
> It is just a prototype and it is using ARQ's property functions rather than
> filter functions (and it is using Lucene spatial rather than SIS). But, it
> is IMHO a good starting point to see how you could have ARQ using a custom
> index to perform spatial searches.

Great.

> 
> The reason why at the time I used Lucene spatial is because that was the only
> alternative (non (L)GPL) I found (I did not know about SIS at the time).

No worries. We created SIS b/c we didn't think this functionality was unique
to Lucene and we wanted an ALv2 licensed toolkit. It's been around since
February 2010. See here:

http://wiki.apache.org/incubator/SpatialProposal

> 
> The reason why I did not implemented GeoSPARQL is simplicity, I wanted just a
> proof of concept and the most important use case IMHO is searching things around
> a point and returning results sorted by distance.
> 
> For GeoSPARQL (which I need to go back an read properly) do we need custom
> FILTER functions or property functions (or both)?

I guess I need to read the spec more to find out :)

> 
>>> IMHO geo location (as well as free text) are two SPARQL extensions which
>>> are very useful in loads of use cases.
>> 
>> Yep I'm super excited to get this implemented. You interested in helping? I think
>> we can bring together Tika, Any23, Jena and SIS here...
> 
> I am interested in learning more about SIS, I have no idea at the moment on how
> much effort is necessary to implement GeoSPARQL and if that spec is going to be
> implemented elsewhere by other RDF stores.

Well even if it is implemented elsewhere by other RDF stores, it wouldn't stop me
from implementing something in SIS. I think it's a great use case for SIS and something
of use to the broader community. The fact that we can't name a ton of other RDF
stores that implement GeoSPARQL is an indication to me that it's not supported
or in widespread use -- so the time is ripe I guess.

> 
> At the moment, I cannot put much effort on this. But, if something similar a la
> LARQ and/or GeoARQ is useful and I can help, I'll do it.
> 
> I see two main use cases here:
> 
> 1. Crawling the web and build a dataset of statements with locations.
> 2. Indexing a dataset with statements with locations and extend SPARQL to
>    perform queries over it.
> 
> For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
> For 2. you need SIS and a SPARQL query engine (which of course uses an RDF store).
> 
> If I were trying to implement GeoSPARQL, I would start with 2., SIS and ARQ.

Thanks, will do.

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: Implementing GeoSPARQL

Posted by Paolo Castagna <ca...@googlemail.com>.
Hi Chris

Mattmann, Chris A (388J) wrote:
>> In my head, SIS (which I do not know very well) is a low-level geo indexing
>> library which could be used to provide the indexing capability for a GeoSPARQL
>> implementation.
> 
> Yeah that's what I was thinking too.

Ok.

I've played with Lucene spatial capabilities for this sort of things in the
past. My knowledge of Apache SIS is very limited. In particular, it is not
clear to me how/when things are persisted on disk. My impression is that SIS
load the entire index in RAM when it starts and serializes it out at the end.
Am I right? (I hope not. :-))

If that is the case, it could be an issue for large indexes.

>> I know that ARQ (i.e. the SPARQL query engine available in Jena) can
>> provide you with a SPARQL 1.1 engine and extension points to use other
>> custom indexes (such as SIS in this case).
>>
>> What exactly do you mean with "integrating with Any23"?
>> Do you mean crawling the web and extract lat/long from web pages?
> 
> Yep that's what I was thinking -- maybe doing it in Any23, and/or Tika.

Doing a web crawl to extract locations out of web pages using Any23|Tika seems
quite an useful thing for certain use cases.

In other scenarios people might already have a large dataset with locations in
it or people might want to leverage datasets such as Geonames, Freebase,
DBPedia, Yahoo GeoPlanet, etc. so crawling in these use cases is less important.

>> Where will you store those RDF statements?
> 
> It looks like Any23 would store to Sesame -- is that the case?

Probably.

It would be nice to have pluggable RDF stores in Any23, but this is another
story: https://issues.apache.org/jira/browse/ANY23-19 :-)

>> How can you implement the GeoSPARQL spec without (re)using a SPARQL
>> query engine (such as ARQ)?
> 
> I need that too :) I just don't understand it as well (and understand the Any23/Tika
> and SIS part better). I'll have to learn Jena it looks like though, you game to 
> help me out?

A very old prototype which shows you how you can extend ARQ is here:
https://github.com/castagna/GeoARQ

It is just a prototype and it is using ARQ's property functions rather than
filter functions (and it is using Lucene spatial rather than SIS). But, it
is IMHO a good starting point to see how you could have ARQ using a custom
index to perform spatial searches.

The reason why at the time I used Lucene spatial is because that was the only
alternative (non (L)GPL) I found (I did not know about SIS at the time).

The reason why I did not implemented GeoSPARQL is simplicity, I wanted just a
proof of concept and the most important use case IMHO is searching things around
a point and returning results sorted by distance.

For GeoSPARQL (which I need to go back an read properly) do we need custom
FILTER functions or property functions (or both)?

>> IMHO geo location (as well as free text) are two SPARQL extensions which
>> are very useful in loads of use cases.
> 
> Yep I'm super excited to get this implemented. You interested in helping? I think
> we can bring together Tika, Any23, Jena and SIS here...

I am interested in learning more about SIS, I have no idea at the moment on how
much effort is necessary to implement GeoSPARQL and if that spec is going to be
implemented elsewhere by other RDF stores.

At the moment, I cannot put much effort on this. But, if something similar a la
LARQ and/or GeoARQ is useful and I can help, I'll do it.

I see two main use cases here:

 1. Crawling the web and build a dataset of statements with locations.
 2. Indexing a dataset with statements with locations and extend SPARQL to
    perform queries over it.

For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
For 2. you need SIS and a SPARQL query engine (which of course uses an RDF store).

If I were trying to implement GeoSPARQL, I would start with 2., SIS and ARQ.

My 2 cents,
Paolo

> 
> Cheers,
> Chris
> 
>>> I'm CC'ing the any23-dev and jena-dev user lists (apologies for the SPAM guys)
>>> just to keep them in the loop.
>>>
>>> Cheers,
>>> Chris
>>>
>>> On Jan 31, 2012, at 4:59 AM, Andy Seaborne wrote:
>>>
>>>> Hi there,
>>>>
>>>> I'm investigating what it would take to implement GeoSPARQL.
>>>> There is already an Apache-licensed SPARQL engine in podling Jena.
>>>>
>>>> Of the things needed are a persistent storage layer with the right 
>>>> license.  Maybe the SIS project has something to use.
>>>>
>>>> If I understand it correctly, the qtree implementation is an in-memory 
>>>> structure, with the ability to read from a serialized form on disk, and 
>>>> to be able to write it to disk in that form.
>>>>
>>>> Is there any information on scaling for the qtree?  Memory usage?
>>>>
>>>> California_Restaurants.csv is 54K points - is that typical usage size?
>>>>
>>>> (yes ... there are other things needed as well such as conversion code 
>>>> between coodinate systems, format parsers, polygon code, ... but a start 
>>>> would be just for point data in one system :)
>>>>
>>>> An open copy of the spec is available at:
>>>>
>>>> http://www.w3.org/2011/02/GeoSPARQL.pdf
>>>>
>>>> 	Andy
>>>
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Chris Mattmann, Ph.D.
>>> Senior Computer Scientist
>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>> Office: 171-266B, Mailstop: 171-246
>>> Email: chris.a.mattmann@nasa.gov
>>> WWW:   http://sunset.usc.edu/~mattmann/
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Adjunct Assistant Professor, Computer Science Department
>>> University of Southern California, Los Angeles, CA 90089 USA
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>
> 
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> Phone: +1 (818) 354-8810
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 


Re: Implementing GeoSPARQL

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hi Paolo,

On Apr 2, 2012, at 2:31 AM, Paolo Castagna wrote:

> Hi Chris
> 
> Mattmann, Chris A (388J) wrote:
>>> How can you implement the GeoSPARQL spec without (re)using a SPARQL
>>> query engine (such as ARQ)?
>> 
>> I need that too :) I just don't understand it as well (and understand the Any23/Tika
>> and SIS part better). I'll have to learn Jena it looks like though, you game to 
>> help me out?
> 
> ARQ has a couple of extension points which are useful here:
> 
> Filter Functions
> http://incubator.apache.org/jena/documentation/query/extension.html#filter-functions
> 
> Property Functions
> http://incubator.apache.org/jena/documentation/query/extension.html#property-functions
> 
> You can find examples of FILTER functions here:
> http://svn.apache.org/repos/asf/incubator/jena/Jena2/ARQ/tags/jena-arq-2.9.0-incubating/src/main/java/com/hp/hpl/jena/sparql/function/library/
> 
> You can find examples of property functions here:
> http://svn.apache.org/repos/asf/incubator/jena/Jena2/ARQ/tags/jena-arq-2.9.0-incubating/src/main/java/com/hp/hpl/jena/sparql/pfunction/library/
> http://svn.apache.org/repos/asf/incubator/jena/Jena2/LARQ/trunk/src/main/java/org/apache/jena/larq/pfunction/
> https://github.com/castagna/GeoARQ/blob/master/src/main/java/org/openjena/geoarq/pfunction/NearbyPropertyFunctionEval.java
> 
> The GeoSPARQL spec available at http://www.w3.org/2011/02/GeoSPARQL.pdf
> describes FILTER functions: ogcf:relate, ogcf:distance, ogc:buffer,
> ogcf:convexHull, ogcf:intersection, ogcf:union, ogcf:difference,
> ogcf:symDifference, ogcf:envelope and ogcf:boundary
> 

Thanks I will check out the above!

> Does GeoSPARQL requires FILTER functions only?

Good question -- I'm still reading through the spec for GeoSPARQL so not sure
as of yet.

> 
> At the moment, I have no idea how you would deal with WKT and GML.

SIS has a goal of dealing with WKT:

https://issues.apache.org/jira/browse/SIS-10

One thing in my mind related to this is the work that I am doing to 
integrate GDAL into Tika, here:

https://issues.apache.org/jira/browse/TIKA-605

If I can get that done up, then I can use it to plug into SIS and
load data into the quad tree.

> 
> Also, I have no idea if SIS provides all the necessary spatial capabilities
> necessary for a GeoSPARQL implementation.

Neither do I, but I know it has a goal of being able to support something like it
and that GeoSPARQL is a great use case. I know that SIS provides a Quad Tree
implementation that is storable to an index on disk, and that does point/radius
and bbox queries, based on data loaded from GeoRSS and is already released
in 0.1 format (and is currently being VOTED for the 0.2 release candidate). So 
it's starting to gain traction and I would love to use this as a case for making
SIS more relevant, so that's my intention.

> 
> It is still not clear to me, how a GeoSPARQL implementation would support
> a use case such as this: "John is in Cardif and he wants to find two or
> three restaurant close to where he is."?

I guess we'll have to find out :)

> 
> Does anyone know if there is any GeoSPARQL implementation elsewhere?

Nope, I'm not aware of one, but that doesn't say much since I'm new to GeoSPARQL :)

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: Implementing GeoSPARQL

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hi Paolo,

On Apr 2, 2012, at 2:31 AM, Paolo Castagna wrote:

> Hi Chris
> 
> Mattmann, Chris A (388J) wrote:
>>> How can you implement the GeoSPARQL spec without (re)using a SPARQL
>>> query engine (such as ARQ)?
>> 
>> I need that too :) I just don't understand it as well (and understand the Any23/Tika
>> and SIS part better). I'll have to learn Jena it looks like though, you game to 
>> help me out?
> 
> ARQ has a couple of extension points which are useful here:
> 
> Filter Functions
> http://incubator.apache.org/jena/documentation/query/extension.html#filter-functions
> 
> Property Functions
> http://incubator.apache.org/jena/documentation/query/extension.html#property-functions
> 
> You can find examples of FILTER functions here:
> http://svn.apache.org/repos/asf/incubator/jena/Jena2/ARQ/tags/jena-arq-2.9.0-incubating/src/main/java/com/hp/hpl/jena/sparql/function/library/
> 
> You can find examples of property functions here:
> http://svn.apache.org/repos/asf/incubator/jena/Jena2/ARQ/tags/jena-arq-2.9.0-incubating/src/main/java/com/hp/hpl/jena/sparql/pfunction/library/
> http://svn.apache.org/repos/asf/incubator/jena/Jena2/LARQ/trunk/src/main/java/org/apache/jena/larq/pfunction/
> https://github.com/castagna/GeoARQ/blob/master/src/main/java/org/openjena/geoarq/pfunction/NearbyPropertyFunctionEval.java
> 
> The GeoSPARQL spec available at http://www.w3.org/2011/02/GeoSPARQL.pdf
> describes FILTER functions: ogcf:relate, ogcf:distance, ogc:buffer,
> ogcf:convexHull, ogcf:intersection, ogcf:union, ogcf:difference,
> ogcf:symDifference, ogcf:envelope and ogcf:boundary
> 

Thanks I will check out the above!

> Does GeoSPARQL requires FILTER functions only?

Good question -- I'm still reading through the spec for GeoSPARQL so not sure
as of yet.

> 
> At the moment, I have no idea how you would deal with WKT and GML.

SIS has a goal of dealing with WKT:

https://issues.apache.org/jira/browse/SIS-10

One thing in my mind related to this is the work that I am doing to 
integrate GDAL into Tika, here:

https://issues.apache.org/jira/browse/TIKA-605

If I can get that done up, then I can use it to plug into SIS and
load data into the quad tree.

> 
> Also, I have no idea if SIS provides all the necessary spatial capabilities
> necessary for a GeoSPARQL implementation.

Neither do I, but I know it has a goal of being able to support something like it
and that GeoSPARQL is a great use case. I know that SIS provides a Quad Tree
implementation that is storable to an index on disk, and that does point/radius
and bbox queries, based on data loaded from GeoRSS and is already released
in 0.1 format (and is currently being VOTED for the 0.2 release candidate). So 
it's starting to gain traction and I would love to use this as a case for making
SIS more relevant, so that's my intention.

> 
> It is still not clear to me, how a GeoSPARQL implementation would support
> a use case such as this: "John is in Cardif and he wants to find two or
> three restaurant close to where he is."?

I guess we'll have to find out :)

> 
> Does anyone know if there is any GeoSPARQL implementation elsewhere?

Nope, I'm not aware of one, but that doesn't say much since I'm new to GeoSPARQL :)

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: Implementing GeoSPARQL

Posted by Paolo Castagna <ca...@googlemail.com>.
Hi Chris

Mattmann, Chris A (388J) wrote:
>> How can you implement the GeoSPARQL spec without (re)using a SPARQL
>> query engine (such as ARQ)?
> 
> I need that too :) I just don't understand it as well (and understand the Any23/Tika
> and SIS part better). I'll have to learn Jena it looks like though, you game to 
> help me out?

ARQ has a couple of extension points which are useful here:

Filter Functions
http://incubator.apache.org/jena/documentation/query/extension.html#filter-functions

Property Functions
http://incubator.apache.org/jena/documentation/query/extension.html#property-functions

You can find examples of FILTER functions here:
http://svn.apache.org/repos/asf/incubator/jena/Jena2/ARQ/tags/jena-arq-2.9.0-incubating/src/main/java/com/hp/hpl/jena/sparql/function/library/

You can find examples of property functions here:
http://svn.apache.org/repos/asf/incubator/jena/Jena2/ARQ/tags/jena-arq-2.9.0-incubating/src/main/java/com/hp/hpl/jena/sparql/pfunction/library/
http://svn.apache.org/repos/asf/incubator/jena/Jena2/LARQ/trunk/src/main/java/org/apache/jena/larq/pfunction/
https://github.com/castagna/GeoARQ/blob/master/src/main/java/org/openjena/geoarq/pfunction/NearbyPropertyFunctionEval.java

The GeoSPARQL spec available at http://www.w3.org/2011/02/GeoSPARQL.pdf
describes FILTER functions: ogcf:relate, ogcf:distance, ogc:buffer,
ogcf:convexHull, ogcf:intersection, ogcf:union, ogcf:difference,
ogcf:symDifference, ogcf:envelope and ogcf:boundary

Does GeoSPARQL requires FILTER functions only?

At the moment, I have no idea how you would deal with WKT and GML.

Also, I have no idea if SIS provides all the necessary spatial capabilities
necessary for a GeoSPARQL implementation.

It is still not clear to me, how a GeoSPARQL implementation would support
a use case such as this: "John is in Cardif and he wants to find two or
three restaurant close to where he is."?

Does anyone know if there is any GeoSPARQL implementation elsewhere?

Paolo

Re: Implementing GeoSPARQL

Posted by Paolo Castagna <ca...@googlemail.com>.
Hi Chris

Mattmann, Chris A (388J) wrote:
>> In my head, SIS (which I do not know very well) is a low-level geo indexing
>> library which could be used to provide the indexing capability for a GeoSPARQL
>> implementation.
> 
> Yeah that's what I was thinking too.

Ok.

I've played with Lucene spatial capabilities for this sort of things in the
past. My knowledge of Apache SIS is very limited. In particular, it is not
clear to me how/when things are persisted on disk. My impression is that SIS
load the entire index in RAM when it starts and serializes it out at the end.
Am I right? (I hope not. :-))

If that is the case, it could be an issue for large indexes.

>> I know that ARQ (i.e. the SPARQL query engine available in Jena) can
>> provide you with a SPARQL 1.1 engine and extension points to use other
>> custom indexes (such as SIS in this case).
>>
>> What exactly do you mean with "integrating with Any23"?
>> Do you mean crawling the web and extract lat/long from web pages?
> 
> Yep that's what I was thinking -- maybe doing it in Any23, and/or Tika.

Doing a web crawl to extract locations out of web pages using Any23|Tika seems
quite an useful thing for certain use cases.

In other scenarios people might already have a large dataset with locations in
it or people might want to leverage datasets such as Geonames, Freebase,
DBPedia, Yahoo GeoPlanet, etc. so crawling in these use cases is less important.

>> Where will you store those RDF statements?
> 
> It looks like Any23 would store to Sesame -- is that the case?

Probably.

It would be nice to have pluggable RDF stores in Any23, but this is another
story: https://issues.apache.org/jira/browse/ANY23-19 :-)

>> How can you implement the GeoSPARQL spec without (re)using a SPARQL
>> query engine (such as ARQ)?
> 
> I need that too :) I just don't understand it as well (and understand the Any23/Tika
> and SIS part better). I'll have to learn Jena it looks like though, you game to 
> help me out?

A very old prototype which shows you how you can extend ARQ is here:
https://github.com/castagna/GeoARQ

It is just a prototype and it is using ARQ's property functions rather than
filter functions (and it is using Lucene spatial rather than SIS). But, it
is IMHO a good starting point to see how you could have ARQ using a custom
index to perform spatial searches.

The reason why at the time I used Lucene spatial is because that was the only
alternative (non (L)GPL) I found (I did not know about SIS at the time).

The reason why I did not implemented GeoSPARQL is simplicity, I wanted just a
proof of concept and the most important use case IMHO is searching things around
a point and returning results sorted by distance.

For GeoSPARQL (which I need to go back an read properly) do we need custom
FILTER functions or property functions (or both)?

>> IMHO geo location (as well as free text) are two SPARQL extensions which
>> are very useful in loads of use cases.
> 
> Yep I'm super excited to get this implemented. You interested in helping? I think
> we can bring together Tika, Any23, Jena and SIS here...

I am interested in learning more about SIS, I have no idea at the moment on how
much effort is necessary to implement GeoSPARQL and if that spec is going to be
implemented elsewhere by other RDF stores.

At the moment, I cannot put much effort on this. But, if something similar a la
LARQ and/or GeoARQ is useful and I can help, I'll do it.

I see two main use cases here:

 1. Crawling the web and build a dataset of statements with locations.
 2. Indexing a dataset with statements with locations and extend SPARQL to
    perform queries over it.

For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
For 2. you need SIS and a SPARQL query engine (which of course uses an RDF store).

If I were trying to implement GeoSPARQL, I would start with 2., SIS and ARQ.

My 2 cents,
Paolo

> 
> Cheers,
> Chris
> 
>>> I'm CC'ing the any23-dev and jena-dev user lists (apologies for the SPAM guys)
>>> just to keep them in the loop.
>>>
>>> Cheers,
>>> Chris
>>>
>>> On Jan 31, 2012, at 4:59 AM, Andy Seaborne wrote:
>>>
>>>> Hi there,
>>>>
>>>> I'm investigating what it would take to implement GeoSPARQL.
>>>> There is already an Apache-licensed SPARQL engine in podling Jena.
>>>>
>>>> Of the things needed are a persistent storage layer with the right 
>>>> license.  Maybe the SIS project has something to use.
>>>>
>>>> If I understand it correctly, the qtree implementation is an in-memory 
>>>> structure, with the ability to read from a serialized form on disk, and 
>>>> to be able to write it to disk in that form.
>>>>
>>>> Is there any information on scaling for the qtree?  Memory usage?
>>>>
>>>> California_Restaurants.csv is 54K points - is that typical usage size?
>>>>
>>>> (yes ... there are other things needed as well such as conversion code 
>>>> between coodinate systems, format parsers, polygon code, ... but a start 
>>>> would be just for point data in one system :)
>>>>
>>>> An open copy of the spec is available at:
>>>>
>>>> http://www.w3.org/2011/02/GeoSPARQL.pdf
>>>>
>>>> 	Andy
>>>
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Chris Mattmann, Ph.D.
>>> Senior Computer Scientist
>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>> Office: 171-266B, Mailstop: 171-246
>>> Email: chris.a.mattmann@nasa.gov
>>> WWW:   http://sunset.usc.edu/~mattmann/
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Adjunct Assistant Professor, Computer Science Department
>>> University of Southern California, Los Angeles, CA 90089 USA
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>
> 
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> Phone: +1 (818) 354-8810
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 


Re: Implementing GeoSPARQL

Posted by Paolo Castagna <ca...@googlemail.com>.
Hi Chris

Mattmann, Chris A (388J) wrote:
>> How can you implement the GeoSPARQL spec without (re)using a SPARQL
>> query engine (such as ARQ)?
> 
> I need that too :) I just don't understand it as well (and understand the Any23/Tika
> and SIS part better). I'll have to learn Jena it looks like though, you game to 
> help me out?

ARQ has a couple of extension points which are useful here:

Filter Functions
http://incubator.apache.org/jena/documentation/query/extension.html#filter-functions

Property Functions
http://incubator.apache.org/jena/documentation/query/extension.html#property-functions

You can find examples of FILTER functions here:
http://svn.apache.org/repos/asf/incubator/jena/Jena2/ARQ/tags/jena-arq-2.9.0-incubating/src/main/java/com/hp/hpl/jena/sparql/function/library/

You can find examples of property functions here:
http://svn.apache.org/repos/asf/incubator/jena/Jena2/ARQ/tags/jena-arq-2.9.0-incubating/src/main/java/com/hp/hpl/jena/sparql/pfunction/library/
http://svn.apache.org/repos/asf/incubator/jena/Jena2/LARQ/trunk/src/main/java/org/apache/jena/larq/pfunction/
https://github.com/castagna/GeoARQ/blob/master/src/main/java/org/openjena/geoarq/pfunction/NearbyPropertyFunctionEval.java

The GeoSPARQL spec available at http://www.w3.org/2011/02/GeoSPARQL.pdf
describes FILTER functions: ogcf:relate, ogcf:distance, ogc:buffer,
ogcf:convexHull, ogcf:intersection, ogcf:union, ogcf:difference,
ogcf:symDifference, ogcf:envelope and ogcf:boundary

Does GeoSPARQL requires FILTER functions only?

At the moment, I have no idea how you would deal with WKT and GML.

Also, I have no idea if SIS provides all the necessary spatial capabilities
necessary for a GeoSPARQL implementation.

It is still not clear to me, how a GeoSPARQL implementation would support
a use case such as this: "John is in Cardif and he wants to find two or
three restaurant close to where he is."?

Does anyone know if there is any GeoSPARQL implementation elsewhere?

Paolo

Re: Implementing GeoSPARQL

Posted by Paolo Castagna <ca...@googlemail.com>.
Hi Chris

Mattmann, Chris A (388J) wrote:
>> In my head, SIS (which I do not know very well) is a low-level geo indexing
>> library which could be used to provide the indexing capability for a GeoSPARQL
>> implementation.
> 
> Yeah that's what I was thinking too.

Ok.

I've played with Lucene spatial capabilities for this sort of things in the
past. My knowledge of Apache SIS is very limited. In particular, it is not
clear to me how/when things are persisted on disk. My impression is that SIS
load the entire index in RAM when it starts and serializes it out at the end.
Am I right? (I hope not. :-))

If that is the case, it could be an issue for large indexes.

>> I know that ARQ (i.e. the SPARQL query engine available in Jena) can
>> provide you with a SPARQL 1.1 engine and extension points to use other
>> custom indexes (such as SIS in this case).
>>
>> What exactly do you mean with "integrating with Any23"?
>> Do you mean crawling the web and extract lat/long from web pages?
> 
> Yep that's what I was thinking -- maybe doing it in Any23, and/or Tika.

Doing a web crawl to extract locations out of web pages using Any23|Tika seems
quite an useful thing for certain use cases.

In other scenarios people might already have a large dataset with locations in
it or people might want to leverage datasets such as Geonames, Freebase,
DBPedia, Yahoo GeoPlanet, etc. so crawling in these use cases is less important.

>> Where will you store those RDF statements?
> 
> It looks like Any23 would store to Sesame -- is that the case?

Probably.

It would be nice to have pluggable RDF stores in Any23, but this is another
story: https://issues.apache.org/jira/browse/ANY23-19 :-)

>> How can you implement the GeoSPARQL spec without (re)using a SPARQL
>> query engine (such as ARQ)?
> 
> I need that too :) I just don't understand it as well (and understand the Any23/Tika
> and SIS part better). I'll have to learn Jena it looks like though, you game to 
> help me out?

A very old prototype which shows you how you can extend ARQ is here:
https://github.com/castagna/GeoARQ

It is just a prototype and it is using ARQ's property functions rather than
filter functions (and it is using Lucene spatial rather than SIS). But, it
is IMHO a good starting point to see how you could have ARQ using a custom
index to perform spatial searches.

The reason why at the time I used Lucene spatial is because that was the only
alternative (non (L)GPL) I found (I did not know about SIS at the time).

The reason why I did not implemented GeoSPARQL is simplicity, I wanted just a
proof of concept and the most important use case IMHO is searching things around
a point and returning results sorted by distance.

For GeoSPARQL (which I need to go back an read properly) do we need custom
FILTER functions or property functions (or both)?

>> IMHO geo location (as well as free text) are two SPARQL extensions which
>> are very useful in loads of use cases.
> 
> Yep I'm super excited to get this implemented. You interested in helping? I think
> we can bring together Tika, Any23, Jena and SIS here...

I am interested in learning more about SIS, I have no idea at the moment on how
much effort is necessary to implement GeoSPARQL and if that spec is going to be
implemented elsewhere by other RDF stores.

At the moment, I cannot put much effort on this. But, if something similar a la
LARQ and/or GeoARQ is useful and I can help, I'll do it.

I see two main use cases here:

 1. Crawling the web and build a dataset of statements with locations.
 2. Indexing a dataset with statements with locations and extend SPARQL to
    perform queries over it.

For 1. you need Any23|Tika (and a crawler) and, eventually, an RDF store.
For 2. you need SIS and a SPARQL query engine (which of course uses an RDF store).

If I were trying to implement GeoSPARQL, I would start with 2., SIS and ARQ.

My 2 cents,
Paolo

> 
> Cheers,
> Chris
> 
>>> I'm CC'ing the any23-dev and jena-dev user lists (apologies for the SPAM guys)
>>> just to keep them in the loop.
>>>
>>> Cheers,
>>> Chris
>>>
>>> On Jan 31, 2012, at 4:59 AM, Andy Seaborne wrote:
>>>
>>>> Hi there,
>>>>
>>>> I'm investigating what it would take to implement GeoSPARQL.
>>>> There is already an Apache-licensed SPARQL engine in podling Jena.
>>>>
>>>> Of the things needed are a persistent storage layer with the right 
>>>> license.  Maybe the SIS project has something to use.
>>>>
>>>> If I understand it correctly, the qtree implementation is an in-memory 
>>>> structure, with the ability to read from a serialized form on disk, and 
>>>> to be able to write it to disk in that form.
>>>>
>>>> Is there any information on scaling for the qtree?  Memory usage?
>>>>
>>>> California_Restaurants.csv is 54K points - is that typical usage size?
>>>>
>>>> (yes ... there are other things needed as well such as conversion code 
>>>> between coodinate systems, format parsers, polygon code, ... but a start 
>>>> would be just for point data in one system :)
>>>>
>>>> An open copy of the spec is available at:
>>>>
>>>> http://www.w3.org/2011/02/GeoSPARQL.pdf
>>>>
>>>> 	Andy
>>>
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Chris Mattmann, Ph.D.
>>> Senior Computer Scientist
>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>> Office: 171-266B, Mailstop: 171-246
>>> Email: chris.a.mattmann@nasa.gov
>>> WWW:   http://sunset.usc.edu/~mattmann/
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Adjunct Assistant Professor, Computer Science Department
>>> University of Southern California, Los Angeles, CA 90089 USA
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>
> 
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> Phone: +1 (818) 354-8810
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 


Re: Implementing GeoSPARQL

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hi Paolo,

On Apr 1, 2012, at 1:12 PM, Paolo Castagna wrote:

> Hi Chris
> thanks for letting us know.

Anytime!

> 
> Mattmann, Chris A (388J) wrote:
>> Hi Andy,
>> 
>> Just to follow up on this, I went ahead and filed this issue:
>> 
>> https://issues.apache.org/jira/browse/SIS-42
>> 
>> I'm going to start work on implementing the Geo SPARQL spec in SIS, maybe
>> even integrating with Any23 (and Jena down the road, but Any23, I'm a tad
>> more familiar with at the moment). 
> 
> I am a bit confused here.
> 
> In my head, SIS (which I do not know very well) is a low-level geo indexing
> library which could be used to provide the indexing capability for a GeoSPARQL
> implementation.

Yeah that's what I was thinking too.

> 
> I know that ARQ (i.e. the SPARQL query engine available in Jena) can
> provide you with a SPARQL 1.1 engine and extension points to use other
> custom indexes (such as SIS in this case).
> 
> What exactly do you mean with "integrating with Any23"?
> Do you mean crawling the web and extract lat/long from web pages?

Yep that's what I was thinking -- maybe doing it in Any23, and/or Tika.

> Where will you store those RDF statements?

It looks like Any23 would store to Sesame -- is that the case?

> 
> How can you implement the GeoSPARQL spec without (re)using a SPARQL
> query engine (such as ARQ)?

I need that too :) I just don't understand it as well (and understand the Any23/Tika
and SIS part better). I'll have to learn Jena it looks like though, you game to 
help me out?

> 
> IMHO geo location (as well as free text) are two SPARQL extensions which
> are very useful in loads of use cases.

Yep I'm super excited to get this implemented. You interested in helping? I think
we can bring together Tika, Any23, Jena and SIS here...

Cheers,
Chris

> 
>> I'm CC'ing the any23-dev and jena-dev user lists (apologies for the SPAM guys)
>> just to keep them in the loop.
>> 
>> Cheers,
>> Chris
>> 
>> On Jan 31, 2012, at 4:59 AM, Andy Seaborne wrote:
>> 
>>> Hi there,
>>> 
>>> I'm investigating what it would take to implement GeoSPARQL.
>>> There is already an Apache-licensed SPARQL engine in podling Jena.
>>> 
>>> Of the things needed are a persistent storage layer with the right 
>>> license.  Maybe the SIS project has something to use.
>>> 
>>> If I understand it correctly, the qtree implementation is an in-memory 
>>> structure, with the ability to read from a serialized form on disk, and 
>>> to be able to write it to disk in that form.
>>> 
>>> Is there any information on scaling for the qtree?  Memory usage?
>>> 
>>> California_Restaurants.csv is 54K points - is that typical usage size?
>>> 
>>> (yes ... there are other things needed as well such as conversion code 
>>> between coodinate systems, format parsers, polygon code, ... but a start 
>>> would be just for point data in one system :)
>>> 
>>> An open copy of the spec is available at:
>>> 
>>> http://www.w3.org/2011/02/GeoSPARQL.pdf
>>> 
>>> 	Andy
>> 
>> 
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Senior Computer Scientist
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 171-266B, Mailstop: 171-246
>> Email: chris.a.mattmann@nasa.gov
>> WWW:   http://sunset.usc.edu/~mattmann/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Adjunct Assistant Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> 
> 


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
Phone: +1 (818) 354-8810
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: Implementing GeoSPARQL

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hi Paolo,

On Apr 1, 2012, at 1:12 PM, Paolo Castagna wrote:

> Hi Chris
> thanks for letting us know.

Anytime!

> 
> Mattmann, Chris A (388J) wrote:
>> Hi Andy,
>> 
>> Just to follow up on this, I went ahead and filed this issue:
>> 
>> https://issues.apache.org/jira/browse/SIS-42
>> 
>> I'm going to start work on implementing the Geo SPARQL spec in SIS, maybe
>> even integrating with Any23 (and Jena down the road, but Any23, I'm a tad
>> more familiar with at the moment). 
> 
> I am a bit confused here.
> 
> In my head, SIS (which I do not know very well) is a low-level geo indexing
> library which could be used to provide the indexing capability for a GeoSPARQL
> implementation.

Yeah that's what I was thinking too.

> 
> I know that ARQ (i.e. the SPARQL query engine available in Jena) can
> provide you with a SPARQL 1.1 engine and extension points to use other
> custom indexes (such as SIS in this case).
> 
> What exactly do you mean with "integrating with Any23"?
> Do you mean crawling the web and extract lat/long from web pages?

Yep that's what I was thinking -- maybe doing it in Any23, and/or Tika.

> Where will you store those RDF statements?

It looks like Any23 would store to Sesame -- is that the case?

> 
> How can you implement the GeoSPARQL spec without (re)using a SPARQL
> query engine (such as ARQ)?

I need that too :) I just don't understand it as well (and understand the Any23/Tika
and SIS part better). I'll have to learn Jena it looks like though, you game to 
help me out?

> 
> IMHO geo location (as well as free text) are two SPARQL extensions which
> are very useful in loads of use cases.

Yep I'm super excited to get this implemented. You interested in helping? I think
we can bring together Tika, Any23, Jena and SIS here...

Cheers,
Chris

> 
>> I'm CC'ing the any23-dev and jena-dev user lists (apologies for the SPAM guys)
>> just to keep them in the loop.
>> 
>> Cheers,
>> Chris
>> 
>> On Jan 31, 2012, at 4:59 AM, Andy Seaborne wrote:
>> 
>>> Hi there,
>>> 
>>> I'm investigating what it would take to implement GeoSPARQL.
>>> There is already an Apache-licensed SPARQL engine in podling Jena.
>>> 
>>> Of the things needed are a persistent storage layer with the right 
>>> license.  Maybe the SIS project has something to use.
>>> 
>>> If I understand it correctly, the qtree implementation is an in-memory 
>>> structure, with the ability to read from a serialized form on disk, and 
>>> to be able to write it to disk in that form.
>>> 
>>> Is there any information on scaling for the qtree?  Memory usage?
>>> 
>>> California_Restaurants.csv is 54K points - is that typical usage size?
>>> 
>>> (yes ... there are other things needed as well such as conversion code 
>>> between coodinate systems, format parsers, polygon code, ... but a start 
>>> would be just for point data in one system :)
>>> 
>>> An open copy of the spec is available at:
>>> 
>>> http://www.w3.org/2011/02/GeoSPARQL.pdf
>>> 
>>> 	Andy
>> 
>> 
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Senior Computer Scientist
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 171-266B, Mailstop: 171-246
>> Email: chris.a.mattmann@nasa.gov
>> WWW:   http://sunset.usc.edu/~mattmann/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Adjunct Assistant Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> 
> 


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
Phone: +1 (818) 354-8810
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: Implementing GeoSPARQL

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hi Paolo,

On Apr 1, 2012, at 1:12 PM, Paolo Castagna wrote:

> Hi Chris
> thanks for letting us know.

Anytime!

> 
> Mattmann, Chris A (388J) wrote:
>> Hi Andy,
>> 
>> Just to follow up on this, I went ahead and filed this issue:
>> 
>> https://issues.apache.org/jira/browse/SIS-42
>> 
>> I'm going to start work on implementing the Geo SPARQL spec in SIS, maybe
>> even integrating with Any23 (and Jena down the road, but Any23, I'm a tad
>> more familiar with at the moment). 
> 
> I am a bit confused here.
> 
> In my head, SIS (which I do not know very well) is a low-level geo indexing
> library which could be used to provide the indexing capability for a GeoSPARQL
> implementation.

Yeah that's what I was thinking too.

> 
> I know that ARQ (i.e. the SPARQL query engine available in Jena) can
> provide you with a SPARQL 1.1 engine and extension points to use other
> custom indexes (such as SIS in this case).
> 
> What exactly do you mean with "integrating with Any23"?
> Do you mean crawling the web and extract lat/long from web pages?

Yep that's what I was thinking -- maybe doing it in Any23, and/or Tika.

> Where will you store those RDF statements?

It looks like Any23 would store to Sesame -- is that the case?

> 
> How can you implement the GeoSPARQL spec without (re)using a SPARQL
> query engine (such as ARQ)?

I need that too :) I just don't understand it as well (and understand the Any23/Tika
and SIS part better). I'll have to learn Jena it looks like though, you game to 
help me out?

> 
> IMHO geo location (as well as free text) are two SPARQL extensions which
> are very useful in loads of use cases.

Yep I'm super excited to get this implemented. You interested in helping? I think
we can bring together Tika, Any23, Jena and SIS here...

Cheers,
Chris

> 
>> I'm CC'ing the any23-dev and jena-dev user lists (apologies for the SPAM guys)
>> just to keep them in the loop.
>> 
>> Cheers,
>> Chris
>> 
>> On Jan 31, 2012, at 4:59 AM, Andy Seaborne wrote:
>> 
>>> Hi there,
>>> 
>>> I'm investigating what it would take to implement GeoSPARQL.
>>> There is already an Apache-licensed SPARQL engine in podling Jena.
>>> 
>>> Of the things needed are a persistent storage layer with the right 
>>> license.  Maybe the SIS project has something to use.
>>> 
>>> If I understand it correctly, the qtree implementation is an in-memory 
>>> structure, with the ability to read from a serialized form on disk, and 
>>> to be able to write it to disk in that form.
>>> 
>>> Is there any information on scaling for the qtree?  Memory usage?
>>> 
>>> California_Restaurants.csv is 54K points - is that typical usage size?
>>> 
>>> (yes ... there are other things needed as well such as conversion code 
>>> between coodinate systems, format parsers, polygon code, ... but a start 
>>> would be just for point data in one system :)
>>> 
>>> An open copy of the spec is available at:
>>> 
>>> http://www.w3.org/2011/02/GeoSPARQL.pdf
>>> 
>>> 	Andy
>> 
>> 
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Senior Computer Scientist
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 171-266B, Mailstop: 171-246
>> Email: chris.a.mattmann@nasa.gov
>> WWW:   http://sunset.usc.edu/~mattmann/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Adjunct Assistant Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> 
> 


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
Phone: +1 (818) 354-8810
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: Implementing GeoSPARQL

Posted by Paolo Castagna <ca...@googlemail.com>.
Hi Chris
thanks for letting us know.

Mattmann, Chris A (388J) wrote:
> Hi Andy,
> 
> Just to follow up on this, I went ahead and filed this issue:
> 
> https://issues.apache.org/jira/browse/SIS-42
> 
> I'm going to start work on implementing the Geo SPARQL spec in SIS, maybe
> even integrating with Any23 (and Jena down the road, but Any23, I'm a tad
> more familiar with at the moment). 

I am a bit confused here.

In my head, SIS (which I do not know very well) is a low-level geo indexing
library which could be used to provide the indexing capability for a GeoSPARQL
implementation.

I know that ARQ (i.e. the SPARQL query engine available in Jena) can
provide you with a SPARQL 1.1 engine and extension points to use other
custom indexes (such as SIS in this case).

What exactly do you mean with "integrating with Any23"?
Do you mean crawling the web and extract lat/long from web pages?
Where will you store those RDF statements?

How can you implement the GeoSPARQL spec without (re)using a SPARQL
query engine (such as ARQ)?

IMHO geo location (as well as free text) are two SPARQL extensions which
are very useful in loads of use cases.

Cheers,
Paolo

> I'm CC'ing the any23-dev and jena-dev user lists (apologies for the SPAM guys)
> just to keep them in the loop.
> 
> Cheers,
> Chris
> 
> On Jan 31, 2012, at 4:59 AM, Andy Seaborne wrote:
> 
>> Hi there,
>>
>> I'm investigating what it would take to implement GeoSPARQL.
>> There is already an Apache-licensed SPARQL engine in podling Jena.
>>
>> Of the things needed are a persistent storage layer with the right 
>> license.  Maybe the SIS project has something to use.
>>
>> If I understand it correctly, the qtree implementation is an in-memory 
>> structure, with the ability to read from a serialized form on disk, and 
>> to be able to write it to disk in that form.
>>
>> Is there any information on scaling for the qtree?  Memory usage?
>>
>> California_Restaurants.csv is 54K points - is that typical usage size?
>>
>> (yes ... there are other things needed as well such as conversion code 
>> between coodinate systems, format parsers, polygon code, ... but a start 
>> would be just for point data in one system :)
>>
>> An open copy of the spec is available at:
>>
>> http://www.w3.org/2011/02/GeoSPARQL.pdf
>>
>> 	Andy
> 
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 


Re: Implementing GeoSPARQL

Posted by Paolo Castagna <ca...@googlemail.com>.
Hi Chris
thanks for letting us know.

Mattmann, Chris A (388J) wrote:
> Hi Andy,
> 
> Just to follow up on this, I went ahead and filed this issue:
> 
> https://issues.apache.org/jira/browse/SIS-42
> 
> I'm going to start work on implementing the Geo SPARQL spec in SIS, maybe
> even integrating with Any23 (and Jena down the road, but Any23, I'm a tad
> more familiar with at the moment). 

I am a bit confused here.

In my head, SIS (which I do not know very well) is a low-level geo indexing
library which could be used to provide the indexing capability for a GeoSPARQL
implementation.

I know that ARQ (i.e. the SPARQL query engine available in Jena) can
provide you with a SPARQL 1.1 engine and extension points to use other
custom indexes (such as SIS in this case).

What exactly do you mean with "integrating with Any23"?
Do you mean crawling the web and extract lat/long from web pages?
Where will you store those RDF statements?

How can you implement the GeoSPARQL spec without (re)using a SPARQL
query engine (such as ARQ)?

IMHO geo location (as well as free text) are two SPARQL extensions which
are very useful in loads of use cases.

Cheers,
Paolo

> I'm CC'ing the any23-dev and jena-dev user lists (apologies for the SPAM guys)
> just to keep them in the loop.
> 
> Cheers,
> Chris
> 
> On Jan 31, 2012, at 4:59 AM, Andy Seaborne wrote:
> 
>> Hi there,
>>
>> I'm investigating what it would take to implement GeoSPARQL.
>> There is already an Apache-licensed SPARQL engine in podling Jena.
>>
>> Of the things needed are a persistent storage layer with the right 
>> license.  Maybe the SIS project has something to use.
>>
>> If I understand it correctly, the qtree implementation is an in-memory 
>> structure, with the ability to read from a serialized form on disk, and 
>> to be able to write it to disk in that form.
>>
>> Is there any information on scaling for the qtree?  Memory usage?
>>
>> California_Restaurants.csv is 54K points - is that typical usage size?
>>
>> (yes ... there are other things needed as well such as conversion code 
>> between coodinate systems, format parsers, polygon code, ... but a start 
>> would be just for point data in one system :)
>>
>> An open copy of the spec is available at:
>>
>> http://www.w3.org/2011/02/GeoSPARQL.pdf
>>
>> 	Andy
> 
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 


Re: Implementing GeoSPARQL

Posted by Paolo Castagna <ca...@googlemail.com>.
Hi Chris
thanks for letting us know.

Mattmann, Chris A (388J) wrote:
> Hi Andy,
> 
> Just to follow up on this, I went ahead and filed this issue:
> 
> https://issues.apache.org/jira/browse/SIS-42
> 
> I'm going to start work on implementing the Geo SPARQL spec in SIS, maybe
> even integrating with Any23 (and Jena down the road, but Any23, I'm a tad
> more familiar with at the moment). 

I am a bit confused here.

In my head, SIS (which I do not know very well) is a low-level geo indexing
library which could be used to provide the indexing capability for a GeoSPARQL
implementation.

I know that ARQ (i.e. the SPARQL query engine available in Jena) can
provide you with a SPARQL 1.1 engine and extension points to use other
custom indexes (such as SIS in this case).

What exactly do you mean with "integrating with Any23"?
Do you mean crawling the web and extract lat/long from web pages?
Where will you store those RDF statements?

How can you implement the GeoSPARQL spec without (re)using a SPARQL
query engine (such as ARQ)?

IMHO geo location (as well as free text) are two SPARQL extensions which
are very useful in loads of use cases.

Cheers,
Paolo

> I'm CC'ing the any23-dev and jena-dev user lists (apologies for the SPAM guys)
> just to keep them in the loop.
> 
> Cheers,
> Chris
> 
> On Jan 31, 2012, at 4:59 AM, Andy Seaborne wrote:
> 
>> Hi there,
>>
>> I'm investigating what it would take to implement GeoSPARQL.
>> There is already an Apache-licensed SPARQL engine in podling Jena.
>>
>> Of the things needed are a persistent storage layer with the right 
>> license.  Maybe the SIS project has something to use.
>>
>> If I understand it correctly, the qtree implementation is an in-memory 
>> structure, with the ability to read from a serialized form on disk, and 
>> to be able to write it to disk in that form.
>>
>> Is there any information on scaling for the qtree?  Memory usage?
>>
>> California_Restaurants.csv is 54K points - is that typical usage size?
>>
>> (yes ... there are other things needed as well such as conversion code 
>> between coodinate systems, format parsers, polygon code, ... but a start 
>> would be just for point data in one system :)
>>
>> An open copy of the spec is available at:
>>
>> http://www.w3.org/2011/02/GeoSPARQL.pdf
>>
>> 	Andy
> 
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>