You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@rya.apache.org by Jorge Machado <jo...@me.com> on 2018/04/20 08:18:23 UTC

A way to scan data for Partitions

Hi Guys, 

is there any way to get the data partitions from Rya ? 

In my case if I have these data:

<urn:fdc:junion.com:19980923:mySubject> <urn:fdc:junion.com:country> “DE” .
<urn:fdc:junion.com:19980923:mySubject> <urn:fdc:junion.com:country> “PT” .
<urn:fdc:junion.com:19980923:mySubject> <urn:fdc:junion.com:country> “GB" .
<urn:fdc:junion.com:19980923:mySubject> <urn:fdc:junion.com:country> “US” .


I would like to read data im parallel from Accumulo using Rya but for that I need to create “groups/partitions”
Let’s say  that I have 4 servers that hit Accumulo. I need to have some way to split my query against all nodes Something like spark datasources.

The api from Accumulo has for example an groupByTablet on the Locations object or getTabletLocation
Any ideas ?
Jorge