You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@s2graph.apache.org by "DOYUNG YOON (JIRA)" <ji...@apache.org> on 2018/04/02 01:44:00 UTC

[jira] [Comment Edited] (S2GRAPH-197) Provide S2graphSink for non-streaming dataset

    [ https://issues.apache.org/jira/browse/S2GRAPH-197?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16421871#comment-16421871 ] 

DOYUNG YOON edited comment on S2GRAPH-197 at 4/2/18 1:44 AM:
-------------------------------------------------------------

[~chul], Quick question about this issue. 

As far as I understand, the current implementation of S2GraphSink class does not support writeBatch method, it just simply throw not supported exception.

Are you suggest to change writeBatch method, which just throws exception currently?

My question is which approach is better between using HBase client API and LoadIncrementalHFiles for writeBatch method.

If we are going to use LoadIncrementalHFiles, then I think it is possible to merge s2jobs.loader.GraphFileGenerator into writeBatch method of S2GraphSink, which I can contribute. 


was (Author: steamshon):
[~chul], Quick question about this issue. 

As far as I understand, current implementation of S2GraphSink class do not support writeBatch method, it just simply throw not supported exception.

Are you suggest to change writeBatch method, which just throw exception currently?

My question is which approach is better between using HBase client API and LoadIncrementalHFiles for writeBatch method.

If we are going to use LoadIncrementalHFiles, then I think it is possible to merge s2jobs.loader.GraphFileGenerator into writeBatch method of S2GraphSink, which I can contribute. 

> Provide S2graphSink for non-streaming dataset
> ---------------------------------------------
>
>                 Key: S2GRAPH-197
>                 URL: https://issues.apache.org/jira/browse/S2GRAPH-197
>             Project: S2Graph
>          Issue Type: Sub-task
>          Components: s2jobs
>            Reporter: Chul Kang
>            Assignee: Chul Kang
>            Priority: Major
>
> Currently, S2graphSink supports sink operation for spark structured streaming that is only for StreamingQuery.
> If we provide the same operation for the DataframeWriter in S2graphSink, we could use it in batch mode.
>  
>  
>  
>  
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)