You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@s2graph.apache.org by "DOYUNG YOON (JIRA)" <ji...@apache.org> on 2018/04/02 01:44:00 UTC

[jira] [Commented] (S2GRAPH-197) Provide S2graphSink for non-streaming dataset

    [ https://issues.apache.org/jira/browse/S2GRAPH-197?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16421871#comment-16421871 ] 

DOYUNG YOON commented on S2GRAPH-197:
-------------------------------------

[~chul], Quick question about this issue. 

As far as I understand, current implementation of S2GraphSink class do not support writeBatch method, it just simply throw not supported exception.

Are you suggest to change writeBatch method, which just throw exception currently?

My question is which approach is better between using HBase client API and LoadIncrementalHFiles for writeBatch method.

If we are going to use LoadIncrementalHFiles, then I think it is possible to merge s2jobs.loader.GraphFileGenerator into writeBatch method of S2GraphSink, which I can contribute. 

> Provide S2graphSink for non-streaming dataset
> ---------------------------------------------
>
>                 Key: S2GRAPH-197
>                 URL: https://issues.apache.org/jira/browse/S2GRAPH-197
>             Project: S2Graph
>          Issue Type: Sub-task
>          Components: s2jobs
>            Reporter: Chul Kang
>            Assignee: Chul Kang
>            Priority: Major
>
> Currently, S2graphSink supports sink operation for spark structured streaming that is only for StreamingQuery.
> If we provide the same operation for the DataframeWriter in S2graphSink, we could use it in batch mode.
>  
>  
>  
>  
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)