You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tinkerpop.apache.org by "stephen mallette (JIRA)" <ji...@apache.org> on 2019/03/11 15:51:00 UTC

[jira] [Commented] (TINKERPOP-2177) Streaming response immediately after authentication stops after first partial response

    [ https://issues.apache.org/jira/browse/TINKERPOP-2177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16789719#comment-16789719 ] 

stephen mallette commented on TINKERPOP-2177:
---------------------------------------------

Thanks for doing the analysis on this and posting a fix. We are preparing for release of 3.3.6/3.4.1 right now - we will be preparing artifacts next week. This week we are just reviewing/testing as well as merging final pull requests. 

Do you want to quickly submit a pull request to correct this issue?

> Streaming response immediately after authentication stops after first partial response
> --------------------------------------------------------------------------------------
>
>                 Key: TINKERPOP-2177
>                 URL: https://issues.apache.org/jira/browse/TINKERPOP-2177
>             Project: TinkerPop
>          Issue Type: Bug
>          Components: python
>    Affects Versions: 3.4.0
>         Environment: Independent of the environment. We could repro the issue both on Windows and Linux. 
>            Reporter: Jayanta Mondal
>            Priority: Critical
>         Attachments: gremlinpythonbug.png
>
>   Original Estimate: 3h
>  Remaining Estimate: 3h
>
> +*Repro step*+: 
> Use gremlinpython 3.4.0 to issue 'g.V()' (or any gremlin query) on a sufficiently large graph, so that the response is large enough that the server needs to send multiple partial responses (status code: 206). 
> The 'g.v()' needs to be the first query immediately after establishing the connection. 
> +*Behavior:*+ 
> The query only returns the first partial response from the gremlin server to the user.  Even though the gremlinpython client ends up reading all the data from the gremlin server. 
>  
> *+Why Critical:+* 
>  # This is a correctness issue from the end-user point of view.
>  # The client is only getting partial data while paying the cost of running the entire query on the gremlin server.
>  
> +*Diagnosis:*+ 
> There was a recent change in gremlinpython to stop using a recursive call to read from a steaming response. The change made the caller (def _receive(self): in connection.py) use a while loop to read the streaming response.
>  
> While this change is fine, on the first request to a WebSocket connection, after the authentication is done, we are still making a recursive call to read data from the response stream. Ideally, after the authentication is done, we should return control to the def _receive(self) method so that it can do the read. 
> Such mixing of recursive call and while loop is causing the following behavior: 
> 1. If the first response immediately after the authentication is a streaming response, we are only reading the first chunk of the result. 
>  
>  
> [https://github.com/apache/tinkerpop/commit/4c8717dd4e94ec248a959911d8c11f3b45b2d7b3#diff-c3daf7e789724ff22bcba3d02f3dc35a]
> +*Fix:*+
>  Add a '{color:#205081}return{color}' before the recursive call to read data after auth (see attachment below)
>        --> This means that the caller now can use the while loop read the streaming response.
>        --> Otherwise the caller gets back a status code 'None' and thinks that it is not a partial response. 
>  
>  
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)