You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@nifi.apache.org by "Matt Burgess (Jira)" <ji...@apache.org> on 2021/10/26 22:29:00 UTC
[jira] [Commented] (NIFI-6175) Spark Livy - Improving Livy
[ https://issues.apache.org/jira/browse/NIFI-6175?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17434603#comment-17434603 ]
Matt Burgess commented on NIFI-6175:
------------------------------------
As Peter has moved to a company that doesn't use NiFi or Livy, I will pick this one up and hopefully get it across the finish line.
> Spark Livy - Improving Livy
> ---------------------------
>
> Key: NIFI-6175
> URL: https://issues.apache.org/jira/browse/NIFI-6175
> Project: Apache NiFi
> Issue Type: Improvement
> Reporter: Peter Wicks
> Assignee: Matt Burgess
> Priority: Major
> Time Spent: 5h 50m
> Remaining Estimate: 0h
>
> The Livy Session Controller is missing many of the options available, and many of them I feel are critical for this service to be useful (queue? conf? num of executors?)
> * Would like to see all available options there, with a blanket "conf" option for users to provide custom configuration.
> * When the controller service shuts down, sessions are left running, with no option to shut them down. Add in functionality to shutdown open sessions.
> * If the controller service finds no Idle Livy Sessions, it will create a new session... until the queue runs out of resources :). Need to have a Min/Max/Should be elastic or strict option
> * When Livy starts up, it searches for existing sessions, but does not verify that those sessions belong to it.
> ** The Kerberos identity should be used to verify the identity on the session matches the identity on the controller service.
> ** Also, if a Proxy user has been specified, that should also be verified. If no proxy user was specified, then the Proxy user on the Livy session should match the Kerberos identity.
> * The initialization of the SSL Context is not implemented in a thread safe way. This leads to exceptions when multiple threads are running against the same Controller Service.
> ** SSL Context init should be made thread safe.
> * There is a bug in Livy that causes running sessions to be killed if they run longer than the timeout value: https://issues.apache.org/jira/browse/LIVY-547.
> ** The processor should support the work around described in the discussion, by pinging the session to record activity on sessions to keep them alive. [https://github.com/apache/incubator-livy/pull/138#issuecomment-455352091]
> Livy should also support Batch mode.
> * Include a controller service to re-use configs, but controller service is basically just a config holder
> * Processor named `ExecuteSparkBatch`. This is harder than Session because Batch mode only supports code submission through a file path. So users will need to upload to HDFS first.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)