You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@livy.apache.org by "Akshat Bordia (Jira)" <ji...@apache.org> on 2021/09/08 04:41:00 UTC
[jira] [Created] (LIVY-866) Optimize Yarn GetApplications Query to
prevent additional load on Yarn and Livy
Akshat Bordia created LIVY-866:
----------------------------------
Summary: Optimize Yarn GetApplications Query to prevent additional load on Yarn and Livy
Key: LIVY-866
URL: https://issues.apache.org/jira/browse/LIVY-866
Project: Livy
Issue Type: Improvement
Components: Server
Reporter: Akshat Bordia
Currently Livy queries Yarn applications by applicationType : Spark. This will put heavy load on Yarn clusters if there are thousands or more Spark application in all states (running, finished, failed, queued etc.).
A better approach would be to query the applications by tags in addition to job type since Livy only needs to track application with certain application tags. However, YarnClient does not expose any API to query applications by tags.
We can extend the YarnClientImpl which would allow us to use protected rmClient in YarnClient and we can implement getApplications method with GetApplicationsRequest where we can include tags.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)