You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by ep...@apache.org on 2023/01/11 21:26:02 UTC

[airflow] 21/27: Minor update to Scheduler documentation (#28620)

This is an automated email from the ASF dual-hosted git repository.

ephraimanierobi pushed a commit to branch v2-5-test
in repository https://gitbox.apache.org/repos/asf/airflow.git

commit e39e07710bb7b45dc7833ae4bed2dd175f1d02ed
Author: Max Ho <ma...@alphalab.capital>
AuthorDate: Wed Dec 28 23:45:22 2022 +0800

    Minor update to Scheduler documentation (#28620)
    
    (cherry picked from commit 48a051acc40469ce43e28353fffd265e5326926a)
---
 docs/apache-airflow/concepts/scheduler.rst | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/docs/apache-airflow/concepts/scheduler.rst b/docs/apache-airflow/concepts/scheduler.rst
index e6b4f14575..d9ba22be82 100644
--- a/docs/apache-airflow/concepts/scheduler.rst
+++ b/docs/apache-airflow/concepts/scheduler.rst
@@ -239,7 +239,7 @@ There are several areas of resource usage that you should pay attention to:
   but if your problems with performance come from distributed filesystem performance, they might be the
   best approach to follow.
 * Database connections and Database usage might become a problem as you want to increase performance and
-  process more things in parallel. Airflow is known from being "database-connection hungry" - the more DAGs
+  process more things in parallel. Airflow is known for being "database-connection hungry" - the more DAGs
   you have and the more you want to process in parallel, the more database connections will be opened.
   This is generally not a problem for MySQL as its model of handling connections is thread-based, but this
   might be a problem for Postgres, where connection handling is process-based. It is a general consensus
@@ -257,8 +257,8 @@ There are several areas of resource usage that you should pay attention to:
   usage. If you have more CPUs available, you can increase number of processing threads
   :ref:`config:scheduler__parsing_processes`, Also Airflow Scheduler scales almost linearly with
   several instances, so you can also add more Schedulers if your Scheduler's performance is CPU-bound.
-* Airflow might use quite significant amount of memory when you try to get more performance out of it.
-  Often more performance is achieved in Airflow by increasing number of processes handling the load,
+* Airflow might use quite a significant amount of memory when you try to get more performance out of it.
+  Often more performance is achieved in Airflow by increasing the number of processes handling the load,
   and each process requires whole interpreter of Python loaded, a lot of classes imported, temporary
   in-memory storage. A lot of it is optimized by Airflow by using forking and copy-on-write memory used
   but in case new classes are imported after forking this can lead to extra memory pressure.