You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by GitBox <gi...@apache.org> on 2022/07/22 21:03:49 UTC

[GitHub] [airflow] josh-fell opened a new issue, #25243: Mapped `BatchOperator` tasks causes SerializationError due to `operator_extra_links` property object

josh-fell opened a new issue, #25243:
URL: https://github.com/apache/airflow/issues/25243

   ### Apache Airflow Provider(s)
   
   amazon
   
   ### Versions of Apache Airflow Providers
   
   apache-airflow-providers-amazon-aws==4.1.0
   
   ### Apache Airflow version
   
   main (development)
   
   ### Operating System
   
   Debian GNU/Linux 11 (bullseye)
   
   ### Deployment
   
   Other Docker-based deployment
   
   ### Deployment details
   
   Using Breeze on `main` branch.
   
   ### What happened
   
   Attempting to create dynamically-mapped tasks using the `BatchOperator` fails with the following DAG import error:
   
   ```
   Broken DAG: [/files/dags/batchop_dtm.py] Traceback (most recent call last):
     File "/opt/airflow/airflow/serialization/serialized_objects.py", line 693, in _serialize_node
       op.operator_extra_links
     File "/opt/airflow/airflow/serialization/serialized_objects.py", line 999, in _serialize_operator_extra_links
       for operator_extra_link in operator_extra_links:
   TypeError: 'property' object is not iterable
   
   During handling of the above exception, another exception occurred:
   
   Traceback (most recent call last):
     File "/opt/airflow/airflow/serialization/serialized_objects.py", line 1175, in to_dict
       json_dict = {"__version": cls.SERIALIZER_VERSION, "dag": cls.serialize_dag(var)}
     File "/opt/airflow/airflow/serialization/serialized_objects.py", line 1083, in serialize_dag
       raise SerializationError(f'Failed to serialize DAG {dag.dag_id!r}: {e}')
   airflow.exceptions.SerializationError: Failed to serialize DAG 'batchop_dtm': 'property' object is not iterable
   ```
   
   ### What you think should happen instead
   
   Users should be able to use Dynamic Task Mapping to generate `BatchOperator` tasks without a DAG import/serialization error.
   
   ### How to reproduce
   
   1. Create a DAG similar to the following in which `BatchOperator` tasks are dynamically-mapped. Note this is a "toy" example, but it should be applicable to more "real-world" use cases.
   
   ```python
   from pendulum import datetime
   
   from airflow.decorators import dag
   from airflow.providers.amazon.aws.operators.batch import BatchOperator
   
   
   @dag(start_date=datetime(2022, 1, 1), schedule_interval=None)
   def batchop_dtm():
       BatchOperator.partial(
           task_id='submit_batch_job',
           job_queue="batch_job_queue_name",
           job_definition="batch_job_definition_name",
           overrides={},
           # Set this flag to False, so we can test the sensor below
           wait_for_completion=False,
       ).expand(job_name=["job_1", "job_2", "job_3"])
   
   
   _ = batchop_dtm()
   ```
   2. Startup an Airflow environment using Breeze: `breeze start-airflow`
   3. The following DAG import error is generated:
   <img width="1440" alt="image" src="https://user-images.githubusercontent.com/48934154/180567560-57db4f86-5a06-43a0-8e17-88fd473c23f0.png">
   
   
   ### Anything else
   
   A [similar issue was created previously](https://github.com/apache/airflow/issues/24653) with related fixes in #24676 and #25215.
   
   I suspect the same behavior would occur using the [`BigQueryExecuteQueryOperator`](https://github.com/apache/airflow/blob/f53bd5df2a0b370a14f811b353229ad3e9c66662/airflow/providers/google/cloud/operators/bigquery.py#L543-L548) as well.
   
   ### Are you willing to submit PR?
   
   - [ ] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of Conduct](https://github.com/apache/airflow/blob/main/CODE_OF_CONDUCT.md)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@airflow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [airflow] josh-fell commented on issue #25243: Mapped `BatchOperator` tasks causes SerializationError due to `operator_extra_links` property object

Posted by "josh-fell (via GitHub)" <gi...@apache.org>.
josh-fell commented on issue #25243:
URL: https://github.com/apache/airflow/issues/25243#issuecomment-1420956054

   Confirmed this works just fine on Airflow 2.5.0


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@airflow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [airflow] hsilva-evisit commented on issue #25243: Mapped tasks with `operator_extra_links` property object causes SerializationError

Posted by "hsilva-evisit (via GitHub)" <gi...@apache.org>.
hsilva-evisit commented on issue #25243:
URL: https://github.com/apache/airflow/issues/25243#issuecomment-1437361750

   Facing the same issue


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@airflow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [airflow] amenzel1986 commented on issue #25243: Mapped `BatchOperator` tasks causes SerializationError due to `operator_extra_links` property object

Posted by GitBox <gi...@apache.org>.
amenzel1986 commented on issue #25243:
URL: https://github.com/apache/airflow/issues/25243#issuecomment-1235731919

   I am seeing similar issues with the ECSOperator.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@airflow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [airflow] Taragolis commented on issue #25243: Mapped tasks with `operator_extra_links` property object causes SerializationError

Posted by "Taragolis (via GitHub)" <gi...@apache.org>.
Taragolis commented on issue #25243:
URL: https://github.com/apache/airflow/issues/25243#issuecomment-1721796292

   https://github.com/apache/airflow/pull/31904


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@airflow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [airflow] uranusjr commented on issue #25243: Mapped `BatchOperator` tasks causes SerializationError due to `operator_extra_links` property object

Posted by GitBox <gi...@apache.org>.
uranusjr commented on issue #25243:
URL: https://github.com/apache/airflow/issues/25243#issuecomment-1196389543

   Instead of going through trouble trying to accomodate dynamic extra links, perhaps we should just detect those and mark them as dynamically and can’t be shown.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@airflow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [airflow] josh-fell commented on issue #25243: Mapped `BatchOperator` tasks causes SerializationError due to `operator_extra_links` property object

Posted by "josh-fell (via GitHub)" <gi...@apache.org>.
josh-fell commented on issue #25243:
URL: https://github.com/apache/airflow/issues/25243#issuecomment-1420937425

   Reopening this issue. Trying to map `BatchOperator` on Airflow 2.5.1 with Amazon provider 7.1.0, yields the original import error described above. I'll try to poke around and find out why this isn't working anymore.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@airflow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [airflow] eladkal closed issue #25243: Mapped `BatchOperator` tasks causes SerializationError due to `operator_extra_links` property object

Posted by GitBox <gi...@apache.org>.
eladkal closed issue #25243: Mapped `BatchOperator` tasks causes SerializationError due to `operator_extra_links` property object
URL: https://github.com/apache/airflow/issues/25243


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@airflow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [airflow] eladkal commented on issue #25243: Mapped `BatchOperator` tasks causes SerializationError due to `operator_extra_links` property object

Posted by GitBox <gi...@apache.org>.
eladkal commented on issue #25243:
URL: https://github.com/apache/airflow/issues/25243#issuecomment-1263986543

   Should be fixed by https://github.com/apache/airflow/pull/25500


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@airflow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Re: [I] Mapped tasks with `operator_extra_links` property object causes SerializationError [airflow]

Posted by "Taragolis (via GitHub)" <gi...@apache.org>.
Taragolis closed issue #25243: Mapped tasks with `operator_extra_links` property object causes SerializationError
URL: https://github.com/apache/airflow/issues/25243


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@airflow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [airflow] asherkhb commented on issue #25243: Mapped tasks with `operator_extra_links` property object causes SerializationError

Posted by "asherkhb (via GitHub)" <gi...@apache.org>.
asherkhb commented on issue #25243:
URL: https://github.com/apache/airflow/issues/25243#issuecomment-1721687717

   Does anyone by chance know a work around for the issue (specifically w/ AWS Batch operator mentioned in original issue? 
   
   My current "work around" is just use the Amazon provider <=v3.4.0 which isn't ideal but does allow the Batch operator to be used in dynamic task mapping...


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@airflow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Re: [I] Mapped tasks with `operator_extra_links` property object causes SerializationError [airflow]

Posted by "Taragolis (via GitHub)" <gi...@apache.org>.
Taragolis commented on issue #25243:
URL: https://github.com/apache/airflow/issues/25243#issuecomment-1765193291

   - Should be finally resolved in #34506


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@airflow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org