You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by GitBox <gi...@apache.org> on 2022/07/28 22:23:30 UTC

[GitHub] [airflow] MatrixManAtYrService opened a new issue, #25389: TypeError when `func` returns `None` in `expand_kwargs(func)`

MatrixManAtYrService opened a new issue, #25389:
URL: https://github.com/apache/airflow/issues/25389

   ### Apache Airflow version
   
   main (development)
   
   ### What happened
   
   Here's a DAG:
   
   ```python3
   with DAG(
       dag_id="expand_kwargs",
       schedule_interval=None,
       start_date=datetime(1970, 1, 1),
   ) as dag:
   
       @task
       def data():
           return [
               ("hello {}", "foo"),
               ("goodbye {}", "bar"),
           ]
   
       def mapper(entry):
           if entry[1] != "bar":
               return {"template": entry[0], "user": entry[1]}
           else:
               return None
   
       @task
       def printer(template, user):
           print(template.format(user))
   
       printer.expand_kwargs(data().map(mapper))
   ```
   
   My understanding (from [this test](https://github.com/apache/airflow/blob/main/tests/models/test_xcom_arg_map.py#L110)) is that I should be able to return `None` and prevent a task from being created for that item.  But when I try to run this I get an error:
   
   ```
   {base_executor.py:91} INFO - Adding to queue: ['<TaskInstance: expand_kwargs_mod_tf.one_task backfill__2022-07-28T00:00:00+00:00 map_index=1 [queued]>']
   hello foo
   {debug_executor.py:84} ERROR - Failed to execute task: 'NoneType' object is not iterable.
   Traceback (most recent call last):
     File "/home/matt/src/airflow/airflow/executors/debug_executor.py", line 78, in _run_task
       ti.run(job_id=ti.job_id, **params)
     File "/home/matt/src/airflow/airflow/utils/session.py", line 71, in wrapper
       return func(*args, session=session, **kwargs)
     File "/home/matt/src/airflow/airflow/models/taskinstance.py", line 1782, in run
       self._run_raw_task(
     File "/home/matt/src/airflow/airflow/utils/session.py", line 68, in wrapper
       return func(*args, **kwargs)
     File "/home/matt/src/airflow/airflow/models/taskinstance.py", line 1445, in _run_raw_task
       self._execute_task_with_callbacks(context, test_mode)
     File "/home/matt/src/airflow/airflow/models/taskinstance.py", line 1580, in _execute_task_with_callbacks
       task_orig = self.render_templates(context=context)
     File "/home/matt/src/airflow/airflow/models/taskinstance.py", line 2202, in render_templates
       rendered_task = self.task.render_template_fields(context)
     File "/home/matt/src/airflow/airflow/models/mappedoperator.py", line 751, in render_template_fields
       unmapped_task = self.unmap(mapped_kwargs)
     File "/home/matt/src/airflow/airflow/models/mappedoperator.py", line 592, in unmap
       kwargs = self._get_unmap_kwargs(kwargs, strict=self._disallow_kwargs_override)
     File "/home/matt/src/airflow/airflow/decorators/base.py", line 460, in _get_unmap_kwargs
       prevent_duplicates(
     File "/home/matt/src/airflow/airflow/models/mappedoperator.py", line 127, in prevent_duplicates
       duplicated_keys = set(kwargs1).intersection(kwargs2)
   TypeError: 'NoneType' object is not iterable
   ```
   
   ### What you think should happen instead
   
   Since one of the two mapped items returned None, this task should end up with just one instance "foo" and there should be no errors while queuing it.
   
   Or, if I'm misunderstanding how this should work, then the error that I see should do more to point me in the right direction.
   
   ### How to reproduce
   
   Run the dag given above
   
   ### Operating System
   
   NixOS 21.11 (linux kernel 5.10.101)
   
   ### Versions of Apache Airflow Providers
   
   n/a
   
   ### Deployment
   
   Virtualenv installation
   
   ### Deployment details
   
   _No response_
   
   ### Anything else
   
   I thought that this might be only a problem in the `prevent_duplicates` function, so I made the following change:
   
   ```python3
   # added 'or set()' below
   duplicated_keys = set(kwargs1 or set()).intersection(kwargs2 or set())
   ```
   
   Following that I got a different error:
   ```
   [2022-07-28 16:17:59,391] {debug_executor.py:84} ERROR - Failed to execute task: 'NoneType' object is not a mapping.
   Traceback (most recent call last):
     File "/home/matt/src/airflow/airflow/executors/debug_executor.py", line 78, in _run_task
       ti.run(job_id=ti.job_id, **params)
     File "/home/matt/src/airflow/airflow/utils/session.py", line 71, in wrapper
       return func(*args, session=session, **kwargs)
     File "/home/matt/src/airflow/airflow/models/taskinstance.py", line 1782, in run
       self._run_raw_task(
     File "/home/matt/src/airflow/airflow/utils/session.py", line 68, in wrapper
       return func(*args, **kwargs)
     File "/home/matt/src/airflow/airflow/models/taskinstance.py", line 1445, in _run_raw_task
       self._execute_task_with_callbacks(context, test_mode)
     File "/home/matt/src/airflow/airflow/models/taskinstance.py", line 1580, in _execute_task_with_callbacks
       task_orig = self.render_templates(context=context)
     File "/home/matt/src/airflow/airflow/models/taskinstance.py", line 2202, in render_templates
       rendered_task = self.task.render_template_fields(context)
     File "/home/matt/src/airflow/airflow/models/mappedoperator.py", line 751, in render_template_fields
       unmapped_task = self.unmap(mapped_kwargs)
     File "/home/matt/src/airflow/airflow/models/mappedoperator.py", line 592, in unmap
       kwargs = self._get_unmap_kwargs(kwargs, strict=self._disallow_kwargs_override)
     File "/home/matt/src/airflow/airflow/decorators/base.py", line 467, in _get_unmap_kwargs
       self._combined_op_kwargs = {**self.partial_kwargs["op_kwargs"], **mapped_kwargs["op_kwargs"]}
   TypeError: 'NoneType' object is not a mapping
   ```
   
   ### Are you willing to submit PR?
   
   - [ ] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of Conduct](https://github.com/apache/airflow/blob/main/CODE_OF_CONDUCT.md)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@airflow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [airflow] MatrixManAtYrService commented on issue #25389: TypeError when `func` returns `None` in `expand_kwargs(func)`

Posted by GitBox <gi...@apache.org>.
MatrixManAtYrService commented on issue #25389:
URL: https://github.com/apache/airflow/issues/25389#issuecomment-1198879328

   Oh.  Sorry for the noise then.  I had jumped to the conclusion that `None` was used to indicate "no task for this one", but I guess that's what `AirflowSkipException` is for.
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@airflow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [airflow] uranusjr commented on issue #25389: TypeError when `func` returns `None` in `expand_kwargs(func)`

Posted by GitBox <gi...@apache.org>.
uranusjr commented on issue #25389:
URL: https://github.com/apache/airflow/issues/25389#issuecomment-1198733589

   This should be covered by #25355. New error messasge would be
   
   ```
   ValueError: expand_kwargs() expects a list[dict], not list[None]
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@airflow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [airflow] uranusjr commented on issue #25389: TypeError when `func` returns `None` in `expand_kwargs(func)`

Posted by GitBox <gi...@apache.org>.
uranusjr commented on issue #25389:
URL: https://github.com/apache/airflow/issues/25389#issuecomment-1198888524

   Yeah, AirflowSkipException is the way to go. It may not be obvious for some users (you are too experienced šŸ˜‰) so Iā€™m adding it to the documentation; see #24489.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@airflow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [airflow] MatrixManAtYrService closed issue #25389: TypeError when `func` returns `None` in `expand_kwargs(func)`

Posted by GitBox <gi...@apache.org>.
MatrixManAtYrService closed issue #25389: TypeError when `func` returns `None` in `expand_kwargs(func)`
URL: https://github.com/apache/airflow/issues/25389


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@airflow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org