You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2022/05/10 20:59:27 UTC

[GitHub] [arrow-datafusion] andygrove opened a new issue, #2505: Ballista assumes all aggregate expressions are not DISTINCT

andygrove opened a new issue, #2505:
URL: https://github.com/apache/arrow-datafusion/issues/2505

   **Describe the bug**
   We have a hard-coded `distinct = false` parameter in `ballista/rust/core/src/serde/physical_plan/mod.rs`.
   
   ``` rust
   Ok(create_aggregate_expr(
       &aggr_function.into(),
       false, // <-- hard-coded "distinct"
       input_phy_expr.as_slice(),
       &physical_schema,
       name.to_string(),
   )?)
   ```
   
   **To Reproduce**
   Try running a `COUNT(DISTINCT expr)` in Ballista
   
   **Expected behavior**
   We need to include the distinct flag in the protobuf for aggregate queries and implement the appropriate serde code.
   
   **Additional context**
   None
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-datafusion] comphead commented on issue #2505: Ballista assumes all aggregate expressions are not DISTINCT

Posted by GitBox <gi...@apache.org>.
comphead commented on issue #2505:
URL: https://github.com/apache/arrow-datafusion/issues/2505#issuecomment-1125013122

   I checked the backtrace 
   ```
      2: datafusion_physical_expr::aggregate::build_in::create_aggregate_expr
                at ./datafusion/physical-expr/src/aggregate/build_in.rs:75:13
      3: datafusion::physical_plan::planner::create_aggregate_expr_with_name
                at ./datafusion/core/src/physical_plan/planner.rs:1347:13
      4: datafusion::physical_plan::planner::create_aggregate_expr
                at ./datafusion/core/src/physical_plan/planner.rs:1390:5
      5: datafusion::physical_plan::planner::DefaultPhysicalPlanner::create_initial_plan::{{closure}}::{{closure}}
                at ./datafusion/core/src/physical_plan/planner.rs:525:29
   ```
   
   I didn't notice ballista modules here. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-datafusion] comphead commented on issue #2505: Ballista assumes all aggregate expressions are not DISTINCT

Posted by GitBox <gi...@apache.org>.
comphead commented on issue #2505:
URL: https://github.com/apache/arrow-datafusion/issues/2505#issuecomment-1124992211

   Hi @andygrove Ive run in local ballista
   ```
   SELECT count(distinct c1) as cnt_distinct FROM aggregate_test_100
   ```
   
   And the result is expected
   ```
   +--------------+
   | cnt_distinct |
   +--------------+
   | 5            |
   +--------------+
   ```
   which is expected


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org