You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@superset.apache.org by GitBox <gi...@apache.org> on 2020/01/23 21:30:53 UTC

[GitHub] [incubator-superset] robdiciuccio opened a new pull request #9007: Serialize nested columns as JSON strings

robdiciuccio opened a new pull request #9007: Serialize nested columns as JSON strings
URL: https://github.com/apache/incubator-superset/pull/9007
 
 
   ### CATEGORY
   
   Choose one
   
   - [X] Bug Fix
   - [ ] Enhancement (new features, refinement)
   - [ ] Refactor
   - [ ] Add tests
   - [ ] Build / Development Environment
   - [ ] Documentation
   
   ### SUMMARY
   SQL Lab queries against databases with nested columns/results containing structs or maps are failing with the following error:
   ```
   pyarrow.lib.ArrowNotImplementedError: Not implemented type for list in DataFrameBlock: struct<...>
   ```
   
   Serializing results to JSON is not the ideal solution, mainly for performance reasons, but there are pending fixes yet to be released in Arrow 1.0 that may improve the situation:
   https://github.com/apache/arrow/pull/6199
   
   ### BEFORE/AFTER SCREENSHOTS OR ANIMATED GIF
   <!--- Skip this if not applicable -->
   
   ### TEST PLAN
   Ensure queries against databases containing nested columns (particularly arrays containing maps) are successful. Per https://github.com/apache/incubator-superset/issues/8978, the following query should succeed and produce the proper results:
   ```
   SELECT id, json_agg(json_build_object('table_name',table_name,'database_id',database_id)) FROM (SELECT * FROM tables) AS tables GROUP BY id
   ```
   
   ### ADDITIONAL INFORMATION
   <!--- Check any relevant boxes with "x" -->
   <!--- HINT: Include "Fixes #nnn" if you are fixing an existing issue -->
   - [X] Has associated issue: #8978
   - [ ] Changes UI
   - [ ] Requires DB Migration.
   - [ ] Confirm DB Migration upgrade and downgrade tested.
   - [ ] Introduces new feature or API
   - [ ] Removes existing feature or API
   
   ### REVIEWERS
   @lxhoang97 @graceguo-supercat @john-bodley @michellethomas @villebro 

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org


[GitHub] [incubator-superset] nytai commented on a change in pull request #9007: Serialize nested columns as JSON strings

Posted by GitBox <gi...@apache.org>.
nytai commented on a change in pull request #9007: Serialize nested columns as JSON strings
URL: https://github.com/apache/incubator-superset/pull/9007#discussion_r370758282
 
 

 ##########
 File path: superset/result_set.py
 ##########
 @@ -86,7 +88,18 @@ def __init__(
         # related: https://issues.apache.org/jira/browse/ARROW-5248
         if pa_data:
             for i, column in enumerate(column_names):
-                if pa.types.is_temporal(pa_data[i].type):
+                # TODO: revisit nested column serialization once Arrow 1.0 is released with:
+                # https://github.com/apache/arrow/pull/6199
+                # Related issue: #8978
 
 Review comment:
   use url instead of issue #? 

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org


[GitHub] [incubator-superset] craig-rueda merged pull request #9007: Serialize nested columns as JSON strings

Posted by GitBox <gi...@apache.org>.
craig-rueda merged pull request #9007: Serialize nested columns as JSON strings
URL: https://github.com/apache/incubator-superset/pull/9007
 
 
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org


[GitHub] [incubator-superset] codecov-io commented on issue #9007: Serialize nested columns as JSON strings

Posted by GitBox <gi...@apache.org>.
codecov-io commented on issue #9007: Serialize nested columns as JSON strings
URL: https://github.com/apache/incubator-superset/pull/9007#issuecomment-577889658
 
 
   # [Codecov](https://codecov.io/gh/apache/incubator-superset/pull/9007?src=pr&el=h1) Report
   > Merging [#9007](https://codecov.io/gh/apache/incubator-superset/pull/9007?src=pr&el=desc) into [master](https://codecov.io/gh/apache/incubator-superset/commit/04be1d4b422648f5a7744bd753f8481d2f2f135f?src=pr&el=desc) will **not change** coverage.
   > The diff coverage is `n/a`.
   
   [![Impacted file tree graph](https://codecov.io/gh/apache/incubator-superset/pull/9007/graphs/tree.svg?width=650&token=KsB0fHcx6l&height=150&src=pr)](https://codecov.io/gh/apache/incubator-superset/pull/9007?src=pr&el=tree)
   
   ```diff
   @@           Coverage Diff           @@
   ##           master    #9007   +/-   ##
   =======================================
     Coverage   59.16%   59.16%           
   =======================================
     Files         367      367           
     Lines       11680    11680           
     Branches     2863     2863           
   =======================================
     Hits         6910     6910           
     Misses       4591     4591           
     Partials      179      179
   ```
   
   
   
   ------
   
   [Continue to review full report at Codecov](https://codecov.io/gh/apache/incubator-superset/pull/9007?src=pr&el=continue).
   > **Legend** - [Click here to learn more](https://docs.codecov.io/docs/codecov-delta)
   > `Δ = absolute <relative> (impact)`, `ø = not affected`, `? = missing data`
   > Powered by [Codecov](https://codecov.io/gh/apache/incubator-superset/pull/9007?src=pr&el=footer). Last update [04be1d4...bad166a](https://codecov.io/gh/apache/incubator-superset/pull/9007?src=pr&el=lastupdated). Read the [comment docs](https://docs.codecov.io/docs/pull-request-comments).
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org