You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@iceberg.apache.org by GitBox <gi...@apache.org> on 2021/05/20 06:40:32 UTC
[GitHub] [iceberg] maxingui opened a new issue #2619: trino iceberg planning time so long
maxingui opened a new issue #2619:
URL: https://github.com/apache/iceberg/issues/2619
we use trino iceberg with huawei OBS,the planning time is more than 20 s,while the total SCHEDULE and RUNNING time is only 2 s ,I do not what worry with it;
the debug log is like that:
2021-05-20T06:06:01.039Z DEBUG Query-20210520_060537_00323_gbd7b-4734 io.trino.plugin.hive.metastore.thrift.ThriftHiveMetastoreClient Invocation of get_table_req(req=GetTableRequest(dbName:icebergsf1_02, tblName:customer, capabilities:ClientCapabilities(values:[INSERT_ONLY_TABLES]))) succeeded in 8.69ms
2021-05-20T06:06:01.049Z INFO Query-20210520_060537_00323_gbd7b-4734 org.apache.iceberg.BaseTableScan Scanning table "icebergsf1_02".customer snapshot 5772262083695712847 created at 2021-05-18 07:43:19.363 with filter true
2021-05-20T06:06:01.127Z DEBUG Query-20210520_060537_00323_gbd7b-4734 io.trino.plugin.hive.metastore.thrift.ThriftHiveMetastoreClient Invocation of get_table_req(req=GetTableRequest(dbName:icebergsf1_02, tblName:customer, capabilities:ClientCapabilities(values:[INSERT_ONLY_TABLES]))) succeeded in 8.70ms
2021-05-20T06:06:01.136Z DEBUG Query-20210520_060537_00323_gbd7b-4734 io.trino.plugin.hive.metastore.thrift.ThriftHiveMetastoreClient Invocation of get_table_req(req=GetTableRequest(dbName:icebergsf1_02, tblName:customer, capabilities:ClientCapabilities(values:[INSERT_ONLY_TABLES]))) succeeded in 7.93ms
2021-05-20T06:06:01.158Z DEBUG Query-20210520_060537_00323_gbd7b-4734 io.trino.plugin.hive.metastore.thrift.ThriftHiveMetastoreClient Invocation of get_table_req(req=GetTableRequest(dbName:icebergsf1_02, tblName:customer, capabilities:ClientCapabilities(values:[INSERT_ONLY_TABLES]))) succeeded in 8.10ms
2021-05-20T06:06:01.171Z INFO Query-20210520_060537_00323_gbd7b-4734 org.apache.iceberg.BaseTableScan Scanning table "icebergsf1_02".customer snapshot 5772262083695712847 created at 2021-05-18 07:43:19.363 with filter true
2021-05-20T06:06:01.193Z DEBUG Query-20210520_060537_00323_gbd7b-4734 io.trino.sql.planner.DistributedExecutionPlanner Dynamic filters: [Descriptor{id=df_2014, input="sr_returned_date_sk_0", operator=EQUAL, nullAllowed=false}]
2021-05-20T06:06:01.203Z DEBUG Query-20210520_060537_00323_gbd7b-4734 io.trino.plugin.hive.metastore.thrift.ThriftHiveMetastoreClient Invocation of get_table_req(req=GetTableRequest(dbName:icebergsf1_02, tblName:store_returns, capabilities:ClientCapabilities(values:[INSERT_ONLY_TABLES]))) succeeded in 8.39ms
2021-05-20T06:06:01.214Z INFO Query-20210520_060537_00323_gbd7b-4734 org.apache.iceberg.BaseTableScan Scanning table "icebergsf1_02"."store_returns" snapshot 3075814051181790716 created at 2021-05-18 07:44:47.401 with filter true
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org
[GitHub] [iceberg] dixingxing0 edited a comment on issue #2619: trino iceberg planning time so long
Posted by GitBox <gi...@apache.org>.
dixingxing0 edited a comment on issue #2619:
URL: https://github.com/apache/iceberg/issues/2619#issuecomment-844838873
It seems your log is incomplete, i think it's better to have more log.
Since you created this issue in iceberg project, i assume you are more inclined to think it is related to iceberg, do you have more clues about it?
Did you have tried with other engine like spark sql, if spark sql also planned slow, then maybe it is irrelevant to trino.
BTW, i've tried trino356 (with PR https://github.com/trinodb/trino/pull/7367), the analyze phase was much faster, like from 2s reduced to 1s in my test case.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org
[GitHub] [iceberg] dixingxing0 commented on issue #2619: trino iceberg planning time so long
Posted by GitBox <gi...@apache.org>.
dixingxing0 commented on issue #2619:
URL: https://github.com/apache/iceberg/issues/2619#issuecomment-844838873
It seems that your log is incomplete, i think it's better to have more log.
Since you created this issue in iceberg project, i assume you are more inclined to think it is related to iceberg, do you have more clues about it?
Did you have tried with other engine like spark sql, if spark sql also planned slow, then maybe it is irrelevant to trino.
BTW, i've tried trino356 (with PR https://github.com/trinodb/trino/pull/7367), the analyze phase was much faster, like from 2s reduced to 1s in my test case.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org
[GitHub] [iceberg] maxingui commented on issue #2619: trino iceberg planning time so long
Posted by GitBox <gi...@apache.org>.
maxingui commented on issue #2619:
URL: https://github.com/apache/iceberg/issues/2619#issuecomment-846410643
thinks for you reply,my trino version is 356 too; the issue does not appear in hdfs+hadoop environment,but trino iceberg with huawei OBS accurs
log as below:
[22.log](https://github.com/apache/iceberg/files/6526494/22.log)
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org