You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@impala.apache.org by "Marcel Kornacker (Code Review)" <ge...@cloudera.org> on 2016/04/13 00:26:18 UTC
[Impala-CR](cdh5-trunk) IMPALA-2840: Don't store table location in partition location
Marcel Kornacker has posted comments on this change.
Change subject: IMPALA-2840: Don't store table location in partition location
......................................................................
Patch Set 6:
(13 comments)
http://gerrit.cloudera.org:8080/#/c/2355/6/be/src/runtime/descriptors.cc
File be/src/runtime/descriptors.cc:
Line 43: std::string DecompressLocation(const impala::THdfsTable& thrift_table,
why not just make this static? (and remove the namespace)
Line 44: const impala::THdfsPartition& thrift_partition) {
use a string* output param instead
http://gerrit.cloudera.org:8080/#/c/2355/6/be/src/runtime/descriptors.h
File be/src/runtime/descriptors.h:
Line 254: // stripped.
that would be good. do you know how large the descriptor tables get today for something like tpc-ds? (or our concurrent benchmark?)
http://gerrit.cloudera.org:8080/#/c/2355/6/fe/src/main/java/com/cloudera/impala/catalog/HdfsPartitionLocationCompressor.java
File fe/src/main/java/com/cloudera/impala/catalog/HdfsPartitionLocationCompressor.java:
Line 23: import com.cloudera.impala.util.ListMap;
there are a few minor objectionable things in ListMap, could you include it in this patch?
Line 33: *
extra blank line
Line 93: private final int prefix_index;
_
Line 135: if (0 == numPartitions) return new Pair<String,String>(s, "");
this is a small function and i can understand the code perfectly well, but this isn't the number of partitions, it's the number of partition columns
Line 142: if (s.charAt(i) == '/') {
single line
http://gerrit.cloudera.org:8080/#/c/2355/6/fe/src/main/java/com/cloudera/impala/catalog/HdfsTable.java
File fe/src/main/java/com/cloudera/impala/catalog/HdfsTable.java:
Line 467: public HdfsTable(TableId id, org.apache.hadoop.hive.metastore.api.Table msTbl,
move c'tor to top, below static classes.
Line 1265: //new HdfsPartitionLocationCompressor(numClusteringCols_);
remove
http://gerrit.cloudera.org:8080/#/c/2355/6/testdata/workloads/functional-query/queries/QueryTest/alter-table.test
File testdata/workloads/functional-query/queries/QueryTest/alter-table.test:
Line 846: ---- QUERY
add summary comment what this is testing
Line 895: ---- QUERY
same here
Line 935: select i,s from alter_table_test_db.table_one_part order by i;
isn't show partitions a more direct way of ensuring that the partition locations didn't get changed? (if that's what you want to demonstrate)
--
To view, visit http://gerrit.cloudera.org:8080/2355
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c67b6ce0f83de2f5277a528a9ce67e47d638adb
Gerrit-PatchSet: 6
Gerrit-Project: Impala
Gerrit-Branch: cdh5-trunk
Gerrit-Owner: Jim Apple <jb...@cloudera.com>
Gerrit-Reviewer: Dimitris Tsirogiannis <dt...@cloudera.com>
Gerrit-Reviewer: Jim Apple <jb...@cloudera.com>
Gerrit-Reviewer: Marcel Kornacker <ma...@cloudera.com>
Gerrit-Reviewer: Mostafa Mokhtar <mm...@cloudera.com>
Gerrit-Reviewer: Sailesh Mukil <sa...@cloudera.com>
Gerrit-HasComments: Yes