You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Tim Armstrong (Jira)" <ji...@apache.org> on 2019/09/16 22:43:00 UTC

[jira] [Resolved] (IMPALA-708) optimize hdfs-table-sink output partition hashing

     [ https://issues.apache.org/jira/browse/IMPALA-708?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Tim Armstrong resolved IMPALA-708.
----------------------------------
    Resolution: Won't Fix

See dan's last comment.

> optimize hdfs-table-sink output partition hashing
> -------------------------------------------------
>
>                 Key: IMPALA-708
>                 URL: https://issues.apache.org/jira/browse/IMPALA-708
>             Project: IMPALA
>          Issue Type: Task
>          Components: Backend
>    Affects Versions: Impala 1.0, Impala 1.2
>            Reporter: Nong Li
>            Priority: Minor
>              Labels: poc
>
> Looking at some basic profiling while doing an unpartitioned insert, it looks like we have some very low hanging fruit:
>      226  16.2%  16.2%      226  16.2% boost::unordered_detail::hash_table::find_iterator                                  <-- Need to track down where this is (we need better cluster tools) but this seems like a big waste of time.
>      178  12.8%  29.0%      178  12.8% impala::HdfsParquetTableWriter::AppendRowBatch
>      157  11.3%  40.3%      157  11.3% impala::HdfsParquetTableWriter::ColumnWriter::EncodeValue@9ff700
>      131   9.4%  49.7%      131   9.4% __strncmp_sse42
>      129   9.3%  59.0%      133   9.6% impala::TextConverter::WriteSlot
>      109   7.8%  66.9%      109   7.8% impala::DelimitedTextParser::ParseFieldLocations
>       94   6.8%  73.6%       94   6.8% snappy::internal::CompressFragment
>       71   5.1%  78.7%       71   5.1% impala::HdfsParquetTableWriter::ColumnWriter::EncodeValue@9fca90
>       56   4.0%  82.7%       56   4.0% impala::HdfsScanner::WriteCompleteTuple
>       36   2.6%  85.3%       36   2.6% impala::HdfsParquetTableWriter::ColumnWriter::EncodeValue@9fd3f0
>       34   2.4%  87.8%       34   2.4% impala::HashUtil::Hash
>       34   2.4%  90.2%       34   2.4% impala::StringParser::StringToIntInternal@801fd0



--
This message was sent by Atlassian Jira
(v8.3.2#803003)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org