You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Pau Tallada CrespĂ (JIRA)" <ji...@apache.org> on 2016/05/18 12:13:12 UTC
[jira] [Commented] (HIVE-13312) TABLESAMPLE with PERCENT throws
FAILED: SemanticException 1:68 Percentage sampling is not supported in
org.apache.hadoop.hive.ql.io.HiveInputFormat. Error encountered near token
'20'
[ https://issues.apache.org/jira/browse/HIVE-13312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15288866#comment-15288866 ]
Pau Tallada CrespĂ commented on HIVE-13312:
-------------------------------------------
Also, even with the workaround, an aggregated query reads and processes all rows.
So, a count(*) on a tablesample returns the actual number of rows in the entire table :(
> TABLESAMPLE with PERCENT throws FAILED: SemanticException 1:68 Percentage sampling is not supported in org.apache.hadoop.hive.ql.io.HiveInputFormat. Error encountered near token '20'
> --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
>
> Key: HIVE-13312
> URL: https://issues.apache.org/jira/browse/HIVE-13312
> Project: Hive
> Issue Type: Bug
> Components: CLI
> Affects Versions: 1.2.1
> Reporter: Artem Ervits
> Priority: Minor
>
> FAILED: SemanticException 1:68 Percentage sampling is not supported in org.apache.hadoop.hive.ql.io.HiveInputFormat. Error encountered near token '20'
> when I execute
> SELECT * FROM tablename TABLESAMPLE(20 percent);
> tried with ORC and TEXT tables. Confirmed with Gopal, a temporary workaround is
> set hive.tez.input.format=${hive.input.format};
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)