You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@carbondata.apache.org by "charles horrell (JIRA)" <ji...@apache.org> on 2019/01/07 15:19:00 UTC

[jira] [Updated] (CARBONDATA-3234) Unable to read data from carbondata table stored in S3 with Presto running on EMR

     [ https://issues.apache.org/jira/browse/CARBONDATA-3234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

charles horrell updated CARBONDATA-3234:
----------------------------------------
    Description: 
After carbondata table stored in S3 we are unable to query with presto and get the following error: 
{code:java}
presto:default> select count(*) from test_table;
Query 20190107_135333_00026_8r2c8 failed: tried to access method org.apache.hadoop.metrics2.lib.MutableCounterLong.<init>(Lorg/apache/hadoop/metrics2/MetricsInfo;J)V from class org.apache.hadoop.fs.s3a.S3AInstrumentation

 

presto:default> select * test_table;
Query 20190107_135610_00028_8r2c8 failed: tried to access method org.apache.hadoop.metrics2.lib.MutableCounterLong.<init>(Lorg/apache/hadoop/metrics2/MetricsInfo;J)V from class org.apache.hadoop.fs.s3a.S3AInstrumentation
{code}
The catalog appears to have been picked up okay as show tables works as expected as does describing the table it is just when actually trying to access the data that we see the error.

We configured presto as per the examples here: 

[http://carbondata.apache.org/quick-start-guide.html]

Querying from Spark works okay however it is vital for our use case that presto also works and with S3.

Amazon EMR version 5.19
 Spark 2.3.2
 Hadoop 2.8.5
 Presto 0.212

  was:
Once creating a carbondata table stored in S3 we are unable to query with presto and get the following error: 
{code:java}
presto:default> select count(*) from test_table;
Query 20190107_135333_00026_8r2c8 failed: tried to access method org.apache.hadoop.metrics2.lib.MutableCounterLong.<init>(Lorg/apache/hadoop/metrics2/MetricsInfo;J)V from class org.apache.hadoop.fs.s3a.S3AInstrumentation

 

presto:default> select * test_table;
Query 20190107_135610_00028_8r2c8 failed: tried to access method org.apache.hadoop.metrics2.lib.MutableCounterLong.<init>(Lorg/apache/hadoop/metrics2/MetricsInfo;J)V from class org.apache.hadoop.fs.s3a.S3AInstrumentation
{code}
The catalog appears to have been picked up okay as show tables works as expected as does describing the table it is just when actually trying to access the data that we see the error.

We configured presto as per the examples here: 

[http://carbondata.apache.org/quick-start-guide.html]

Querying from Spark works okay however it is vital for our use case that presto also works and with S3.

Amazon EMR version 5.19
Spark 2.3.2
Hadoop 2.8.5
Presto 0.212

        Summary: Unable to read data from carbondata table stored in S3 with Presto running on EMR  (was: Unable to read data from carbondata table stored in S3 from Presto running on EMR)

> Unable to read data from carbondata table stored in S3 with Presto running on EMR
> ---------------------------------------------------------------------------------
>
>                 Key: CARBONDATA-3234
>                 URL: https://issues.apache.org/jira/browse/CARBONDATA-3234
>             Project: CarbonData
>          Issue Type: Bug
>          Components: presto-integration
>         Environment: Amazon EMR 5.19
>            Reporter: charles horrell
>            Priority: Major
>
> After carbondata table stored in S3 we are unable to query with presto and get the following error: 
> {code:java}
> presto:default> select count(*) from test_table;
> Query 20190107_135333_00026_8r2c8 failed: tried to access method org.apache.hadoop.metrics2.lib.MutableCounterLong.<init>(Lorg/apache/hadoop/metrics2/MetricsInfo;J)V from class org.apache.hadoop.fs.s3a.S3AInstrumentation
>  
> presto:default> select * test_table;
> Query 20190107_135610_00028_8r2c8 failed: tried to access method org.apache.hadoop.metrics2.lib.MutableCounterLong.<init>(Lorg/apache/hadoop/metrics2/MetricsInfo;J)V from class org.apache.hadoop.fs.s3a.S3AInstrumentation
> {code}
> The catalog appears to have been picked up okay as show tables works as expected as does describing the table it is just when actually trying to access the data that we see the error.
> We configured presto as per the examples here: 
> [http://carbondata.apache.org/quick-start-guide.html]
> Querying from Spark works okay however it is vital for our use case that presto also works and with S3.
> Amazon EMR version 5.19
>  Spark 2.3.2
>  Hadoop 2.8.5
>  Presto 0.212



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)