You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@pig.apache.org by shazz Ng <sh...@gmail.com> on 2011/09/08 21:27:04 UTC

HbaseStorage / OpenTSDB data queries issue

hello !

So I solved some issuesto retrieve TSD uids but now when querying the real
data, pig simply crashes...

model:
hbase(main):002:0> describe 'tsdb'
DESCRIPTION

                      ENABLED
 {NAME => 'tsdb', FAMILIES => [{NAME => 't', BLOOMFILTER => 'NONE',
REPLICATION_SCOPE => '0', COMPRESSION => 'NONE', VERSIONS => '3', TTL =>
'2147483647', BLOCKSIZE => '6553 true
 6', IN_MEMORY => 'false', BLOCKCACHE => 'true'}]}

hbase(main):001:0> scan 'tsdb'
ROW
 COLUMN+CELL
 \x00\x00\x01Ni\x10\xB0\x00\x00\x01\x00\x00\x01\x00\x00\x02\x00\x00\
column=t:C\xEB, timestamp=1315509487519, value=\x00\x00\x00\x00B\xE1\xBFa
 x02\x00\x00\x03\x00\x00\x01
 \x00\x00\x01Ni\x10\xB0\x00\x00\x01\x00\x00\x01\x00\x00\x02\x00\x00\
column=t:E\xCB, timestamp=1315509517558, value=\x00\x00\x00\x00B\xF2\xE9\xB6
 x02\x00\x00\x03\x00\x00\x01

traces :
grunt> tsd_data        =  LOAD 'hbase://tsdb' using
org.apache.pig.backend.hadoop.hbase.HBaseStorage('t',
'-caster=HBaseBinaryConverter -loadKey=true') AS (key:long);
grunt> dump tsd_data;
2011-09-08 21:20:00,521 [main] INFO
 org.apache.pig.tools.pigstats.ScriptState - Pig features used in the
script: UNKNOWN
2011-09-08 21:20:00,521 [main] INFO
 org.apache.pig.backend.hadoop.executionengine.HExecutionEngine -
pig.usenewlogicalplan is set to true. New logical plan will be used.
2011-09-08 21:20:00,542 [main] INFO
 org.apache.pig.backend.hadoop.executionengine.HExecutionEngine - (Name:
tsd_data:
Store(file:/tmp/temp-93457683/tmp-1360380816:org.apache.pig.impl.io.InterStorage)
- scope-15 Operator Key: scope-15)
2011-09-08 21:20:00,542 [main] INFO
 org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MRCompiler -
File concatenation threshold: 100 optimistic? false
2011-09-08 21:20:00,543 [main] INFO
 org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer
- MR plan size before optimization: 1
2011-09-08 21:20:00,543 [main] INFO
 org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer
- MR plan size after optimization: 1
2011-09-08 21:20:00,545 [main] INFO
 org.apache.pig.tools.pigstats.ScriptState - Pig script settings are added
to the job
2011-09-08 21:20:00,546 [main] INFO
 org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler
- mapred.job.reduce.markreset.buffer.percent is not set, set to default 0.3
2011-09-08 21:20:00,638 [main] ERROR org.apache.pig.tools.grunt.Grunt -
ERROR 2017: Internal error creating job configuration.
Details at logfile: /home/opentsdb/pig/pig_1315508236191.log
grunt> quit
opentsdb@TSDB:~/pig$ more /home/opentsdb/pig/pig_1315508236191.log
Pig Stack Trace
---------------
ERROR 2017: Internal error creating job configuration.

org.apache.pig.impl.logicalLayer.FrontendException: ERROR 1066: Unable to
open iterator for alias tsd_data
        at org.apache.pig.PigServer.openIterator(PigServer.java:765)
        at
org.apache.pig.tools.grunt.GruntParser.processDump(GruntParser.java:615)
        at
org.apache.pig.tools.pigscript.parser.PigScriptParser.parse(PigScriptParser.java:303)
        at
org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:168)
        at
org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:144)
        at org.apache.pig.tools.grunt.Grunt.run(Grunt.java:76)
        at org.apache.pig.Main.run(Main.java:455)
        at org.apache.pig.Main.main(Main.java:107)
Caused by: org.apache.pig.PigException: ERROR 1002: Unable to store alias
tsd_data
        at org.apache.pig.PigServer.storeEx(PigServer.java:889)
        at org.apache.pig.PigServer.store(PigServer.java:827)
        at org.apache.pig.PigServer.openIterator(PigServer.java:739)
        ... 7 more
Caused by:
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobCreationException:
ERROR 2017: Internal error creating job configuration.
        at
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler.getJob(JobControlCompiler.java:673)
        at
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler.compile(JobControlCompiler.java:256)
        at
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher.launchPig(MapReduceLauncher.java:147)
        at
org.apache.pig.backend.hadoop.executionengine.HExecutionEngine.execute(HExecutionEngine.java:382)
        at
org.apache.pig.PigServer.executeCompiledLogicalPlan(PigServer.java:1209)
        at org.apache.pig.PigServer.storeEx(PigServer.java:885)
        ... 9 more
Caused by: java.lang.ArrayIndexOutOfBoundsException: 1
        at
org.apache.pig.backend.hadoop.hbase.HBaseStorage.setLocation(HBaseStorage.java:343)
        at
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler.getJob(JobControlCompiler.java:369)
        ... 14 more

any idea ????