You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@cassandra.apache.org by Miguel Angel Martin junquera <mi...@gmail.com> on 2013/08/22 12:51:33 UTC

how can i get the column value? Need help!.. cassandra 1.28 and pig 0.11.1

hi all:




I,m testing the new CqlStorage() with cassandra 1.28 and pig 0.11.1


I am using this sample data test:


http://frommyworkshop.blogspot.com.es/2013/07/hadoop-map-reduce-with-cassandra.html

And I load and dump data Righ with this script:

*rows = LOAD
'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
CqlStorage();*
*
*
*dump rows;*
*describe rows;*
*
*

*resutls:

((id,6),(age,30),(title,QA))

((id,5),(age,30),(title,QA))

rows: {id: chararray,age: int,title: chararray}


*


But i can not  get  the column values

I try to define   another schemas in Load like I used with
cassandraStorage()

http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Cassandra-and-Pig-how-to-get-column-values-td5641158.html


example:

*rows = LOAD
'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
CqlStorage() AS (columns: bag {T: tuple(name, value)});*


and I get this error:

*2013-08-22 12:24:45,426 [main] ERROR org.apache.pig.tools.grunt.Grunt -
ERROR 1031: Incompatable schema: left is
"columns:bag{T:tuple(name:bytearray,value:bytearray)}", right is
"id:chararray,age:int,title:chararray"*




I try to use, FLATTEN, SUBSTRING, SPLIT UDF`s but i have not get good
result:

Example:


   - when I flatten , I get a set of tuples like

*(title,QA)*

*(title,QA)*

*2013-08-22 12:42:20,673 [main] INFO
 org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total
input paths to process : 1*

*A: {title: chararray}*



but i can get value QA

Sustring only works with title



example:

*B = FOREACH A GENERATE SUBSTRING(title,2,5);*
*
*
*dump B;*
*describe B;*
*
*
*
*

*results:*
*
*

*(tle)*
*(tle)*
*B: {chararray}*




i try, this like ERIC LEE inthe other mail  and have the same results:


 Anyways, what I really what is the column value, not the name. Is there a
way to do that? I listed all of the failed attempts I made below.

   - colnames = FOREACH cols GENERATE $1 and was told $1 was out of bounds.
   - casted = FOREACH cols GENERATE (tuple(chararray, chararray))$0; but
   all I got back were empty tuples
   - values = FOREACH cols GENERATE $0.$1; but I got an error telling me
   data byte array can't be casted to tuple


Please, I will appreciate any help


Regards

Re: how can i get the column value? Need help!.. cassandra 1.28 and pig 0.11.1

Posted by Cyril Scetbon <cy...@free.fr>.
I tried with 1.2.10 and don't meet the issue anymore.

Regards
-- 
Cyril SCETBON

On Sep 19, 2013, at 10:28 PM, Cyril Scetbon <cy...@free.fr> wrote:

> Hi,
> 
> Did you try to build 1.2.10 and to use it for your tests ? I've got the same issue and will give it a try as soon as it's released (expected at the end of the week).
> 
> Regards
> -- 
> Cyril SCETBON
> 
> On Sep 2, 2013, at 3:09 PM, Miguel Angel Martin junquera <mi...@gmail.com> wrote:
> 
>> hi all:
>> 
>> More info :
>> 
>> https://issues.apache.org/jira/browse/CASSANDRA-5941
>> 
>> 
>> 
>> I tried this (and gen. cassandra 1.2.9)  but do not work for me, 
>> 
>> git clone http://git-wip-us.apache.org/repos/asf/cassandra.git
>> cd cassandra
>> git checkout cassandra-1.2
>> patch -p1 < 5867-bug-fix-filter-push-down-1.2-branch.txt
>> ant
>> 
>> 
>> 
>> Miguel Angel Martín Junquera
>> Analyst Engineer.
>> miguelangel.martin@brainsins.com
>> 
>> 
>> 
>> 2013/9/2 Miguel Angel Martin junquera <mi...@gmail.com>
>> hi:
>> 
>> I test this in cassandra 1.2.9 new  version and the issue still persists .
>> 
>> :-(
>> 
>> 
>> 
>> 
>> 
>> 
>> Miguel Angel Martín Junquera
>> Analyst Engineer.
>> miguelangel.martin@brainsins.com
>> 
>> 
>> 
>> 2013/8/30 Miguel Angel Martin junquera <mi...@gmail.com>
>> I try this:
>> 
>> rows = LOAD 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING CqlStorage();
>> dump rows;
>> ILLUSTRATE rows;
>> describe rows;
>> 
>> values2= FOREACH rows GENERATE  TOTUPLE (id) as (mycolumn:tuple(name,value));
>> dump values2;
>> describe values2;
>> 
>> But I get this results:
>> 
>> 
>> 
>> -------------------------------------------------------------
>> | rows     | id:chararray   | age:int   | title:chararray   | 
>> -------------------------------------------------------------
>> |          | (id, 6)        | (age, 30) | (title, QA)       | 
>> -------------------------------------------------------------
>> 
>> rows: {id: chararray,age: int,title: chararray}
>> 2013-08-30 09:54:37,831 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 1031: Incompatable field schema: left is "tuple_0:tuple(mycolumn:tuple(name:bytearray,value:bytearray))", right is "org.apache.pig.builtin.totuple_id_1:tuple(id:chararray)"
>> 
>> 
>> 
>> 
>> 
>> or 
>> 
>> 
>> 
>> ....
>> 
>> values2= FOREACH rows GENERATE  TOTUPLE (id) ;
>> dump values2;
>> describe values2;
>> 
>> 
>> 
>> and  the results are:
>> 
>> 
>> ...
>> (((id,6)))
>> (((id,5)))
>> values2: {org.apache.pig.builtin.totuple_id_8: (id: chararray)}
>> 
>> 
>> 
>> Aggg!!!!!
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> Miguel Angel Martín Junquera
>> Analyst Engineer.
>> miguelangel.martin@brainsins.com
>> 
>> 
>> 
>> 2013/8/28 Miguel Angel Martin junquera <mi...@gmail.com>
>> hi:
>> 
>> I can not understand why the schema is  define like "id:chararray,age:int,title:chararray"  and it does not define like tuples or bag tuples,  if we have pair key-values  columns
>> 
>> 
>> I try other time to change schema  but it does not work.
>> 
>> any ideas ...
>> 
>> perhaps, is the issue in the definition cql3 tables ?
>> 
>> regards
>> 
>> 
>> 2013/8/28 Miguel Angel Martin junquera <mi...@gmail.com>
>> hi all:
>> 
>> 
>> Regards
>> 
>> Still i can resolve this issue. .....
>> 
>> does anybody have this issue or try to test this simple example?
>> 
>> 
>> i am stumped I can not find a solution working. 
>> 
>> I appreciate any comment or help
>> 
>> 
>> 2013/8/22 Miguel Angel Martin junquera <mi...@gmail.com>
>> hi all:
>> 
>> 
>> 
>> 
>> I,m testing the new CqlStorage() with cassandra 1.28 and pig 0.11.1 
>> 
>> 
>> I am using this sample data test:
>> 
>>  http://frommyworkshop.blogspot.com.es/2013/07/hadoop-map-reduce-with-cassandra.html
>> 
>> And I load and dump data Righ with this script:
>> 
>> rows = LOAD 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING CqlStorage();
>> 
>> dump rows;
>> describe rows;
>> 
>> resutls:
>> 
>> ((id,6),(age,30),(title,QA))
>> ((id,5),(age,30),(title,QA))
>> rows: {id: chararray,age: int,title: chararray}
>> 
>> 
>> But i can not  get  the column values 
>> 
>> I try to define   another schemas in Load like I used with cassandraStorage()
>> 
>> http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Cassandra-and-Pig-how-to-get-column-values-td5641158.html
>> 
>> 
>> example:
>> 
>> rows = LOAD 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING CqlStorage() AS (columns: bag {T: tuple(name, value)});
>> 
>> 
>> and I get this error:
>> 
>> 2013-08-22 12:24:45,426 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 1031: Incompatable schema: left is "columns:bag{T:tuple(name:bytearray,value:bytearray)}", right is "id:chararray,age:int,title:chararray"
>> 
>> 
>> 
>> I try to use, FLATTEN, SUBSTRING, SPLIT UDF`s but i have not get good result:
>> 
>> Example:
>> 
>> when I flatten , I get a set of tuples like
>> (title,QA)
>> (title,QA)
>> 2013-08-22 12:42:20,673 [main] INFO  org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input paths to process : 1
>> A: {title: chararray}
>> 
>> 
>> but i can get value QA 
>> 
>> Sustring only works with title
>> 
>> 
>> 
>> example:
>> 
>> B = FOREACH A GENERATE SUBSTRING(title,2,5);
>> 
>> dump B;
>> describe B;
>> 
>> 
>> results:
>> 
>> (tle)
>> (tle)
>> B: {chararray}
>> 
>> 
>> 
>> i try, this like ERIC LEE inthe other mail  and have the same results:
>> 
>> 
>>  Anyways, what I really what is the column value, not the name. Is there a way to do that? I listed all of the failed attempts I made below.
>> colnames = FOREACH cols GENERATE $1 and was told $1 was out of bounds. 
>> casted = FOREACH cols GENERATE (tuple(chararray, chararray))$0; but all I got back were empty tuples
>> values = FOREACH cols GENERATE $0.$1; but I got an error telling me data byte array can't be casted to tuple
>> 
>> Please, I will appreciate any help 
>> 
>> 
>> Regards
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> -- 
>> 
>> 
>> 
>> 
>> 
>> 
>> Miguel Angel Martín Junquera
>> Analyst Engineer.
>> miguelangel.martin@brainsins.com
>> Tel. / Fax: (+34) 91 485 56 66
>> http://www.brainsins.com
>> Smart eCommerce
>>         Madrid: http://goo.gl/4B5kv
>>         London: http://goo.gl/uIXdv
>>         Barcelona: http://goo.gl/NZslW
>> 
>> Antes de imprimir este e-mail, piense si es necesario.
>> La legislación española ampara el secreto de las comunicaciones. Este correo electrónico es estrictamente confidencial y va dirigido exclusivamente a su destinatario/a. Si no es Ud., le rogamos que no difunda ni copie la transmisión y nos lo notifique cuanto antes.
>> 
>> 
>> 
>> -- 
>> 
>> 
>> 
>> 
>> 
>> Miguel Angel Martín Junquera
>> Analyst Engineer.
>> miguelangel.martin@brainsins.com
>> 
>> 
>> 
>> 
> 


Re: how can i get the column value? Need help!.. cassandra 1.28 and pig 0.11.1

Posted by Cyril Scetbon <cy...@free.fr>.
I tried with 1.2.10 and don't meet the issue anymore.

Regards
-- 
Cyril SCETBON

On Sep 19, 2013, at 10:28 PM, Cyril Scetbon <cy...@free.fr> wrote:

> Hi,
> 
> Did you try to build 1.2.10 and to use it for your tests ? I've got the same issue and will give it a try as soon as it's released (expected at the end of the week).
> 
> Regards
> -- 
> Cyril SCETBON
> 
> On Sep 2, 2013, at 3:09 PM, Miguel Angel Martin junquera <mi...@gmail.com> wrote:
> 
>> hi all:
>> 
>> More info :
>> 
>> https://issues.apache.org/jira/browse/CASSANDRA-5941
>> 
>> 
>> 
>> I tried this (and gen. cassandra 1.2.9)  but do not work for me, 
>> 
>> git clone http://git-wip-us.apache.org/repos/asf/cassandra.git
>> cd cassandra
>> git checkout cassandra-1.2
>> patch -p1 < 5867-bug-fix-filter-push-down-1.2-branch.txt
>> ant
>> 
>> 
>> 
>> Miguel Angel Martín Junquera
>> Analyst Engineer.
>> miguelangel.martin@brainsins.com
>> 
>> 
>> 
>> 2013/9/2 Miguel Angel Martin junquera <mi...@gmail.com>
>> hi:
>> 
>> I test this in cassandra 1.2.9 new  version and the issue still persists .
>> 
>> :-(
>> 
>> 
>> 
>> 
>> 
>> 
>> Miguel Angel Martín Junquera
>> Analyst Engineer.
>> miguelangel.martin@brainsins.com
>> 
>> 
>> 
>> 2013/8/30 Miguel Angel Martin junquera <mi...@gmail.com>
>> I try this:
>> 
>> rows = LOAD 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING CqlStorage();
>> dump rows;
>> ILLUSTRATE rows;
>> describe rows;
>> 
>> values2= FOREACH rows GENERATE  TOTUPLE (id) as (mycolumn:tuple(name,value));
>> dump values2;
>> describe values2;
>> 
>> But I get this results:
>> 
>> 
>> 
>> -------------------------------------------------------------
>> | rows     | id:chararray   | age:int   | title:chararray   | 
>> -------------------------------------------------------------
>> |          | (id, 6)        | (age, 30) | (title, QA)       | 
>> -------------------------------------------------------------
>> 
>> rows: {id: chararray,age: int,title: chararray}
>> 2013-08-30 09:54:37,831 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 1031: Incompatable field schema: left is "tuple_0:tuple(mycolumn:tuple(name:bytearray,value:bytearray))", right is "org.apache.pig.builtin.totuple_id_1:tuple(id:chararray)"
>> 
>> 
>> 
>> 
>> 
>> or 
>> 
>> 
>> 
>> ....
>> 
>> values2= FOREACH rows GENERATE  TOTUPLE (id) ;
>> dump values2;
>> describe values2;
>> 
>> 
>> 
>> and  the results are:
>> 
>> 
>> ...
>> (((id,6)))
>> (((id,5)))
>> values2: {org.apache.pig.builtin.totuple_id_8: (id: chararray)}
>> 
>> 
>> 
>> Aggg!!!!!
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> Miguel Angel Martín Junquera
>> Analyst Engineer.
>> miguelangel.martin@brainsins.com
>> 
>> 
>> 
>> 2013/8/28 Miguel Angel Martin junquera <mi...@gmail.com>
>> hi:
>> 
>> I can not understand why the schema is  define like "id:chararray,age:int,title:chararray"  and it does not define like tuples or bag tuples,  if we have pair key-values  columns
>> 
>> 
>> I try other time to change schema  but it does not work.
>> 
>> any ideas ...
>> 
>> perhaps, is the issue in the definition cql3 tables ?
>> 
>> regards
>> 
>> 
>> 2013/8/28 Miguel Angel Martin junquera <mi...@gmail.com>
>> hi all:
>> 
>> 
>> Regards
>> 
>> Still i can resolve this issue. .....
>> 
>> does anybody have this issue or try to test this simple example?
>> 
>> 
>> i am stumped I can not find a solution working. 
>> 
>> I appreciate any comment or help
>> 
>> 
>> 2013/8/22 Miguel Angel Martin junquera <mi...@gmail.com>
>> hi all:
>> 
>> 
>> 
>> 
>> I,m testing the new CqlStorage() with cassandra 1.28 and pig 0.11.1 
>> 
>> 
>> I am using this sample data test:
>> 
>>  http://frommyworkshop.blogspot.com.es/2013/07/hadoop-map-reduce-with-cassandra.html
>> 
>> And I load and dump data Righ with this script:
>> 
>> rows = LOAD 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING CqlStorage();
>> 
>> dump rows;
>> describe rows;
>> 
>> resutls:
>> 
>> ((id,6),(age,30),(title,QA))
>> ((id,5),(age,30),(title,QA))
>> rows: {id: chararray,age: int,title: chararray}
>> 
>> 
>> But i can not  get  the column values 
>> 
>> I try to define   another schemas in Load like I used with cassandraStorage()
>> 
>> http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Cassandra-and-Pig-how-to-get-column-values-td5641158.html
>> 
>> 
>> example:
>> 
>> rows = LOAD 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING CqlStorage() AS (columns: bag {T: tuple(name, value)});
>> 
>> 
>> and I get this error:
>> 
>> 2013-08-22 12:24:45,426 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 1031: Incompatable schema: left is "columns:bag{T:tuple(name:bytearray,value:bytearray)}", right is "id:chararray,age:int,title:chararray"
>> 
>> 
>> 
>> I try to use, FLATTEN, SUBSTRING, SPLIT UDF`s but i have not get good result:
>> 
>> Example:
>> 
>> when I flatten , I get a set of tuples like
>> (title,QA)
>> (title,QA)
>> 2013-08-22 12:42:20,673 [main] INFO  org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input paths to process : 1
>> A: {title: chararray}
>> 
>> 
>> but i can get value QA 
>> 
>> Sustring only works with title
>> 
>> 
>> 
>> example:
>> 
>> B = FOREACH A GENERATE SUBSTRING(title,2,5);
>> 
>> dump B;
>> describe B;
>> 
>> 
>> results:
>> 
>> (tle)
>> (tle)
>> B: {chararray}
>> 
>> 
>> 
>> i try, this like ERIC LEE inthe other mail  and have the same results:
>> 
>> 
>>  Anyways, what I really what is the column value, not the name. Is there a way to do that? I listed all of the failed attempts I made below.
>> colnames = FOREACH cols GENERATE $1 and was told $1 was out of bounds. 
>> casted = FOREACH cols GENERATE (tuple(chararray, chararray))$0; but all I got back were empty tuples
>> values = FOREACH cols GENERATE $0.$1; but I got an error telling me data byte array can't be casted to tuple
>> 
>> Please, I will appreciate any help 
>> 
>> 
>> Regards
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> -- 
>> 
>> 
>> 
>> 
>> 
>> 
>> Miguel Angel Martín Junquera
>> Analyst Engineer.
>> miguelangel.martin@brainsins.com
>> Tel. / Fax: (+34) 91 485 56 66
>> http://www.brainsins.com
>> Smart eCommerce
>>         Madrid: http://goo.gl/4B5kv
>>         London: http://goo.gl/uIXdv
>>         Barcelona: http://goo.gl/NZslW
>> 
>> Antes de imprimir este e-mail, piense si es necesario.
>> La legislación española ampara el secreto de las comunicaciones. Este correo electrónico es estrictamente confidencial y va dirigido exclusivamente a su destinatario/a. Si no es Ud., le rogamos que no difunda ni copie la transmisión y nos lo notifique cuanto antes.
>> 
>> 
>> 
>> -- 
>> 
>> 
>> 
>> 
>> 
>> Miguel Angel Martín Junquera
>> Analyst Engineer.
>> miguelangel.martin@brainsins.com
>> 
>> 
>> 
>> 
> 


Re: how can i get the column value? Need help!.. cassandra 1.28 and pig 0.11.1

Posted by Cyril Scetbon <cy...@free.fr>.
Hi,

Did you try to build 1.2.10 and to use it for your tests ? I've got the same issue and will give it a try as soon as it's released (expected at the end of the week).

Regards
-- 
Cyril SCETBON

On Sep 2, 2013, at 3:09 PM, Miguel Angel Martin junquera <mi...@gmail.com> wrote:

> hi all:
> 
> More info :
> 
> https://issues.apache.org/jira/browse/CASSANDRA-5941
> 
> 
> 
> I tried this (and gen. cassandra 1.2.9)  but do not work for me, 
> 
> git clone http://git-wip-us.apache.org/repos/asf/cassandra.git
> cd cassandra
> git checkout cassandra-1.2
> patch -p1 < 5867-bug-fix-filter-push-down-1.2-branch.txt
> ant
> 
> 
> 
> Miguel Angel Martín Junquera
> Analyst Engineer.
> miguelangel.martin@brainsins.com
> 
> 
> 
> 2013/9/2 Miguel Angel Martin junquera <mi...@gmail.com>
> hi:
> 
> I test this in cassandra 1.2.9 new  version and the issue still persists .
> 
> :-(
> 
> 
> 
> 
> 
> 
> Miguel Angel Martín Junquera
> Analyst Engineer.
> miguelangel.martin@brainsins.com
> 
> 
> 
> 2013/8/30 Miguel Angel Martin junquera <mi...@gmail.com>
> I try this:
> 
> rows = LOAD 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING CqlStorage();
> dump rows;
> ILLUSTRATE rows;
> describe rows;
> 
> values2= FOREACH rows GENERATE  TOTUPLE (id) as (mycolumn:tuple(name,value));
> dump values2;
> describe values2;
> 
> But I get this results:
> 
> 
> 
> -------------------------------------------------------------
> | rows     | id:chararray   | age:int   | title:chararray   | 
> -------------------------------------------------------------
> |          | (id, 6)        | (age, 30) | (title, QA)       | 
> -------------------------------------------------------------
> 
> rows: {id: chararray,age: int,title: chararray}
> 2013-08-30 09:54:37,831 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 1031: Incompatable field schema: left is "tuple_0:tuple(mycolumn:tuple(name:bytearray,value:bytearray))", right is "org.apache.pig.builtin.totuple_id_1:tuple(id:chararray)"
> 
> 
> 
> 
> 
> or 
> 
> 
> 
> ....
> 
> values2= FOREACH rows GENERATE  TOTUPLE (id) ;
> dump values2;
> describe values2;
> 
> 
> 
> and  the results are:
> 
> 
> ...
> (((id,6)))
> (((id,5)))
> values2: {org.apache.pig.builtin.totuple_id_8: (id: chararray)}
> 
> 
> 
> Aggg!!!!!
> 
> 
> 
> 
> 
> 
> 
> 
> 
> Miguel Angel Martín Junquera
> Analyst Engineer.
> miguelangel.martin@brainsins.com
> 
> 
> 
> 2013/8/28 Miguel Angel Martin junquera <mi...@gmail.com>
> hi:
> 
> I can not understand why the schema is  define like "id:chararray,age:int,title:chararray"  and it does not define like tuples or bag tuples,  if we have pair key-values  columns
> 
> 
> I try other time to change schema  but it does not work.
> 
> any ideas ...
> 
> perhaps, is the issue in the definition cql3 tables ?
> 
> regards
> 
> 
> 2013/8/28 Miguel Angel Martin junquera <mi...@gmail.com>
> hi all:
> 
> 
> Regards
> 
> Still i can resolve this issue. .....
> 
> does anybody have this issue or try to test this simple example?
> 
> 
> i am stumped I can not find a solution working. 
> 
> I appreciate any comment or help
> 
> 
> 2013/8/22 Miguel Angel Martin junquera <mi...@gmail.com>
> hi all:
> 
> 
> 
> 
> I,m testing the new CqlStorage() with cassandra 1.28 and pig 0.11.1 
> 
> 
> I am using this sample data test:
> 
>  http://frommyworkshop.blogspot.com.es/2013/07/hadoop-map-reduce-with-cassandra.html
> 
> And I load and dump data Righ with this script:
> 
> rows = LOAD 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING CqlStorage();
> 
> dump rows;
> describe rows;
> 
> resutls:
> 
> ((id,6),(age,30),(title,QA))
> ((id,5),(age,30),(title,QA))
> rows: {id: chararray,age: int,title: chararray}
> 
> 
> But i can not  get  the column values 
> 
> I try to define   another schemas in Load like I used with cassandraStorage()
> 
> http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Cassandra-and-Pig-how-to-get-column-values-td5641158.html
> 
> 
> example:
> 
> rows = LOAD 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING CqlStorage() AS (columns: bag {T: tuple(name, value)});
> 
> 
> and I get this error:
> 
> 2013-08-22 12:24:45,426 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 1031: Incompatable schema: left is "columns:bag{T:tuple(name:bytearray,value:bytearray)}", right is "id:chararray,age:int,title:chararray"
> 
> 
> 
> I try to use, FLATTEN, SUBSTRING, SPLIT UDF`s but i have not get good result:
> 
> Example:
> 
> when I flatten , I get a set of tuples like
> (title,QA)
> (title,QA)
> 2013-08-22 12:42:20,673 [main] INFO  org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input paths to process : 1
> A: {title: chararray}
> 
> 
> but i can get value QA 
> 
> Sustring only works with title
> 
> 
> 
> example:
> 
> B = FOREACH A GENERATE SUBSTRING(title,2,5);
> 
> dump B;
> describe B;
> 
> 
> results:
> 
> (tle)
> (tle)
> B: {chararray}
> 
> 
> 
> i try, this like ERIC LEE inthe other mail  and have the same results:
> 
> 
>  Anyways, what I really what is the column value, not the name. Is there a way to do that? I listed all of the failed attempts I made below.
> colnames = FOREACH cols GENERATE $1 and was told $1 was out of bounds. 
> casted = FOREACH cols GENERATE (tuple(chararray, chararray))$0; but all I got back were empty tuples
> values = FOREACH cols GENERATE $0.$1; but I got an error telling me data byte array can't be casted to tuple
> 
> Please, I will appreciate any help 
> 
> 
> Regards
> 
> 
> 
> 
> 
> 
> 
> 
> 
> -- 
> 
> 
> 
> 
> 
> 
> Miguel Angel Martín Junquera
> Analyst Engineer.
> miguelangel.martin@brainsins.com
> Tel. / Fax: (+34) 91 485 56 66
> http://www.brainsins.com
> Smart eCommerce
>         Madrid: http://goo.gl/4B5kv
>         London: http://goo.gl/uIXdv
>         Barcelona: http://goo.gl/NZslW
> 
> Antes de imprimir este e-mail, piense si es necesario.
> La legislación española ampara el secreto de las comunicaciones. Este correo electrónico es estrictamente confidencial y va dirigido exclusivamente a su destinatario/a. Si no es Ud., le rogamos que no difunda ni copie la transmisión y nos lo notifique cuanto antes.
> 
> 
> 
> -- 
> 
> 
> 
> 
> 
> Miguel Angel Martín Junquera
> Analyst Engineer.
> miguelangel.martin@brainsins.com
> 
> 
> 
> 


Re: how can i get the column value? Need help!.. cassandra 1.28 and pig 0.11.1

Posted by Cyril Scetbon <cy...@free.fr>.
Hi,

Did you try to build 1.2.10 and to use it for your tests ? I've got the same issue and will give it a try as soon as it's released (expected at the end of the week).

Regards
-- 
Cyril SCETBON

On Sep 2, 2013, at 3:09 PM, Miguel Angel Martin junquera <mi...@gmail.com> wrote:

> hi all:
> 
> More info :
> 
> https://issues.apache.org/jira/browse/CASSANDRA-5941
> 
> 
> 
> I tried this (and gen. cassandra 1.2.9)  but do not work for me, 
> 
> git clone http://git-wip-us.apache.org/repos/asf/cassandra.git
> cd cassandra
> git checkout cassandra-1.2
> patch -p1 < 5867-bug-fix-filter-push-down-1.2-branch.txt
> ant
> 
> 
> 
> Miguel Angel Martín Junquera
> Analyst Engineer.
> miguelangel.martin@brainsins.com
> 
> 
> 
> 2013/9/2 Miguel Angel Martin junquera <mi...@gmail.com>
> hi:
> 
> I test this in cassandra 1.2.9 new  version and the issue still persists .
> 
> :-(
> 
> 
> 
> 
> 
> 
> Miguel Angel Martín Junquera
> Analyst Engineer.
> miguelangel.martin@brainsins.com
> 
> 
> 
> 2013/8/30 Miguel Angel Martin junquera <mi...@gmail.com>
> I try this:
> 
> rows = LOAD 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING CqlStorage();
> dump rows;
> ILLUSTRATE rows;
> describe rows;
> 
> values2= FOREACH rows GENERATE  TOTUPLE (id) as (mycolumn:tuple(name,value));
> dump values2;
> describe values2;
> 
> But I get this results:
> 
> 
> 
> -------------------------------------------------------------
> | rows     | id:chararray   | age:int   | title:chararray   | 
> -------------------------------------------------------------
> |          | (id, 6)        | (age, 30) | (title, QA)       | 
> -------------------------------------------------------------
> 
> rows: {id: chararray,age: int,title: chararray}
> 2013-08-30 09:54:37,831 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 1031: Incompatable field schema: left is "tuple_0:tuple(mycolumn:tuple(name:bytearray,value:bytearray))", right is "org.apache.pig.builtin.totuple_id_1:tuple(id:chararray)"
> 
> 
> 
> 
> 
> or 
> 
> 
> 
> ....
> 
> values2= FOREACH rows GENERATE  TOTUPLE (id) ;
> dump values2;
> describe values2;
> 
> 
> 
> and  the results are:
> 
> 
> ...
> (((id,6)))
> (((id,5)))
> values2: {org.apache.pig.builtin.totuple_id_8: (id: chararray)}
> 
> 
> 
> Aggg!!!!!
> 
> 
> 
> 
> 
> 
> 
> 
> 
> Miguel Angel Martín Junquera
> Analyst Engineer.
> miguelangel.martin@brainsins.com
> 
> 
> 
> 2013/8/28 Miguel Angel Martin junquera <mi...@gmail.com>
> hi:
> 
> I can not understand why the schema is  define like "id:chararray,age:int,title:chararray"  and it does not define like tuples or bag tuples,  if we have pair key-values  columns
> 
> 
> I try other time to change schema  but it does not work.
> 
> any ideas ...
> 
> perhaps, is the issue in the definition cql3 tables ?
> 
> regards
> 
> 
> 2013/8/28 Miguel Angel Martin junquera <mi...@gmail.com>
> hi all:
> 
> 
> Regards
> 
> Still i can resolve this issue. .....
> 
> does anybody have this issue or try to test this simple example?
> 
> 
> i am stumped I can not find a solution working. 
> 
> I appreciate any comment or help
> 
> 
> 2013/8/22 Miguel Angel Martin junquera <mi...@gmail.com>
> hi all:
> 
> 
> 
> 
> I,m testing the new CqlStorage() with cassandra 1.28 and pig 0.11.1 
> 
> 
> I am using this sample data test:
> 
>  http://frommyworkshop.blogspot.com.es/2013/07/hadoop-map-reduce-with-cassandra.html
> 
> And I load and dump data Righ with this script:
> 
> rows = LOAD 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING CqlStorage();
> 
> dump rows;
> describe rows;
> 
> resutls:
> 
> ((id,6),(age,30),(title,QA))
> ((id,5),(age,30),(title,QA))
> rows: {id: chararray,age: int,title: chararray}
> 
> 
> But i can not  get  the column values 
> 
> I try to define   another schemas in Load like I used with cassandraStorage()
> 
> http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Cassandra-and-Pig-how-to-get-column-values-td5641158.html
> 
> 
> example:
> 
> rows = LOAD 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING CqlStorage() AS (columns: bag {T: tuple(name, value)});
> 
> 
> and I get this error:
> 
> 2013-08-22 12:24:45,426 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 1031: Incompatable schema: left is "columns:bag{T:tuple(name:bytearray,value:bytearray)}", right is "id:chararray,age:int,title:chararray"
> 
> 
> 
> I try to use, FLATTEN, SUBSTRING, SPLIT UDF`s but i have not get good result:
> 
> Example:
> 
> when I flatten , I get a set of tuples like
> (title,QA)
> (title,QA)
> 2013-08-22 12:42:20,673 [main] INFO  org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input paths to process : 1
> A: {title: chararray}
> 
> 
> but i can get value QA 
> 
> Sustring only works with title
> 
> 
> 
> example:
> 
> B = FOREACH A GENERATE SUBSTRING(title,2,5);
> 
> dump B;
> describe B;
> 
> 
> results:
> 
> (tle)
> (tle)
> B: {chararray}
> 
> 
> 
> i try, this like ERIC LEE inthe other mail  and have the same results:
> 
> 
>  Anyways, what I really what is the column value, not the name. Is there a way to do that? I listed all of the failed attempts I made below.
> colnames = FOREACH cols GENERATE $1 and was told $1 was out of bounds. 
> casted = FOREACH cols GENERATE (tuple(chararray, chararray))$0; but all I got back were empty tuples
> values = FOREACH cols GENERATE $0.$1; but I got an error telling me data byte array can't be casted to tuple
> 
> Please, I will appreciate any help 
> 
> 
> Regards
> 
> 
> 
> 
> 
> 
> 
> 
> 
> -- 
> 
> 
> 
> 
> 
> 
> Miguel Angel Martín Junquera
> Analyst Engineer.
> miguelangel.martin@brainsins.com
> Tel. / Fax: (+34) 91 485 56 66
> http://www.brainsins.com
> Smart eCommerce
>         Madrid: http://goo.gl/4B5kv
>         London: http://goo.gl/uIXdv
>         Barcelona: http://goo.gl/NZslW
> 
> Antes de imprimir este e-mail, piense si es necesario.
> La legislación española ampara el secreto de las comunicaciones. Este correo electrónico es estrictamente confidencial y va dirigido exclusivamente a su destinatario/a. Si no es Ud., le rogamos que no difunda ni copie la transmisión y nos lo notifique cuanto antes.
> 
> 
> 
> -- 
> 
> 
> 
> 
> 
> Miguel Angel Martín Junquera
> Analyst Engineer.
> miguelangel.martin@brainsins.com
> 
> 
> 
> 


Re: how can i get the column value? Need help!.. cassandra 1.28 and pig 0.11.1

Posted by Miguel Angel Martin junquera <mi...@gmail.com>.
hi all:

More info :

https://issues.apache.org/jira/browse/CASSANDRA-5941



I tried this (and gen. cassandra 1.2.9)  but do not work for me,

git clone http://git-wip-us.apache.org/repos/asf/cassandra.git
cd cassandra
git checkout cassandra-1.2
patch -p1 < 5867-bug-fix-filter-push-down-1.2-branch.txt
ant



Miguel Angel Martín Junquera
Analyst Engineer.
miguelangel.martin@brainsins.com



2013/9/2 Miguel Angel Martin junquera <mi...@gmail.com>

> hi:
>
> I test this in cassandra 1.2.9 new  version and the issue still persists .
>
> :-(
>
>
>
>
> Miguel Angel Martín Junquera
> Analyst Engineer.
> miguelangel.martin@brainsins.com
>
>
>
> 2013/8/30 Miguel Angel Martin junquera <mi...@gmail.com>
>
>> I try this:
>>
>> *rows = LOAD
>> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
>> CqlStorage();*
>>
>> *dump rows;*
>>
>> *ILLUSTRATE rows;*
>>
>> *describe rows;*
>>
>> *
>> *
>>
>> *values2= FOREACH rows GENERATE  TOTUPLE (id) as
>> (mycolumn:tuple(name,value));*
>>
>> *dump values2;*
>>
>> *describe values2;*
>> *
>> *
>>
>> But I get this results:
>>
>>
>>
>> -------------------------------------------------------------
>> | rows     | id:chararray   | age:int   | title:chararray   |
>> -------------------------------------------------------------
>> |          | (id, 6)        | (age, 30) | (title, QA)       |
>> -------------------------------------------------------------
>>
>> rows: {id: chararray,age: int,title: chararray}
>> 2013-08-30 09:54:37,831 [main] ERROR org.apache.pig.tools.grunt.Grunt -
>> ERROR 1031: Incompatable field schema: left is
>> "tuple_0:tuple(mycolumn:tuple(name:bytearray,value:bytearray))", right is
>> "org.apache.pig.builtin.totuple_id_1:tuple(id:chararray)"
>>
>>
>>
>>
>>
>> or
>>
>>
>>
>> ....
>>
>> *values2= FOREACH rows GENERATE  TOTUPLE (id) ;*
>> *dump values2;*
>> *describe values2;*
>>
>>
>>
>>
>> and  the results are:
>>
>>
>> ...
>> (((id,6)))
>> (((id,5)))
>> values2: {org.apache.pig.builtin.totuple_id_8: (id: chararray)}
>>
>>
>>
>> Aggg!!!!!
>>
>>
>> *
>> *
>>
>>
>>
>>
>> Miguel Angel Martín Junquera
>> Analyst Engineer.
>> miguelangel.martin@brainsins.com
>>
>>
>>
>> 2013/8/28 Miguel Angel Martin junquera <mi...@gmail.com>
>>
>>> hi:
>>>
>>> I can not understand why the schema is  define like *"id:chararray,age:int,title:chararray"
>>>  and it does not define like tuples or bag tuples,  if we have pair
>>> key-values  columns*
>>> *
>>> *
>>> *
>>> *
>>> *I try other time to change schema  but it does not work.*
>>> *
>>> *
>>> *any ideas ...*
>>> *
>>> *
>>> *perhaps, is the issue in the definition cql3 tables ?*
>>> *
>>> *
>>> *regards*
>>>
>>>
>>> 2013/8/28 Miguel Angel Martin junquera <mianmarjun.mailinglist@gmail.com
>>> >
>>>
>>>> hi all:
>>>>
>>>>
>>>> Regards
>>>>
>>>> Still i can resolve this issue. .....
>>>>
>>>> does anybody have this issue or try to test this simple example?
>>>>
>>>>
>>>> i am stumped I can not find a solution working.
>>>>
>>>> I appreciate any comment or help
>>>>
>>>>
>>>> 2013/8/22 Miguel Angel Martin junquera <
>>>> mianmarjun.mailinglist@gmail.com>
>>>>
>>>>> hi all:
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> I,m testing the new CqlStorage() with cassandra 1.28 and pig 0.11.1
>>>>>
>>>>>
>>>>> I am using this sample data test:
>>>>>
>>>>>
>>>>> http://frommyworkshop.blogspot.com.es/2013/07/hadoop-map-reduce-with-cassandra.html
>>>>>
>>>>> And I load and dump data Righ with this script:
>>>>>
>>>>> *rows = LOAD
>>>>> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
>>>>> CqlStorage();*
>>>>> *
>>>>> *
>>>>> *dump rows;*
>>>>> *describe rows;*
>>>>> *
>>>>> *
>>>>>
>>>>> *resutls:
>>>>>
>>>>> ((id,6),(age,30),(title,QA))
>>>>>
>>>>> ((id,5),(age,30),(title,QA))
>>>>>
>>>>> rows: {id: chararray,age: int,title: chararray}
>>>>>
>>>>>
>>>>> *
>>>>>
>>>>>
>>>>> But i can not  get  the column values
>>>>>
>>>>> I try to define   another schemas in Load like I used with
>>>>> cassandraStorage()
>>>>>
>>>>>
>>>>> http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Cassandra-and-Pig-how-to-get-column-values-td5641158.html
>>>>>
>>>>>
>>>>> example:
>>>>>
>>>>> *rows = LOAD
>>>>> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
>>>>> CqlStorage() AS (columns: bag {T: tuple(name, value)});*
>>>>>
>>>>>
>>>>> and I get this error:
>>>>>
>>>>> *2013-08-22 12:24:45,426 [main] ERROR
>>>>> org.apache.pig.tools.grunt.Grunt - ERROR 1031: Incompatable schema: left is
>>>>> "columns:bag{T:tuple(name:bytearray,value:bytearray)}", right is
>>>>> "id:chararray,age:int,title:chararray"*
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> I try to use, FLATTEN, SUBSTRING, SPLIT UDF`s but i have not get good
>>>>> result:
>>>>>
>>>>> Example:
>>>>>
>>>>>
>>>>>    - when I flatten , I get a set of tuples like
>>>>>
>>>>> *(title,QA)*
>>>>>
>>>>> *(title,QA)*
>>>>>
>>>>> *2013-08-22 12:42:20,673 [main] INFO
>>>>>  org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total
>>>>> input paths to process : 1*
>>>>>
>>>>> *A: {title: chararray}*
>>>>>
>>>>>
>>>>>
>>>>> but i can get value QA
>>>>>
>>>>> Sustring only works with title
>>>>>
>>>>>
>>>>>
>>>>> example:
>>>>>
>>>>> *B = FOREACH A GENERATE SUBSTRING(title,2,5);*
>>>>> *
>>>>> *
>>>>> *dump B;*
>>>>> *describe B;*
>>>>> *
>>>>> *
>>>>> *
>>>>> *
>>>>>
>>>>> *results:*
>>>>> *
>>>>> *
>>>>>
>>>>> *(tle)*
>>>>> *(tle)*
>>>>> *B: {chararray}*
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> i try, this like ERIC LEE inthe other mail  and have the same results:
>>>>>
>>>>>
>>>>>  Anyways, what I really what is the column value, not the name. Is
>>>>> there a way to do that? I listed all of the failed attempts I made below.
>>>>>
>>>>>    - colnames = FOREACH cols GENERATE $1 and was told $1 was out of
>>>>>    bounds.
>>>>>    - casted = FOREACH cols GENERATE (tuple(chararray, chararray))$0;
>>>>>    but all I got back were empty tuples
>>>>>    - values = FOREACH cols GENERATE $0.$1; but I got an error telling
>>>>>    me data byte array can't be casted to tuple
>>>>>
>>>>>
>>>>> Please, I will appreciate any help
>>>>>
>>>>>
>>>>> Regards
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>
>>>>
>>>> --
>>>>
>>>> Miguel Angel Martín Junquera
>>>> Analyst Engineer.
>>>> miguelangel.martin@brainsins.com
>>>> Tel. / Fax: (+34) 91 485 56 66
>>>> *http://www.brainsins.com*
>>>> Smart eCommerce
>>>> *Madrid*: http://goo.gl/4B5kv
>>>>  *London*: http://goo.gl/uIXdv
>>>>  *Barcelona*: http://goo.gl/NZslW
>>>>
>>>> Antes de imprimir este e-mail, piense si es necesario.
>>>> La legislación española ampara el secreto de las comunicaciones. Este
>>>> correo electrónico es estrictamente confidencial y va dirigido
>>>> exclusivamente a su destinatario/a. Si no es Ud., le rogamos que no difunda
>>>> ni copie la transmisión y nos lo notifique cuanto antes.
>>>>
>>>
>>>
>>>
>>> --
>>>
>>> Miguel Angel Martín Junquera
>>> Analyst Engineer.
>>> miguelangel.martin@brainsins.com
>>>
>>>
>>
>

Re: how can i get the column value? Need help!.. cassandra 1.28 and pig 0.11.1

Posted by Miguel Angel Martin junquera <mi...@gmail.com>.
hi all:

More info :

https://issues.apache.org/jira/browse/CASSANDRA-5941



I tried this (and gen. cassandra 1.2.9)  but do not work for me,

git clone http://git-wip-us.apache.org/repos/asf/cassandra.git
cd cassandra
git checkout cassandra-1.2
patch -p1 < 5867-bug-fix-filter-push-down-1.2-branch.txt
ant



Miguel Angel Martín Junquera
Analyst Engineer.
miguelangel.martin@brainsins.com



2013/9/2 Miguel Angel Martin junquera <mi...@gmail.com>

> hi:
>
> I test this in cassandra 1.2.9 new  version and the issue still persists .
>
> :-(
>
>
>
>
> Miguel Angel Martín Junquera
> Analyst Engineer.
> miguelangel.martin@brainsins.com
>
>
>
> 2013/8/30 Miguel Angel Martin junquera <mi...@gmail.com>
>
>> I try this:
>>
>> *rows = LOAD
>> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
>> CqlStorage();*
>>
>> *dump rows;*
>>
>> *ILLUSTRATE rows;*
>>
>> *describe rows;*
>>
>> *
>> *
>>
>> *values2= FOREACH rows GENERATE  TOTUPLE (id) as
>> (mycolumn:tuple(name,value));*
>>
>> *dump values2;*
>>
>> *describe values2;*
>> *
>> *
>>
>> But I get this results:
>>
>>
>>
>> -------------------------------------------------------------
>> | rows     | id:chararray   | age:int   | title:chararray   |
>> -------------------------------------------------------------
>> |          | (id, 6)        | (age, 30) | (title, QA)       |
>> -------------------------------------------------------------
>>
>> rows: {id: chararray,age: int,title: chararray}
>> 2013-08-30 09:54:37,831 [main] ERROR org.apache.pig.tools.grunt.Grunt -
>> ERROR 1031: Incompatable field schema: left is
>> "tuple_0:tuple(mycolumn:tuple(name:bytearray,value:bytearray))", right is
>> "org.apache.pig.builtin.totuple_id_1:tuple(id:chararray)"
>>
>>
>>
>>
>>
>> or
>>
>>
>>
>> ....
>>
>> *values2= FOREACH rows GENERATE  TOTUPLE (id) ;*
>> *dump values2;*
>> *describe values2;*
>>
>>
>>
>>
>> and  the results are:
>>
>>
>> ...
>> (((id,6)))
>> (((id,5)))
>> values2: {org.apache.pig.builtin.totuple_id_8: (id: chararray)}
>>
>>
>>
>> Aggg!!!!!
>>
>>
>> *
>> *
>>
>>
>>
>>
>> Miguel Angel Martín Junquera
>> Analyst Engineer.
>> miguelangel.martin@brainsins.com
>>
>>
>>
>> 2013/8/28 Miguel Angel Martin junquera <mi...@gmail.com>
>>
>>> hi:
>>>
>>> I can not understand why the schema is  define like *"id:chararray,age:int,title:chararray"
>>>  and it does not define like tuples or bag tuples,  if we have pair
>>> key-values  columns*
>>> *
>>> *
>>> *
>>> *
>>> *I try other time to change schema  but it does not work.*
>>> *
>>> *
>>> *any ideas ...*
>>> *
>>> *
>>> *perhaps, is the issue in the definition cql3 tables ?*
>>> *
>>> *
>>> *regards*
>>>
>>>
>>> 2013/8/28 Miguel Angel Martin junquera <mianmarjun.mailinglist@gmail.com
>>> >
>>>
>>>> hi all:
>>>>
>>>>
>>>> Regards
>>>>
>>>> Still i can resolve this issue. .....
>>>>
>>>> does anybody have this issue or try to test this simple example?
>>>>
>>>>
>>>> i am stumped I can not find a solution working.
>>>>
>>>> I appreciate any comment or help
>>>>
>>>>
>>>> 2013/8/22 Miguel Angel Martin junquera <
>>>> mianmarjun.mailinglist@gmail.com>
>>>>
>>>>> hi all:
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> I,m testing the new CqlStorage() with cassandra 1.28 and pig 0.11.1
>>>>>
>>>>>
>>>>> I am using this sample data test:
>>>>>
>>>>>
>>>>> http://frommyworkshop.blogspot.com.es/2013/07/hadoop-map-reduce-with-cassandra.html
>>>>>
>>>>> And I load and dump data Righ with this script:
>>>>>
>>>>> *rows = LOAD
>>>>> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
>>>>> CqlStorage();*
>>>>> *
>>>>> *
>>>>> *dump rows;*
>>>>> *describe rows;*
>>>>> *
>>>>> *
>>>>>
>>>>> *resutls:
>>>>>
>>>>> ((id,6),(age,30),(title,QA))
>>>>>
>>>>> ((id,5),(age,30),(title,QA))
>>>>>
>>>>> rows: {id: chararray,age: int,title: chararray}
>>>>>
>>>>>
>>>>> *
>>>>>
>>>>>
>>>>> But i can not  get  the column values
>>>>>
>>>>> I try to define   another schemas in Load like I used with
>>>>> cassandraStorage()
>>>>>
>>>>>
>>>>> http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Cassandra-and-Pig-how-to-get-column-values-td5641158.html
>>>>>
>>>>>
>>>>> example:
>>>>>
>>>>> *rows = LOAD
>>>>> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
>>>>> CqlStorage() AS (columns: bag {T: tuple(name, value)});*
>>>>>
>>>>>
>>>>> and I get this error:
>>>>>
>>>>> *2013-08-22 12:24:45,426 [main] ERROR
>>>>> org.apache.pig.tools.grunt.Grunt - ERROR 1031: Incompatable schema: left is
>>>>> "columns:bag{T:tuple(name:bytearray,value:bytearray)}", right is
>>>>> "id:chararray,age:int,title:chararray"*
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> I try to use, FLATTEN, SUBSTRING, SPLIT UDF`s but i have not get good
>>>>> result:
>>>>>
>>>>> Example:
>>>>>
>>>>>
>>>>>    - when I flatten , I get a set of tuples like
>>>>>
>>>>> *(title,QA)*
>>>>>
>>>>> *(title,QA)*
>>>>>
>>>>> *2013-08-22 12:42:20,673 [main] INFO
>>>>>  org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total
>>>>> input paths to process : 1*
>>>>>
>>>>> *A: {title: chararray}*
>>>>>
>>>>>
>>>>>
>>>>> but i can get value QA
>>>>>
>>>>> Sustring only works with title
>>>>>
>>>>>
>>>>>
>>>>> example:
>>>>>
>>>>> *B = FOREACH A GENERATE SUBSTRING(title,2,5);*
>>>>> *
>>>>> *
>>>>> *dump B;*
>>>>> *describe B;*
>>>>> *
>>>>> *
>>>>> *
>>>>> *
>>>>>
>>>>> *results:*
>>>>> *
>>>>> *
>>>>>
>>>>> *(tle)*
>>>>> *(tle)*
>>>>> *B: {chararray}*
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> i try, this like ERIC LEE inthe other mail  and have the same results:
>>>>>
>>>>>
>>>>>  Anyways, what I really what is the column value, not the name. Is
>>>>> there a way to do that? I listed all of the failed attempts I made below.
>>>>>
>>>>>    - colnames = FOREACH cols GENERATE $1 and was told $1 was out of
>>>>>    bounds.
>>>>>    - casted = FOREACH cols GENERATE (tuple(chararray, chararray))$0;
>>>>>    but all I got back were empty tuples
>>>>>    - values = FOREACH cols GENERATE $0.$1; but I got an error telling
>>>>>    me data byte array can't be casted to tuple
>>>>>
>>>>>
>>>>> Please, I will appreciate any help
>>>>>
>>>>>
>>>>> Regards
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>
>>>>
>>>> --
>>>>
>>>> Miguel Angel Martín Junquera
>>>> Analyst Engineer.
>>>> miguelangel.martin@brainsins.com
>>>> Tel. / Fax: (+34) 91 485 56 66
>>>> *http://www.brainsins.com*
>>>> Smart eCommerce
>>>> *Madrid*: http://goo.gl/4B5kv
>>>>  *London*: http://goo.gl/uIXdv
>>>>  *Barcelona*: http://goo.gl/NZslW
>>>>
>>>> Antes de imprimir este e-mail, piense si es necesario.
>>>> La legislación española ampara el secreto de las comunicaciones. Este
>>>> correo electrónico es estrictamente confidencial y va dirigido
>>>> exclusivamente a su destinatario/a. Si no es Ud., le rogamos que no difunda
>>>> ni copie la transmisión y nos lo notifique cuanto antes.
>>>>
>>>
>>>
>>>
>>> --
>>>
>>> Miguel Angel Martín Junquera
>>> Analyst Engineer.
>>> miguelangel.martin@brainsins.com
>>>
>>>
>>
>

Re: how can i get the column value? Need help!.. cassandra 1.28 and pig 0.11.1

Posted by Miguel Angel Martin junquera <mi...@gmail.com>.
hi:

I test this in cassandra 1.2.9 new  version and the issue still persists .

:-(




Miguel Angel Martín Junquera
Analyst Engineer.
miguelangel.martin@brainsins.com



2013/8/30 Miguel Angel Martin junquera <mi...@gmail.com>

> I try this:
>
> *rows = LOAD
> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
> CqlStorage();*
>
> *dump rows;*
>
> *ILLUSTRATE rows;*
>
> *describe rows;*
>
> *
> *
>
> *values2= FOREACH rows GENERATE  TOTUPLE (id) as
> (mycolumn:tuple(name,value));*
>
> *dump values2;*
>
> *describe values2;*
> *
> *
>
> But I get this results:
>
>
>
> -------------------------------------------------------------
> | rows     | id:chararray   | age:int   | title:chararray   |
> -------------------------------------------------------------
> |          | (id, 6)        | (age, 30) | (title, QA)       |
> -------------------------------------------------------------
>
> rows: {id: chararray,age: int,title: chararray}
> 2013-08-30 09:54:37,831 [main] ERROR org.apache.pig.tools.grunt.Grunt -
> ERROR 1031: Incompatable field schema: left is
> "tuple_0:tuple(mycolumn:tuple(name:bytearray,value:bytearray))", right is
> "org.apache.pig.builtin.totuple_id_1:tuple(id:chararray)"
>
>
>
>
>
> or
>
>
>
> ....
>
> *values2= FOREACH rows GENERATE  TOTUPLE (id) ;*
> *dump values2;*
> *describe values2;*
>
>
>
>
> and  the results are:
>
>
> ...
> (((id,6)))
> (((id,5)))
> values2: {org.apache.pig.builtin.totuple_id_8: (id: chararray)}
>
>
>
> Aggg!!!!!
>
>
> *
> *
>
>
>
>
> Miguel Angel Martín Junquera
> Analyst Engineer.
> miguelangel.martin@brainsins.com
>
>
>
> 2013/8/28 Miguel Angel Martin junquera <mi...@gmail.com>
>
>> hi:
>>
>> I can not understand why the schema is  define like *"id:chararray,age:int,title:chararray"
>>  and it does not define like tuples or bag tuples,  if we have pair
>> key-values  columns*
>> *
>> *
>> *
>> *
>> *I try other time to change schema  but it does not work.*
>> *
>> *
>> *any ideas ...*
>> *
>> *
>> *perhaps, is the issue in the definition cql3 tables ?*
>> *
>> *
>> *regards*
>>
>>
>> 2013/8/28 Miguel Angel Martin junquera <mi...@gmail.com>
>>
>>> hi all:
>>>
>>>
>>> Regards
>>>
>>> Still i can resolve this issue. .....
>>>
>>> does anybody have this issue or try to test this simple example?
>>>
>>>
>>> i am stumped I can not find a solution working.
>>>
>>> I appreciate any comment or help
>>>
>>>
>>> 2013/8/22 Miguel Angel Martin junquera <mianmarjun.mailinglist@gmail.com
>>> >
>>>
>>>> hi all:
>>>>
>>>>
>>>>
>>>>
>>>> I,m testing the new CqlStorage() with cassandra 1.28 and pig 0.11.1
>>>>
>>>>
>>>> I am using this sample data test:
>>>>
>>>>
>>>> http://frommyworkshop.blogspot.com.es/2013/07/hadoop-map-reduce-with-cassandra.html
>>>>
>>>> And I load and dump data Righ with this script:
>>>>
>>>> *rows = LOAD
>>>> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
>>>> CqlStorage();*
>>>> *
>>>> *
>>>> *dump rows;*
>>>> *describe rows;*
>>>> *
>>>> *
>>>>
>>>> *resutls:
>>>>
>>>> ((id,6),(age,30),(title,QA))
>>>>
>>>> ((id,5),(age,30),(title,QA))
>>>>
>>>> rows: {id: chararray,age: int,title: chararray}
>>>>
>>>>
>>>> *
>>>>
>>>>
>>>> But i can not  get  the column values
>>>>
>>>> I try to define   another schemas in Load like I used with
>>>> cassandraStorage()
>>>>
>>>>
>>>> http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Cassandra-and-Pig-how-to-get-column-values-td5641158.html
>>>>
>>>>
>>>> example:
>>>>
>>>> *rows = LOAD
>>>> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
>>>> CqlStorage() AS (columns: bag {T: tuple(name, value)});*
>>>>
>>>>
>>>> and I get this error:
>>>>
>>>> *2013-08-22 12:24:45,426 [main] ERROR org.apache.pig.tools.grunt.Grunt
>>>> - ERROR 1031: Incompatable schema: left is
>>>> "columns:bag{T:tuple(name:bytearray,value:bytearray)}", right is
>>>> "id:chararray,age:int,title:chararray"*
>>>>
>>>>
>>>>
>>>>
>>>> I try to use, FLATTEN, SUBSTRING, SPLIT UDF`s but i have not get good
>>>> result:
>>>>
>>>> Example:
>>>>
>>>>
>>>>    - when I flatten , I get a set of tuples like
>>>>
>>>> *(title,QA)*
>>>>
>>>> *(title,QA)*
>>>>
>>>> *2013-08-22 12:42:20,673 [main] INFO
>>>>  org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total
>>>> input paths to process : 1*
>>>>
>>>> *A: {title: chararray}*
>>>>
>>>>
>>>>
>>>> but i can get value QA
>>>>
>>>> Sustring only works with title
>>>>
>>>>
>>>>
>>>> example:
>>>>
>>>> *B = FOREACH A GENERATE SUBSTRING(title,2,5);*
>>>> *
>>>> *
>>>> *dump B;*
>>>> *describe B;*
>>>> *
>>>> *
>>>> *
>>>> *
>>>>
>>>> *results:*
>>>> *
>>>> *
>>>>
>>>> *(tle)*
>>>> *(tle)*
>>>> *B: {chararray}*
>>>>
>>>>
>>>>
>>>>
>>>> i try, this like ERIC LEE inthe other mail  and have the same results:
>>>>
>>>>
>>>>  Anyways, what I really what is the column value, not the name. Is
>>>> there a way to do that? I listed all of the failed attempts I made below.
>>>>
>>>>    - colnames = FOREACH cols GENERATE $1 and was told $1 was out of
>>>>    bounds.
>>>>    - casted = FOREACH cols GENERATE (tuple(chararray, chararray))$0;
>>>>    but all I got back were empty tuples
>>>>    - values = FOREACH cols GENERATE $0.$1; but I got an error telling
>>>>    me data byte array can't be casted to tuple
>>>>
>>>>
>>>> Please, I will appreciate any help
>>>>
>>>>
>>>> Regards
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>
>>>
>>> --
>>>
>>> Miguel Angel Martín Junquera
>>> Analyst Engineer.
>>> miguelangel.martin@brainsins.com
>>> Tel. / Fax: (+34) 91 485 56 66
>>> *http://www.brainsins.com*
>>> Smart eCommerce
>>> *Madrid*: http://goo.gl/4B5kv
>>>  *London*: http://goo.gl/uIXdv
>>>  *Barcelona*: http://goo.gl/NZslW
>>>
>>> Antes de imprimir este e-mail, piense si es necesario.
>>> La legislación española ampara el secreto de las comunicaciones. Este
>>> correo electrónico es estrictamente confidencial y va dirigido
>>> exclusivamente a su destinatario/a. Si no es Ud., le rogamos que no difunda
>>> ni copie la transmisión y nos lo notifique cuanto antes.
>>>
>>
>>
>>
>> --
>>
>> Miguel Angel Martín Junquera
>> Analyst Engineer.
>> miguelangel.martin@brainsins.com
>>
>>
>

Re: how can i get the column value? Need help!.. cassandra 1.28 and pig 0.11.1

Posted by Miguel Angel Martin junquera <mi...@gmail.com>.
hi:

I test this in cassandra 1.2.9 new  version and the issue still persists .

:-(




Miguel Angel Martín Junquera
Analyst Engineer.
miguelangel.martin@brainsins.com



2013/8/30 Miguel Angel Martin junquera <mi...@gmail.com>

> I try this:
>
> *rows = LOAD
> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
> CqlStorage();*
>
> *dump rows;*
>
> *ILLUSTRATE rows;*
>
> *describe rows;*
>
> *
> *
>
> *values2= FOREACH rows GENERATE  TOTUPLE (id) as
> (mycolumn:tuple(name,value));*
>
> *dump values2;*
>
> *describe values2;*
> *
> *
>
> But I get this results:
>
>
>
> -------------------------------------------------------------
> | rows     | id:chararray   | age:int   | title:chararray   |
> -------------------------------------------------------------
> |          | (id, 6)        | (age, 30) | (title, QA)       |
> -------------------------------------------------------------
>
> rows: {id: chararray,age: int,title: chararray}
> 2013-08-30 09:54:37,831 [main] ERROR org.apache.pig.tools.grunt.Grunt -
> ERROR 1031: Incompatable field schema: left is
> "tuple_0:tuple(mycolumn:tuple(name:bytearray,value:bytearray))", right is
> "org.apache.pig.builtin.totuple_id_1:tuple(id:chararray)"
>
>
>
>
>
> or
>
>
>
> ....
>
> *values2= FOREACH rows GENERATE  TOTUPLE (id) ;*
> *dump values2;*
> *describe values2;*
>
>
>
>
> and  the results are:
>
>
> ...
> (((id,6)))
> (((id,5)))
> values2: {org.apache.pig.builtin.totuple_id_8: (id: chararray)}
>
>
>
> Aggg!!!!!
>
>
> *
> *
>
>
>
>
> Miguel Angel Martín Junquera
> Analyst Engineer.
> miguelangel.martin@brainsins.com
>
>
>
> 2013/8/28 Miguel Angel Martin junquera <mi...@gmail.com>
>
>> hi:
>>
>> I can not understand why the schema is  define like *"id:chararray,age:int,title:chararray"
>>  and it does not define like tuples or bag tuples,  if we have pair
>> key-values  columns*
>> *
>> *
>> *
>> *
>> *I try other time to change schema  but it does not work.*
>> *
>> *
>> *any ideas ...*
>> *
>> *
>> *perhaps, is the issue in the definition cql3 tables ?*
>> *
>> *
>> *regards*
>>
>>
>> 2013/8/28 Miguel Angel Martin junquera <mi...@gmail.com>
>>
>>> hi all:
>>>
>>>
>>> Regards
>>>
>>> Still i can resolve this issue. .....
>>>
>>> does anybody have this issue or try to test this simple example?
>>>
>>>
>>> i am stumped I can not find a solution working.
>>>
>>> I appreciate any comment or help
>>>
>>>
>>> 2013/8/22 Miguel Angel Martin junquera <mianmarjun.mailinglist@gmail.com
>>> >
>>>
>>>> hi all:
>>>>
>>>>
>>>>
>>>>
>>>> I,m testing the new CqlStorage() with cassandra 1.28 and pig 0.11.1
>>>>
>>>>
>>>> I am using this sample data test:
>>>>
>>>>
>>>> http://frommyworkshop.blogspot.com.es/2013/07/hadoop-map-reduce-with-cassandra.html
>>>>
>>>> And I load and dump data Righ with this script:
>>>>
>>>> *rows = LOAD
>>>> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
>>>> CqlStorage();*
>>>> *
>>>> *
>>>> *dump rows;*
>>>> *describe rows;*
>>>> *
>>>> *
>>>>
>>>> *resutls:
>>>>
>>>> ((id,6),(age,30),(title,QA))
>>>>
>>>> ((id,5),(age,30),(title,QA))
>>>>
>>>> rows: {id: chararray,age: int,title: chararray}
>>>>
>>>>
>>>> *
>>>>
>>>>
>>>> But i can not  get  the column values
>>>>
>>>> I try to define   another schemas in Load like I used with
>>>> cassandraStorage()
>>>>
>>>>
>>>> http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Cassandra-and-Pig-how-to-get-column-values-td5641158.html
>>>>
>>>>
>>>> example:
>>>>
>>>> *rows = LOAD
>>>> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
>>>> CqlStorage() AS (columns: bag {T: tuple(name, value)});*
>>>>
>>>>
>>>> and I get this error:
>>>>
>>>> *2013-08-22 12:24:45,426 [main] ERROR org.apache.pig.tools.grunt.Grunt
>>>> - ERROR 1031: Incompatable schema: left is
>>>> "columns:bag{T:tuple(name:bytearray,value:bytearray)}", right is
>>>> "id:chararray,age:int,title:chararray"*
>>>>
>>>>
>>>>
>>>>
>>>> I try to use, FLATTEN, SUBSTRING, SPLIT UDF`s but i have not get good
>>>> result:
>>>>
>>>> Example:
>>>>
>>>>
>>>>    - when I flatten , I get a set of tuples like
>>>>
>>>> *(title,QA)*
>>>>
>>>> *(title,QA)*
>>>>
>>>> *2013-08-22 12:42:20,673 [main] INFO
>>>>  org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total
>>>> input paths to process : 1*
>>>>
>>>> *A: {title: chararray}*
>>>>
>>>>
>>>>
>>>> but i can get value QA
>>>>
>>>> Sustring only works with title
>>>>
>>>>
>>>>
>>>> example:
>>>>
>>>> *B = FOREACH A GENERATE SUBSTRING(title,2,5);*
>>>> *
>>>> *
>>>> *dump B;*
>>>> *describe B;*
>>>> *
>>>> *
>>>> *
>>>> *
>>>>
>>>> *results:*
>>>> *
>>>> *
>>>>
>>>> *(tle)*
>>>> *(tle)*
>>>> *B: {chararray}*
>>>>
>>>>
>>>>
>>>>
>>>> i try, this like ERIC LEE inthe other mail  and have the same results:
>>>>
>>>>
>>>>  Anyways, what I really what is the column value, not the name. Is
>>>> there a way to do that? I listed all of the failed attempts I made below.
>>>>
>>>>    - colnames = FOREACH cols GENERATE $1 and was told $1 was out of
>>>>    bounds.
>>>>    - casted = FOREACH cols GENERATE (tuple(chararray, chararray))$0;
>>>>    but all I got back were empty tuples
>>>>    - values = FOREACH cols GENERATE $0.$1; but I got an error telling
>>>>    me data byte array can't be casted to tuple
>>>>
>>>>
>>>> Please, I will appreciate any help
>>>>
>>>>
>>>> Regards
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>
>>>
>>> --
>>>
>>> Miguel Angel Martín Junquera
>>> Analyst Engineer.
>>> miguelangel.martin@brainsins.com
>>> Tel. / Fax: (+34) 91 485 56 66
>>> *http://www.brainsins.com*
>>> Smart eCommerce
>>> *Madrid*: http://goo.gl/4B5kv
>>>  *London*: http://goo.gl/uIXdv
>>>  *Barcelona*: http://goo.gl/NZslW
>>>
>>> Antes de imprimir este e-mail, piense si es necesario.
>>> La legislación española ampara el secreto de las comunicaciones. Este
>>> correo electrónico es estrictamente confidencial y va dirigido
>>> exclusivamente a su destinatario/a. Si no es Ud., le rogamos que no difunda
>>> ni copie la transmisión y nos lo notifique cuanto antes.
>>>
>>
>>
>>
>> --
>>
>> Miguel Angel Martín Junquera
>> Analyst Engineer.
>> miguelangel.martin@brainsins.com
>>
>>
>

Re: how can i get the column value? Need help!.. cassandra 1.28 and pig 0.11.1

Posted by Miguel Angel Martin junquera <mi...@gmail.com>.
I try this:

*rows = LOAD
'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
CqlStorage();*

*dump rows;*

*ILLUSTRATE rows;*

*describe rows;*

*
*

*values2= FOREACH rows GENERATE  TOTUPLE (id) as
(mycolumn:tuple(name,value));*

*dump values2;*

*describe values2;*
*
*

But I get this results:



-------------------------------------------------------------
| rows     | id:chararray   | age:int   | title:chararray   |
-------------------------------------------------------------
|          | (id, 6)        | (age, 30) | (title, QA)       |
-------------------------------------------------------------

rows: {id: chararray,age: int,title: chararray}
2013-08-30 09:54:37,831 [main] ERROR org.apache.pig.tools.grunt.Grunt -
ERROR 1031: Incompatable field schema: left is
"tuple_0:tuple(mycolumn:tuple(name:bytearray,value:bytearray))", right is
"org.apache.pig.builtin.totuple_id_1:tuple(id:chararray)"





or



....

*values2= FOREACH rows GENERATE  TOTUPLE (id) ;*
*dump values2;*
*describe values2;*




and  the results are:


...
(((id,6)))
(((id,5)))
values2: {org.apache.pig.builtin.totuple_id_8: (id: chararray)}



Aggg!!!!!


*
*




Miguel Angel Martín Junquera
Analyst Engineer.
miguelangel.martin@brainsins.com



2013/8/28 Miguel Angel Martin junquera <mi...@gmail.com>

> hi:
>
> I can not understand why the schema is  define like *"id:chararray,age:int,title:chararray"
>  and it does not define like tuples or bag tuples,  if we have pair
> key-values  columns*
> *
> *
> *
> *
> *I try other time to change schema  but it does not work.*
> *
> *
> *any ideas ...*
> *
> *
> *perhaps, is the issue in the definition cql3 tables ?*
> *
> *
> *regards*
>
>
> 2013/8/28 Miguel Angel Martin junquera <mi...@gmail.com>
>
>> hi all:
>>
>>
>> Regards
>>
>> Still i can resolve this issue. .....
>>
>> does anybody have this issue or try to test this simple example?
>>
>>
>> i am stumped I can not find a solution working.
>>
>> I appreciate any comment or help
>>
>>
>> 2013/8/22 Miguel Angel Martin junquera <mi...@gmail.com>
>>
>>> hi all:
>>>
>>>
>>>
>>>
>>> I,m testing the new CqlStorage() with cassandra 1.28 and pig 0.11.1
>>>
>>>
>>> I am using this sample data test:
>>>
>>>
>>> http://frommyworkshop.blogspot.com.es/2013/07/hadoop-map-reduce-with-cassandra.html
>>>
>>> And I load and dump data Righ with this script:
>>>
>>> *rows = LOAD
>>> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
>>> CqlStorage();*
>>> *
>>> *
>>> *dump rows;*
>>> *describe rows;*
>>> *
>>> *
>>>
>>> *resutls:
>>>
>>> ((id,6),(age,30),(title,QA))
>>>
>>> ((id,5),(age,30),(title,QA))
>>>
>>> rows: {id: chararray,age: int,title: chararray}
>>>
>>>
>>> *
>>>
>>>
>>> But i can not  get  the column values
>>>
>>> I try to define   another schemas in Load like I used with
>>> cassandraStorage()
>>>
>>>
>>> http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Cassandra-and-Pig-how-to-get-column-values-td5641158.html
>>>
>>>
>>> example:
>>>
>>> *rows = LOAD
>>> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
>>> CqlStorage() AS (columns: bag {T: tuple(name, value)});*
>>>
>>>
>>> and I get this error:
>>>
>>> *2013-08-22 12:24:45,426 [main] ERROR org.apache.pig.tools.grunt.Grunt
>>> - ERROR 1031: Incompatable schema: left is
>>> "columns:bag{T:tuple(name:bytearray,value:bytearray)}", right is
>>> "id:chararray,age:int,title:chararray"*
>>>
>>>
>>>
>>>
>>> I try to use, FLATTEN, SUBSTRING, SPLIT UDF`s but i have not get good
>>> result:
>>>
>>> Example:
>>>
>>>
>>>    - when I flatten , I get a set of tuples like
>>>
>>> *(title,QA)*
>>>
>>> *(title,QA)*
>>>
>>> *2013-08-22 12:42:20,673 [main] INFO
>>>  org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total
>>> input paths to process : 1*
>>>
>>> *A: {title: chararray}*
>>>
>>>
>>>
>>> but i can get value QA
>>>
>>> Sustring only works with title
>>>
>>>
>>>
>>> example:
>>>
>>> *B = FOREACH A GENERATE SUBSTRING(title,2,5);*
>>> *
>>> *
>>> *dump B;*
>>> *describe B;*
>>> *
>>> *
>>> *
>>> *
>>>
>>> *results:*
>>> *
>>> *
>>>
>>> *(tle)*
>>> *(tle)*
>>> *B: {chararray}*
>>>
>>>
>>>
>>>
>>> i try, this like ERIC LEE inthe other mail  and have the same results:
>>>
>>>
>>>  Anyways, what I really what is the column value, not the name. Is there
>>> a way to do that? I listed all of the failed attempts I made below.
>>>
>>>    - colnames = FOREACH cols GENERATE $1 and was told $1 was out of
>>>    bounds.
>>>    - casted = FOREACH cols GENERATE (tuple(chararray, chararray))$0;
>>>    but all I got back were empty tuples
>>>    - values = FOREACH cols GENERATE $0.$1; but I got an error telling
>>>    me data byte array can't be casted to tuple
>>>
>>>
>>> Please, I will appreciate any help
>>>
>>>
>>> Regards
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>
>>
>> --
>>
>> Miguel Angel Martín Junquera
>> Analyst Engineer.
>> miguelangel.martin@brainsins.com
>> Tel. / Fax: (+34) 91 485 56 66
>> *http://www.brainsins.com*
>> Smart eCommerce
>> *Madrid*: http://goo.gl/4B5kv
>>  *London*: http://goo.gl/uIXdv
>>  *Barcelona*: http://goo.gl/NZslW
>>
>> Antes de imprimir este e-mail, piense si es necesario.
>> La legislación española ampara el secreto de las comunicaciones. Este
>> correo electrónico es estrictamente confidencial y va dirigido
>> exclusivamente a su destinatario/a. Si no es Ud., le rogamos que no difunda
>> ni copie la transmisión y nos lo notifique cuanto antes.
>>
>
>
>
> --
>
> Miguel Angel Martín Junquera
> Analyst Engineer.
> miguelangel.martin@brainsins.com
>
>

Re: how can i get the column value? Need help!.. cassandra 1.28 and pig 0.11.1

Posted by Miguel Angel Martin junquera <mi...@gmail.com>.
I try this:

*rows = LOAD
'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
CqlStorage();*

*dump rows;*

*ILLUSTRATE rows;*

*describe rows;*

*
*

*values2= FOREACH rows GENERATE  TOTUPLE (id) as
(mycolumn:tuple(name,value));*

*dump values2;*

*describe values2;*
*
*

But I get this results:



-------------------------------------------------------------
| rows     | id:chararray   | age:int   | title:chararray   |
-------------------------------------------------------------
|          | (id, 6)        | (age, 30) | (title, QA)       |
-------------------------------------------------------------

rows: {id: chararray,age: int,title: chararray}
2013-08-30 09:54:37,831 [main] ERROR org.apache.pig.tools.grunt.Grunt -
ERROR 1031: Incompatable field schema: left is
"tuple_0:tuple(mycolumn:tuple(name:bytearray,value:bytearray))", right is
"org.apache.pig.builtin.totuple_id_1:tuple(id:chararray)"





or



....

*values2= FOREACH rows GENERATE  TOTUPLE (id) ;*
*dump values2;*
*describe values2;*




and  the results are:


...
(((id,6)))
(((id,5)))
values2: {org.apache.pig.builtin.totuple_id_8: (id: chararray)}



Aggg!!!!!


*
*




Miguel Angel Martín Junquera
Analyst Engineer.
miguelangel.martin@brainsins.com



2013/8/28 Miguel Angel Martin junquera <mi...@gmail.com>

> hi:
>
> I can not understand why the schema is  define like *"id:chararray,age:int,title:chararray"
>  and it does not define like tuples or bag tuples,  if we have pair
> key-values  columns*
> *
> *
> *
> *
> *I try other time to change schema  but it does not work.*
> *
> *
> *any ideas ...*
> *
> *
> *perhaps, is the issue in the definition cql3 tables ?*
> *
> *
> *regards*
>
>
> 2013/8/28 Miguel Angel Martin junquera <mi...@gmail.com>
>
>> hi all:
>>
>>
>> Regards
>>
>> Still i can resolve this issue. .....
>>
>> does anybody have this issue or try to test this simple example?
>>
>>
>> i am stumped I can not find a solution working.
>>
>> I appreciate any comment or help
>>
>>
>> 2013/8/22 Miguel Angel Martin junquera <mi...@gmail.com>
>>
>>> hi all:
>>>
>>>
>>>
>>>
>>> I,m testing the new CqlStorage() with cassandra 1.28 and pig 0.11.1
>>>
>>>
>>> I am using this sample data test:
>>>
>>>
>>> http://frommyworkshop.blogspot.com.es/2013/07/hadoop-map-reduce-with-cassandra.html
>>>
>>> And I load and dump data Righ with this script:
>>>
>>> *rows = LOAD
>>> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
>>> CqlStorage();*
>>> *
>>> *
>>> *dump rows;*
>>> *describe rows;*
>>> *
>>> *
>>>
>>> *resutls:
>>>
>>> ((id,6),(age,30),(title,QA))
>>>
>>> ((id,5),(age,30),(title,QA))
>>>
>>> rows: {id: chararray,age: int,title: chararray}
>>>
>>>
>>> *
>>>
>>>
>>> But i can not  get  the column values
>>>
>>> I try to define   another schemas in Load like I used with
>>> cassandraStorage()
>>>
>>>
>>> http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Cassandra-and-Pig-how-to-get-column-values-td5641158.html
>>>
>>>
>>> example:
>>>
>>> *rows = LOAD
>>> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
>>> CqlStorage() AS (columns: bag {T: tuple(name, value)});*
>>>
>>>
>>> and I get this error:
>>>
>>> *2013-08-22 12:24:45,426 [main] ERROR org.apache.pig.tools.grunt.Grunt
>>> - ERROR 1031: Incompatable schema: left is
>>> "columns:bag{T:tuple(name:bytearray,value:bytearray)}", right is
>>> "id:chararray,age:int,title:chararray"*
>>>
>>>
>>>
>>>
>>> I try to use, FLATTEN, SUBSTRING, SPLIT UDF`s but i have not get good
>>> result:
>>>
>>> Example:
>>>
>>>
>>>    - when I flatten , I get a set of tuples like
>>>
>>> *(title,QA)*
>>>
>>> *(title,QA)*
>>>
>>> *2013-08-22 12:42:20,673 [main] INFO
>>>  org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total
>>> input paths to process : 1*
>>>
>>> *A: {title: chararray}*
>>>
>>>
>>>
>>> but i can get value QA
>>>
>>> Sustring only works with title
>>>
>>>
>>>
>>> example:
>>>
>>> *B = FOREACH A GENERATE SUBSTRING(title,2,5);*
>>> *
>>> *
>>> *dump B;*
>>> *describe B;*
>>> *
>>> *
>>> *
>>> *
>>>
>>> *results:*
>>> *
>>> *
>>>
>>> *(tle)*
>>> *(tle)*
>>> *B: {chararray}*
>>>
>>>
>>>
>>>
>>> i try, this like ERIC LEE inthe other mail  and have the same results:
>>>
>>>
>>>  Anyways, what I really what is the column value, not the name. Is there
>>> a way to do that? I listed all of the failed attempts I made below.
>>>
>>>    - colnames = FOREACH cols GENERATE $1 and was told $1 was out of
>>>    bounds.
>>>    - casted = FOREACH cols GENERATE (tuple(chararray, chararray))$0;
>>>    but all I got back were empty tuples
>>>    - values = FOREACH cols GENERATE $0.$1; but I got an error telling
>>>    me data byte array can't be casted to tuple
>>>
>>>
>>> Please, I will appreciate any help
>>>
>>>
>>> Regards
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>
>>
>> --
>>
>> Miguel Angel Martín Junquera
>> Analyst Engineer.
>> miguelangel.martin@brainsins.com
>> Tel. / Fax: (+34) 91 485 56 66
>> *http://www.brainsins.com*
>> Smart eCommerce
>> *Madrid*: http://goo.gl/4B5kv
>>  *London*: http://goo.gl/uIXdv
>>  *Barcelona*: http://goo.gl/NZslW
>>
>> Antes de imprimir este e-mail, piense si es necesario.
>> La legislación española ampara el secreto de las comunicaciones. Este
>> correo electrónico es estrictamente confidencial y va dirigido
>> exclusivamente a su destinatario/a. Si no es Ud., le rogamos que no difunda
>> ni copie la transmisión y nos lo notifique cuanto antes.
>>
>
>
>
> --
>
> Miguel Angel Martín Junquera
> Analyst Engineer.
> miguelangel.martin@brainsins.com
>
>

Re: how can i get the column value? Need help!.. cassandra 1.28 and pig 0.11.1

Posted by Miguel Angel Martin junquera <mi...@gmail.com>.
hi:

I can not understand why the schema is  define like
*"id:chararray,age:int,title:chararray"
 and it does not define like tuples or bag tuples,  if we have pair
key-values  columns*
*
*
*
*
*I try other time to change schema  but it does not work.*
*
*
*any ideas ...*
*
*
*perhaps, is the issue in the definition cql3 tables ?*
*
*
*regards*


2013/8/28 Miguel Angel Martin junquera <mi...@gmail.com>

> hi all:
>
>
> Regards
>
> Still i can resolve this issue. .....
>
> does anybody have this issue or try to test this simple example?
>
>
> i am stumped I can not find a solution working.
>
> I appreciate any comment or help
>
>
> 2013/8/22 Miguel Angel Martin junquera <mi...@gmail.com>
>
>> hi all:
>>
>>
>>
>>
>> I,m testing the new CqlStorage() with cassandra 1.28 and pig 0.11.1
>>
>>
>> I am using this sample data test:
>>
>>
>> http://frommyworkshop.blogspot.com.es/2013/07/hadoop-map-reduce-with-cassandra.html
>>
>> And I load and dump data Righ with this script:
>>
>> *rows = LOAD
>> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
>> CqlStorage();*
>> *
>> *
>> *dump rows;*
>> *describe rows;*
>> *
>> *
>>
>> *resutls:
>>
>> ((id,6),(age,30),(title,QA))
>>
>> ((id,5),(age,30),(title,QA))
>>
>> rows: {id: chararray,age: int,title: chararray}
>>
>>
>> *
>>
>>
>> But i can not  get  the column values
>>
>> I try to define   another schemas in Load like I used with
>> cassandraStorage()
>>
>>
>> http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Cassandra-and-Pig-how-to-get-column-values-td5641158.html
>>
>>
>> example:
>>
>> *rows = LOAD
>> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
>> CqlStorage() AS (columns: bag {T: tuple(name, value)});*
>>
>>
>> and I get this error:
>>
>> *2013-08-22 12:24:45,426 [main] ERROR org.apache.pig.tools.grunt.Grunt -
>> ERROR 1031: Incompatable schema: left is
>> "columns:bag{T:tuple(name:bytearray,value:bytearray)}", right is
>> "id:chararray,age:int,title:chararray"*
>>
>>
>>
>>
>> I try to use, FLATTEN, SUBSTRING, SPLIT UDF`s but i have not get good
>> result:
>>
>> Example:
>>
>>
>>    - when I flatten , I get a set of tuples like
>>
>> *(title,QA)*
>>
>> *(title,QA)*
>>
>> *2013-08-22 12:42:20,673 [main] INFO
>>  org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total
>> input paths to process : 1*
>>
>> *A: {title: chararray}*
>>
>>
>>
>> but i can get value QA
>>
>> Sustring only works with title
>>
>>
>>
>> example:
>>
>> *B = FOREACH A GENERATE SUBSTRING(title,2,5);*
>> *
>> *
>> *dump B;*
>> *describe B;*
>> *
>> *
>> *
>> *
>>
>> *results:*
>> *
>> *
>>
>> *(tle)*
>> *(tle)*
>> *B: {chararray}*
>>
>>
>>
>>
>> i try, this like ERIC LEE inthe other mail  and have the same results:
>>
>>
>>  Anyways, what I really what is the column value, not the name. Is there
>> a way to do that? I listed all of the failed attempts I made below.
>>
>>    - colnames = FOREACH cols GENERATE $1 and was told $1 was out of
>>    bounds.
>>    - casted = FOREACH cols GENERATE (tuple(chararray, chararray))$0; but
>>    all I got back were empty tuples
>>    - values = FOREACH cols GENERATE $0.$1; but I got an error telling me
>>    data byte array can't be casted to tuple
>>
>>
>> Please, I will appreciate any help
>>
>>
>> Regards
>>
>>
>>
>>
>>
>>
>>
>
>
> --
>
> Miguel Angel Martín Junquera
> Analyst Engineer.
> miguelangel.martin@brainsins.com
> Tel. / Fax: (+34) 91 485 56 66
> *http://www.brainsins.com*
> Smart eCommerce
> *Madrid*: http://goo.gl/4B5kv
>  *London*: http://goo.gl/uIXdv
>  *Barcelona*: http://goo.gl/NZslW
>
> Antes de imprimir este e-mail, piense si es necesario.
> La legislación española ampara el secreto de las comunicaciones. Este
> correo electrónico es estrictamente confidencial y va dirigido
> exclusivamente a su destinatario/a. Si no es Ud., le rogamos que no difunda
> ni copie la transmisión y nos lo notifique cuanto antes.
>



-- 

Miguel Angel Martín Junquera
Analyst Engineer.
miguelangel.martin@brainsins.com

Re: how can i get the column value? Need help!.. cassandra 1.28 and pig 0.11.1

Posted by Miguel Angel Martin junquera <mi...@gmail.com>.
hi:

I can not understand why the schema is  define like
*"id:chararray,age:int,title:chararray"
 and it does not define like tuples or bag tuples,  if we have pair
key-values  columns*
*
*
*
*
*I try other time to change schema  but it does not work.*
*
*
*any ideas ...*
*
*
*perhaps, is the issue in the definition cql3 tables ?*
*
*
*regards*


2013/8/28 Miguel Angel Martin junquera <mi...@gmail.com>

> hi all:
>
>
> Regards
>
> Still i can resolve this issue. .....
>
> does anybody have this issue or try to test this simple example?
>
>
> i am stumped I can not find a solution working.
>
> I appreciate any comment or help
>
>
> 2013/8/22 Miguel Angel Martin junquera <mi...@gmail.com>
>
>> hi all:
>>
>>
>>
>>
>> I,m testing the new CqlStorage() with cassandra 1.28 and pig 0.11.1
>>
>>
>> I am using this sample data test:
>>
>>
>> http://frommyworkshop.blogspot.com.es/2013/07/hadoop-map-reduce-with-cassandra.html
>>
>> And I load and dump data Righ with this script:
>>
>> *rows = LOAD
>> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
>> CqlStorage();*
>> *
>> *
>> *dump rows;*
>> *describe rows;*
>> *
>> *
>>
>> *resutls:
>>
>> ((id,6),(age,30),(title,QA))
>>
>> ((id,5),(age,30),(title,QA))
>>
>> rows: {id: chararray,age: int,title: chararray}
>>
>>
>> *
>>
>>
>> But i can not  get  the column values
>>
>> I try to define   another schemas in Load like I used with
>> cassandraStorage()
>>
>>
>> http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Cassandra-and-Pig-how-to-get-column-values-td5641158.html
>>
>>
>> example:
>>
>> *rows = LOAD
>> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
>> CqlStorage() AS (columns: bag {T: tuple(name, value)});*
>>
>>
>> and I get this error:
>>
>> *2013-08-22 12:24:45,426 [main] ERROR org.apache.pig.tools.grunt.Grunt -
>> ERROR 1031: Incompatable schema: left is
>> "columns:bag{T:tuple(name:bytearray,value:bytearray)}", right is
>> "id:chararray,age:int,title:chararray"*
>>
>>
>>
>>
>> I try to use, FLATTEN, SUBSTRING, SPLIT UDF`s but i have not get good
>> result:
>>
>> Example:
>>
>>
>>    - when I flatten , I get a set of tuples like
>>
>> *(title,QA)*
>>
>> *(title,QA)*
>>
>> *2013-08-22 12:42:20,673 [main] INFO
>>  org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total
>> input paths to process : 1*
>>
>> *A: {title: chararray}*
>>
>>
>>
>> but i can get value QA
>>
>> Sustring only works with title
>>
>>
>>
>> example:
>>
>> *B = FOREACH A GENERATE SUBSTRING(title,2,5);*
>> *
>> *
>> *dump B;*
>> *describe B;*
>> *
>> *
>> *
>> *
>>
>> *results:*
>> *
>> *
>>
>> *(tle)*
>> *(tle)*
>> *B: {chararray}*
>>
>>
>>
>>
>> i try, this like ERIC LEE inthe other mail  and have the same results:
>>
>>
>>  Anyways, what I really what is the column value, not the name. Is there
>> a way to do that? I listed all of the failed attempts I made below.
>>
>>    - colnames = FOREACH cols GENERATE $1 and was told $1 was out of
>>    bounds.
>>    - casted = FOREACH cols GENERATE (tuple(chararray, chararray))$0; but
>>    all I got back were empty tuples
>>    - values = FOREACH cols GENERATE $0.$1; but I got an error telling me
>>    data byte array can't be casted to tuple
>>
>>
>> Please, I will appreciate any help
>>
>>
>> Regards
>>
>>
>>
>>
>>
>>
>>
>
>
> --
>
> Miguel Angel Martín Junquera
> Analyst Engineer.
> miguelangel.martin@brainsins.com
> Tel. / Fax: (+34) 91 485 56 66
> *http://www.brainsins.com*
> Smart eCommerce
> *Madrid*: http://goo.gl/4B5kv
>  *London*: http://goo.gl/uIXdv
>  *Barcelona*: http://goo.gl/NZslW
>
> Antes de imprimir este e-mail, piense si es necesario.
> La legislación española ampara el secreto de las comunicaciones. Este
> correo electrónico es estrictamente confidencial y va dirigido
> exclusivamente a su destinatario/a. Si no es Ud., le rogamos que no difunda
> ni copie la transmisión y nos lo notifique cuanto antes.
>



-- 

Miguel Angel Martín Junquera
Analyst Engineer.
miguelangel.martin@brainsins.com

Re: how can i get the column value? Need help!.. cassandra 1.28 and pig 0.11.1

Posted by Miguel Angel Martin junquera <mi...@gmail.com>.
hi all:


Regards

Still i can resolve this issue. .....

does anybody have this issue or try to test this simple example?


i am stumped I can not find a solution working.

I appreciate any comment or help


2013/8/22 Miguel Angel Martin junquera <mi...@gmail.com>

> hi all:
>
>
>
>
> I,m testing the new CqlStorage() with cassandra 1.28 and pig 0.11.1
>
>
> I am using this sample data test:
>
>
> http://frommyworkshop.blogspot.com.es/2013/07/hadoop-map-reduce-with-cassandra.html
>
> And I load and dump data Righ with this script:
>
> *rows = LOAD
> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
> CqlStorage();*
> *
> *
> *dump rows;*
> *describe rows;*
> *
> *
>
> *resutls:
>
> ((id,6),(age,30),(title,QA))
>
> ((id,5),(age,30),(title,QA))
>
> rows: {id: chararray,age: int,title: chararray}
>
>
> *
>
>
> But i can not  get  the column values
>
> I try to define   another schemas in Load like I used with
> cassandraStorage()
>
>
> http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Cassandra-and-Pig-how-to-get-column-values-td5641158.html
>
>
> example:
>
> *rows = LOAD
> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
> CqlStorage() AS (columns: bag {T: tuple(name, value)});*
>
>
> and I get this error:
>
> *2013-08-22 12:24:45,426 [main] ERROR org.apache.pig.tools.grunt.Grunt -
> ERROR 1031: Incompatable schema: left is
> "columns:bag{T:tuple(name:bytearray,value:bytearray)}", right is
> "id:chararray,age:int,title:chararray"*
>
>
>
>
> I try to use, FLATTEN, SUBSTRING, SPLIT UDF`s but i have not get good
> result:
>
> Example:
>
>
>    - when I flatten , I get a set of tuples like
>
> *(title,QA)*
>
> *(title,QA)*
>
> *2013-08-22 12:42:20,673 [main] INFO
>  org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total
> input paths to process : 1*
>
> *A: {title: chararray}*
>
>
>
> but i can get value QA
>
> Sustring only works with title
>
>
>
> example:
>
> *B = FOREACH A GENERATE SUBSTRING(title,2,5);*
> *
> *
> *dump B;*
> *describe B;*
> *
> *
> *
> *
>
> *results:*
> *
> *
>
> *(tle)*
> *(tle)*
> *B: {chararray}*
>
>
>
>
> i try, this like ERIC LEE inthe other mail  and have the same results:
>
>
>  Anyways, what I really what is the column value, not the name. Is there a
> way to do that? I listed all of the failed attempts I made below.
>
>    - colnames = FOREACH cols GENERATE $1 and was told $1 was out of
>    bounds.
>    - casted = FOREACH cols GENERATE (tuple(chararray, chararray))$0; but
>    all I got back were empty tuples
>    - values = FOREACH cols GENERATE $0.$1; but I got an error telling me
>    data byte array can't be casted to tuple
>
>
> Please, I will appreciate any help
>
>
> Regards
>
>
>
>
>
>
>


-- 

Miguel Angel Martín Junquera
Analyst Engineer.
miguelangel.martin@brainsins.com
Tel. / Fax: (+34) 91 485 56 66
*http://www.brainsins.com*
Smart eCommerce
*Madrid*: http://goo.gl/4B5kv
*London*: http://goo.gl/uIXdv
*Barcelona*: http://goo.gl/NZslW

Antes de imprimir este e-mail, piense si es necesario.
La legislación española ampara el secreto de las comunicaciones. Este
correo electrónico es estrictamente confidencial y va dirigido
exclusivamente a su destinatario/a. Si no es Ud., le rogamos que no difunda
ni copie la transmisión y nos lo notifique cuanto antes.

Re: how can i get the column value? Need help!.. cassandra 1.28 and pig 0.11.1

Posted by Miguel Angel Martin junquera <mi...@gmail.com>.
hi all:


Regards

Still i can resolve this issue. .....

does anybody have this issue or try to test this simple example?


i am stumped I can not find a solution working.

I appreciate any comment or help


2013/8/22 Miguel Angel Martin junquera <mi...@gmail.com>

> hi all:
>
>
>
>
> I,m testing the new CqlStorage() with cassandra 1.28 and pig 0.11.1
>
>
> I am using this sample data test:
>
>
> http://frommyworkshop.blogspot.com.es/2013/07/hadoop-map-reduce-with-cassandra.html
>
> And I load and dump data Righ with this script:
>
> *rows = LOAD
> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
> CqlStorage();*
> *
> *
> *dump rows;*
> *describe rows;*
> *
> *
>
> *resutls:
>
> ((id,6),(age,30),(title,QA))
>
> ((id,5),(age,30),(title,QA))
>
> rows: {id: chararray,age: int,title: chararray}
>
>
> *
>
>
> But i can not  get  the column values
>
> I try to define   another schemas in Load like I used with
> cassandraStorage()
>
>
> http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Cassandra-and-Pig-how-to-get-column-values-td5641158.html
>
>
> example:
>
> *rows = LOAD
> 'cql://keyspace1/test?page_size=1&split_size=4&where_clause=age%3D30' USING
> CqlStorage() AS (columns: bag {T: tuple(name, value)});*
>
>
> and I get this error:
>
> *2013-08-22 12:24:45,426 [main] ERROR org.apache.pig.tools.grunt.Grunt -
> ERROR 1031: Incompatable schema: left is
> "columns:bag{T:tuple(name:bytearray,value:bytearray)}", right is
> "id:chararray,age:int,title:chararray"*
>
>
>
>
> I try to use, FLATTEN, SUBSTRING, SPLIT UDF`s but i have not get good
> result:
>
> Example:
>
>
>    - when I flatten , I get a set of tuples like
>
> *(title,QA)*
>
> *(title,QA)*
>
> *2013-08-22 12:42:20,673 [main] INFO
>  org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total
> input paths to process : 1*
>
> *A: {title: chararray}*
>
>
>
> but i can get value QA
>
> Sustring only works with title
>
>
>
> example:
>
> *B = FOREACH A GENERATE SUBSTRING(title,2,5);*
> *
> *
> *dump B;*
> *describe B;*
> *
> *
> *
> *
>
> *results:*
> *
> *
>
> *(tle)*
> *(tle)*
> *B: {chararray}*
>
>
>
>
> i try, this like ERIC LEE inthe other mail  and have the same results:
>
>
>  Anyways, what I really what is the column value, not the name. Is there a
> way to do that? I listed all of the failed attempts I made below.
>
>    - colnames = FOREACH cols GENERATE $1 and was told $1 was out of
>    bounds.
>    - casted = FOREACH cols GENERATE (tuple(chararray, chararray))$0; but
>    all I got back were empty tuples
>    - values = FOREACH cols GENERATE $0.$1; but I got an error telling me
>    data byte array can't be casted to tuple
>
>
> Please, I will appreciate any help
>
>
> Regards
>
>
>
>
>
>
>


-- 

Miguel Angel Martín Junquera
Analyst Engineer.
miguelangel.martin@brainsins.com
Tel. / Fax: (+34) 91 485 56 66
*http://www.brainsins.com*
Smart eCommerce
*Madrid*: http://goo.gl/4B5kv
*London*: http://goo.gl/uIXdv
*Barcelona*: http://goo.gl/NZslW

Antes de imprimir este e-mail, piense si es necesario.
La legislación española ampara el secreto de las comunicaciones. Este
correo electrónico es estrictamente confidencial y va dirigido
exclusivamente a su destinatario/a. Si no es Ud., le rogamos que no difunda
ni copie la transmisión y nos lo notifique cuanto antes.