You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@pig.apache.org by Robert Yerex <ro...@civitaslearning.com> on 2012/09/18 01:00:12 UTC
Removing unnecessary disambiguation marks
Probably an easy one but...
After processing a file through a series of groupings, aggreagtions and
projections using flatten I end up with long concatenated names for each
field shown in this snippre t from the JsonStorage generated schema
{
"name"
:"enrollments_instructor_1::enrollments_student_3::enrollments_student_2::enrollments_student_1::enrollments_section::enrollments::term::term_id"
,
"type":55,
"description":"autogenerated from Pig Field Schema",
"schema":null
},
--
Robert Yerex
Data Scientist
Civitas Learning
www.civitaslearning.com
Re: Removing unnecessary disambiguation marks
Posted by Robert Yerex <ro...@civitaslearning.com>.
Thanks! That worked!
On Tue, Sep 18, 2012 at 6:58 AM, Alan Gates <ga...@hortonworks.com> wrote:
> The added foreach will not generate another MR job.
>
> Alan.
>
> On Sep 18, 2012, at 8:54 AM, Ruslan Al-Fakikh wrote:
>
> > Hey,
> >
> > You can try cleaning in a separate FOREACH. I don't think it'll
> > trigger another MR job, but you better check it.
> > Example:
> > resultCleaned = FOREACH result GENERATE
> > name::group::fileldName AS
> fileldName;
> >
> > Ruslan
> >
> > On Tue, Sep 18, 2012 at 3:01 AM, Robert Yerex
> > <ro...@civitaslearning.com> wrote:
> >> Probably an easy one but...
> >>
> >> After processing a file through a series of groupings, aggreagtions and
> >> projections using flatten I end up with long concatenated names for each
> >> field shown in this snippre t from the JsonStorage generated schema
> >>
> >> {
> >>
> >> "name"
> >>
> :"enrollments_instructor_1::enrollments_student_3::enrollments_student_2::enrollments_student_1::enrollments_section::enrollments::term::term_id"
> >> ,
> >>
> >> "type":55,
> >>
> >> "description":"autogenerated from Pig Field Schema",
> >>
> >> "schema":null
> >>
> >> },
> >> How do I get rid of all the concatenated naming?
> >>
> >> --
> >> Robert Yerex
> >> Data Scientist
> >> Civitas Learning
> >> www.civitaslearning.com
> >>
> >>
> >>
> >>
> >> --
> >> Robert Yerex
> >> Data Scientist
> >> Civitas Learning
> >> www.civitaslearning.com
>
>
--
Robert Yerex
Data Scientist
Civitas Learning
www.civitaslearning.com
Re: Removing unnecessary disambiguation marks
Posted by Alan Gates <ga...@hortonworks.com>.
The added foreach will not generate another MR job.
Alan.
On Sep 18, 2012, at 8:54 AM, Ruslan Al-Fakikh wrote:
> Hey,
>
> You can try cleaning in a separate FOREACH. I don't think it'll
> trigger another MR job, but you better check it.
> Example:
> resultCleaned = FOREACH result GENERATE
> name::group::fileldName AS fileldName;
>
> Ruslan
>
> On Tue, Sep 18, 2012 at 3:01 AM, Robert Yerex
> <ro...@civitaslearning.com> wrote:
>> Probably an easy one but...
>>
>> After processing a file through a series of groupings, aggreagtions and
>> projections using flatten I end up with long concatenated names for each
>> field shown in this snippre t from the JsonStorage generated schema
>>
>> {
>>
>> "name"
>> :"enrollments_instructor_1::enrollments_student_3::enrollments_student_2::enrollments_student_1::enrollments_section::enrollments::term::term_id"
>> ,
>>
>> "type":55,
>>
>> "description":"autogenerated from Pig Field Schema",
>>
>> "schema":null
>>
>> },
>> How do I get rid of all the concatenated naming?
>>
>> --
>> Robert Yerex
>> Data Scientist
>> Civitas Learning
>> www.civitaslearning.com
>>
>>
>>
>>
>> --
>> Robert Yerex
>> Data Scientist
>> Civitas Learning
>> www.civitaslearning.com
Re: Removing unnecessary disambiguation marks
Posted by Ruslan Al-Fakikh <me...@gmail.com>.
Hey,
You can try cleaning in a separate FOREACH. I don't think it'll
trigger another MR job, but you better check it.
Example:
resultCleaned = FOREACH result GENERATE
name::group::fileldName AS fileldName;
Ruslan
On Tue, Sep 18, 2012 at 3:01 AM, Robert Yerex
<ro...@civitaslearning.com> wrote:
> Probably an easy one but...
>
> After processing a file through a series of groupings, aggreagtions and
> projections using flatten I end up with long concatenated names for each
> field shown in this snippre t from the JsonStorage generated schema
>
> {
>
> "name"
> :"enrollments_instructor_1::enrollments_student_3::enrollments_student_2::enrollments_student_1::enrollments_section::enrollments::term::term_id"
> ,
>
> "type":55,
>
> "description":"autogenerated from Pig Field Schema",
>
> "schema":null
>
> },
> How do I get rid of all the concatenated naming?
>
> --
> Robert Yerex
> Data Scientist
> Civitas Learning
> www.civitaslearning.com
>
>
>
>
> --
> Robert Yerex
> Data Scientist
> Civitas Learning
> www.civitaslearning.com
Fwd: Removing unnecessary disambiguation marks
Posted by Robert Yerex <ro...@civitaslearning.com>.
Probably an easy one but...
After processing a file through a series of groupings, aggreagtions and
projections using flatten I end up with long concatenated names for each
field shown in this snippre t from the JsonStorage generated schema
{
"name"
:"enrollments_instructor_1::enrollments_student_3::enrollments_student_2::enrollments_student_1::enrollments_section::enrollments::term::term_id"
,
"type":55,
"description":"autogenerated from Pig Field Schema",
"schema":null
},
How do I get rid of all the concatenated naming?
--
Robert Yerex
Data Scientist
Civitas Learning
www.civitaslearning.com
--
Robert Yerex
Data Scientist
Civitas Learning
www.civitaslearning.com