You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by Raghavendra Pai <rv...@rediffmail.com.INVALID> on 2020/07/19 16:17:27 UTC

PIG: LOAD not loading the data properly for XLSX file

Hi Sir,I am stuck and not able to proceed in loading attached file country-code.xlsx file using following command:country_data_load = LOAD &#39;/user/rvpai427385/Pig_Labs/Zomato_Analysis/Country-Code.xlsx&#39; using org.apache.pig.piggybank.storage.CSVExcelStorage(&#39;,&#39;, &#39;NO_MULTILINE&#39;, &#39;NOCHANGE&#39;, &#39;SKIP_INPUT_HEADER&#39;);dump country_data_load;It should load only 12 rows consisting of 12 country codes and 12 country names but that is not happening.It is loading junk data and not the 12 country data which we want.Can you please check below command and advise ?&nbsp;&nbsp;

Re: PIG: LOAD not loading the data properly for XLSX file

Posted by JB Data31 <jb...@gmail.com>.
You try to load an xls file with a CSV loader.
Convert *Country-Code.xlsx* to *Country-Code.csv* and then load it.

@*JB*Δ <http://jbigdata.fr/jbigdata/index.html>



Le lun. 20 juil. 2020 à 13:44, Raghavendra Pai
<rv...@rediffmail.com.invalid> a écrit :

> Hi Sir,
> I am stuck and not able to proceed in loading attached file
> country-code.xlsx file using following command:
>
>
> *country_data_load = LOAD
> '/user/rvpai427385/Pig_Labs/Zomato_Analysis/Country-Code.xlsx' using
> org.apache.pig.piggybank.storage.CSVExcelStorage(',', 'NO_MULTILINE',
> 'NOCHANGE', 'SKIP_INPUT_HEADER');dump country_data_load;*
>
> It should load only 12 rows consisting of 12 country codes and 12 country
> names but that is not happening.
> It is loading junk data and not the 12 country data which we want.
> Can you please check below command and advise ?
>
>
>
>
>
>>
>
>
>