You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hive.apache.org by "kaducangica ." <ka...@gmail.com> on 2017/12/07 11:43:04 UTC

Hive - Json Serde - ORC

Hi all,

i have a very complex json that i need to insert in a hive table. A json
example follws attached.

First of all i read a json file with Spark to make some data processing and
then i write to a stage table with no Serde and with no any kind of
compression and format (Parquet or ORC).

Then i do an INSERT/SELECT into the "jsonTable" (create table attached)
with no problems. This table use a json Serde
(org.openx.data.jsonserde.JsonSerDe)
and a ORC format and is also particioned by date and timezone.

The problem is that after all this process every time a try to make a
simple "select * from jsonTable" query i got this error message:

"Failed with exception java.io.IOException:java.io.IOException: Error
reading file: hdfs://ip-xxx-xxx-xxx-xxx.sa-east-1.compute.int
ernal:8020/user/hive/warehouse/jsonTable/data_posicao_short=2017-12-02/
veitimezone=America-Sao_Paulo/000000_0"

Actually i do not know if it is possible to use Serde, ORC and partition in
the same table.

Someone could help me?

Thanks in advance.
Best regards

Carlos.