You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Dumitru Husleag (Jira)" <ji...@apache.org> on 2020/12/16 17:37:00 UTC

[jira] [Updated] (PARQUET-1956) "parquet-tools cat -j" should not output metadata

     [ https://issues.apache.org/jira/browse/PARQUET-1956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dumitru Husleag updated PARQUET-1956:
-------------------------------------
    Description: 
Parquet-tools cat -help says:

{noformat}
parquet-tools cat:
Prints the content of a Parquet file. The output contains only the data, no
metadata is displayed
{noformat}


Let's say I have a structure like this in a Parquet file:
{code:json}
{
   "account": 
   [
		{
			"accountDepositType": "NA",
			"accountNumber": "NA",
			"accountHolderType": "NA"
		}
	]
}
{code}

it is extracted like this

{code:json}
{
   "account": {
		"list": [
			{
				"element": {
					"accountDepositType": "NA",
					"accountNumber": "NA",
					"accountHolderType": "NA"
				}
			}
		]
	}
}
{code}

*"list"* and *"element"* is metadata or type information if you like and it should not appear in output.

Can you please fix that ?


  was:

Parquet-tools cat -help says:


Let's say I have a structure like this in a Parquet file:
{code:json}
{
   "account": 
   [
		{
			"accountDepositType": "NA",
			"accountNumber": "NA",
			"accountHolderType": "NA"
		}
	]
}
{code}

it is extracted like this

{code:json}
{
   "account": {
		"list": [
			{
				"element": {
					"accountDepositType": "NA",
					"accountNumber": "NA",
					"accountHolderType": "NA"
				}
			}
		]
	}
}
{code}

*"list"* and *"element"* is metadata or type information if you like and it should not appear in output.

Can you please fix that ?



> "parquet-tools cat -j" should not output metadata
> -------------------------------------------------
>
>                 Key: PARQUET-1956
>                 URL: https://issues.apache.org/jira/browse/PARQUET-1956
>             Project: Parquet
>          Issue Type: Bug
>          Components: parquet-mr
>    Affects Versions: 1.11.1
>            Reporter: Dumitru Husleag
>            Priority: Major
>
> Parquet-tools cat -help says:
> {noformat}
> parquet-tools cat:
> Prints the content of a Parquet file. The output contains only the data, no
> metadata is displayed
> {noformat}
> Let's say I have a structure like this in a Parquet file:
> {code:json}
> {
>    "account": 
>    [
> 		{
> 			"accountDepositType": "NA",
> 			"accountNumber": "NA",
> 			"accountHolderType": "NA"
> 		}
> 	]
> }
> {code}
> it is extracted like this
> {code:json}
> {
>    "account": {
> 		"list": [
> 			{
> 				"element": {
> 					"accountDepositType": "NA",
> 					"accountNumber": "NA",
> 					"accountHolderType": "NA"
> 				}
> 			}
> 		]
> 	}
> }
> {code}
> *"list"* and *"element"* is metadata or type information if you like and it should not appear in output.
> Can you please fix that ?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)