You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Dumitru Husleag (Jira)" <ji...@apache.org> on 2020/12/16 17:37:00 UTC
[jira] [Updated] (PARQUET-1956) "parquet-tools cat -j" should not
output metadata
[ https://issues.apache.org/jira/browse/PARQUET-1956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Dumitru Husleag updated PARQUET-1956:
-------------------------------------
Description:
Parquet-tools cat -help says:
{noformat}
parquet-tools cat:
Prints the content of a Parquet file. The output contains only the data, no
metadata is displayed
{noformat}
Let's say I have a structure like this in a Parquet file:
{code:json}
{
"account":
[
{
"accountDepositType": "NA",
"accountNumber": "NA",
"accountHolderType": "NA"
}
]
}
{code}
it is extracted like this
{code:json}
{
"account": {
"list": [
{
"element": {
"accountDepositType": "NA",
"accountNumber": "NA",
"accountHolderType": "NA"
}
}
]
}
}
{code}
*"list"* and *"element"* is metadata or type information if you like and it should not appear in output.
Can you please fix that ?
was:
Parquet-tools cat -help says:
Let's say I have a structure like this in a Parquet file:
{code:json}
{
"account":
[
{
"accountDepositType": "NA",
"accountNumber": "NA",
"accountHolderType": "NA"
}
]
}
{code}
it is extracted like this
{code:json}
{
"account": {
"list": [
{
"element": {
"accountDepositType": "NA",
"accountNumber": "NA",
"accountHolderType": "NA"
}
}
]
}
}
{code}
*"list"* and *"element"* is metadata or type information if you like and it should not appear in output.
Can you please fix that ?
> "parquet-tools cat -j" should not output metadata
> -------------------------------------------------
>
> Key: PARQUET-1956
> URL: https://issues.apache.org/jira/browse/PARQUET-1956
> Project: Parquet
> Issue Type: Bug
> Components: parquet-mr
> Affects Versions: 1.11.1
> Reporter: Dumitru Husleag
> Priority: Major
>
> Parquet-tools cat -help says:
> {noformat}
> parquet-tools cat:
> Prints the content of a Parquet file. The output contains only the data, no
> metadata is displayed
> {noformat}
> Let's say I have a structure like this in a Parquet file:
> {code:json}
> {
> "account":
> [
> {
> "accountDepositType": "NA",
> "accountNumber": "NA",
> "accountHolderType": "NA"
> }
> ]
> }
> {code}
> it is extracted like this
> {code:json}
> {
> "account": {
> "list": [
> {
> "element": {
> "accountDepositType": "NA",
> "accountNumber": "NA",
> "accountHolderType": "NA"
> }
> }
> ]
> }
> }
> {code}
> *"list"* and *"element"* is metadata or type information if you like and it should not appear in output.
> Can you please fix that ?
--
This message was sent by Atlassian Jira
(v8.3.4#803005)