You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Christophe Préaud (Jira)" <ji...@apache.org> on 2022/05/23 13:09:00 UTC

[jira] [Created] (TEZ-4415) Hadoop archives created with Tez miss index files

Christophe Préaud created TEZ-4415:
--------------------------------------

             Summary: Hadoop archives created with Tez miss index files
                 Key: TEZ-4415
                 URL: https://issues.apache.org/jira/browse/TEZ-4415
             Project: Apache Tez
          Issue Type: Bug
    Affects Versions: 0.9.2
            Reporter: Christophe Préaud


When a hadoop archive is created with Tez, the _index and _masterindex files are not created:
{code:java}
# create hadoop archive with Tez
hadoop archive -D mapreduce.framework.name=yarn-tez -archiveName data.har -p /user/preaudc/data /user/preaudc 
(...)
22/05/23 13:04:39 INFO client.TezClient: Tez Client Version: [ component=tez-api, version=0.9.2, revision=10cb3519bd34389210e6511a2ba291b52dcda081, SCM-URL=scm:git:https://gitbox.apache.org/repos/asf/tez.git, buildTime=2019-03-19T20:44:07Z ]
(...)

# _index and _masterindex files are not created
hdfs dfs -ls /user/preaudc/data.har
Found 2 items
-rw-r--r--   3 preaudc preaudc          0 2022-05-23 13:06 /user/preaudc/data.har/_SUCCESS
-rw-r--r--   3 preaudc preaudc 2537147461 2022-05-23 13:06 /user/preaudc/data.har/part-0

# the hadoop archive is thus unreadable
hdfs dfs -ls har:/user/preaudc/data.har
ls: Invalid path for the Har Filesystem. No index file in har:/user/preaudc/data.har{code}



--
This message was sent by Atlassian Jira
(v8.20.7#820007)