You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-issues@hadoop.apache.org by "Bikas Saha (JIRA)" <ji...@apache.org> on 2012/09/26 03:24:07 UTC

[jira] [Commented] (HADOOP-8847) Change untar to use Java API instead of spawning tar process

    [ https://issues.apache.org/jira/browse/HADOOP-8847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13463446#comment-13463446 ] 

Bikas Saha commented on HADOOP-8847:
------------------------------------

Attaching patch that uses JAVA apache commons api to do untar. Adds a test that does a sanity check. 
I have written a manual test that untars a tar file 10 times using old and new methods and both have similar perf. Not adding that test because it takes a long time to run.
Attaching 2 test resource files. The patch file has the locations where these resources need to be committed.
                
> Change untar to use Java API instead of spawning tar process
> ------------------------------------------------------------
>
>                 Key: HADOOP-8847
>                 URL: https://issues.apache.org/jira/browse/HADOOP-8847
>             Project: Hadoop Common
>          Issue Type: Improvement
>            Reporter: Bikas Saha
>            Assignee: Bikas Saha
>         Attachments: HADOOP-8847.branch-1-win.1.patch, test-untar.tar, test-untar.tgz
>
>
> Currently FileUtil.unTar() spawns tar utility to do the work. Tar may not be present on all platforms by default eg. Windows. So changing this to use JAVA API's would help make it more cross-platform. FileUtil.unZip() uses the same approach.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira