You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by Jason Dere <jd...@hortonworks.com> on 2017/11/03 19:36:17 UTC
Review Request 63544: HIVE-17963: Fix for HIVE-17113 can be improved
for non-blobstore filesystems
-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/63544/
-----------------------------------------------------------
Review request for hive, Ashutosh Chauhan and Owen O'Malley.
Bugs: HIVE-17963
https://issues.apache.org/jira/browse/HIVE-17963
Repository: hive-git
Description
-------
For non-blobstore FS:
1) Move the temp directory to a new directory name, to prevent additional files from being added by any runaway processes.
2) Run removeTempOrDuplicateFiles() on this renamed temp directory
3) Run renameOrMoveFiles() to move the renamed temp directory to the final location.
For blobstore FS:
1) Remove duplicates from tmpPath
2) Use moveSpecifiedFiles() to perform a file-by-file move of the de-duped files to specPath
Also remove hive.exec.move.files.from.source.dir setting - the above changes will always be enabled.
Diffs
-----
common/src/java/org/apache/hadoop/hive/conf/HiveConf.java cbe4de5
ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java 1be7eab
ql/src/test/queries/clientpositive/insert_with_move_files_from_source_dir.q 0117755
ql/src/test/queries/clientpositive/skewjoin.q 9ad4b77
Diff: https://reviews.apache.org/r/63544/diff/1/
Testing
-------
Thanks,
Jason Dere