You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@accumulo.apache.org by GitBox <gi...@apache.org> on 2022/08/10 21:25:08 UTC

[GitHub] [accumulo] EdColeman opened a new issue, #2872: Make exportTable command volume aware

EdColeman opened a new issue, #2872:
URL: https://github.com/apache/accumulo/issues/2872

   **Is your feature request related to a problem? Please describe.**
   When using `exportTable` command, it does not take into consideration of multiple volumes and creates a single distcp.txt file. The file does not differentiate between volumes and the file must be post-processed to separate the files by volume(s) for feeding to individual distcp commands for each volume when exporting a table that uses more than one volume.
   
   **Describe the solution you'd like**
   Have the command generate a single file (distcp) list per volume.  The solution should play nice with `importTable` command.  It may also be useful if the exported metadata and the distcp file lists are preserved after the import table process to allow for post-processing validation.
   
   **Additional context**
   This is a follow on to #2869, #2871 and #2849 (offline option) which implement work-arounds for `exportTable` and `importTable` not being volume aware.  Inspired by changes submitted by @drewfarris for `importTable` to handle multiple directories.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@accumulo.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [accumulo] ctubbsii commented on issue #2872: Make exportTable command volume aware

Posted by "ctubbsii (via GitHub)" <gi...@apache.org>.
ctubbsii commented on issue #2872:
URL: https://github.com/apache/accumulo/issues/2872#issuecomment-1661718870

   In discussions on #3228, I've become aware that the term "Volume aware" isn't quite the right terminology for what is needed. What it needs to be is "FileSystem aware". Files need to be grouped by Hadoop FileSystem instances (not just FileSystem types), regardless of how they map to Accumulo Volume instances. The limitation on distcp is copying across Hadoop FileSystems. DistCp has no knowledge of, or concept of, Accumulo Volumes.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@accumulo.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [accumulo] AlbertWhitlock commented on issue #2872: Make exportTable command volume aware

Posted by "AlbertWhitlock (via GitHub)" <gi...@apache.org>.
AlbertWhitlock commented on issue #2872:
URL: https://github.com/apache/accumulo/issues/2872#issuecomment-1434861879

   I have logic in place and starting to test the changes now. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@accumulo.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org