You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@accumulo.apache.org by GitBox <gi...@apache.org> on 2021/07/29 21:58:31 UTC

[GitHub] [accumulo] Manno15 opened a new issue #2217: Pretty-print PrintInfo's 'histogram' ouput

Manno15 opened a new issue #2217:
URL: https://github.com/apache/accumulo/issues/2217


   Original JIRA ticket : https://issues.apache.org/jira/browse/ACCUMULO-1660
   Description from the ticket:
   <pre>
   The output from o.a.a.core.file.rfile.PrintInfo --histogram is hard to read. 
   Specifically, the number of zeroes in each row in the histogram is difficult to tell at a glance.
   We have code elsewhere to pretty-print large sizes using MB, GB, etc...
    It would be nice to see that here as well.
   </pre>
   
   **Describe the solution you'd like**
   As shown in the photo below, the size is hard to determine at a glance. Using units like MB or GB would be easier to read. This could be done as the default or as another option on top of `--histogram`. We utilize `NumUtil` in other areas of the code such as `DUCommand` and that could possibly be used here as well..
   
   
   **Additional context**
   Currently, the output of the histogram command looks like the following:
   
   ![image](https://user-images.githubusercontent.com/29436247/127570710-c6bff9b2-56de-4b5f-a921-901fc2bc1c43.png)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@accumulo.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [accumulo] Shivakumar-SG commented on issue #2217: Pretty-print PrintInfo's 'histogram' ouput

Posted by GitBox <gi...@apache.org>.
Shivakumar-SG commented on issue #2217:
URL: https://github.com/apache/accumulo/issues/2217#issuecomment-890491909


   Hey @Manno15 , I would like to take up this issue.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@accumulo.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [accumulo] Shivakumar-SG edited a comment on issue #2217: Pretty-print PrintInfo's 'histogram' ouput

Posted by GitBox <gi...@apache.org>.
Shivakumar-SG edited a comment on issue #2217:
URL: https://github.com/apache/accumulo/issues/2217#issuecomment-890491909


   Hey, I would like to take up this issue.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@accumulo.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [accumulo] Manno15 closed issue #2217: Pretty-print PrintInfo's 'histogram' ouput

Posted by GitBox <gi...@apache.org>.
Manno15 closed issue #2217:
URL: https://github.com/apache/accumulo/issues/2217


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@accumulo.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [accumulo] EdColeman commented on issue #2217: Pretty-print PrintInfo's 'histogram' ouput

Posted by GitBox <gi...@apache.org>.
EdColeman commented on issue #2217:
URL: https://github.com/apache/accumulo/issues/2217#issuecomment-890511823


   Maybe @Manno15 has additional suggestions, but the coding for this issue is probably the easiest part, but to develop and test might be more complicated.
   
   More information on running the command is available in the documentation [Troubleshooting Tools](https://accumulo.apache.org/docs/2.x/troubleshooting/tools)
   
   RFiles are the  storage format used by Accumulo.  
   
   To generate sample inputs there are some options.
   
   - Writing tests that use LogHistogram class (internal to PrintInfo) directly
   - Using / adopting org.apache.accumulo.core.file.rfile.RFileTest
   
   Options to generate "real" files:
   
   - Running an Accumulo instance. See  [Setup](https://accumulo.apache.org/docs/2.x/getting-started
   - /quickstart) 
   - Running one or more of the integration tests. The output would end up in ./test/target/mini-tests/[TEST-NAME]/accumulo/tables/[TABLE_ID]  Running integration tests using maven is shown in [Running a Build](https://accumulo.apache.org/contributor/building)
   
   The general structure used by Accumulo is described in the [Advanced Troubleshooting](https://accumulo.apache.org/docs/2.x/troubleshooting/advanced) and generate rfiles will be placed into directories according that structure and can then be displayed using the print-info shell command.
   
   Successful development and testing would likely require one or more of the above to demonstrate larger prefixes (MB, GB...) are being correctly formatted.
   
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@accumulo.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [accumulo] Shivakumar-SG commented on issue #2217: Pretty-print PrintInfo's 'histogram' ouput

Posted by GitBox <gi...@apache.org>.
Shivakumar-SG commented on issue #2217:
URL: https://github.com/apache/accumulo/issues/2217#issuecomment-890499877


   I'm a bit confused about how to reproduce the above mentioned output as I'm a newbie to open source😌. 
   @EdColeman Would you mind giving me a lil start?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@accumulo.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [accumulo] Manno15 closed issue #2217: Pretty-print PrintInfo's 'histogram' ouput

Posted by GitBox <gi...@apache.org>.
Manno15 closed issue #2217:
URL: https://github.com/apache/accumulo/issues/2217


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@accumulo.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [accumulo] EdColeman edited a comment on issue #2217: Pretty-print PrintInfo's 'histogram' ouput

Posted by GitBox <gi...@apache.org>.
EdColeman edited a comment on issue #2217:
URL: https://github.com/apache/accumulo/issues/2217#issuecomment-890511823


   Maybe @Manno15 has additional suggestions, but the coding for this issue is probably the easiest part, but to develop and test might be more complicated.
   
   More information on running the command is available in the documentation [Troubleshooting Tools](https://accumulo.apache.org/docs/2.x/troubleshooting/tools)
   
   RFiles are the  storage format used by Accumulo.  
   
   To generate sample inputs there are some options.
   
   - Writing tests that use LogHistogram class (internal to PrintInfo) directly
   - Using / adopting org.apache.accumulo.core.file.rfile.RFileTest
   
   Options to generate "real" files:
   
   - Running an Accumulo instance. See  [Setup](https://accumulo.apache.org/docs/2.x/getting-started/quickstart) 
   - Running one or more of the integration tests. The output would end up in ./test/target/mini-tests/[TEST-NAME]/accumulo/tables/[TABLE_ID]  Running integration tests using maven is shown in [Running a Build](https://accumulo.apache.org/contributor/building)
   
   The general structure used by Accumulo is described in the [Advanced Troubleshooting](https://accumulo.apache.org/docs/2.x/troubleshooting/advanced) and generate rfiles will be placed into directories according that structure and can then be displayed using the print-info shell command.
   
   Successful development and testing would likely require one or more of the above to demonstrate larger prefixes (MB, GB...) are being correctly formatted.
   
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@accumulo.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org