You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@accumulo.apache.org by "Adam Fuchs (JIRA)" <ji...@apache.org> on 2012/10/10 17:03:03 UTC
[jira] [Updated] (ACCUMULO-121) document detailed bulk ingest best
practices
[ https://issues.apache.org/jira/browse/ACCUMULO-121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Adam Fuchs updated ACCUMULO-121:
--------------------------------
Assignee: David Medinets (was: Adam Fuchs)
> document detailed bulk ingest best practices
> --------------------------------------------
>
> Key: ACCUMULO-121
> URL: https://issues.apache.org/jira/browse/ACCUMULO-121
> Project: Accumulo
> Issue Type: Improvement
> Components: docs
> Reporter: Eric Newton
> Assignee: David Medinets
> Fix For: 1.5.0
>
>
> The users' manual should advise best practices for generating and importing bulk files, as well as the effects of bulk imported files:
> # Recommended file sizes
> # Side effects of bypassing constraints
> # Proper use of range partitioners
> # Preemptive splitting of tables
> # How major compaction/garbage collection affects bulk files
> # Setting the timestamp
> # Invalid visibility fields makes accumulo fail in strange ways
> # Bulk import doesn't increase entry counts on the monitor page until the files are compacted
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira