You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@accumulo.apache.org by "Adam Fuchs (JIRA)" <ji...@apache.org> on 2012/10/10 17:03:03 UTC

[jira] [Updated] (ACCUMULO-121) document detailed bulk ingest best practices

     [ https://issues.apache.org/jira/browse/ACCUMULO-121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Adam Fuchs updated ACCUMULO-121:
--------------------------------

    Assignee: David Medinets  (was: Adam Fuchs)
    
> document detailed bulk ingest best practices
> --------------------------------------------
>
>                 Key: ACCUMULO-121
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-121
>             Project: Accumulo
>          Issue Type: Improvement
>          Components: docs
>            Reporter: Eric Newton
>            Assignee: David Medinets
>             Fix For: 1.5.0
>
>
> The users' manual should advise best practices for generating and importing bulk files, as well as the effects of bulk imported files:  
>  # Recommended file sizes
>  # Side effects of bypassing constraints
>  # Proper use of range partitioners
>  # Preemptive splitting of tables
>  # How major compaction/garbage collection affects bulk files
>  # Setting the timestamp
>  # Invalid visibility fields makes accumulo fail in strange ways
>  # Bulk import doesn't increase entry counts on the monitor page until the files are compacted

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira