You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Brandon Williams (JIRA)" <ji...@apache.org> on 2011/02/18 23:30:38 UTC

[jira] Created: (CASSANDRA-2200) stress.java doesn't insert the correct amount of rows

stress.java doesn't insert the correct amount of rows
-----------------------------------------------------

                 Key: CASSANDRA-2200
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-2200
             Project: Cassandra
          Issue Type: Bug
          Components: Contrib
    Affects Versions: 0.7.1
            Reporter: Brandon Williams
            Assignee: Pavel Yaskevich
            Priority: Minor
             Fix For: 0.7.3


For example, if you pass -n 2000000 you only get 1999800 (with 300 threads at least, didn't check if it was related)

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] Commented: (CASSANDRA-2200) stress.java doesn't insert the correct amount of rows

Posted by "Pavel Yaskevich (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/CASSANDRA-2200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12996624#comment-12996624 ] 

Pavel Yaskevich commented on CASSANDRA-2200:
--------------------------------------------

this is related to gaussian function and (when -r is used) pseudo-random generator which generate the same keys in some circumstances. We either should acknowledge this is an known issue or retry key generation (which could take infinite time).

> stress.java doesn't insert the correct amount of rows
> -----------------------------------------------------
>
>                 Key: CASSANDRA-2200
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-2200
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Contrib
>    Affects Versions: 0.7.1
>            Reporter: Brandon Williams
>            Assignee: Pavel Yaskevich
>            Priority: Minor
>             Fix For: 0.7.3
>
>
> For example, if you pass -n 2000000 you only get 1999800 (with 300 threads at least, didn't check if it was related)

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] Commented: (CASSANDRA-2200) stress.java doesn't insert the correct amount of rows

Posted by "Brandon Williams (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/CASSANDRA-2200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12996642#comment-12996642 ] 

Brandon Williams commented on CASSANDRA-2200:
---------------------------------------------

Maybe we should just move forward with CASSANDRA-2020 then.

> stress.java doesn't insert the correct amount of rows
> -----------------------------------------------------
>
>                 Key: CASSANDRA-2200
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-2200
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Contrib
>    Affects Versions: 0.7.1
>            Reporter: Brandon Williams
>            Assignee: Pavel Yaskevich
>            Priority: Minor
>             Fix For: 0.7.3
>
>
> For example, if you pass -n 2000000 you only get 1999800 (with 300 threads at least, didn't check if it was related)

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] Commented: (CASSANDRA-2200) stress.java doesn't insert the correct amount of rows

Posted by "Jonathan Ellis (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/CASSANDRA-2200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12996626#comment-12996626 ] 

Jonathan Ellis commented on CASSANDRA-2200:
-------------------------------------------

_generation_ should be deterministic.  key distribution functions should only be applied to reads.

> stress.java doesn't insert the correct amount of rows
> -----------------------------------------------------
>
>                 Key: CASSANDRA-2200
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-2200
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Contrib
>    Affects Versions: 0.7.1
>            Reporter: Brandon Williams
>            Assignee: Pavel Yaskevich
>            Priority: Minor
>             Fix For: 0.7.3
>
>
> For example, if you pass -n 2000000 you only get 1999800 (with 300 threads at least, didn't check if it was related)

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] Updated: (CASSANDRA-2200) stress.java doesn't insert the correct amount of rows

Posted by "Jonathan Ellis (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/CASSANDRA-2200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jonathan Ellis updated CASSANDRA-2200:
--------------------------------------

    Remaining Estimate: 8h
     Original Estimate: 8h

> stress.java doesn't insert the correct amount of rows
> -----------------------------------------------------
>
>                 Key: CASSANDRA-2200
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-2200
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Contrib
>    Affects Versions: 0.7.1
>            Reporter: Brandon Williams
>            Assignee: Pavel Yaskevich
>            Priority: Minor
>             Fix For: 0.7.3
>
>   Original Estimate: 8h
>  Remaining Estimate: 8h
>
> For example, if you pass -n 2000000 you only get 1999800 (with 300 threads at least, didn't check if it was related)

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] Commented: (CASSANDRA-2200) stress.java doesn't insert the correct amount of rows

Posted by "Pavel Yaskevich (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/CASSANDRA-2200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12996634#comment-12996634 ] 

Pavel Yaskevich commented on CASSANDRA-2200:
--------------------------------------------

oh sorry for misleading comment, of course this is about reads. for write however - each thread was a portion of range of keys to generate and that range is calculated in the following way:

(keysPerThread * (idx + keysToSkip), keysPerThread * (idx + 1)), where _idx_ - index of the current thread, _keysPerThread_ - totalKeys / threadCount, _keysToSkip_ - determined by user in params.

This is ported from python code (line 203). For some numbers of the total keys and threadCount it won't generate precise ranges. Seems like we'll need to range one more thread at the end to generate those missing rows sometimes.

> stress.java doesn't insert the correct amount of rows
> -----------------------------------------------------
>
>                 Key: CASSANDRA-2200
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-2200
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Contrib
>    Affects Versions: 0.7.1
>            Reporter: Brandon Williams
>            Assignee: Pavel Yaskevich
>            Priority: Minor
>             Fix For: 0.7.3
>
>
> For example, if you pass -n 2000000 you only get 1999800 (with 300 threads at least, didn't check if it was related)

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] Resolved: (CASSANDRA-2200) stress.java doesn't insert the correct amount of rows

Posted by "Brandon Williams (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/CASSANDRA-2200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Brandon Williams resolved CASSANDRA-2200.
-----------------------------------------

    Resolution: Fixed

Solved by CASSANDRA-2020

> stress.java doesn't insert the correct amount of rows
> -----------------------------------------------------
>
>                 Key: CASSANDRA-2200
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-2200
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Contrib
>    Affects Versions: 0.7.1
>            Reporter: Brandon Williams
>            Assignee: Pavel Yaskevich
>            Priority: Minor
>             Fix For: 0.7.3
>
>   Original Estimate: 8h
>  Remaining Estimate: 8h
>
> For example, if you pass -n 2000000 you only get 1999800 (with 300 threads at least, didn't check if it was related)

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] Commented: (CASSANDRA-2200) stress.java doesn't insert the correct amount of rows

Posted by "Pavel Yaskevich (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/CASSANDRA-2200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12996648#comment-12996648 ] 

Pavel Yaskevich commented on CASSANDRA-2200:
--------------------------------------------

I concur. It seems to me that porting from py_stress wasn't such a good idea, stress.java needs re-design.

> stress.java doesn't insert the correct amount of rows
> -----------------------------------------------------
>
>                 Key: CASSANDRA-2200
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-2200
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Contrib
>    Affects Versions: 0.7.1
>            Reporter: Brandon Williams
>            Assignee: Pavel Yaskevich
>            Priority: Minor
>             Fix For: 0.7.3
>
>
> For example, if you pass -n 2000000 you only get 1999800 (with 300 threads at least, didn't check if it was related)

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira