You are viewing a plain text version of this content. The canonical link for it is here.

Posted to reviews@spark.apache.org by suyanNone <gi...@git.apache.org> on 2014/12/03 12:22:03 UTC

[GitHub] spark pull request: [SPARK-4721][CORE] Improve logic while first t...

GitHub user suyanNone opened a pull request:

    https://github.com/apache/spark/pull/3582

    [SPARK-4721][CORE] Improve logic while first thread put block failed

    1. make thread which wait old block info try one by one while the first thread which created that block failed.
    2. use reentrantLock instead of this.syn{} in block info.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/suyanNone/spark refine-block-put

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/3582.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #3582
    
----
commit 740e479da941d9f213e3d65825e60d73ff70d8aa
Author: hushan[胡珊] <hu...@xiaomi.com>
Date:   2014-12-03T11:18:23Z

    Make wait thread try to put one by one if first thread failed

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-4721][CORE] Improve logic while first t...

Posted by suyanNone <gi...@git.apache.org>.

Github user suyanNone commented on the pull request:

https://github.com/apache/spark/pull/3582#issuecomment-65891778

Sorry for my poor comments and English.

In all,
1. we do put one thread by one thread until there have 1 thread succeed.
2. multiple doGetLocal threads and only 1 dropFromMemory thread will wait 1 time whenever put is succeed or failed. doGetLocal get failed, the return none. dropFromMemory get failed, return none.

There are 3 places call info.waitForReady()
1. doGetLocal
2. dropFromMemory
3. doPut

and if there are many thread try to put the same block.
for 1, do doGetLocal, I think just wait for one time(Wait1Condition, now renamed as OtherCondition), succeed or failed.
for 2, actually it will never have the situation if we call dropFromMemory but the block is not ready. but in current code there are have a info.waitForReady method call in dropFromMemory, just for compatibility, let's wait only one time(Wait1Condition) for block put succeed or failed. and also think, if we found one thread do the dropFromMemory, we should cancel all put threads.
for 3, do all put threads one by one untill there have a success or have a thread want drop it from memory as we described in 2. it may can fails many times, so WaitNCondition(now named as PutCondition)

All I want to do for WaitType(now I rename BlockWaitCondition), just reuse enum convenience to call method and have a variable can record number of thread wait for that block finish put. and Each Block object have its own wait count, so I use extends Enumration.

---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org

[GitHub] spark pull request: [SPARK-4721][CORE] Improve logic while first t...

Posted by suyanNone <gi...@git.apache.org>.

Github user suyanNone closed the pull request at:

    https://github.com/apache/spark/pull/3582


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org