You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Mohamed Mehdi BEN AISSA (JIRA)" <ji...@apache.org> on 2019/03/03 14:41:00 UTC
[jira] [Commented] (SPARK-26795) Retry remote
fileSegmentManagedBuffer when creating inputStream failed during shuffle
read phase
[ https://issues.apache.org/jira/browse/SPARK-26795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16782748#comment-16782748 ]
Mohamed Mehdi BEN AISSA commented on SPARK-26795:
-------------------------------------------------
I have the same issue with spark 2.3.0:
{code:java}
org.apache.spark.shuffle.FetchFailedException
{code}
AND
{code:java}
Error opening block StreamChunkId {streamId=1377556883266, chunkIndex=9} for request from /ip.adress:39050
java.io.EOFException..
{code}
Please, how did you fix this issue ? we should wait for the patch ?
> Retry remote fileSegmentManagedBuffer when creating inputStream failed during shuffle read phase
> ------------------------------------------------------------------------------------------------
>
> Key: SPARK-26795
> URL: https://issues.apache.org/jira/browse/SPARK-26795
> Project: Spark
> Issue Type: Bug
> Components: Shuffle
> Affects Versions: 2.3.0, 2.3.1, 2.3.2, 2.4.0
> Reporter: feiwang
> Priority: Major
>
> There is a parameter spark.maxRemoteBlockSizeFetchToMem, which means the remote block will be fetched to disk when size of the block is above this threshold in bytes.
> So during shuffle read phase, the managedBuffer which throw IOException may be a remote downloaded FileSegment and should be retried instead of throwFetchFailed directly.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org