You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2021/09/16 17:08:04 UTC

[GitHub] [hudi] stym06 opened a new issue #3677: DeltaStreamer exits with error while writing to Azure Blob storage[SUPPORT]

stym06 opened a new issue #3677:
URL: https://github.com/apache/hudi/issues/3677


   
   
   **Describe the problem you faced**
   
   I'm running the DeltaStreamer application in continuous mode writing data into Azure blob storage. It runs fine for a few days, but exits with the following error.
   
   `Caused by: java.net.UnknownHostException: xyz.blob.core.windows.net`
   
   **To Reproduce**
   
   Steps to reproduce the behavior:
   
   1. Run DeltaStreamer with below configs:
   `arguments:
           - "--table-type"
           - "COPY_ON_WRITE"
           - "--props"
           - "/opt/spark/hudi/config-azure/source.properties"
           - "--schemaprovider-class"
           - "org.apache.hudi.utilities.schema.SchemaRegistryProvider"
           - "--source-class"
           - "org.apache.hudi.utilities.sources.AvroKafkaSource"
           - "--target-base-path"
           - "wasb://<container>@<account>.blob.core.windows.net/stage/db.table/"
           - "--target-table"
           - "db.table"
           - "--op"
           - "UPSERT"
           - "--source-ordering-field"
           - "__ts_ms"
           - "--continuous"
           - "--min-sync-interval-seconds"
           - "10"`
   
   **Expected behavior**
   
   The application should continue writing after every 10 seconds gap
   
   A clear and concise description of what you expected to happen.
   
   **Environment Description**
   
   * Hudi version : 0.8
   
   * Spark version : 2.4.4
   
   * Hive version : 3.0
   
   * Hadoop version :  2.7.3
   
   * Storage (HDFS/S3/GCS..) : WASB
   
   * Running on Docker? (yes/no) : K8s
   
   
   **Additional context**
   
   Add any other context about the problem here.
   
   **Stacktrace**
   
   ```21/09/15 13:16:45 ERROR HoodieDeltaStreamer: Shutting down delta-sync due to exception
   org.apache.hadoop.fs.azure.AzureException: com.microsoft.azure.storage.StorageException: The server encountered an unknown failure:
   	at org.apache.hadoop.fs.azure.AzureNativeFileSystemStore.retrieveMetadata(AzureNativeFileSystemStore.java:1930)
   	at org.apache.hadoop.fs.azure.NativeAzureFileSystem.getFileStatus(NativeAzureFileSystem.java:1592)
   	at org.apache.hadoop.fs.FileSystem.exists(FileSystem.java:1426)
   	at org.apache.hudi.utilities.deltastreamer.DeltaSync.refreshTimeline(DeltaSync.java:224)
   	at org.apache.hudi.utilities.deltastreamer.DeltaSync.syncOnce(DeltaSync.java:257)
   	at org.apache.hudi.utilities.deltastreamer.HoodieDeltaStreamer$DeltaSyncService.lambda$startService$0(HoodieDeltaStreamer.java:587)
   	at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1590)
   	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
   	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
   	at java.lang.Thread.run(Thread.java:748)
   Caused by: com.microsoft.azure.storage.StorageException: The server encountered an unknown failure:
   	at com.microsoft.azure.storage.StorageException.translateException(StorageException.java:178)
   	at com.microsoft.azure.storage.core.ExecutionEngine.executeWithRetry(ExecutionEngine.java:214)
   	at com.microsoft.azure.storage.blob.CloudBlob.exists(CloudBlob.java:1963)
   	at com.microsoft.azure.storage.blob.CloudBlob.exists(CloudBlob.java:1950)
   	at org.apache.hadoop.fs.azure.StorageInterfaceImpl$CloudBlobWrapperImpl.exists(StorageInterfaceImpl.java:313)
   	at org.apache.hadoop.fs.azure.AzureNativeFileSystemStore.retrieveMetadata(AzureNativeFileSystemStore.java:1865)
   	... 9 more
   Caused by: java.net.UnknownHostException: xyz.blob.core.windows.net
   	at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:184)
   	at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392)
   	at java.net.Socket.connect(Socket.java:589)
   	at java.net.Socket.connect(Socket.java:538)
   	at sun.net.NetworkClient.doConnect(NetworkClient.java:180)
   	at sun.net.www.http.HttpClient.openServer(HttpClient.java:463)
   	at sun.net.www.http.HttpClient.openServer(HttpClient.java:558)
   	at sun.net.www.http.HttpClient.<init>(HttpClient.java:242)
   	at sun.net.www.http.HttpClient.New(HttpClient.java:339)
   	at sun.net.www.http.HttpClient.New(HttpClient.java:357)
   	at sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(HttpURLConnection.java:1220)
   	at sun.net.www.protocol.http.HttpURLConnection.plainConnect0(HttpURLConnection.java:1156)
   	at sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnection.java:1050)
   	at sun.net.www.protocol.http.HttpURLConnection.connect(HttpURLConnection.java:984)
   	at sun.net.www.protocol.http.HttpURLConnection.getInputStream0(HttpURLConnection.java:1564)
   	at sun.net.www.protocol.http.HttpURLConnection.getInputStream(HttpURLConnection.java:1492)
   	at java.net.HttpURLConnection.getResponseCode(HttpURLConnection.java:480)
   	at com.microsoft.azure.storage.core.ExecutionEngine.executeWithRetry(ExecutionEngine.java:124)
   	... 13 more```
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] xushiyan commented on issue #3677: DeltaStreamer exits with error while writing to Azure Blob storage[SUPPORT]

Posted by GitBox <gi...@apache.org>.
xushiyan commented on issue #3677:
URL: https://github.com/apache/hudi/issues/3677#issuecomment-923743879


   @stym06 this does not look like an issue with Hudi. It's a network issue with Azure storage. Please check with Azure support to investigate. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] xushiyan closed issue #3677: DeltaStreamer exits with error while writing to Azure Blob storage[SUPPORT]

Posted by GitBox <gi...@apache.org>.
xushiyan closed issue #3677:
URL: https://github.com/apache/hudi/issues/3677


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org