You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@iceberg.apache.org by "hellochueng (via GitHub)" <gi...@apache.org> on 2023/04/28 05:58:26 UTC

[GitHub] [iceberg] hellochueng opened a new issue, #7455: spark insert partition table Data skew

hellochueng opened a new issue, #7455:
URL: https://github.com/apache/iceberg/issues/7455

   ### Query engine
   
   spark
   
   ### Question
   
   use spark3.3 iceberg 1.2
   insert into partition table,Large amount of data in one partition
   doc
   ![image](https://user-images.githubusercontent.com/29030883/235065890-053b3519-a38b-4db2-b4e7-631756e26d23.png)
   Iceberg requires the data to be sorted according to the partition spec per task (Spark partition) in prior to write against partitioned table. This applies both Writing with SQL and Writing with DataFrames.
   
   ![image](https://user-images.githubusercontent.com/29030883/235066104-fc44557d-3cd0-452d-93cd-50fac743697b.png)
   The problem of slanting data writing occurred in my job
   
   how can i solve this problem


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org


[GitHub] [iceberg] BsoBird commented on issue #7455: spark insert partition table Data skew

Posted by "BsoBird (via GitHub)" <gi...@apache.org>.
BsoBird commented on issue #7455:
URL: https://github.com/apache/iceberg/issues/7455#issuecomment-1527185278

   Hey, dude. Looks like you and I are having the same problem. Why don't you come over here and we'll discuss it?
   https://github.com/apache/iceberg/issues/7406


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org


Re: [I] spark insert partition table Data skew [iceberg]

Posted by "github-actions[bot] (via GitHub)" <gi...@apache.org>.
github-actions[bot] commented on issue #7455:
URL: https://github.com/apache/iceberg/issues/7455#issuecomment-1818027657

   This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale'


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org


Re: [I] spark insert partition table Data skew [iceberg]

Posted by "github-actions[bot] (via GitHub)" <gi...@apache.org>.
github-actions[bot] commented on issue #7455:
URL: https://github.com/apache/iceberg/issues/7455#issuecomment-1780223493

   This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs. To permanently prevent this issue from being considered stale, add the label 'not-stale', but commenting on the issue is preferred when possible.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org


Re: [I] spark insert partition table Data skew [iceberg]

Posted by "github-actions[bot] (via GitHub)" <gi...@apache.org>.
github-actions[bot] closed issue #7455: spark insert partition table Data skew
URL: https://github.com/apache/iceberg/issues/7455


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org