You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2022/01/10 12:54:29 UTC

[GitHub] [hudi] nochimow commented on issue #4456: [SUPPORT] MultiWriter w/ DynamoDB - Unable to acquire lock, lock object null

nochimow commented on issue #4456:
URL: https://github.com/apache/hudi/issues/4456#issuecomment-1008846694


   Hi there, my code basically reads some avro file into a dataframe then we write this dataframe into a hudi table.
   I'm using the following hudi confs during the write. (It's a python on AWS Glue 3.0)
   
   oodie.datasource.write.keygenerator.class": "org.apache.hudi.keygen.ComplexKeyGenerator",
   oodie.datasource.write.payload.class": "org.apache.hudi.common.model.DefaultHoodieRecordPayload",
   hoodie.datasource.hive_sync.partition_extractor_class": "org.apache.hudi.hive.MultiPartKeysValueExtractor",
   hoodie.table.name": table_name,
   hoodie.datasource.write.recordkey.field": IDX_COL,
   hoodie.datasource.write.partitionpath.field": pks,
   hoodie.datasource.write.hive_style_partitioning": "true",
   hoodie.datasource.write.precombine.field": tiebreaker,
   hoodie.datasource.write.operation": operation,
   hoodie.write.lock.provider=org.apache.hudi.aws.transaction.lock.DynamoDBBasedLockProvider
   hoodie.write.lock.dynamodb.table
   hoodie.write.lock.dynamodb.partition_key
   hoodie.write.lock.dynamodb.region
   hoodie.write.lock.dynamodb.billing_mode=PAY_PER_REQUEST
   
   
   My dynamodb is a simple table with just the partition_key field as a string. There is any recommendation on how the dynamodb structure have to be?
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org