You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@bluemarlin.apache.org by GitBox <gi...@apache.org> on 2022/01/31 15:57:13 UTC

[GitHub] [incubator-bluemarlin] sanjaynirmal opened a new issue #39: Lookalike: spark data pipeline issue

sanjaynirmal opened a new issue #39:
URL: https://github.com/apache/incubator-bluemarlin/issues/39


   For the latest code that you have released for lookalike, we are facing a problem in the last step of data pipeline i.e. tf record generation.
   script: https://github.com/apache/incubator-bluemarlin/blob/main/Model/lookalike-model/lookalike_model/pipeline/main_tfrecord_generator.py
   <img width="687" alt="image" src="https://user-images.githubusercontent.com/40193781/151826834-42ee6106-4725-4628-a037-b01f8bd69c31.png">
   
   We are currently running the pipeline for 120 Million AIDs and we get the following error on the highlighted line.
   
   <img width="944" alt="image" src="https://user-images.githubusercontent.com/40193781/151827090-ac1f7c53-1e18-4dcf-afe1-53aa5a800867.png">
   
   Kindly help us in running the pipeline for big data.
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscribe@bluemarlin.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [incubator-bluemarlin] radibnia77 commented on issue #39: Lookalike: spark data pipeline issue

Posted by GitBox <gi...@apache.org>.
radibnia77 commented on issue #39:
URL: https://github.com/apache/incubator-bluemarlin/issues/39#issuecomment-1028238340


   #40 
   Panda is replaced with Spark.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscribe@bluemarlin.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [incubator-bluemarlin] radibnia77 closed issue #39: Lookalike: spark data pipeline issue

Posted by GitBox <gi...@apache.org>.
radibnia77 closed issue #39:
URL: https://github.com/apache/incubator-bluemarlin/issues/39


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscribe@bluemarlin.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org