You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@bluemarlin.apache.org by GitBox <gi...@apache.org> on 2022/01/31 07:53:38 UTC

[GitHub] [incubator-bluemarlin] SatyamSwarup opened a new issue #38: Number of Distinct Users in Trainready Table

SatyamSwarup opened a new issue #38:
URL: https://github.com/apache/incubator-bluemarlin/issues/38


   Hello,
   
   As discussed in previous meetings, the total number of records in Trainready table is same as the total number of distinct users(aids). We have confirmed it on our side. 
   It would be helpful if you once check your data and verify if you have same number of records as the number of distinct users or not in the final Trainready table.
   
   Thanks and Regards
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscribe@bluemarlin.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [incubator-bluemarlin] jimmylao edited a comment on issue #38: Number of Distinct Users in Trainready Table

Posted by GitBox <gi...@apache.org>.
jimmylao edited a comment on issue #38:
URL: https://github.com/apache/incubator-bluemarlin/issues/38#issuecomment-1026413094


   In Trainready table, each record corresponds to a distinct user (aid), so the total number of records in Trainready table equals to the total number of distinct users (aid) that survive to the stage.
   Due to some filtering operation in preprocessing steps which remove quite a lot of users (aids), the total number of distinct user (aid) in Trainready table most likely not equals to the total number of users (aid) in raw data.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscribe@bluemarlin.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [incubator-bluemarlin] jimmylao commented on issue #38: Number of Distinct Users in Trainready Table

Posted by GitBox <gi...@apache.org>.
jimmylao commented on issue #38:
URL: https://github.com/apache/incubator-bluemarlin/issues/38#issuecomment-1026413094


   In Trainready table, each record corresponds to a distinct user (aid), so the total number of records in Trainready table equals to the total number of distinct users (aid) that survive to the stage.
   Due to some filtering operation is preprocessing steps which remove quite a lot of users (aids), the total number of distinct user (aid) in Trainready table most likely not equals to the total number of users (aid) in raw data.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscribe@bluemarlin.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [incubator-bluemarlin] radibnia77 closed issue #38: Number of Distinct Users in Trainready Table

Posted by GitBox <gi...@apache.org>.
radibnia77 closed issue #38:
URL: https://github.com/apache/incubator-bluemarlin/issues/38


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscribe@bluemarlin.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org