You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@airflow.apache.org by Teresa Martyny <te...@omadahealth.com> on 2018/08/24 18:29:39 UTC
Late arriving data from external apis
Hey there,
We are moving our ETL over into airflow and re-writing our scripts in
python. Due to client-side queueing, offline iOS and Android data may take
up to 5 days to enter the raw data store for Mixpanel. Currently what we
have found is fastest/easiest is to just drop and replace 5 days of
Mixpanel data every day to handle that.
Moving into the airflow world, I thought it'd be nice to be able to use
airflow's features to be able to do some of this for us.
How are other folks handling late arriving data from external apis?
Thanks!
*Teresa Martyny*
pronouns: she, her, hers
Software Engineer | Data Team | Omada Health <https://www.omadahealth.com/>
500 Sansome St #200, SF, CA 94111
*What is Omada?* <https://vimeo.com/203386025>
--
This email may contain material that is confidential and/or privileged for
the sole use of the intended recipient. Any review, reliance, or
distribution by others or forwarding without express permission is strictly
prohibited. If you are not the intended recipient, please contact the
sender and delete all copies. Also note that email is not an appropriate
way to send protected health information to Omada Health employees. Please
use your discretion when responding to this email.