Hi, I have a query regarding spark stage optimization. I have asked the question in more detail at Stackoverflow, please find the following link: http://stackoverflow.com/questions/40192302/why-is- that-two-stages-in-apache-spark-are-computing-same-thing