You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by Esa Heikkinen <es...@student.tut.fi> on 2017/12/28 12:33:43 UTC

Pyspark and searching items from data structures

Hi

I would want to build pyspark-application, which searches sequential items or events of time series from csv-files.

What are the best data structures for this purpose ? Dataframe of pyspark or pandas, or RDD or SQL or something else ?

---
Esa