You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@zeppelin.apache.org by anish singh <an...@gmail.com> on 2016/08/09 05:51:31 UTC

[GSoC - 2016][Zeppelin Notebooks] Stanford Datasets Collection

Hello,

The Stanford Large Network Dataset Collection [0] mentions a list of
datasets that are available for use and analysis from their site. The
datasets are mainly graph datasets on various Internet activities such as
on-line community interaction and reddit posts, amazon product and customer
data among others. These can be analyzed using GraphX.

From the large list of datasets, a few can be chosen and used for analysis
in Zeppelin. Additionally, some of the pages mention links to papers that
provide new ways of analyzing the data.

[0]. http://snap.stanford.edu/data/#communities

Thanks,
Anish.