Big Data, 2018-19 edition Minutes for lecture V ----- Plan for the day ----- - Iterables, more iterables and files. -- follow up on Snakify Iterables are there to manipulate SETS. Spreadsheets have ROWS for the same reason. Example: Katzaros' index: =($B2+$J2)*(4/(2018-$C2+1)) Importance = (4 x # of citations) / # of years elapsed - Practicum: installing and running Python *locally* -- a per-host installation of Python 3 and some editor -Exercise: estimate the number of cancelations (churn) in Facebook -- download the dataset -- write own Python code; debug; repeat -- create own mini-instance; run; validate -- run stat on actual instance