Description
Hadoop a Spark allow comfortable processing of large amounts of data. I will demonstrate what it means to process data (personalized emails in this case) and how a subsequent use of processed data can look like.
I will also show advantages of using Python to manage the entire process and the use of libraries and tools such as Pandas, SciPy, and IPython notebook.