2.1 Overview of data types 2.2 Review of Data pre-processing tools and platforms 2.3 Clean, storage and management of raw data 2.4 Collections of data analysis and data mining
Parallel DBMS technologies Proposed in the late eighties Matured over the last two decades Multi-billion dollar industry: Proprietary DBMS Engines intended as Data Warehousing solutions for very large enterprises Hadoop Spark UC Berkeley