Motivation – Why do we need big data integration? – How has “small” data integration been done? – Challenges in big data integration Schema alignment Record linkage Data fusion Emerging topics
《大规模数据处理——云计算 Mass Data Processing Cloud Computing》课程教学资源(阅读材料)Efficient Clustering of High-Dimensional Data Sets with Application to Reference Matching