正在加载图片...
Application Scenarios Complex object identification Data quality Real-life data is often dirty 1%0-5% of business data contains errors Dirty costs us businesses 600 billion dollars each year gartner Wrong price data in retail databases alone costs US consumers $2.5 billion annually Data cleaning tools deliver an overall business value of more than 600 million GBP each year at BT Data cleaning FORRESTER Data repairing Record matching(aka object identification, entity resolution, data deduplication) Complex object identification Modeling complex objects as graphs• Data quality – Real-life data is often dirty: 1%–5% of business data contains errors – Dirty costs us businesses 600 billion dollars each year – Wrong price data in retail databases alone costs US consumers $2.5 billion annually – Data cleaning tools deliver an overall business value of more than ‘‘600 million GBP’’ each year at BT. • Data cleaning – Data repairing – Record matching (aka. object identification, entity resolution, data deduplication) • Complex object identification – Modeling complex objects as graphs Application Scenarios 7 Complex object identification
<<向上翻页向下翻页>>
©2008-现在 cucdc.com 高等教育资讯网 版权所有