Noisy data Noise: random error or variance in a measured variable a Incorrect attribute values may be due to faulty data collection instruments ◆ data entry problems data transmission problems ◆ technology limitation inconsistency in naming convention a Other data problems which require data cleaning ◆ duplicate records ◆ incomplete data ◆ inconsistent data 同济大学软件学院 ool of Software Engineering. Tongpi Unversity9 Noisy Data ◼ Noise: random error or variance in a measured variable ◼ Incorrect attribute values may be due to ◆ faulty data collection instruments ◆ data entry problems ◆ data transmission problems ◆ technology limitation ◆ inconsistency in naming convention ◼ Other data problems which require data cleaning ◆ duplicate records ◆ incomplete data ◆ inconsistent data