Most data sources are notoriously unreliable: sensors can be faulty,
humans may provide biased opinions,
remote websites might be stale, and so on.
Understanding and modeling these sources of error is a first step toward developing data cleaning techniques. Unfortunately, much of this is data source and application dependent.