Dirty data

Dirty data is inaccurate, incomplete or erroneous data, especially in a computer system or database. In reference to databases, this is data that… (More)
Wikipedia

Topic mentions per year

Topic mentions per year

1986-2017
051019862017

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
2015
2015
An important obstacle to accurate data analytics is dirty data in the form of missing, duplicate, incorrect, or inconsistent… (More)
  • figure 1
  • table 1
  • figure 2
  • figure 3
  • figure 4
Is this relevant?
2014
2014
It is widely accepted that proper data publishing is difficult. The majority of Linked Open Data (LOD) does not meet even a core… (More)
  • figure 1
  • figure 2
Is this relevant?
2011
2011
There is a growing awareness that high quality of data is a key to today’s business success and that dirty data existing within… (More)
  • table I
  • table II
  • table III
  • table IV
  • table V
Is this relevant?
2011
2011
Dirty data refers to the inconsistent information that has no meaning in the system. It is collecting almost by mistakes and from… (More)
Is this relevant?
2010
2010
We investigate the problem of creating and analyzing samples of relational databases to find relationships between string-valued… (More)
  • table 1
  • figure 1
  • table 3
  • table 2
  • figure 2
Is this relevant?
Highly Cited
2008
Highly Cited
2008
Dirty data is a serious problem for businesses leading to incorrect decision making, inefficient daily operations, and ultimately… (More)
  • table 1
  • figure 1
  • figure 2
  • table 3
  • figure 3
Is this relevant?
2006
2006
There are two either explicitly or implicitly and widely accepted ideas about the distribution of land in Ethiopia after the… (More)
  • table 1
  • table 2
  • table 3
  • figure 1
  • figure 2
Is this relevant?
Highly Cited
2003
Highly Cited
2003
Today large corporations are constructing enterprise data warehouses from disparate data sources in order to run enterprise-wide… (More)
Is this relevant?
2003
2003
Abstract: Information quality assessment is the process of inspecting business information to ensure that it meets the needs of… (More)
  • figure 1
  • figure 2
Is this relevant?
2001
2001
Distributional dominance criteria are commonly applied to draw welfare inferences about comparisons, but conclusions drawn from… (More)
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • figure 5
Is this relevant?