Synthetic data

Known as: Synthetic, Synthetic datum 
The creation of synthetic data is an involved process of data anonymization; that is to say that synthetic data is a subset of anonymized data… (More)
Wikipedia

Topic mentions per year

Topic mentions per year

1937-2018
01000200019372017

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2008
Highly Cited
2008
Despite the intense interest towards realizing the Semantic Web vision, most existing RDF data management schemes are constrained… (More)
  • figure 1
  • figure 2
  • figure 5
  • figure 3
  • figure 6
Is this relevant?
Highly Cited
2008
Highly Cited
2008
This paper presents a novel adaptive synthetic (ADASYN) sampling approach for learning from imbalanced data sets. The essential… (More)
  • figure 1
  • table I
  • figure 2
  • table II
Is this relevant?
Highly Cited
2007
Highly Cited
2007
Uncertain data are inherent in some important applications. Although a considerable amount of research has been dedicated to… (More)
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • figure 7
Is this relevant?
Highly Cited
2006
Highly Cited
2006
Clustering is an important task in mining evolving data streams. Beside the limited memory and one-pass constraints, the nature… (More)
  • figure 1
  • figure 2
  • figure 3
  • figure 6
  • figure 7
Is this relevant?
Highly Cited
2003
Highly Cited
2003
We consider an environment where distributed data sources continuously stream updates to a centralized processor that monitors… (More)
  • figure 1
  • table 1
  • figure 2
  • figure 3
  • figure 4
Is this relevant?
Highly Cited
2003
Highly Cited
2003
The clustering problem is a difficult problem for the data stream domain. This is because the large volumes of data arriving in a… (More)
  • table 1
  • figure 1
  • figure 2
  • figure 3
  • figure 6
Is this relevant?
Highly Cited
2001
Highly Cited
2001
MOTIVATION Clustering is a useful exploratory technique for the analysis of gene expression data. Many different heuristic… (More)
  • table 1
  • table 2
  • table 3
  • table 4
  • table 5
Is this relevant?
Highly Cited
2000
Highly Cited
2000
In this paper, we propose a novel formulation for distance-based outliers that is based on the distance of a point from its kth… (More)
  • table 1
  • figure 1
  • figure 3
  • figure 4
  • figure 5
Is this relevant?
Highly Cited
2000
Highly Cited
2000
Many organizations today have more than very large databases; they have databases that grow without limit at a rate of several… (More)
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • figure 5
Is this relevant?
Highly Cited
1997
Highly Cited
1997
We consider the problem of analyzing market-basket data and present several important contributions. First, we present a new… (More)
  • figure 1
  • figure 2
  • figure 3
  • figure 5
  • figure 7
Is this relevant?