#### Filter Results:

- Full text PDF available (61)

#### Publication Year

1988

2017

- This year (3)
- Last 5 years (15)
- Last 10 years (18)

#### Publication Type

#### Co-author

#### Journals and Conferences

#### Data Set Used

#### Key Phrases

Learn More

- Usama M. Fayyad, Keki B. Irani
- IJCAI
- 1993

Since most real-world applications of classification learning involve continuous-valued attributes, properly addressing the discretization process is an important problem. This paper addresses the use of the entropy minimization heuristic for discretizing the range of a continuous-valued attribute into multiple intervals. We briefly present theoretical… (More)

- Usama M. Fayyad, Gregory Piatetsky-Shapiro, Padhraic Smyth
- AI Magazine
- 1996

databases have been attracting a significant amount of research, industry, and media attention of late. What is all the excitement about? This article provides an overview of this emerging field, clarifying how data mining and knowledge discovery in databases are related both to each other and to related fields, such as machine learning, statistics, and… (More)

- Usama M. Fayyad, Padhraic Smyth, +22 authors Martin L. Kersten
- 1996

- Usama M. Fayyad, Gregory Piatetsky-Shapiro, Padhraic Smyth
- Advances in Knowledge Discovery and Data Mining
- 1996

- Usama M. Fayyad
- ILP
- 1991

It has been estimated that the amount of information in the world doubles every 20 months. The size and number of databases probably increases even faster. In 1989, the total number of databases in the world was estimated at five million, although most of them are small DBASE II I databases. The automation of business activities produces an ever-increasing… (More)

- Paul S. Bradley, Usama M. Fayyad
- ICML
- 1998

Practical approaches to clustering use an iterative procedure (e.g. K-Means, EM) which converges to one of numerous local minima. It is known that these iterative techniques are especially sensitive to initial starting conditions. We present a procedure for computing a refined starting condition from a given initial one that is based on an efficient… (More)

- Usama M. Fayyad, Gregory Piatetsky-Shapiro, Padhraic Smyth
- Commun. ACM
- 1996

AS WE MARCH INTO THE AGE of digital information, the problem of data overload looms ominously ahead. Our ability to analyze and understand massive datasets lags far behind our ability to gather and store the data. A new generation of computational techniques and tools is required to support the extraction of useful knowledge from the rapidly growing volumes… (More)

- Paul S. Bradley, Usama M. Fayyad, Cory Reina
- KDD
- 1998

Practical clustering algorithms require multiple data scans to achieve convergence. For large databases, these scans become prohibitively expensive. We present a scalable clustering framework applicable to a wide class of iterative clustering. We require at most one scan of the database. In this work, the framework is instantiated and numerically justified… (More)

- Ying Zhao, George Karypis, Usama M. Fayyad
- Data Mining and Knowledge Discovery
- 2005

Fast and high-quality document clustering algorithms play an important role in providing intuitive navigation and browsing mechanisms by organizing large amounts of information into a small number of meaningful clusters. In particular, clustering algorithms that build meaningful hierarchies out of large document collections are ideal tools for their… (More)

- Usama M. Fayyad, Keki B. Irani
- Machine Learning
- 1992

We present a result applicable to classification learning algorithms that generate decision trees or rules using the information entropy minimization heuristic for discretizing continuous-valued attributes. The result serves to give a better understanding of the entropy measure, to point out that the behavior of the information entropy heuristic possesses… (More)