John W. Tukey

Learn More
Document clustering has not been well received as an information retrieval tool. Objections to its use fall into two main categories: first, that clustering is too slow for large corpora (with running time often quadratic in the number of documents); and second, that clustering does not appreciably improve retrieval. We argue that these problems arise only(More)
An algorithm for the analysis of multivariate data is presented, and discussed in terms of specific examples. The algorithm seeks to find oneand two-dimensional linear projections of multivariate data that are relatively highly revealing. *Supported by the U.S. Atomic Energy Commission under Contract AT(@+3)515. **Prepared in part in connection with(More)
Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at http://www.jstor.org/page/info/about/policies/terms.jsp. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you(More)
Experiments in which the treatments are composed of a series of doses of a compound and a zero dose control are often used in animal toxicity studies. A test procedure is proposed to assess trends in the response variable. The notion of a no-statistical-significance-of-trend (NOSTASOT) dose is introduced, and questions of multiplicity of statistical tests(More)
We present numerical methods for studying the relationship between the shape of the vocal tract and its acoustic output. For a stationary vocal tract, the articulatory-acoustic relationship can be represented as a multidimensional function of a multidimensional argument: y=f(x), where x, y are vectors describing the vocal-tract shape and the resulting(More)
(Submitted to A.E.C. Scientific Computer Information Exchange Meeting, May 2-3, 1974) PRIM-9 is an interactive data display and analysis system for the examination and dissection of multidimensional data. It allows the user to manipulate and view point sets in up to nine dimensions. This is accomplished by providing all 36 two-dimensional projections along(More)
The conventional procedure for null hypothesis significance testing has long been the target of appropriate criticism. A more reasonable alternative is proposed, one that not only avoids the unrealistic postulation of a null hypothesis but also, for a given parametric difference and a given error probability, is more likely to report the detection of that(More)
Problems of statistical and conceptual design of experiments are exacerberated by ethical issues in many, if not most, clinical trials. Statutory requirements of demonstrated effectiveness are far from being clearly resolved--either qualitatively or quantitatively. Ethics, bolstered by informed consent, are likely to keep us from ever learning the answer to(More)