Corpus ID: 1689067

CSM-349 - Benford's Law: An Empirical Investigation and a Novel Explanation

@inproceedings{Scott2001CSM349B,
  title={CSM-349 - Benford's Law: An Empirical Investigation and a Novel Explanation},
  author={Pd Scott and Maria Fasli},
  year={2001}
}
This report describes an investigation into Benford?s Law for the distribution of leading digits in real data sets. A large number of such data sets have been examined and it was found that only a small fraction of them conform to the law. Three classes of mathematical model of processes that might account for such a leading digit distribution have also been investigated. We found that based on the notion of taking the product of many random factors the most credible. This led to the… Expand
An Empirical Non-Parametric Likelihood Family of Data-Based Benford-Like Distributions
A mathematical expression known as Benford's law provides an example of an unexpected relationship among randomly selected sequences of first significant digits (FSDs). Newcomb [Note on the frequencyExpand
Reality Checks for a Distributional Assumption: The Case of “Benford’s Law”
In recent years, many articles have promoted uses for “Benford’s Law,” claimed to identify a nearly ubiquitous distribution pattern for the frequencies of first digits of numbers in many data sets.Expand
Generalized Benford’s Law as a Lie Detector
TLDR
In two studies, some empirical support for the generalized Benford analysis is provided and it is concluded that familiarity with the numerical domain involved as well as cognitive effort only have a mild effect on the method’s accuracy. Expand
Benford's Law, Families of Distributions and a Test Basis
Benford's Law is used to test for data irregularities. While novel, there are two weaknesses in the current methodology. First, test values used in practice are too conservative and the test valuesExpand
Remarks on the Use of Benford's Law
  • James C. Alexander
  • Mathematics
  • 2009
Benford's law is a method of digital analysis that is used to probe sets of data. It is based on the frequency of occurrence of the digits '1' through '9' of the first significant digit of dataExpand
An Empirical Non-Parametric Likelihood Family of Data-Based Benford-Like Distributions
A mathematical expression known as Benford's law provides an example of an unexpected relationship among randomly selected first significant digits (FSD). Newcomb (1881), and later Benford (1938),Expand
Benford's Law as an Instrument for Fraud Detection in Surveys Using the Data of the Socio-Economic Panel (SOEP)
TLDR
This paper focuses on fraud detection in surveys using Socio-Economic Panel (SOEP) data as an example for testing newly methods proposed here, and develops a measure that reflects the plausibility of the digit distribution in interviewer clusters and shows that in several SOEP subsamples, Benford's Law holds for the available continuous data. Expand
Benford’s Law as an Instrument for Fraud Detection in Surveys Using the Data of the Socio-Economic Panel (SOEP)
Summary This paper focuses on fraud detection in surveys using Socio-Economic Panel (SOEP) data as an example for testing newly methods proposed here. A statistical theorem referred to as Benford’sExpand
Scatter and regularity imply Benford's law... and more
A random variable (r.v.) X is said to follow Benford's law if log(X) is uniform mod 1. Many experimental data sets prove to follow an approximate version of it, and so do many mathematical series andExpand
The value of the last digit: statistical fraud detection with digit analysis
TLDR
It is proved that last digits are approximately uniform for distributions with an absolutely continuous distribution function, and a result for ‘certain’ sums of lattice-variables as well. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 22 REFERENCES
A Statistical Derivation of the Significant-Digit Law
The history, empirical evidence and classical explanations of the significant-digit (or Benford's) law are reviewed, followed by a sum- mary of recent invariant-measure characterizations. Then a newExpand
Benford's Law
TLDR
It is shown that a number of different statistics associated with computation like space and runtime often follow Benford's Law, and that search cost on input data that follows Benford’s Law is often very different to that on more uniform data. Expand
On the Distribution of First Significant Digits
Introduction. It has been noticed by astute observers that well used tables of logarithms are invariably dirtier at the front than at the back. Upon reflection one is led to inquire whether there areExpand
An Application of Fourier Series to the Most Significant Digit Problem
It is an old observation [N] that in tables of logarithms, the first few pages are more smudged and worn than the later pages. It follows that the distribution of the first significant digit of theExpand
Base-Invariance Implies Benford's Law
A derivation of Benford's Law or the First-Digit Phenomenon is given assuming only base-invariance of the underlying law. The only baseinvariant distributions are shown to be convex combinations ofExpand
Evaluating data mining procedures: techniques for generating artificial data sets
TLDR
It is argued that tests done with real data sets cannot provide all the information needed for a thorough assessment of their performance characteristics and that artificial data sets are therefore essential. Expand
Statistical distributions
TLDR
The probability integrals are evaluated by series or continued fractions, identified by their equation numbers in Abramowitz and Stegun, and the inverses are obtained by Newtonian iteration. Expand
On the distribution of numbers
  • R. Hamming
  • Mathematics, Computer Science
  • Bell Syst. Tech. J.
  • 1970
TLDR
It is shown how the arithmetic operations of a computer transform various distributions toward the limiting distribution of the mantissas of floating point numbers, and a number of applications to hardware, software, and general computing show that this distribution is not merely an amusing curiosity. Expand
The Peculiar Distribution of First Digits
...
1
2
3
...