Copyright © 2009 Werner Hürlimann. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Abstract
Many distributions for first digits of integer sequences are not Benford. A simple method to derive parametric analytical extensions of Benford's law for first digits of numerical data is proposed. Two generalized Benford distributions are considered, namely, the two-sided power Benford (TSPB) distribution, which has been introduced in Hürlimann (2003), and the new Pareto Benford (PB) distribution. Based on the minimum chi-square estimators, the fitting capabilities of these generalized Benford distributions are illustrated and compared at some interesting and important integer sequences. In particular, it is significant that much of the analyzed integer sequences follow with a high P-value the generalized Benford distributions. While the sequences of prime numbers less than 1000, respectively, 10 000 are not at all Benford or TSPB distributed, they are approximately PB distributed with high P-values of 93.3% and 99.9% and reveal after a further deeper analysis of longer sequences a new interesting property. On the other side, Benford's law of a mixing of data sets is rejected at the 5% significance level while the PB law is accepted with a 93.6% P-value, which improves the P-value of 25.2%, which has been obtained previously for the TSPB law.