]>
The zeta distribution is used to model the size or ranks of certain types of objects randomly chosen from certain types of populations. Typical examples include the frequency of occurrence of a word randomly chosen from a text, or the population rank of a city randomly chosen from a country. The zeta distribution is also known as the Zipf distribution, in honor of the American linguist George Zipf.
The Riemann zeta function , named after Bernhard Riemann, is defined as follows:
(You might recall from calculus that the series in the zeta function converges for and diverges for . A graph of the zeta function on the interval is given below:
Try to verify the main properties of the graph analytically. In particular, show that
The zeta function is transcendental, and most of its values must be approximated. However, can be given explicitly for even integer values of ; in particular, and .
Show that the function given below is probability density function for any .
The discrete distribution defined by the density function in Exercise 2 is called the zeta distribution with parameter . In an algebraic sense, the zeta distribution is a discrete version of the Pareto distribution.
Let denote the frequency of occurrence of a word chosen at random from a certain text, and suppose that has the zeta distribution with parameter . Find .
Suppose that has the zeta distribution with parameter . Show that the distribution is a one-parameter exponential family with natural parameter and natural statistic .
The moments of the zeta distribution can be expressed easily in terms of the zeta function.
Suppose that has the zeta distribution with parameter and that . Show that
In particular, show that