This paper contributes to the study of the random number K_n of blocks in the random partition of {1,...,n} induced by random sampling from the celebrated two-parameter Poisson–Dirichlet process. For any α ∈ (0, 1) and θ > −α, Pitman (Combinatorial Stochastic Processes (2006) Springer, Berlin) showed that (K_n)/(n^α) → S as n → +∞, where the limiting random variable, referred to as Pitman’s α-diversity, is distributed according to a polynomially scaled Mittag–Leffler distribution function. Our main result is a Berry–Esseen theorem for Pitman’s α-diversity S. The proof relies on three intermediate novel results which are of independent interest: (i) a probabilistic representation of the distribution of K_n in terms of a compound distribution; (ii) a quantitative version of the Laplace’s approximation method for integrals; (iii) a refined quantitative bound for Poisson approximation. An application of our Berry–Esseen theorem is presented in the context of Bayesian nonparametric inference for species sampling problems, quantifying the error of a posterior approximation that has been extensively applied to infer the number of unseen species in a population.

A BERRY–ESSEEN THEOREM FOR PITMAN’S α-DIVERSITY

Dolera Emanuele;
2020-01-01

Abstract

This paper contributes to the study of the random number K_n of blocks in the random partition of {1,...,n} induced by random sampling from the celebrated two-parameter Poisson–Dirichlet process. For any α ∈ (0, 1) and θ > −α, Pitman (Combinatorial Stochastic Processes (2006) Springer, Berlin) showed that (K_n)/(n^α) → S as n → +∞, where the limiting random variable, referred to as Pitman’s α-diversity, is distributed according to a polynomially scaled Mittag–Leffler distribution function. Our main result is a Berry–Esseen theorem for Pitman’s α-diversity S. The proof relies on three intermediate novel results which are of independent interest: (i) a probabilistic representation of the distribution of K_n in terms of a compound distribution; (ii) a quantitative version of the Laplace’s approximation method for integrals; (iii) a refined quantitative bound for Poisson approximation. An application of our Berry–Esseen theorem is presented in the context of Bayesian nonparametric inference for species sampling problems, quantifying the error of a posterior approximation that has been extensively applied to infer the number of unseen species in a population.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11571/1342865
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 3
social impact