Beta distribution and Dirichlet distribution

Article directory

0. Supplementary knowledge

0.1 Beta function $\Beta(P, Q)$

The beta function is also called Euler's first integral and is defined as:
$\begin{aligned } \Beta(P,Q) = \int_0^1x^{P-1}(1-x)^{Q-1}dx \quad (P>0,Q>0) \end{aligned}$
If the beta function is changed into an indefinite integral, there is an incomplete beta function $\Beta_x(P,Q)$
$\begin{aligned} \Beta_x(P,Q) = \int_0^xu^{P-1}(1-u)^{Q-1}du \quad (0\le x \le 1,P>0,Q>0) \end{aligned}$

0.2 Gamma function $\Gamma(x)$

The gamma function is also called Euler's second integral and is defined as:
$\begin{aligned} \Gamma(x) &= \int_0^{+\infin}t^{x-1}e^{-t}dt \quad (x>0)\\ &= 2\int_0^ {+\infin} t^{2x-1}e^{-t^2}dt \end{aligned}$
Some properties of the gamma function:

$\Gamma(x+1) = x\Gamma(x)$
$\Gamma(n) = (n-1)!$
$\Gamma(\frac{1}{2})=\sqrt{\pi}$
given $\beta$ function relation: $\Beta(m,n)=\frac{\Gamma(m)\Gamma(n)}{\ gamma(m+n)}$

1. Beta Distribution

Beta distribution , also known as $B\Beta$ distribution, defined at $(0, 1)$ On the interval, there are two parameters $\alpha,\beta \gt 0$ , the random variable obeys the beta distribution and is generally written as $X\sim \text{Be}(\alpha,\beta)$

1.1 Probability density function PDF

$\begin{aligned} f(x;\alpha,\beta) & = \frac{x^{\alpha-1}(1-x)^{\beta-1}}{\int_0^1u^{\alpha-1}(1-u)^{\beta-1}du }\\ &= \frac{\Gamma(\alpha+\beta)}{\Gamma(\alpha)\Gamma(\beta)}x^{\alpha-1}(1-x)^{\beta-1 }\\ &=\frac{1}{B(\alpha,\beta)}x^{\alpha-1}(1-x)^{\beta-1} \end{aligned}$

1.2 Cumulative distribution function CDF

$\begin{aligned} F(x;\alpha,\beta) = \frac{\Beta_x(\alpha,\beta) }{\Beta(\alpha,\beta)} \end{aligned}$
Among them, $\Beta_x(\alpha,\beta)$ is an incomplete beta function, defined as:

1.3 Digital features

Definition: $\mu=E(X)=\frac{\alpha}{\alpha+\beta}$
方差： $Var(X)=E((X-\mu)^2)= \frac{\alpha\beta}{(\alpha+\beta)^2(\alpha+\beta+1)}$

2. Dirichlet Distribution

Dirichlet distribution is a multivariate generalization of beta distribution. For $Dirichlet distribution in d$ dimensions, with a total of $d$ parameters.
The Dirichlet distribution is about a set of $d$ continuous variables $\mu_i\in[0,1]$ probability distribution; or a $Probability distribution of d-$ dimensional vectors, where vector elements $\mu_i\in[0,1]$ , and have $\sum_{i=1}^d\mu_i=1$ 。

2.1 Probability density function PDF

记 $\boldsymbol{\mu} = (\mu_1;\mu_2;\cdots;\mu_d)$ 。
令parameters $\boldsymbol{\alpha}=(\alpha_1;\alpha_2;\cdots;\alpha_d)$ ， $\hat{\alpha} = \sum_{i=1}^d\alpha_i$ , and $\alpha_i > 0$

Given the dependent variable:
$\begin{aligned} p(\mu_1,\mu_2,\dots,\mu_d|\alpha_1,\alpha_2,\dots,\alpha_d) &= p(\ballsymbol{\mu}|\ballsymbol{; \alpha}) = \text{Dir}(\bold symbol{\mu}|\bold symbol{\alpha})\\ &= \frac{\Gamma(\hat{\alpha})}{\Gamma(\alpha_1) \Gamma(\alpha_2)\cdots\Gamma(\alpha_d)}\prod_{i=1}^d\mu_i^{\alpha_i-1}\\ &= \frac{\Gamma(\hat{\alpha}); }{\prod_{i=1}^d\Gamma(\alpha_i)}\prod_{i=1}^d\mu_i^{\alpha_i-1} \end{aligned}$
Obviously, when $d = At 2$ , the Dirichlet distribution degenerates into the Beta distribution.

2.2 Digital features

Definition: $\mathbb{E}[\mu_i] = \frac{\alpha_i}{\hat{\alpha}}$
方差： $Var[\mu_i] = \frac{\alpha_i(\hat{\alpha}-\alpha_i)}{ \hat{\alpha}(\hat{\alpha}+1)}$