【统计学习笔记】习题四

在这里插入图片描述

4.1 朴素贝叶斯概率估计公式

a. P ( Y = c k ) P(Y=c_k) P(Y=ck)

P ( Y = c k ) = θ P(Y=c_k)=\theta P(Y=ck)=θ,进行N次实验,n次Y=ck
则有:
L ( θ ) = θ n ( 1 − θ ) N − n L(\theta)=\theta^n(1-\theta)^{N-n} L(θ)=θn(1θ)Nn
取对数:
log ⁡ L ( θ ) = n log ⁡ θ + ( N − n ) log ⁡ ( 1 − θ ) \log L(\theta)=n\log\theta+(N-n)\log(1-\theta) logL(θ)=nlogθ+(Nn)log(1θ)
求导:
d log ⁡ L ( θ ) d θ = n θ − N − n 1 − θ \frac{d\log L(\theta)}{d\theta}=\frac{n}{\theta}-\frac{N-n}{1-\theta} dθdlogL(θ)=θn1θNn
θ = n / N \theta=n/N θ=n/N时,似然函数取极大值。
故先验概率的极大似然估计为:
P ( Y = c k ) = n N = ∑ i = 1 N I ( y i = c k ) N P(Y=c_k)=\frac{n}{N}=\frac{\sum\limits_{i=1}^NI(y_i=c_k)}{N} P(Y=ck)=Nn=Ni=1NI(yi=ck)

b. P ( X ( j ) = a j l ∣ Y = c k ) P(X^{(j)}=a_{jl}|Y=c_k) P(X(j)=ajlY=ck)

P ( X ( j ) = a j l ∣ Y = c k ) = θ P(X^{(j)}=a_{jl}|Y=c_k)=\theta P(X(j)=ajlY=ck)=θ,进行N次试验,有n次Y=ck,有m次Y=ck且X(j)=ajl
则有:
L ( θ ) = ( n N θ ) m ( 1 − n θ N ) N − m L(\theta)=(\frac{n}{N}\theta)^m(1-\frac{n\theta}{N})^{N-m} L(θ)=(Nnθ)m(1Nnθ)Nm
取对数:
log ⁡ L ( θ ) = m log ⁡ n N + m log ⁡ θ + ( N − m ) log ⁡ ( 1 − n θ N ) \log L(\theta)=m\log\frac{n}{N}+m\log\theta+(N-m)\log{(1-\frac{n\theta}{N})} logL(θ)=mlogNn+mlogθ+(Nm)log(1Nnθ)
求导:
d log ⁡ L ( θ ) d θ = m θ − n ( N − m ) N ( 1 − n θ / N ) \frac{d\log L(\theta)}{d\theta}=\frac{m}{\theta}-\frac{n(N-m)}{N(1-n\theta/N)} dθdlogL(θ)=θmN(1nθ/N)n(Nm)
则:
P ( X ( j ) = a j l ∣ Y = c k ) = m n P(X^{(j)}=a_{jl}|Y=c_k)=\frac{m}{n} P(X(j)=ajlY=ck)=nm

猜你喜欢

转载自blog.csdn.net/qq_39573785/article/details/107695032
今日推荐