[模式识别] [讲义] 马尔科夫链与隐式马尔科夫模型

随机过程:

每个随机过程是关于随机变量 t 的函数:
连续情况下 ζ ( t ) , t [ α , β ]
离散情况下 ζ ( t 1 ) . . . ζ ( t T ) , t = 1 , . . . , T

离散时间随机过程,状态序列 S 1 , . . . S T ,记 V = 1 , 2 , . . . , N ,即 S t 取值为 V 中某个 i S t = i

P ( S 1 , . . . , S T ) = P ( S 1 ) P ( S 2 | S 1 ) P ( S 3 | S 1 , S 2 ) . . . P ( S T | S 1 , . . . , S T 1 )

马尔科夫性:前一个状态确定,后一个状态就只跟前一个状态有关,即

P ( S T | S 1 , . . . , S T 1 ) = P ( S T | S T 1 )

所以:

P ( S 1 , . . . , S T ) = P ( S 1 ) P ( S 2 | S 1 ) P ( S 3 | S 2 ) . . . P ( S T | S T 1 ) = P ( S 1 ) t = 2 T P ( S t | S t 1 )

上式称为 马尔科夫过程/马尔科夫链

齐次马尔科夫过程 P ( S 2 = j | S 1 = i ) = P ( S 3 = j | S 2 = i )

即(与 t 无关):

P ( S t = j | S t 1 = i ) = a i j


隐式马尔科夫模型(HMM, Hidden Markov Model)

S = { 1 , 2 , . . . , N } , N 个状态(股市的牛、熊、 B o r i n g );

初始状态概率分布 π = [ π 1 , . . . , π N ]

状态转移矩阵 A = [ a i j ] N M , a i j = P ( S t | S t 1 )

A = B o r i n g [ 0.6 0.2 0.2 0.5 0.3 0.2 0.4 0.1 0.5 ]

V = V 1 , . . . , V m ,每个状态下的观察符号为 M (股票的升、降、平);

观察符号概率分布矩阵 B = [ b j k ] N M , b j k = P ( O t = V k | S t = j ) ;

B = 1 2 3 B o r [ 0.6 0.2 0.2 0.5 0.3 0.2 0.4 0.1 0.5 ]



记观察序列为 O = { O 1 , O 2 , . . . , O T } , S = { S 1 , . . . , S T }

P ( O | A , B , π ) = P ( O | λ ) = S P ( O , S | λ ) = S P ( O | S λ ) P ( S | λ )

其中:

P ( S | λ ) = P ( S 1 , . . . , S T | λ ) = P ( S 1 ) P ( S 2 | S 1 ) . . . P ( S T | S T 1 ) = π S 1 a S 1 S 2 . . . a S T 1 S T

P ( O | S , λ ) = P ( O 1 , . . . , O T | S 1 , . . . , S T , λ ) = P ( O 1 | S 1 ) P ( O 2 | S 2 ) . . . P ( O T | S T ) = b S 1 O 1 . . . b S T O T

所以:

P ( O | λ ) = S π S 1 b S 1 O 1 a S 1 S 2 b S 2 O 2 . . . a S T 1 S T b S T O T

由于其中序列S有N^T种可能,计算复杂度过高,引入

α t ( i ) = P ( O 1 O 2 . . . O t , S t = i | λ )


α 1 ( i ) = P ( O 1 , S t = i | λ ) = P ( S 1 = i | λ ) P ( O 1 | S 1 = i , λ ) = π i b i O 1

下证:

α t + 1 ( j ) = P ( O 1 O 2 . . . O t + 1 , S t + 1 = j | λ ) = ( i = 1 N α t ( i ) a i j ) b j O t + 1

证明:
( i = 1 N α t ( i ) a i j ) b j O t + 1 = [ i = 1 N P ( O 1 . . . O t , S t = i | λ ) P ( S t + 1 = j | S t = i ) ] P ( O t + 1 | S t + 1 = j ) = ( i = 1 N P ( O 1 . . . O t , S t = i , S t + 1 = j | λ ) ) P ( O t + 1 | S t + 1 = j ) ) = P ( O 1 . . . O t , S t + 1 = j | λ ) P ( O t + 1 | S t + 1 = j ) ) = P ( O 1 . . . O t + 1 , S t + 1 = j | λ )

所以:

α T ( i ) = P ( O 1 O 2 . . . O T , S T = i | λ )

P ( O | λ ) = i = 1 N P ( O 1 . . . O T , S T = i | λ ) = i = 1 N α T ( i )

复杂度为 O ( N 2 T )

猜你喜欢

转载自blog.csdn.net/Blanchedingding/article/details/80874945