[Stereo Vision (2)] Epipolar Geometry and Key Matrix

This is a personal study note. I have borrowed a lot of good pictures and articles (references are at the end of the article), mainly to sort out relevant knowledge in order to form my own system. The text expression is pieced together, and the symbolic formula is manually entered. Please point out if there are any mistakes.

1. Epipolar geometry

Now I want to find two frames of images I 1 , I 2 I_1, I_2I1I2Movement between ( R , t R , tR , t ), if a pair of points with the same name p 1 , p 2 p_1, p_2are detected in two frames of imagesp1p2as the picture shows.
Forget about symbolic meaning in imaging principles for a moment.
insert image description here

  • Baseline: O 1 O 2 O_1O_2O1O2connection
  • Epipoles: e 1 , e 2 e_1, e_2e1e2
  • 极线(Epipolar line):l 1 、 l 2 l_1、l_2l1l2
  • Epipolar plane: O 1 , O 2 , P O_1, O_2, PO1O2, the plane formed by P

Let the space point P of the left camera coordinate system coordinates = [ X , Y , Z ] TP=[X,Y,Z]^TP=[X,Y,Z]T , thenthe two pixel positions known fromthe pinhole camera model

s 1 p 1 = KP , s 2 p 2 = K ( RP + t ) s_1 p_1=KP , s_2 p_2=K(RP+t)s1p1=KPs2p2=K ( RP+t)

Among them, KKK is the internal reference matrix,R, t R, tR , t is the motion relationship between two cameras (specifically,R 21 , t 21 R_{21}, t_{21}R21t21 )。 s 1 , s 2 s_1,s_2 s1s2is PPThe zz of point P in the two coordinate systemsz -axis coordinates, when using homogeneous coordinates, a vector will be equal to itself multiplied by any non-zero constant, which is usually used to express a projection relationship, called equal up to a scale. Writesp ≃ p sp ≃ pspp . So there are
p 1 ≃ KP , p 2 ≃ K ( RP + t ) p_1≃KP , p_2≃K(RP+t)p1KPp2K ( RP+t)

Next, start deriving:

  1. KK _K is moved to the left side of the equation:
    K 1 1 p 1 ≃ P , K 2 1 p 2 ≃ RP + t K^1_1p_1≃P, K^1_2p_2≃RP+tK11p1PK21p2RP+t
  2. On the normalized plane, x 1 = K 1 1 p 1 , x 2 = K 2 1 p 2 x_1=K^1_1p_1, x_2=K^1_2p_2x1=K11p1x2=K21p2,则x 1 = P , x 2 = RP + t x_1=P,x_2=RP+tx1=Px2=RP+t , then
    x 2 = R x 1 + t x_2=Rx_1+tx2=Rx1+t
  3. Use tt on both sides of the equationt cross product (vector and self cross product is 0) hast × x 2 = t × R x 1 + t × tt \times x_2=t \times Rx_1+t \times tt×x2=t×Rx1+t×t,即
    t × x 2 = t × R x 1 t \times x_2=t \times Rx_1 t×x2=t×Rx1
  4. Multiply both sides of the equation by x 2 T x_2^Tx2T,得 x 2 T t × x 2 = x 2 T t × R x 1 x_2^Tt \times x_2=x_2^T t \times Rx_1 x2Tt×x2=x2Tt×Rx1, the left side of the equation is obviously 0, then
    x 2 T t × R x 1 = 0 x_2^T t \times Rx_1=0x2Tt×Rx1=0
  5. Since the cross product is equivalent to the point product of antisymmetric matrices, ttThe antisymmetric matrix of t is t ∧ t^{\land}t (equivalent to both sides simultaneously withttt as outer product), then
    x 2 T t ∧ R x 1 = 0 x_2^T t^{\land} Rx_1=0x2TtRx1=0

This formula describes two pixel coordinates x 1 , x 2 x_1, x_2x1x2The connection between the spatial point at the imaging point of the two cameras through ttThe t external parameter matrix establishes an equality relationship, or a constraint relationship, and this constraint is calledan epipolar constraint.
Note that abovex 1 , x 2 x_1, x_2x1x2are normalized coordinates, if p 1 , p 2 p_1, p_2 are usedp1p2Substitute x 1 , x 2 x_1, x_2x1x2, there is
p 2 TK 2 − T t ∧ RK 1 1 p 1 = 0 p_2^TK^{-T}_2 t^{\land} RK^1_1p_1=0p2TK2TtRK11p1=0
These two formulas are equivalent, both are called epipolar constraints, and the geometric meaning isO 1 , O 2 , P , p 1 , p 2 O_1, O_2, P, p_1, p_2O1O2Pp1p2Coplanar.


2. Key Matrix

Key matrices: essential matrix, fundamental matrix, homography matrix.
The key matrix can establish the coordinate connection between the common points of two views, or complete the coordinate transformation between the common points. They are like a bridge between the dual views of stereo vision, so that they are closely connected to each other to form a whole, and then there is the concept of stereo vision system.

1) Essential Matrix and Fundamental Matrix

Remember E = t ∧ RE=t^{\land} RE=tR F = K 2 − T t ∧ R K 1 1 F=K^{-T}_2 t^{\land} RK^1_1 F=K2TtRK11, are called Essential Matrix and Fundamental Matrix respectively , then the epipolar constraint can be expressed as:
x 2 TE x 1 = 0 p 2 TF p 1 = 0 x_2^T Ex_1=0\\ p_2^ TFp_1=0x2TEx1=0p2TFp1=0

The epipolar constraint concisely gives the spatial position relationship of the two matching points, so the camera pose estimation problem becomes the following two steps:

  1. Calculate EE from the pixel position of the paired pointE orFFF
  2. According to EEE orFFF CalculateR, t R, tRt

It can be seen that the essential matrix EEE and Fundamental MatrixFFThe difference of F : the essential matrix directly and the normalized coordinatexxx establishes a relationship, butxxx needs to be determined by the internal parameter matrixKKK and pixel coordinatesppp is calculated, so the premise of using the essential matrix is ​​that the internal reference matrixKKK ; while the fundamental matrix is ​​directly related to the pixel coordinatesppp establishes a relationship without knowing the internal reference matrix. Or:

  • Essence matrix: reflects the relationship between the representations of [the image point of a point P in space] in the [camera coordinate system] under [cameras with different viewing angles].
  • Fundamental matrix: reflects the relationship between the representations of [image points of a point P in space] in the [image coordinate system] under [cameras with different viewing angles].

Due to EEE andFFF only differs from the internal parameters of the camera (the solution method of the two is the same), and the internal parameters are usually known in problems such as SLAM, so in practice, the simpler form of EE is oftenusedE

2) Solving the essential matrix

Since translation and rotation each have 3 degrees of freedom, E = t ∧ RE=t^{\land} RE=t Rhas 6 degrees of freedom in total. Since the epipolar constraint is a constraint that the equality is zero, after multiplying E by any non-zero constant, the antipolar constraint is still satisfied, that is,EEE is equivalent at different scales, soEEE actually has 5 degrees of freedom. This shows that we can use at least 5 pairs of points to solveEEE. _ However,EEThe inherent nature of E is a nonlinear property, which will cause trouble when estimating. Therefore, it is also possible to only consider its scale equivalence and use 8 pairs of points to estimate E—this is the classic eight-point method (Eight -point-algorithm). The eight-point method only utilizesEEThe linear nature of E , so it can be solved under the framework of linear algebra.
Consider a pair of matching points whose normalized coordinates arex 1 = [ u 1 , v 1 , 1 ] T x_1=[u_1,v_1,1]^Tx1=[u1,v1,1]T x 2 = [ u 2 , v 2 , 1 ] T x_2=[u_2,v_2,1]^T x2=[u2,v2,1]T ,wavelength range,wave
[ u 2 v 2 1 ] [ e 1 e 2 e 3 e 4 e 5 e 6 e 7 e 8 e 9 ] [ u 1 v 1 1 ] = 0 \left[ \begin{ matrix} u_2 & v_2 & 1 \end{matrix}\right]\left[\begin{matrix}e_1&e_2&e_3\\e_4&e_5&e_6\\e_7&e_8&e_9\\\end{matrix}\ right ] \ left [ \ begin { matrix } u_1 \\ v_1 \\ 1 \ end { matrix } \ right ]=0[u2v21] e1e4e7e2e5e8e3e6e9 u1v11 =0
will matrixEEE expansion, mapping form:
e = [ e 1 , e 2 , e 3 , e 4 , e 5 , e 6 , e 7 , e 8 , e 9 ] T e=[e_1,e_2,e_3,e_4,e_5 ,e_6,e_7,e_8,e_9]^Te=[e1,e2,e3,e4,e5,e6,e7,e8,e9]TThen
, the epipolar constraint can be written as the same aseee Correlated linear form:
[ u 2 u 1 , u 2 v 1 , u 2 , v 2 u 1 , v 2 v 1 v 2 , u 1 , v 1 , 1 ] ⋅ e = 0 [u_2u_1,u_2v_1,u_2 ,v_2u_1,v_2v_1v_2,u_1,v_1,1] \cdot e=0[u2u1,u2v1,u2,v2u1,v2v1v2,u1,v1,1]e=In the same way as 0
, it has the same representation for other point pairs. Then a system of linear equations (ui , viu^i, v^iui,vi displayiii feature points, and so on):
[ u 2 1 u 1 1 , u 2 1 v 1 1 , u 2 1 , v 2 1 u 1 1 , v 2 1 v 1 1 v 2 1 , u 1 1 , v 1 1 , 1 u 2 2 u 1 2 , u 2 2 v 1 2 , u 2 2 , v 2 2 u 1 2 , v 2 2 v 1 2 v 2 2 , u 1 2 , v 1 2 , 1 u 2 3 u 1 3 , u 2 3 v 1 3 , u 2 3 , v 2 3 u 1 3 , v 2 3 v 1 3 v 2 3 , u 1 3 , v 1 3 , 1 . . . u 2 8 u 1 8 , u 2 8 v 1 8 , u 2 8 , v 2 8 u 1 8 , v 2 8 v 1 8 v 2 8 , u 1 8 , v 1 8 , 1 ] [ e 1 e 2 e 3 e 4 e 5 e 6 e 7   e 8 e 9 ] = 0 \left[ \begin{matrix} u^1_2u^1_1,u^1_2v^1_1,u^1_2,v^1_2u^1_1,v^1_2v^1_1v^1_2,u^1_1,v^1_1,1\\ u^2_2u^2_1,u^2_2v^2_1,u^2_2,v^2_2u^2_1,v^2_2v^2_1v^2_2,u^2_1,v^2_1,1\\ u^3_2u^3_1,u^3_2v^3_1,u^3_2,v^3_2u^3_1,v^3_2v^3_1v^3_2,u^3_1,v^3_1,1\\ ...\\ u^8_2u^8_1,u^8_2v^8_1,u^8_2,v^8_2u^8_1,v^8_2v^8_1v^8_2,u^8_1,v^8_1,1 \end{matrix} \right] \left[ \begin{matrix} e_1 \\ e_2 \\ e_3\\ e_4 \\ e_5 \\ e_6\\ e_7 \\\ e_8 \\ e_9\\ \end{matrix} \right] =0 u21u11,u21v11,u21,v21u11,v21v11v21,u11,v11,1u22u12,u22v12,u22,v22u12,v22v12v22,u12,v12,1u23u13,u23v13,u23,v23u13,v23v13v23,u13,v13,1...u28u18,u28v18,u28,v28u18,v28v18v28,u18,v18,1 e1e2e3e4e5e6e7 e8e9 =0
Solving the linear equationA e = 0 Ae=0Ae=0 to get the vectoreee , the essential matrixEEE. _ arrowratee is the coefficient matrixAAThe null space of A , from the knowledge of linear algebra, we can know that the number of special solutions of the equation is9 − rank ( A ) 9-rank(A)9rank(A),当 r a n k ( A ) = 8 rank(A)=8 rank(A)=8 , there is a unique special solution, and all the solutions of the equation are constant times of the special solution, corresponding toEEScale invariance of E.

3) Decomposition of the essential matrix

The essential matrix E = t ∧ RE=t^{\land} RE=t Ris a3 × 3 3\times 33×3 matrix, its singular value must be[ σ , σ , 0 ] T [\sigma,\sigma,0]^T[ p ,s ,0]The form of T is called the intrinsic property of the essential matrix (inherited from the antisymmetric matrixt ∧ t^{\land}t properties). RRcan be obtained by SVD decompositionR andttt , ifEEE -like SVD decomposition isE = U Σ VTE=U\Sigma V^TE=UΣVT , thenRRR andttt 可能的结果有:
t 1 ∧ = U Z U T , t 2 ∧ = U Z T U T , R 1 = U W U T , R 2 = U W T U T t_1^{\land}=UZU^T, t_2^{\land}=UZ^T U^T,\\R_1=UW U^T, R_2=UW^T U^T t1=U Z UT,t2=U ZTUT,R1=U W UT,R2=UWTUThen
,W = [ 0 − 1 0 1 0 0 0 0 1 ] W=\left[ \begin{matrix} 0 & -1 & 0\\ 1 & 0 & 0 \\ 0 & 0 & 1 \end{ matrix}\right]W= 010100001 (Specific), Z = [ 0 1 0 − 1 0 0 0 0 1 ] Z=\left[ \begin{matrix} 0 & 1 & 0\\ -1 & 0 & 0 \\ 0 & 0 & \end{matrix}\right]Z= 010100001 (antisymmetric matrix, if the vector z= W = ( 0 − 1 0 ) TW=\left( \begin{matrix} 0 & -1 & 0 \end{matrix} \right)^TW=(010)T,则 z ∧ = Z z^{\land}=Z z=Z)。

solve ttWhen t , it is not necessary to solve fort ∧ t^{\land}t ,以 t 1 ∧ = U Z U T t_1^{\land}=UZU^T t1=U Z UTake T as an example, multiply both sides byt 1 t_1t1,有 t 1 ∧ t 1 = U Z U T t 1 t_1^{\land}t_1=UZU^Tt_1 t1t1=U Z UTt1,即 U Z U T t 1 = 0 UZU^Tt_1=0 U Z UTt1=0 , multiplyUTU^TUT,有ZUT t 1 = 0 ZU^Tt_1=0to uTt1=0,因为z ∧ = Z z^{\land}=Zz=Z,且 z ∧ z = 0 z^{\land}z=0 zz=0 , soUT t 1 = z U^Tt_1=zUTt1=z,得t 1 = U z = U ( 0 , 0 , 1 ) T t_1=Uz=U(0,0,1)^Tt1=Uz _=U(0,0,1)T , that is,t 1 t_1t1is the matrix UUThe third column of U , ie
t 1 = U . col ( 3 ), t 2 = − t 1 t_1=U.col(3), t_2=-t_1t1=U.col(3)t2=t1
There are four possible sets of RRsR andttCombinations of t :
insert image description here

However, only in the first combination above PPP has positive depth in both cameras. Therefore, as long as any observation point is brought into the verification, the correct set of solutions can be obtained.

EE solved from the linear equationE , may not satisfyEEIntrinsic properties of E : the singular value is( σ , σ , 0 ) (σ, σ, 0)( p ,s ,0 ) . The usual practice is that the EEobtained by the eight-point methodAfter the SVD decomposition of E , the singular value matrix Σ = diag ( σ 1 , σ 2 , σ 3 ) Σ = diag(σ1, σ2, σ3)S=diag(σ1,p 2 ,σ 3 ),set1 ⩾ σ 2 ⩾ σ 3 σ1 ⩾ σ2 ⩾ σ3p 1p 2σ 3,取:
E = U diag ( σ 1 + σ 1 2 , σ 1 + σ 1 2 , 0 ) E=Udiag(\frac{\sigma_1+\sigma_1}{2},\frac{\sigma_1+\sigma_1} {2},0)E=U d ia g (2p1+p1,2p1+p1,0 )
This is equivalent to projecting the obtained matrix toEEOn the manifold where E is located. A simpler approach is to take the singular value matrix asdiag ( 1 , 1 , 0 ) diag(1, 1, 0)diag(1,1,0 ) , becauseEEE has scale equivalence, so it is also reasonable to do so.

The premise of applying the essential matrix is ​​that the internal parameter matrix KK is knownK. _ The basic matrix does not require internal parameters, but there are many unknowns, and it is difficult to decompose to obtain accurate internal and external parameters. We need to make some assumptions, such as having an initial focal length value, such as the principal point at the center of the image, etc. Regardless of estimating the essential matrix or estimating the fundamental matrix, the camera parameters obtained by decomposition can only be used as initial values. To obtain accurate parameters, it is necessary to solve the exact values ​​through nonlinear iterative optimization.

3) Homography matrix and its solution

The essential matrix and fundamental matrix are not the mutual conversion relationship between two pixels, but an inherent constraint between coordinates. So is there an expression that directly converts the pixel coordinates of the left view to the pixel coordinates of the corresponding point in the right view? Yes, that is, the homography matrix (Homography matrix) H , when the scene in space is the same plane, their projection points in the left and right views can be converted one-to-one through the reversible homography matrix.
Homography is a concept in projective geometry, also known as projective transformation. It maps points (three-dimensional homogeneous vectors) on one projective plane to another projective plane, and maps lines to lines. In other words, a homography is a linear transformation of a three-dimensional homogeneous vector.

Consider a plane in space that satisfies the equation: n TP + d = 0 n^TP+d=0nTP+d=0 , combined with the above, we have

p 2 ≃ K ( R P + t ) p 2 ≃ ( R P + t ( − n T P d ) ) p 2 ≃ K ( R − n T P d ) P p 2 ≃ K ( R − n T P d ) K − 1 p 1 p_2≃K(RP+t)\\p_2≃(RP+t(-\frac{n^TP}{d}))\\p_2≃K(R-\frac{n^TP}{d})P\\p_2≃K(R-\frac{n^TP}{d})K^{-1}p_1 p2K ( RP+t)p2(RP+t(dnTP))p2K(RdnTP)Pp2K(RdnTP)K1p1
H = K ( R − n T P d ) K − 1 H=K(R-\frac{n^TP}{d})K^{-1} H=K(RdnTP)K1 , there is
p 2 ≃ H p 1 p 1 ≃ H − 1 p 2 p_2≃Hp_1\\p_1≃H^{-1}p_2p2Hp1p1H1p2

The homography matrix describes the mapping relationship between two planes. If the feature points in the scene all fall on the same plane (such as walls, ground, etc.), motion estimation can be performed through homography. Its definition is related to the parameters of rotation, translation and plane, which is a 3 × 3 3 \times 33×3 matrix. Similarly,HHH , decomposed to calculateR , t R, tRt

[ u 2 v 2 1 ] ≃ [ h 1 h 2 h 3 h 4 h 5 h 6 h 7 h 8 h 9 ] [ u 1 v 1 1 ] \left[ \begin{matrix} u_2\\ v_2 \\ 1 \end{matrix} \right]≃\left[ \begin{matrix} h_1 & h_2 & h_3\\ h_4 & h_5 & h_6\\ h_7 & h_8 & h_9\\ \end{matrix} \right] \left[ \begin{matrix} u_1\\ v_1 \\ 1 \end{matrix} \right] u2v21 h1h4h7h2h5h8h3h6h9 u1v11

Let h 9 = 1 h_9=1h9=1 (when it takes a non-zero value), according to the normal processing method of the homogeneous formula, we will divide the first and second lines by the third line to remove the scale factor, which can be obtained (note the uu inthe
formulau givevvv difference)

u 2 = h 1 u 1 + h 2 v 1 + h 3 h 7 u 1 + h 8 v 1 + h 9 v 1 = h 4 u 1 + h 5 v 1 + h 6 h 7 u 1 + h 8 v 1 + h 9 u_2=\frac{h_1u_1+h_2v_1+h_3}{h_7u_1+h_8v_1+h_9}\\ v_1=\frac{h_4u_1+h_5v_1+h_6}{h_7u_1+h_8v_1+h_9} u2=h7u1+h8v1+h9h1u1+h2v1+h3v1=h7u1+h8v1+h9h4u1+h5v1+h6

整理得
h 7 u 1 + h 8 v 1 + h 9 − h 7 u 1 u 2 − h 8 v 1 u 2 = u 2 h 4 u 1 + h 5 v 1 + h 6 − h 7 u 1 v 2 − h 8 v 1 v 2 = v 2 h_7u_1+h_8v_1+h_9-h_7u_1u_2-h_8v_1u_2=u_2\\ h_4u_1+h_5v_1+h_6-h_7u_1v_2-h_8v_1v_2=v_2 h7u1+h8v1+h9h7u1u2h8v1u2=u2h4u1+h5v1+h6h7u1v2h8v1v2=v2

Such a set of matching point pairs can construct two constraints (in fact, there are three constraints, but because of linear correlation, only the first two are taken), so the homography matrix with 8 degrees of freedom can be calculated by 4 pairs of matching feature points ( Note that these feature points cannot have three points collinear), that is, solve the following linear equations (when h9 = 0, the right side is zero).
[ u 1 1 v 1 1 1 0 0 0 − u 1 1 u 2 1 − v 1 1 u 2 1 0 0 0 u 1 1 v 1 1 1 − u 1 1 v 2 1 − v 1 1 v 2 1 u 1 2 v 1 2 1 0 0 0 − u 1 2 u 2 2 − v 1 2 u 2 2 0 0 0 u 1 2 v 1 2 1 − u 1 2 v 2 2 − v 1 2 v 2 2 u 1 3 v 1 3 1 0 0 0 − u 3 2 u 2 3 − v 1 3 u 2 3 0 0 0 u 1 3 v 1 3 1 − u 1 3 v 2 3 − v 1 3 v 2 3 u 1 4 v 1 4 1 0 0 0 − u 3 4 u 2 4 − v 1 4 u 2 4 0 0 0 u 1 4 v 1 4 1 − u 1 4 v 2 4 − v 1 4 v 2 4 ] [ h 1 h 2 h 3 h 4 h 5 h 6 h 7 h 8 ] = [ u 2 1 v 2 1 u 2 2 v 2 2 u 2 3 v 2 3 u 2 4 v 2 4 ] \left[\begin{matrix} u_1^1 & v_1^1 & 1 & 0 & 0 & 0 & -u_1^1u_2^1& -v_1^1u_2^1\\ 0 & 0 & 0 &u_1^1 & v_1^1 & 1 & -u_1^1v_2^1& -v_1^1v_2^1\\ u_1^2 & v_1^2 & 1 & 0 & 0 & 0 & -u_1^2u_2^2& -v_1^2u_2^2\\ 0 & 0 & 0 & u_1^2 & v_1^2 & 1 & -u_1^2v_2^2& -v_1^2v_2^2\\ u_1^3 & v_1^3 & 1 & 0 & 0 & 0 & -u_3^2u_2^3& -v_1^3u_2^3\\ 0 & 0 & 0 & u_1^3 & v_1^3 & 1 & -u_1^3v_2^3& -v_1^3v_2^3\\ u_1^4 & v_1^4 & 1 & 0 & 0 & 0 & -u_3^4u_2^4& -v_1^4u_2^4\\ 0 & 0 & 0 & u_1^4 & v_1^4 & 1 & -u_1^4v_2^4& -v_1^4v_2^4\\ \end{matrix}\right] \left[\begin{matrix}h_1\\h_2\\h_3\\h_4\\h_5\\h_6\\h_7\\h_8\end{matrix}\right] = \left[\begin{matrix}u_2^1\\v_2^1\\u_2^2\\v_2^2\\u_2^3\\v_2^3\\u_2^4\\v_2^4\end{matrix}\right] u110u120u130u140v110v120v130v140101010100u110u120u130u140v110v120v130v1401010101u11u21u11v21u12u22u12v22u32u23u13v23u34u24u14v24v11u21v11v21v12u22v12v22v13u23v13v23v14u24v14v24 h1h2h3h4h5h6h7h8 = u21v21u22v22u23v23u24v24
This approach puts HHThe H matrix is ​​regarded as a vector, and HHis recovered by solving the linear equation of the vectorH , also known as **Direct Linear Transform (DLT)*. Similar to the essential matrix, after obtaining the homography matrix, it needs to be decomposed to obtain the corresponding rotation matrixRRR and the translation vectorttt . Decomposition methods include numerical methods and analytical methods. The decomposition of the homography matrix will also return 4 sets of rotation matrices and translation vectors, and at the same time, the normal vectors of the planes corresponding to the scene points can be calculated. If the depths of the map points known to be imaged are all positive (i.e. in front of the camera), then two sets of solutions can be ruled out. In the end, there are only two sets of solutions left, and more prior information is needed for judgment.

Homography is of great significance in SLAM. When the feature points are coplanar or the camera undergoes pure rotation, the degrees of freedom of the fundamental matrix decrease, which is called degenerate. The data in reality always contains some noise. At this time, if you continue to use the eight-point method to solve the fundamental matrix, the extra degrees of freedom of the fundamental matrix will be mainly determined by the noise. In order to avoid the impact of degradation phenomena, we usually estimate the fundamental matrix FF at the same timeF and the homography matrixHHH , choose the one with smaller reprojection error as the final motion estimation matrix.

In addition to DLT, RANSAC can also be used.


references:

[1] Gao Xiang, Zhang Tao, Visual SLAM Lecture Fourteen: From Theory to Practice (2nd Edition)
[2] Stereo Vision Introductory Guide (2): Key Matrix (Essential Matrix, Fundamental Matrix, Homography Matrix)
[3] Yes Polar Geometry and Triangulation
[4] Computer Vision 6 - Epipolar Geometry and Fundamental Matrix
[5] Homography Matrix and Epipolar Geometry

Guess you like

Origin blog.csdn.net/m0_50910915/article/details/130978856