Transcription of Introduction - UCONN
1 THE MINIMAL POLYNOMIAL AND SOME APPLICATIONSKEITH easiest matrices to compute with are the diagonal ones. The sum and product ofdiagonal matrices can be computed componentwise along the main diagonal, and takingpowers of a diagonal matrix is simple too. All the complications of matrix operations aregone when working only with diagonal matrices. If a matrixAis not diagonal but can beconjugated to a diagonal matrix, sayD:=PAP 1is diagonal, thenA=P 1 DPsoAk=P 1 DkPfor all integersk, which reduces us to computations with a diagonal matrix. Inmany applications of linear algebra ( , dynamical systems, differential equations, Markovchains, recursive sequences) powers of a matrix are crucial to understanding the situation,so the relevance of knowing when we can conjugate a nondiagonal matrix into a diagonalmatrix is want look at the coordinate-free formulation of the idea of a diagonal matrix, whichwill be called a diagonalizable operator.
2 There is a special polynomial, the minimal polyno-mial (generally not equal to the characteristic polynomial), which will tell us exactly whena linear operator is diagonalizable. The minimal polynomial will also give us informationabout nilpotent operators (those having a power equal toO).All linear operators under discussion are understood to be acting on nonzero finite-dimensional vector spaces over a given OperatorsDefinition say the linear operatorA:V Visdiagonalizablewhen it admitsa diagonal matrix representation with respect to some basis ofV: there is a basisBofVsuch that the matrix [A]Bis s translate diagoinalizability into the language of eigenvectors rather than linear operatorA:V Vis diagonalizable if and only if there is abasis of eigenvectors there is a basisB={e1.}
3 ,en}ofVin which [A]Bis diagonal:[A]B= a10 00a2 an .ThenAei=aieifor alli, so eacheiis an eigenvector forA. Conversely, ifVhas a basis{v1,..,vn}of eigenvectors ofA, withAvi= ivifor i F, then in this basis the matrixrepresentation ofAis diag( 1,.., n). A basis of eigenvectors for an operator is called example of a linear operator that isnotdiagonalizable over any fieldFis (1 10 1) actingonF2. Its only eigenvectors are the vectors(x0). There are not enough eigenvectors to forma basis forF2, so (1 10 1) onF2does not diagonalize. Remember this example! Since (1 10 1)12 KEITH CONRADand (1 00 1) have the same characteristic polynomial, and the second matrix is diagonalizable,the characteristic polynomialdoesn tdetermine (in general) if an operator is are the main results we will obtain about diagonalizability:(1) There are ways of determining if an operator is diagonalizable without having tolook explicitly for a basis of eigenvectors.
4 (2) WhenFisalgebraically closed, most operators on a finite-dimensionalF-vectorspace are diagonalizable.(3) There is a polynomial, the minimal polynomial of the operator, which can be usedto detect diagonalizability.(4) If two operators are each diagonalizable, they can be simultaneously diagonalized( , there is a common eigenbasis) precisely when s look at three examples related to diagonalizability (0 11 0), the 90-degree rotation matrix acting onR2. It is notdiagonalizable onR2since there are no eigenvectors: a rotation inR2sends no nonzerovector to a scalar multiple of itself. This geometric reason is complemented by an algebraicreason: the characteristic polynomialT2+ 1 ofRhas no roots inR, so there are no realeigenvalues and thus no eigenvectors inR2. However, there are roots iofT2+ 1 inC,and there are eigenvectors ofRas an operator onC2rather thanR2.
5 Eigenvectors ofRinC2for the eigenvaluesiand iare(i1)and( i1), respectively. In the basisB={(i1),( i1)},the matrix ofRis [R]B= (i00 i), where the first diagonal entry is the eigenvalue of the firstbasis vector inBand the second diagonal entry is the eigenvalue of the second basis vectorinB. (Review the proof of Theorem to see why this relation between the ordering ofvectors in an eigenbasis and the ordering of entries in a diagonal matrix always holds.)Put more concretely, since passing to a new matrix representation of an operator froman old one amounts to conjugating the old matrix representation by the change-of-basismatrix expressing the old basis in terms of the new basis, we must have (i00 i) =PRP 1whereP= ([(10)]B[(01)]B) = ( i/2 1/2i/2 1/2). Verify thisPreally conjugatesRto (i00 i).
6 NoteP 1= (i i1 1) has as its columns the eigenvectors ofRin the standard basis(10),(01) Mn(R) satisfyingA=A>can be diagonalized overR. Thisis a significant result, called the real spectral theorem. (Any theorem that gives sufficientconditions under which an operator can be diagonalized is called a spectral theorem, becausethe eigenvalues of an operator is called its spectrum.) The essential step in the proof of thereal spectral theorem is to show that any real symmetric matrix has a real Mn(C) satisfyingAA>=A>Ais diagonalizable in Mn(C).1 WhenAis real, soA>=A>, sayingAA>=A>Ais weaker than sayingA=A>. In particular,the real matrix (0 11 0) commutes with its transpose and thus is diagonalizable overC,but the real spectral theorem does not apply to this matrix and in fact this matrix isn tdiagonalizable overR(it has no real eigenvalues).
7 Eigenvalues and DiagonalizabilityIf a linear operator on a finite-dimensionalF-vector space is diagonalizable, its eigenvaluesall lie inF, since a diagonal matrix representation has the eigenvalues along the complex square matrixAsatisfyingAA>=A>Ais callednormal, and normal matrices are unitarilydiagonalizable:A=U DU 1whereDis diagonal andUis unitary, meaningUU>=In. While the conditionofAbeing unitarily diagonalizable is equivalent toAA>=A>A, the condition of being diagonalizable aloneis not equivalent to any algebraic identity on complex MINIMAL POLYNOMIAL AND SOME APPLICATIONS3(See the proof of Theorem ) The converse is false: if all the eigenvalues of an operatorare inFthis doesnotnecessarily mean the operator is diagonalizable. Just think aboutour basic example (1 10 1), whose only eigenvalue is 1.
8 It is a repeated eigenvalue , in thesense that the characteristic polynomial (T 1)2has 1 as a repeated root. Imposing anadditional condition, that the eigenvalues lie inFand aresimpleroots of the characteristicpolynomial, does force diagonalizability. To prove this, we start with a general lemma oneigenvalues and linear for distinct eigenvalues are linearly independent. More pre-cisely, ifA:V Vis linear andv1,..,vrare eigenvectors ofAwith distinct eigenvalues 1,.., r, thevi s are linearly will be an induction onr. The caser= 1 is easy. Ifr >1, suppose there is alinear relation( )c1v1+ +cr 1vr 1+crvr= 0withci F. ApplyAto both sides:vibecomesAvi= ivi, so( )c1 1v1+ +cr 1 r 1vr 1+cr rvr= the linear relation in ( ) by r:( )c1 rv1+ +cr 1 rvr 1+cr rvr= ( ) from ( ), the last terms on the left cancel:c1( 1 r)v1+ +cr 1( r 1 r)vr 1= we have a linear relation withr 1 eigenvectors having distinct eigenvalues.
9 Byinduction, all the coefficients are 0:ci( i r) = 0 fori= 1,..,r 1. Since 1,.., r 1, rare distinct, i r6= 0 fori= 1,..,r 1. Thusci= 0 fori= 1,..,r 1. Now ouroriginal linear relation ( ) becomescrvr= 0. The vectorvris not 0 (eigenvectors arealways nonzero by definition), socr= 0. Theorem linear operator onVwhose characteristic polynomial is a product oflinear factors inF[T]with distinct roots is assumption is that there arendifferent eigenvalues inF, wheren= dimV. Callthem 1,.., n. Leteibe an eigenvector with eigenvalue i. The eigenvalues are distinct,so by Lemma theei s are linearly independent. Since there arenof these vectors, theei s are a basis ofV, so the operator admits an eigenbasis and is diagonalizable. Example matrix (1 10 1) has characteristic polynomial (T 1)2, which has linearfactors inR[T] but the roots are not distinct, so Theorem does not say the matrix isdiagonalizable in M2(R), and in fact it isn matrix (1 11 0) has characteristic polynomialT2 T 1, which has 2different real roots, so the matrix is diagonalizable in M2(R).
10 There are many diagonal matrices with repeated diagonal entries (take the simplestexample,In!), and their characteristic polynomials have repeated roots. The criterion inTheorem will never detect a diagonalizable operator with a repeated eigenvalue , so thatcriterion is a sufficient but not necessary condition for diagonalizability. In Section 4 we willsee a way to detect diagonalizability using a different polynomial than the characteristicpolynomial that is both necessary and how common is it for a characteristic polynomial to have distinct roots (whetheror not they lie inF)? Consider 2 2 matrices: the characteristic polynomialT2+bT+chasrepeated roots if and only ifb2 4c= 0. A random quadratic will usually satisfyb2 4c6= 04 KEITH CONRAD(you have to becarefulto arrange things so thatb2 4cis 0), so most 2 2 matrices havea characteristic polynomial with distinct roots.