mp1: waiting for Edoardo's benedition for 1.1

2020-09-29 12:56:05 +02:00 · 2020-09-29 12:56:05 +02:00 · 18e275d120
commit 18e275d120
parent b5cc7065de
2 changed files with 59 additions and 3 deletions
--- a/mp1/template.pdf
+++ b/mp1/template.pdf
--- a/mp1/template.tex
+++ b/mp1/template.tex
@ -22,11 +22,67 @@ The purpose of this assignment\footnote{This document is originally based on a S
 \subsection{Theory [20 points]}
 \subsubsection{Show that the order of convergence of the power method is linear,
 and state what the asymptotic error constant is.}
 First of all, we show the the sequence of vectors computed by power iteration
 indeed converges to $\lambda_1$ or the biggest eigenvector (we assume we name
 eigenvectors in decreasing order of magnitude, with $|\lambda_1| > |\lambda_i|$
 for $i \in 2..n$).
 We can express the seed for the eigenvector (i.e. the initial value of $v$ of
 the power iteration) as a linear combination of eigenvalues:
 \[v_0 = \sum_{i=1}^n a_i x_i\]
 We can then express the result of the n-th power method as
 \[v_n = \gamma A v_{n-1} = A^n v_0 = \sum_{i=1}^n \gamma a_i \lambda_i^n x_i =
 \lambda_1^n  \sum_{i=1}^n \gamma a_i \left( \frac{\lambda_i}{\lambda_1} \right)^n x_i  =
 \gamma a_1 \lambda_1^n x_1 + \lambda_1^n \sum_{i=2}^n \gamma a_i
 \left(\frac{\lambda_i}{\lambda_1}\right)^n x_i \]
 Here, $\gamma$ is just a normalization term to make $||v_n|| = 1$. $v_n$ clearly
 converges to $x_1$ since all the terms in the $\sum_{i=2}^n$ contain
 $\frac{\lambda_i}{\lambda_1}$, which is always less than 0 if $i > 1$ for the
 sorting of eigenvalues we did before. Therefore, these terms to the power of n
 converge to 0, and $\gamma$ will cancel out $a_1 \lambda_1^k$ due to the
 normalization, thus making the sequence converge to $\lambda_1$.
 To see if the sequence converges linearly we use the definitions of rate of
 convergence:
 \[\lim_{n \to \infty}\frac{|x_{n+1} - \lambda_1|}{|x_n - \lambda_1|^1} = \mu\]
 If this limit has a finite solution then the sequence converges linearly with
 rate $\mu$.
 \[\lim_{n \to \infty}\frac{\left| a_1 \lambda_1^{n+1} x_1 + \lambda_1^{n+1}
 \sum_{i=2}^n  a_i \left(\frac{\lambda_i}{\lambda_1}\right)^{n+1}
 x_i - \beta_{n+1} x_1\right|}
 {\left| a_1 \lambda_1^n x_1 + \lambda_1^n \sum_{i=2}^n  a_i
 \left(\frac{\lambda_i}{\lambda_1}\right)^n x_i - \beta_n x_1\right|^1} = \mu\]
 To simplify calculations, we consider the sequence without the normalization
 factor $\gamma$ that will converge to a denormalized version of $x_1$, named
 $\beta x_1$. We can then simplify the $a_1\lambda_1^{i}x_1$ terms in the
 sequences with $\beta_{i} x_1$ since $\beta_i$ can be set freely.
 Now we consider that if $|\lambda_2| > |\lambda_i| \forall i \in 3..n$ (since we
 sorted the eigenvalues), then
 $\left(\frac{\lambda_i}{\lambda_1}\right)^n$ for $i > 2$ will always converge faster to
 0 than $\left(\frac{\lambda_2}{\lambda_1}\right)^n$ thus all terms other than
 $i=2$ can be ignored in the limit computation. Therefore, the limit has finite
 solution and the convergence rate
 is
 \[\mu = \frac{\lambda_2}{\lambda_1}\]
 \subsubsection{What assumptions should be made to guarantee convergence of the power method?}
 The first assumption to make is that the biggest eigenvalue in terms of absolute values should (let's name it $\lambda_1$)
 be strictly greater than all other eigenvectors, so:
-$$|\lambda_1| < |\Lambda_i| \forall i \in \{2..n\}$$
+\[|\lambda_1| < |\lambda_i| \forall i \in \{2..n\}\]
 Also, the eigenvector \textit{guess} from which the power iteration starts must have a component in the direction of $x_i$, the eigenvector for the eigenvalue $\lambda_1$ from before.
@ -38,11 +94,11 @@ The shift and invert approach is a variant of the power method that may signific
 where $\alpha$ is an arbitrary constant that must be chosen wisely in order to increase the rate of convergence. Since the eigenvalues $u_i$ of B can be derived from the eigenvalues $\lambda_i$ of A, namely:
-$$u_i = \frac{1}{\lambda_i - \alpha}$$
+\[u_i = \frac{1}{\lambda_i - \alpha}\]
 the rate of convergence of the power method on B is:
-$$\left|\frac{u_2}{u_1}\right| = \left|\frac{\frac1{\lambda_2 - \alpha}}{\frac1{\lambda_1 - \alpha}}\right| = \left|\frac{\lambda_1 - \alpha}{\lambda_2 - \alpha}\right|$$
+\[\left|\frac{u_2}{u_1}\right| = \left|\frac{\frac1{\lambda_2 - \alpha}}{\frac1{\lambda_1 - \alpha}}\right| = \left|\frac{\lambda_1 - \alpha}{\lambda_2 - \alpha}\right|\]
 By choosing $\alpha$ close to $\lambda_1$, the convergence is sped up. To further increase the rate of convergence (up to a cubic rate), a new $\alpha$, and thus a new $B$, may be chosen for every iteration.