midterm: done w report 1, 2.2-2.5

2021-05-08 10:23:29 +02:00 · 2021-05-08 10:23:29 +02:00 · 92dac9b00f
commit 92dac9b00f
parent 11ae8556f2
5 changed files with 160 additions and 2 deletions
--- a/Claudio_Maggioni_midterm/2-4-energy.png
+++ b/Claudio_Maggioni_midterm/2-4-energy.png
--- a/Claudio_Maggioni_midterm/2-5-energy.png
+++ b/Claudio_Maggioni_midterm/2-5-energy.png
--- a/Claudio_Maggioni_midterm/2-5-gnorms.png
+++ b/Claudio_Maggioni_midterm/2-5-gnorms.png
--- a/Claudio_Maggioni_midterm/Claudio_Maggioni_midterm.md
+++ b/Claudio_Maggioni_midterm/Claudio_Maggioni_midterm.md
@ -1,14 +1,18 @@
 <!-- vim: set ts=2 sw=2 et tw=80: -->

 ---
+title: Midterm -- Optimization Methods
+author: Claudio Maggioni
 header-includes:
 - \usepackage{amsmath}
 - \usepackage{hyperref}
 - \usepackage[utf8]{inputenc}
 - \usepackage[margin=2.5cm]{geometry}
+- \usepackage[ruled,vlined]{algorithm2e}
+- \usepackage{float}
+- \floatplacement{figure}{H}
+
 ---
-\title{Midterm -- Optimization Methods}
-\author{Claudio Maggioni}
 \maketitle

 # Exercise 1
@ -139,3 +143,157 @@ https://en.wikipedia.org/wiki/Definite_matrix#Multiplication)
 Thanks to this we have indeed proven that the delta $\|e_k\|_A - \|e_{k+1}\|_A$
 is indeed positive and thus as $i$ increases the energy norm of the error
 monotonically decreases.
+
+# Question 2
+
+## Point 1
+
+TBD
+
+## Point 2
+
+The trust region algorithm is the following:
+
+\begin{algorithm}[H]
+\SetAlgoLined
+Given $\hat{\Delta} > 0, \Delta_0 \in (0,\hat{\Delta})$,
+and $\eta \in [0, \frac14)$\;
+
+  \For{$k = 0, 1, 2, \ldots$}{%
+    Obtain $p_k$ by using Cauchy or Dogleg method\;
+    $\rho_k \gets \frac{f(x_k) - f(x_k + p_k)}{m_k(0) - m_k(p_k)}$\;
+    \uIf{$\rho_k < \frac14$}{%
+      $\Delta_{k+1} \gets \frac14 \Delta_k$\;
+    }\Else{%
+      \uIf{$\rho_k > \frac34$ and $\|\rho_k\| = \Delta_k$}{%
+        $\Delta_{k+1} \gets \min(2\Delta_k, \hat{\Delta})$\;
+      }
+      \Else{%
+        $\Delta_{k+1} \gets \Delta_k$\;
+    }}
+    \uIf{$\rho_k > \eta$}{%
+      $x_{k+1} \gets x_k + p_k$\;
+    }
+    \Else{
+      $x_{k+1} \gets x_k$\;
+    }
+  }
+	\caption{Trust region method}
+\end{algorithm}
+
+The Cauchy point algorithm is the following:
+
+\begin{algorithm}[H]
+\SetAlgoLined
+Input $B$ (quadratic term), $g$ (linear term), $\Delta_k$\;
+   \uIf{$g^T B g \geq 0$}{%
+       $\tau \gets 1$\;
+   }\Else{%
+       $\tau \gets \min(\frac{\|g\|^3}{\Delta_k \cdot g^T B g}, 1)$\;
+   }
+
+   $p_k \gets -\tau \cdot \frac{\Delta_k}{\|g\|^2 \cdot g}$\;
+   \Return{$p_k$}
+	\caption{Cauchy point}
+\end{algorithm}
+
+Finally, the Dogleg method algorithm is the following:
+
+\begin{algorithm}[H]
+\SetAlgoLined
+Input $B$ (quadratic term), $g$ (linear term), $\Delta_k$\;
+    $p_N \gets - B^{-1} g$\;
+
+    \uIf{$\|p_N\| < \Delta_k$}{%
+      $p_k \gets p_N$\;
+    }\Else{%
+        $p_u = - \frac{g^T g}{g^T B g} g$\;
+
+        \uIf{$\|p_u\| > \Delta_k$}{%
+          compute $p_k$ with Cauchy point algorithm\;
+        }\Else{%
+            solve for $\tau$ the equality $\|p_u + \tau * (p_N - p_u)\|^2 =
+            \Delta_k^2$\;
+            $p_k \gets p_u + \tau \cdot (p_N - p_u)$\;
+        }
+    }
+  \caption{Dogleg method}
+\end{algorithm}
+
+## Point 3
+
+The trust region, dogleg and Cauchy point algorithms were implemented
+respectively in the files `trust_region.m`, `dogleg.m`, and `cauchy.m`.
+
+## Point 4
+
+### Taylor expansion
+
+The Taylor expansion up the second order of the function is the following:
+
+$$f(x_0, w) = f(x_0) + \langle\begin{bmatrix}48x^3 - 16xy + 2x - 2\\2y - 8x^2
+\end{bmatrix}, w\rangle + \frac12 \langle\begin{bmatrix}144x^2 -16y + 2 - 16 &
+-16 \\ -16 & 2 \end{bmatrix}w, w\rangle$$
+
+### Minimization
+
+The code used to minimize the function can be found in the MATLAB script
+`main.m` under section 2.4. The resulting minimizer (found in 10 iterations) is:
+
+$$x_m = \begin{bmatrix}1\\4\end{bmatrix}$$
+
+### Energy landscape
+
+The following figure shows a `surf` plot of the objective function overlayed
+with the iterates used to reach the minimizer:
+
+![Energy landscape of the function overlayed with iterates and steps (the white
+dot is $x_0$ while the black dot is $x_m$)](./2-4-energy.png)
+
+The code used to generate such plot can be found in the MATLAB script `main.m`
+under section 2.4c.
+
+## Point 5
+
+### Minimization
+
+The code used to minimize the function can be found in the MATLAB script
+`main.m` under section 2.5. The resulting minimizer (found in 25 iterations) is:
+
+$$x_m = \begin{bmatrix}1\\5\end{bmatrix}$$
+
+### Energy landscape
+
+The following figure shows a `surf` plot of the objective function overlayed
+with the iterates used to reach the minimizer:
+
+![Energy landscape of the Rosenbrock function overlayed with iterates and steps
+(the white dot is $x_0$ while the black dot is $x_m$)](./2-5-energy.png)
+
+The code used to generate such plot can be found in the MATLAB script `main.m`
+under section 2.5b.
+
+### Gradient norms
+
+The following figure shows the logarithm of the norm of the gradient w.r.t.
+iterations:
+
+![Gradient norms (y-axis, log-scale) w.r.t. iteration number
+(x-axis)](./2-5-gnorms.png)
+
+The code used to generate such plot can be found in the MATLAB script `main.m`
+under section 2.5c.
+
+Comparing the behaviour shown above with the figures obtained in the previous
+assignment for the Newton method with backtracking and the gradient descent with
+backtracking, we notice that the trust-region method really behaves like a
+compromise between the two methods. First of all, we notice that TR converges in
+25 iterations, almost double of the number of iterations of regular NM +
+backtracking. The actual behaviour of the curve is somewhat similar to the
+Netwon gradient norms curve w.r.t. to the presence of spikes, which however are
+less evident in the Trust region curve (probably due to Trust region method
+alternating quadratic steps with linear or almost linear steps while iterating).
+Finally, we notice that TR is the only method to have neighbouring iterations
+having the exact same norm: this is probably due to some proposed iterations
+steps not being validated by the acceptance criteria, which makes the method mot
+move for some iterations.
--- a/Claudio_Maggioni_midterm/Claudio_Maggioni_midterm.pdf
+++ b/Claudio_Maggioni_midterm/Claudio_Maggioni_midterm.pdf