KKT Callbacks with IntPoint

The interior point solver can sometimes be sped up dramatically by exploiting the simultanious structure of $Q$,$A$ and $G$. Here we will consider the simple problem:

$$\mbox{minimize}\quad\frac{1}{2}x^{T}Qx-b^{T}x,\qquad\mbox{s.t.}\quad x\geq0.$$

The solver can be called with no special parameters,



In [17]:

    
using ConicIP

n = 1000

Q = sparse(randn(n,n)); Q = Q'*Q;
c = ones(n,1);
A = speye(n);
b = zeros(n,1);
𝐾 = [("R",n)];

@time conicIP( Q , c , A , b , 𝐾 , verbose = true);









    



 > INTERIOR POINT SOLVER v0.7 (July 2016)

            Optimality                      Objective              Infeasibility       

   Iter  │  prFeas    duFeas    muFeas   │  pobj      dobj      │  icertp    icertd   │  refine   
      1  │  1.4e+00   4.3e+01   1.7e+00  │  -2.6e+01  -3.8e+03  │  NaN       1.0e+01  │  0
      2  │  2.8e-01   8.4e+00   3.2e-01  │  -1.5e+01  -8.0e+02  │  NaN       2.0e+01  │  1
      3  │  7.2e-02   2.2e+00   2.0e-01  │  -3.2e+00  -2.7e+02  │  NaN       6.8e+01  │  1
      4  │  1.7e-02   5.3e-01   9.6e-02  │  -1.3e+00  -7.5e+01  │  NaN       1.5e+02  │  1
      5  │  3.3e-03   1.0e-01   4.0e-02  │  -1.1e+00  -1.9e+01  │  NaN       2.5e+02  │  1
      6  │  1.4e-14   5.9e-17   7.9e-03  │  -2.1e+00  -3.7e+00  │  NaN       2.9e+02  │  1
      7  │  8.7e-15   3.4e-17   1.3e-03  │  -2.4e+00  -2.6e+00  │  NaN       2.5e+02  │  1
      8  │  8.4e-15   3.1e-17   2.1e-04  │  -2.5e+00  -2.5e+00  │  NaN       2.4e+02  │  1
      9  │  8.9e-15   3.4e-17   1.8e-05  │  -2.5e+00  -2.5e+00  │  NaN       2.4e+02  │  1
     10  │  8.7e-15   3.5e-17   1.4e-06  │  -2.5e+00  -2.5e+00  │  NaN       2.4e+02  │  1
     11  │  8.6e-15   3.6e-17   1.1e-07  │  -2.5e+00  -2.5e+00  │  NaN       2.4e+02  │  1

 > EXIT -- Below Tolerance!

  5.597993 seconds (37.52 k allocations: 1.419 GB, 9.98% gc time)

The speed of the solver is reasonable, as the deault solver exploits the sparsity of the constraint matrix. We can do better, however.

KKT Callbacks

To speed up the solver, we require a function kktsolver which returns a function solve3x3gen which, in turn returns a function solving the KKT system $$\left(\begin{array}{ccc} Q & G^{T} & A^{T}\\ G\\ A & & -F^{T}F \end{array}\right) \left(\begin{array}{c} a\\ c\\ b \end{array}\right) = \left(\begin{array}{c} x\\ z\\ y \end{array}\right)$$

Where $F$ is a Block matrix with blocks corrosponding to the cones specified in 𝐾, $$F[i]=\begin{cases} \mbox{diag}(u_i) & \mbox{if }K_{i}\mbox{ is "R"}\\ α_iJ+u_i u_i^T & \mbox{if }K_{i}\mbox{ is "Q"} \quad \mbox{ for some } u_i, \alpha_i \mbox{ $J$ is the hyperbolic identity }\\ A_{U_i} & \mbox{if }K_i\mbox{ is "S" }\, \quad \mbox{where } A_{U_i}x=\mbox{vec}(U_i^{T}\mbox{mat}(x)U_i) \mbox{ for all $x$}. \end{cases} $$

The operation $vec$ is the vectorization operator on symmetric matrices $$ \mbox{vec}(U)=(U_{11},\sqrt{2}U_{21},\dots,\sqrt{2}U_{p1},U_{22},\sqrt{2}U_{32},\dots,\sqrt{2}U_{p2},\dots,U_{p-1,p-1},\sqrt{2}U_{p,p-1},U_{pp}) $$

and $\mbox{mat}$ is the inverse transformation back to a symmetric matrix. The matrix $\mbox{diag}(u)$ is represented by type Diag, $αI+uu$ is represented by type SymWoodbury, and $A_{U_i}$ is represented by the type VecCongurance.

In this example, since we have no linear constraints, $G$ is empty, and our KKT system is $$ \left(\begin{array}{cc} Q & I\\ I & -F^{T}F \end{array}\right)\left(\begin{array}{c} a\\ b \end{array}\right)=\left(\begin{array}{c} x\\ y \end{array}\right) $$

The system can be solved by pivoting on the second block, as follows: $$ \big(Q+(F^TF)^{-1} \big)\,a=x+(F^TF)^{-1}y,\qquad b=(F^TF)^{-1}(a-y) $$

Because we only have polyhedral constraints, $F^{-2}$ is a diagonal matrix, thus the first equation is a diagonal perturbation to $Q$ which can be solved via a Cholesky Factorization. Pivoting allows us to solve a 1000x1000 system rather than a 2000x2000 system (albeit with a some sparsity structure).



In [11]:

    
function kktsolver(Q, A, G, cone_dims)
    
    function solve3x3gen(F, F⁻¹)

      invFᵀF = inv(F'F)
      QpD⁻¹ = cholfact(Q + spdiagm( (F[1].diag).^(-2) ))

      function solve3x3(x, z, y)

        a = QpD⁻¹\(x + A'*(invFᵀF*y))
        b = invFᵀF*(y - A*a)
        c = zeros(0,1)
        return(a, c, b)

      end

    end
    
end

@time sol = conicIP( Q , c , A , b , 𝐾 , kktsolver = kktsolver; verbose = true);









    



 > INTERIOR POINT SOLVER v0.7 (July 2016)

            Optimality                      Objective              Infeasibility       

   Iter  │  prFeas    duFeas    muFeas   │  pobj      dobj      │  icertp    icertd   │  refine   
      1  │  1.3e+00   4.2e+01   1.8e+00  │  -2.2e+01  -3.6e+03  │  NaN       1.2e+01  │  0
      2  │  2.4e-01   7.6e+00   2.8e-01  │  -1.3e+01  -6.9e+02  │  NaN       2.0e+01  │  1
      3  │  7.3e-02   2.3e+00   1.8e-01  │  -6.0e+00  -2.5e+02  │  NaN       5.6e+01  │  1
      4  │  2.0e-02   6.3e-01   9.7e-02  │  -2.4e+00  -8.3e+01  │  NaN       1.3e+02  │  1
      5  │  3.0e-03   9.6e-02   3.8e-02  │  -1.3e+00  -1.8e+01  │  NaN       2.5e+02  │  1
      6  │  1.2e-14   4.6e-17   7.5e-03  │  -2.1e+00  -3.6e+00  │  NaN       2.9e+02  │  1
      7  │  8.8e-15   2.9e-17   1.0e-03  │  -2.4e+00  -2.6e+00  │  NaN       2.6e+02  │  1
      8  │  8.2e-15   2.6e-17   1.4e-04  │  -2.4e+00  -2.5e+00  │  NaN       2.5e+02  │  1
      9  │  8.5e-15   2.9e-17   1.3e-05  │  -2.4e+00  -2.4e+00  │  NaN       2.5e+02  │  1
     10  │  8.9e-15   3.2e-17   9.8e-07  │  -2.4e+00  -2.4e+00  │  NaN       2.5e+02  │  1

 > EXIT -- Below Tolerance!

  2.266916 seconds (37.09 k allocations: 1.035 GB, 13.42% gc time)

This results in a 5-fold improvement in speed, and a dramatic drop in memory usage!

This pattern of pivoting on the third block happens often enough that we have encapsulated it in the convenience function pivot, which transforms a $2x2$ solver of the system

$$ \left(\begin{array}{cc} Q+A^{T}(F^{T}F)^{-1}A & G\\ G & 0 \end{array}\right)\left(\begin{array}{c} a\\ b \end{array}\right) = \left(\begin{array}{c} x\\ y \end{array}\right) $$

into a $3x3$ solver. This is illustrated below



In [14]:

    
function kktsolver2x2(Q, A, G, cone_dims)
    
  function solve3x3gen(F, F⁻¹)

    QpD⁻¹ = cholfact(Q + spdiagm( (F[1].diag).^(-2) ))
    return (y, x) -> (QpD⁻¹\y, zeros(0,1))

  end
    
end

@time sol = conicIP( Q , c , A , b , 𝐾 , kktsolver = pivot(kktsolver2x2); verbose = true);









    



 > INTERIOR POINT SOLVER v0.7 (July 2016)

            Optimality                      Objective              Infeasibility       

   Iter  │  prFeas    duFeas    muFeas   │  pobj      dobj      │  icertp    icertd   │  refine   
      1  │  1.3e+00   4.2e+01   1.8e+00  │  -2.2e+01  -3.6e+03  │  NaN       1.2e+01  │  0
      2  │  2.4e-01   7.6e+00   2.8e-01  │  -1.3e+01  -6.9e+02  │  NaN       2.0e+01  │  1
      3  │  7.3e-02   2.3e+00   1.8e-01  │  -6.0e+00  -2.5e+02  │  NaN       5.6e+01  │  1
      4  │  2.0e-02   6.3e-01   9.7e-02  │  -2.4e+00  -8.3e+01  │  NaN       1.3e+02  │  1
      5  │  3.0e-03   9.6e-02   3.8e-02  │  -1.3e+00  -1.8e+01  │  NaN       2.5e+02  │  1
      6  │  1.3e-14   6.2e-17   7.5e-03  │  -2.1e+00  -3.6e+00  │  NaN       2.9e+02  │  1
      7  │  9.2e-15   3.6e-17   1.0e-03  │  -2.4e+00  -2.6e+00  │  NaN       2.6e+02  │  1
      8  │  9.2e-15   3.4e-17   1.4e-04  │  -2.4e+00  -2.5e+00  │  NaN       2.5e+02  │  1
      9  │  8.8e-15   3.4e-17   1.3e-05  │  -2.4e+00  -2.4e+00  │  NaN       2.5e+02  │  1
     10  │  8.7e-15   3.7e-17   9.8e-07  │  -2.4e+00  -2.4e+00  │  NaN       2.5e+02  │  1

 > EXIT -- Below Tolerance!

  2.003137 seconds (48.64 k allocations: 1.036 GB, 9.69% gc time)

And as a bonus we even get an extra boost in speed!



In [16]:

    
HTML(readall(open("style.css")))









    Out[16]: