Deep Learning & Optimal Control: what's the connection?


  • One famous interpretation of ResNet(Residual network) is to view the layer-to-layer transformation
    $$u_{n+1} = u_n + h \sigma (W_n u_n + b_n)$$
    as the discretization of a dynamical system(see Neural ODE). As a result, the process of training can be viewed as a numerical optimal control problem where weights and biases are control variables. This also shows the possibility of using numerical schemes as instructions for developing/improving the structures of deep neural networks.
  • On the other hand, physics informed neural networks(PINN) offers an approach to solve high dimensional PDEs without a computational grid. Some control problems need to solve a high dimensional Hamilton-Jacobi-Bellman equation, which may take advantage of this.
    For example, the following figure is a slice of a 4D distance function trained with data generated from Eikonal solutions with black dots as sources. This slice shows the shortest distance from each point in a domain(with an obstacle) towards the source (0.3,0.1).