The structure of atmospheric profile data fits both convolution and recurrent neural network architectures. Convolutions generally give smoother results and are chosen given their previous success in this kind of problem. Using physical knowledge to customize the architectures it is possible to emulate the RTTMG model with very little loss in accuracy. U-Net architecture, which has the property of capturing low and high frequency patterns, gives the best performance when compared with a number of other candidate architectures.
In order to assess whether a candidate neural network would produce the correct TLM and adjoint the Jacobian of the original scheme was compared with the Jacobian of the neural network. In both cases the Jacobian is obtained using a finite difference approach with infinitesimal perturbations. These ‘nonlinear’ Jacobians are compared with the actual TLM of the neural network, obtained directly through Keras/TensorFlow. While the nonlinear neural network Jacobian would agree very well with the gradient of the neural network, neither matched well with the Jacobian of the model being trained on, despite rigorous checks for overfitting. The findings of this work suggest it may not be possible to achieve a neural network that produces a useful linearized model using standard methods.
Rather than implicitly obtain the linearized model the U-Net is trained to produce the nonlinear Jacobian directly. Producing the Jacobian is equivalent to having both the TLM and the adjoint. The problem becomes similar to image generation which is well suited to U-Nets. This time we use a 2D-Unet to produce the output.