Explaining Deep Residual Networks Predictions with Symplectic Adjoint Method
- Department of Automation, China University of Petroleum Bejing
102249 Beijing, China
luoxl@cup.edu.cn, leixia2008530059@163.com, lawliet3@qq.com - State Key Laboratory of Media Convergence and Communication, Communication University of China
100024 Beijing, China
fanyongkai@gmail.com
Abstract
Understanding deep residual networks (ResNets) decisions are receiving much attention as a way to ensure their security and reliability. Recent research, however, lacks theoretical analysis to guarantee the faithfulness of explanations and could produce an unreliable explanation. In order to explain ResNets predictions, we suggest a provably faithful explanation for ResNet using a surrogate explainable model, a neural ordinary differential equation network (Neural ODE). First, ResNets are proved to converge to a Neural ODE and the Neural ODE is regarded as a surrogate model to explain the decision-making attribution of the ResNets. And then the decision feature and the explanation map of inputs belonging to the target class for Neural ODE are generated via the symplectic adjoint method. Finally, we prove that the explanations of Neural ODE can be sufficiently approximate to ResNet. Experiments show that the proposed explanation method has higher faithfulness with lower computational cost than other explanation approaches and it is effective for troubleshooting and optimizing a model by the explanation.
Key words
Deep residual networks, Explanation, Neural ODE, Symplectic adjoint method
Digital Object Identifier (DOI)
https://doi.org/10.2298/CSIS230310047L
Publication information
Volume 20, Issue 4 (September 2023)
Year of Publication: 2023
ISSN: 2406-1018 (Online)
Publisher: ComSIS Consortium
Full text
Available in PDF
Portable Document Format
How to cite
Lei, X., Lin, J., Luo, X., Fan, Y.: Explaining Deep Residual Networks Predictions with Symplectic Adjoint Method. Computer Science and Information Systems, Vol. 20, No. 4. (2023), https://doi.org/10.2298/CSIS230310047L