OS Numerical Optimization: Adaptive Deep Neural Network Architectures: Time-adaptive pruning and sensitivity-based layer insertion

Wann
Donnerstag, 13. Juni 2024
15:15 bis 16:45 Uhr

Wo
F423

Veranstaltet von
B. Azmi & S. Volkwein

Vortragende Person/Vortragende Personen:
Dr. Evelyn Herberg

On 13th June 2024 at 15:15, Dr. Evelyn Herberg from the Scientific Computing and Optimization (University of Heidelberg) will give a talk.


Abstract: We propose adaptive neural network architecture approaches for Feedforward Neural Networks (FNNs) and Residual Neural Networks (ResNets). On the one hand we propose an adaptive pruning approach, and on the other hand we provide a sensitivity-based layer insertion technique. In both approaches, the hyperparameter search is simplified and network complexity can be reduced without compromising expressiveness, while simultaneously decreasing training time. The results are illustrated by numerical examples.