Paper ID: 2312.06633

Examining the Effect of Implementation Factors on Deep Learning Reproducibility

Kevin Coakley, Christine R. Kirkpatrick, Odd Erik Gundersen

Reproducing published deep learning papers to validate their conclusions can be difficult due to sources of irreproducibility. We investigate the impact that implementation factors have on the results and how they affect reproducibility of deep learning studies. Three deep learning experiments were ran five times each on 13 different hardware environments and four different software environments. The analysis of the 780 combined results showed that there was a greater than 6% accuracy range on the same deterministic examples introduced from hardware or software environment variations alone. To account for these implementation factors, researchers should run their experiments multiple times in different hardware and software environments to verify their conclusions are not affected.

Submitted: Dec 11, 2023

Topics

Deep Learning
Mixed Effect
Research Reproducibility
Reproducible Deep Learning

Links

arXiv PDF