Paper ID: 2205.10955

Investigating classification learning curves for automatically generated and labelled plant images

Michael A. Beck, Christopher P. Bidinosti, Christopher J. Henry, Manisha Ajmani

In the context of supervised machine learning a learning curve describes how a model's performance on unseen data relates to the amount of samples used to train the model. In this paper we present a dataset of plant images with representatives of crops and weeds common to the Manitoba prairies at different growth stages. We determine the learning curve for a classification task on this data with the ResNet architecture. Our results are in accordance with previous studies and add to the evidence that learning curves are governed by power-law relationships over large scales, applications, and models. We further investigate how label noise and the reduction of trainable parameters impacts the learning curve on this dataset. Both effects lead to the model requiring disproportionally larger training sets to achieve the same classification performance as observed without these effects.

Submitted: May 22, 2022

Topics

Links

arXiv PDF