Paper ID: 2404.12407

TV100: A TV Series Dataset that Pre-Trained CLIP Has Not Seen

Da-Wei Zhou, Zhi-Hong Qi, Han-Jia Ye, De-Chuan Zhan

The era of pre-trained models has ushered in a wealth of new insights for the machine learning community. Among the myriad of questions that arise, one of paramount importance is: 'Do pre-trained models possess comprehensive knowledge?' This paper seeks to address this crucial inquiry. In line with our objective, we have made publicly available a novel dataset comprised of images from TV series released post-2021. This dataset holds significant potential for use in various research areas, including the evaluation of incremental learning, novel class discovery, and long-tailed learning, among others. Project page: https://tv-100.github.io/

Submitted: Apr 16, 2024