Vision Task
Vision tasks, encompassing image and video analysis for diverse applications, are a central focus in computer vision research. Current efforts concentrate on improving model efficiency and robustness, particularly through multi-task learning, the development of novel architectures like Vision Transformers and state-space models, and the incorporation of human feedback for improved alignment with user preferences. These advancements are driving progress in areas such as image compression for machine learning pipelines, multi-image understanding, and the creation of more robust and fair models for real-world deployment.
Papers
Energy Consumption Analysis of pruned Semantic Segmentation Networks on an Embedded GPU
Hugo Tessier, Vincent Gripon, Mathieu Léonardon, Matthieu Arzel, David Bertrand, Thomas Hannagan
Leveraging Structured Pruning of Convolutional Neural Networks
Hugo Tessier, Vincent Gripon, Mathieu Léonardon, Matthieu Arzel, David Bertrand, Thomas Hannagan