Fine Grained
Fine-grained analysis focuses on achieving high precision and detail in various domains, moving beyond coarse-grained classifications. Current research emphasizes developing models capable of handling nuanced distinctions, often employing techniques like multi-modal learning, transformer architectures, and diffusion models to achieve this fine-grained understanding in tasks ranging from image captioning and object detection to legal analysis and speech processing. This detailed level of analysis is crucial for advancing fields like medical diagnosis, legal technology, and scientific discovery, enabling more accurate and insightful interpretations of complex data. The development of robust and efficient fine-grained models is driving progress across numerous scientific and practical applications.
Papers
Evolving Interpretable Visual Classifiers with Large Language Models
Mia Chiquier, Utkarsh Mall, Carl Vondrick
AesExpert: Towards Multi-modality Foundation Model for Image Aesthetics Perception
Yipo Huang, Xiangfei Sheng, Zhichao Yang, Quan Yuan, Zhichao Duan, Pengfei Chen, Leida Li, Weisi Lin, Guangming Shi
MyGO: Discrete Modality Information as Fine-Grained Tokens for Multi-modal Knowledge Graph Completion
Yichi Zhang, Zhuo Chen, Lingbing Guo, Yajing Xu, Binbin Hu, Ziqi Liu, Huajun Chen, Wen Zhang
Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models
Haotian Zhang, Haoxuan You, Philipp Dufter, Bowen Zhang, Chen Chen, Hong-You Chen, Tsu-Jui Fu, William Yang Wang, Shih-Fu Chang, Zhe Gan, Yinfei Yang
Fine-Grained Side Information Guided Dual-Prompts for Zero-Shot Skeleton Action Recognition
Yang Chen, Jingcai Guo, Tian He, Ling Wang
Tuning-Free Adaptive Style Incorporation for Structure-Consistent Text-Driven Style Transfer
Yanqi Ge, Jiaqi Liu, Qingnan Fan, Xi Jiang, Ye Huang, Shuai Qin, Hong Gu, Wen Li, Lixin Duan
CrimeAlarm: Towards Intensive Intent Dynamics in Fine-grained Crime Prediction
Kaixi Hu, Lin Li, Qing Xie, Xiaohui Tao, Guandong Xu
360$^\circ$REA: Towards A Reusable Experience Accumulation with 360{\deg} Assessment for Multi-Agent System
Shen Gao, Hao Li, Chengrui Huang, Quan Tu, Zhiliang Tian, Minlie Huang, Shuo Shang
EFSA: Towards Event-Level Financial Sentiment Analysis
Tianyu Chen, Yiming Zhang, Guoxin Yu, Dapeng Zhang, Li Zeng, Qing He, Xiang Ao
MM-MATH: Advancing Multimodal Math Evaluation with Process Evaluation and Fine-grained Classification
Kai Sun, Yushi Bai, Ji Qi, Lei Hou, Juanzi Li
FGAIF: Aligning Large Vision-Language Models with Fine-grained AI Feedback
Liqiang Jing, Xinya Du
FRACTAL: Fine-Grained Scoring from Aggregate Text Labels
Yukti Makhija, Priyanka Agrawal, Rishi Saket, Aravindan Raghuveer
DiffBody: Human Body Restoration by Imagining with Generative Diffusion Prior
Yiming Zhang, Zhe Wang, Xinjie Li, Yunchen Yuan, Chengsong Zhang, Xiao Sun, Zhihang Zhong, Jian Wang
Is CLIP the main roadblock for fine-grained open-world perception?
Lorenzo Bianchi, Fabio Carrara, Nicola Messina, Fabrizio Falchi
Performance of computer vision algorithms for fine-grained classification using crowdsourced insect images
Rita Pucci, Vincent J. Kalkman, Dan Stowell
iSeg: Interactive 3D Segmentation via Interactive Attention
Itai Lang, Fei Xu, Dale Decatur, Sudarshan Babu, Rana Hanocka