Fine Grained
Fine-grained analysis focuses on achieving high precision and detail in various domains, moving beyond coarse-grained classifications. Current research emphasizes developing models capable of handling nuanced distinctions, often employing techniques like multi-modal learning, transformer architectures, and diffusion models to achieve this fine-grained understanding in tasks ranging from image captioning and object detection to legal analysis and speech processing. This detailed level of analysis is crucial for advancing fields like medical diagnosis, legal technology, and scientific discovery, enabling more accurate and insightful interpretations of complex data. The development of robust and efficient fine-grained models is driving progress across numerous scientific and practical applications.
Papers - Page 13
Feather the Throttle: Revisiting Visual Token Pruning for Vision-Language Model Acceleration
Improving Fine-grained Visual Understanding in VLMs through Text-Only Training
Learning Coarse-to-Fine Pruning of Graph Convolutional Networks for Skeleton-based Recognition
Boosting Fine-Grained Visual Anomaly Detection with Coarse-Knowledge-Aware Adversarial Learning
Bidirectional Logits Tree: Pursuing Granularity Reconcilement in Fine-Grained Classification
ITP: Instance-Aware Test Pruning for Out-of-Distribution Detection
DocVLM: Make Your VLM an Efficient Reader
Learning Flow Fields in Attention for Controllable Person Image Generation
TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning
ProGDF: Progressive Gaussian Differential Field for Controllable and Flexible 3D Editing
Progressive Multi-granular Alignments for Grounded Reasoning in Large Vision-Language Models
Barking Up The Syntactic Tree: Enhancing VLM Training with Syntactic Losses