Fine Grained
Fine-grained analysis focuses on achieving high precision and detail in various domains, moving beyond coarse-grained classifications. Current research emphasizes developing models capable of handling nuanced distinctions, often employing techniques like multi-modal learning, transformer architectures, and diffusion models to achieve this fine-grained understanding in tasks ranging from image captioning and object detection to legal analysis and speech processing. This detailed level of analysis is crucial for advancing fields like medical diagnosis, legal technology, and scientific discovery, enabling more accurate and insightful interpretations of complex data. The development of robust and efficient fine-grained models is driving progress across numerous scientific and practical applications.
Papers
TROPE: TRaining-Free Object-Part Enhancement for Seamlessly Improving Fine-Grained Zero-Shot Image Captioning
Joshua Feinglass, Yezhou Yang
UniSumEval: Towards Unified, Fine-Grained, Multi-Dimensional Summarization Evaluation for LLMs
Yuho Lee, Taewon Yun, Jason Cai, Hang Su, Hwanjun Song
TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm
Bingqing Zhang, Zhuo Cao, Heming Du, Xin Yu, Xue Li, Jiajun Liu, Sen Wang
EyeTrAES: Fine-grained, Low-Latency Eye Tracking via Adaptive Event Slicing
Argha Sen, Nuwan Bandara, Ila Gokarn, Thivya Kandappu, Archan Misra
Incorporating Precedents for Legal Judgement Prediction on European Court of Human Rights Cases
T.Y.S.S. Santosh, Mohamed Hesham Elganayni, Stanisław Sójka, Matthias Grabmair
Navigating the Nuances: A Fine-grained Evaluation of Vision-Language Navigation
Zehao Wang, Minye Wu, Yixin Cao, Yubo Ma, Meiqi Chen, Tinne Tuytelaars
Holistic Automated Red Teaming for Large Language Models through Top-Down Test Case Generation and Multi-turn Interaction
Jinchuan Zhang, Yan Zhou, Yaxin Liu, Ziming Li, Songlin Hu
TalkinNeRF: Animatable Neural Fields for Full-Body Talking Humans
Aggelina Chatziagapi, Bindita Chaudhuri, Amit Kumar, Rakesh Ranjan, Dimitris Samaras, Nikolaos Sarafianos
Detect, Describe, Discriminate: Moving Beyond VQA for MLLM Evaluation
Manu Gaur, Darshan Singh S, Makarand Tapaswi
Scientific Cross-Document Coreference and Hierarchy with Definition-Augmented Relational Reasoning
Lior Forer, Tom Hope
DiSPo: Diffusion-SSM based Policy Learning for Coarse-to-Fine Action Discretization
Nayoung Oh, Moonkyeong Jung, Daehyung Park
Video-to-Audio Generation with Fine-grained Temporal Semantics
Yuchen Hu, Yu Gu, Chenxing Li, Rilin Chen, Dong Yu