Discover cutting-edge AI research papers, automatically curated and categorized daily.
Latest Papers
Distilling Multi-modal Large Language Models for Autonomous Driving
Deepti Hegde, Rajeev Yasarla, Hong Cai, Shizhong Han, Apratim Bhattacharyya, Shweta Mahajan, Litian Liu, Risheek Garrepalli, Vishal M. Patel, Fatih Porikli
SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces
Sumit Chaturvedi, Mengwei Ren, Yannick Hold-Geoffroy, Jingyuan Liu, Julie Dorsey, Zhixin Shu
Learnings from Scaling Visual Tokenizers for Reconstruction and Generation
Philippe Hansen-Estruch, David Yan, Ching-Yao Chung, Orr Zohar, Jialiang Wang, Tingbo Hou, Tao Xu, Sriram Vishwanath, Peter Vajda, Xinlei Chen
Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues
Youngjoon Jang, Haran Raajesh, Liliane Momeni, Gül Varol, Andrew Zisserman
SRE-Conv: Symmetric Rotation Equivariant Convolution for Biomedical Image Classification
Yuexi Du, Jiazhen Zhang, Tal Zeevi, Nicha C. Dvornek, John A. Onofrey
OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking
Zekun Xi, Wenbiao Yin, Jizhan Fang, Jialong Wu, Runnan Fang, Ningyu Zhang, Jiang Yong, Pengjun Xie, Fei Huang, Huajun Chen
Enhancing Lexicon-Based Text Embeddings with Large Language Models
Yibin Lei, Tao Shen, Yu Cao, Andrew Yates
FAST: Efficient Action Tokenization for Vision-Language-Action Models
Karl Pertsch, Kyle Stachowicz, Brian Ichter, Danny Driess, Suraj Nair, Quan Vuong, Oier Mees, Chelsea Finn, Sergey Levine
Suggesting Code Edits in Interactive Machine Learning Notebooks Using Large Language Models
Bihui Jin, Jiayue Wang, Pengyu Nie
KU AIGEN ICL EDI@BC8 Track 3: Advancing Phenotype Named Entity Recognition and Normalization for Dysmorphology Physical Examination Reports
Hajung Kim, Chanhwi Kim, Jiwoong Sohn, Tim Beck, Marek Rei, Sunkyu Kim, T Ian Simpson, Joram M Posma, Antoine Lain, Mujeen Sung, Jaewoo Kang
Random Subspace Cubic-Regularization Methods, with Applications to Low-Rank Functions
Coralia Cartis, Zhen Shao, Edward Tansley
ComplexVAD: Detecting Interaction Anomalies in Video
Furkan Mumcu, Michael J. Jones, Yasin Yilmaz, Anoop Cherian
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps
Nanye Ma, Shangyuan Tong, Haolin Jia, Hexiang Hu, Yu-Chuan Su, Mingda Zhang, Xuan Yang, Yandong Li, Tommi Jaakkola, Xuhui Jia, Saining Xie
Predictions as Surrogates: Revisiting Surrogate Outcomes in the Age of AI
Wenlong Ji, Lihua Lei, Tijana Zrnic
Generating particle physics Lagrangians with transformers
Yong Sheng Koay, Rikard Enberg, Stefano Moretti, Eliel Camargo-Molina
Parallel multi-objective metaheuristics for smart communications in vehicular networks
Jamal Toutouh, Enrique Alba
Attention based Bidirectional GRU hybrid model for inappropriate content detection in Urdu language
Ezzah Shoukat, Rabia Irfan, Iqra Basharat, Muhammad Ali Tahir, Sameen Shaukat
A Simple Aerial Detection Baseline of Multimodal Language Models
Qingyun Li, Yushi Chen, Xinya Shu, Dong Chen, Xin He, Yi Yu, Xue Yang
Comparative Insights from 12 Machine Learning Models in Extracting Economic Ideology from Political Text
Jihed Ncib
FLOL: Fast Baselines for Real-World Low-Light Enhancement
Juan C. Benito, Daniel Feijoo, Alvaro Garcia, Marcos V. Conde