Full Model

"Full Model" research encompasses the development and improvement of large-scale machine learning models across diverse applications, aiming to enhance performance, efficiency, and robustness. Current research focuses on addressing model vulnerabilities (e.g., adversarial attacks, hallucinations), improving efficiency for resource-constrained devices, and developing specialized models for specific domains (e.g., finance, astronomy, medical imaging). This work is significant for advancing AI capabilities in various fields and for mitigating potential risks associated with deploying complex models in real-world settings.

Papers

November 14, 2024

November 13, 2024

November 12, 2024

November 11, 2024

November 9, 2024

Detecting Reference Errors in Scientific Literature with Large Language Models
Tianmai M. Zhang, Neil F. Abernethy
Large Language Model Full Model Large Language Higher Quality Reference Scientific Literature Quotation Attribution Scientific Publishing Hidden Citation

November 8, 2024

Full Model

Papers

MARS: Unleashing the Power of Variance Reduction for Training Large Models

OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models

Explanation for Trajectory Planning using Multi-modal Large Language Model for Autonomous Driving

DT-JRD: Deep Transformer based Just Recognizable Difference Prediction Model for Video Coding for Machines

Enhancing Financial Domain Adaptation of Language Models via Model Augmentation

Model agnostic local variable importance for locally dependent relationships

Towards Optimizing a Retrieval Augmented Generation using Large Language Model on Academic Data

Deceiving Question-Answering Models: A Hybrid Word-Level Adversarial Approach

PERFT: Parameter-Efficient Routed Fine-Tuning for Mixture-of-Expert Model

LLMPhy: Complex Physical Reasoning Using Large Language Models and World Models

Controlled Evaluation of Syntactic Knowledge in Multilingual Language Models

DecoPrompt : Decoding Prompts Reduces Hallucinations when Large Language Models Meet False Premises

Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models

Cancer-Answer: Empowering Cancer Care with Advanced Large Language Models

UMFC: Unsupervised Multi-Domain Feature Calibration for Vision-Language Models

LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language Models

Autonomous Droplet Microfluidic Design Framework with Large Language Models

Detecting Reference Errors in Scientific Literature with Large Language Models

Energy Efficient Protein Language Models: Leveraging Small Language Models with LoRA for Controllable Protein Generation

Aligned Vector Quantization for Edge-Cloud Collabrative Vision-Language Models