Full Model

"Full Model" research encompasses the development and improvement of large-scale machine learning models across diverse applications, aiming to enhance performance, efficiency, and robustness. Current research focuses on addressing model vulnerabilities (e.g., adversarial attacks, hallucinations), improving efficiency for resource-constrained devices, and developing specialized models for specific domains (e.g., finance, astronomy, medical imaging). This work is significant for advancing AI capabilities in various fields and for mitigating potential risks associated with deploying complex models in real-world settings.

Papers

November 4, 2024

November 2, 2024

Enhancing Diabetic Retinopathy Detection with CNN-Based Models: A Comparative Study of UNET and Stacked UNET Architectures
Ameya Uppina, S Navaneetha Krishnan, Talluri Krishna Sai Teja, Nikhil N Iyer, Joe Dhanith P R
Deep Learning Convolutional Neural Network Full Model Comparative Study Diabetic Retinopathy Retinal Image UNet Based UNet Architecture
Swan and ArabicMTEB: Dialect-Aware, Arabic-Centric, Cross-Lingual, and Cross-Cultural Embedding Models and Benchmarks
Gagan Bhatia, El Moatez Billah Nagoudi, Abdellah El Mekki, Fakhraddin Alwajih, Muhammad Abdul-Mageed
Language Model Natural Language Processing New Benchmark Full Model Arabic Dialect
Hollowed Net for On-Device Personalization of Text-to-Image Diffusion Models
Wonguk Cho, Seokeon Choi, Debasmit Das, Matthias Reisser, Taesup Kim, Sungrack Yun, Fatih Porikli
Full Model Text to Image Diffusion Model Pre Trained Diffusion Model Text to Image Diffusion Personalization Performance Subject Driven Generation
CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset for Security Research
Sian-Yao Huang, Cheng-Lin Yang, Che-Yu Lin, Chun-Ying Huang
Large Language Model Data Set Full Model Security Related High Level Cybersecurity Domain

November 1, 2024

Identify Backdoored Model in Federated Learning via Individual Unlearning
Jiahao Xu, Zikai Zhang, Rui Hu
Full Model Backdoor Attack Person Identification Harmful Unlearning Attack Strategy Learning Based Fusion Approach Malicious Model
LLM-KT: A Versatile Framework for Knowledge Transfer from Large Language Models to Collaborative Filtering
Nikita Severin, Aleksei Ziablitsev, Yulia Savelyeva, Valeriy Tashchilin, Ivan Bulychev, Mikhail Yushkov, Artem Kushneruk, Amaliya Zaryvnykh, Dmitrii Kiselev, Andrey Savchenko, Ilya Makarov
Full Model Jina Embeddings Knowledge Transfer Large Language Collaborative Filtering Flexible Framework Context Awareness Recommendation Scenario Knowledge Management

October 31, 2024

Evaluating the Evolution of YOLO (You Only Look Once) Models: A Comprehensive Benchmark Study of YOLO11 and Its Predecessors
Nidhal Jegham, Chan Young Koh, Marwan Abdelatti, Abdeltawab Hendawi
Full Model Benchmark Study YOLO Shake Hand YOLO Series Look Once Algorithm Novel Predecessor and Successor
A Geometric Framework for Understanding Memorization in Generative Models
Brendan Leigh Ross, Hamidreza Kamkari, Tongzi Wu, Rasa Hosseinzadeh, Zhaoyan Liu, George Stein, Jesse C. Cresswell, Gabriel Loaiza-Ganem
Generative Model Full Model Generative Question Limited Memorization Memorization Effect Manifold Hypothesis Geometric Framework
Hamiltonian Monte Carlo Inference of Marginalized Linear Mixed-Effects Models
Jinlin Lai, Justin Domke, Daniel Sheldon
Full Model Scientific Inference Markov Chain Monte Carlo Hamiltonian Monte Carlo Mixed Effect Model
Approximate attention with MLP: a pruning strategy for attention-based model in multivariate time series forecasting
Suhan Guo, Jiahong Deng, Yi Wei, Hui Dou, Furao Shen, Jian Zhao
Full Model Multivariate Time Series Temporal Network Single Scene Specific MLP Self Attention Network Pruning Strategy Attention Based Network Approximate Attention
Metamorphic Malware Evolution: The Potential and Peril of Large Language Models
Pooria Madani
Full Model Full Potential Large Language Malware Analysis Code Transformation Metamorphic Malware Mutation Testing
Transferable Ensemble Black-box Jailbreak Attacks on Large Language Models
Yiqi Yang, Hongye Fu
Full Model Black Box Jailbreak Attack Large Language LLM Attack

October 30, 2024

Can Models Help Us Create Better Models? Evaluating LLMs as Data Scientists
Michał Pietruszka, Łukasz Borchmann, Aleksander Jędrosz, Paweł Morawiecki
Large Language Model Full Model Medical LLM Raw Data Domain Knowledge Science Journalism Deep Understanding Feature Engineering Knowledge Intensive Task Base Model
Partial Channel Dependence with Channel Masks for Time Series Foundation Models
Seunghan Lee, Taeyoung Park, Kibok Lee
Anomaly Detection Full Model Single Task Channel Wise Time Series Foundation Model Channel Masking

Full Model

Papers

Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models

Discrete the solving model of time-variant standard Sylvester-conjugate matrix equations using Euler-forward formula

Evaluating Creative Short Story Generation in Humans and Large Language Models

An Exploration of Higher Education Course Evaluation by Large Language Models

Know Where You're Uncertain When Planning with Multimodal Foundation Models: A Formal Framework

Adaptive Conformal Inference by Particle Filtering under Hidden Markov Models

Enhancing Diabetic Retinopathy Detection with CNN-Based Models: A Comparative Study of UNET and Stacked UNET Architectures

Swan and ArabicMTEB: Dialect-Aware, Arabic-Centric, Cross-Lingual, and Cross-Cultural Embedding Models and Benchmarks

Hollowed Net for On-Device Personalization of Text-to-Image Diffusion Models

CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset for Security Research

Identify Backdoored Model in Federated Learning via Individual Unlearning

LLM-KT: A Versatile Framework for Knowledge Transfer from Large Language Models to Collaborative Filtering

Evaluating the Evolution of YOLO (You Only Look Once) Models: A Comprehensive Benchmark Study of YOLO11 and Its Predecessors

A Geometric Framework for Understanding Memorization in Generative Models

Hamiltonian Monte Carlo Inference of Marginalized Linear Mixed-Effects Models

Approximate attention with MLP: a pruning strategy for attention-based model in multivariate time series forecasting

Metamorphic Malware Evolution: The Potential and Peril of Large Language Models

Transferable Ensemble Black-box Jailbreak Attacks on Large Language Models

Can Models Help Us Create Better Models? Evaluating LLMs as Data Scientists

Partial Channel Dependence with Channel Masks for Time Series Foundation Models