Inferior Attribution Performance

Inferior attribution performance in machine learning models refers to the difficulty in accurately tracing model outputs back to their contributing input features or training data. Current research focuses on improving attribution methods for various model architectures, including deep neural networks and diffusion models, often employing gradient-based techniques or novel feature representations like trajectory-based approaches. This research is crucial for enhancing model reliability, ensuring fairness in data valuation, improving model robustness through techniques like attribution-driven dropout, and advancing explainable AI, particularly in anomaly detection and authorship attribution. Ultimately, overcoming these attribution challenges is vital for building more trustworthy and interpretable AI systems.

Papers