System Performance
System performance research focuses on optimizing the efficiency and accuracy of various computational systems, from machine learning models to robotic controllers and even quantum computers. Current research emphasizes improving model architectures (e.g., graph-oriented databases for language models, retention-based networks for multi-agent reinforcement learning) and training techniques (e.g., hard sample mining, co-optimization of design and control), while also addressing issues like fairness, robustness, and explainability. These advancements have significant implications for diverse fields, impacting the development of more efficient and reliable AI systems, improved medical diagnostics, and enhanced manufacturing processes.
Papers
GraphPipe: Improving Performance and Scalability of DNN Training with Graph Pipeline Parallelism
Byungsoo Jeon, Mengdi Wu, Shiyi Cao, Sunghyun Kim, Sunghyun Park, Neeraj Aggarwal, Colin Unger, Daiyaan Arfeen, Peiyuan Liao, Xupeng Miao, Mohammad Alizadeh, Gregory R. Ganger, Tianqi Chen, Zhihao Jia
BitNet b1.58 Reloaded: State-of-the-art Performance Also on Smaller Networks
Jacob Nielsen, Peter Schneider-Kamp
Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images
Qinfeng Zhu, Yuanzhi Cai, Lei Fan
How Many Parameters Does it Take to Change a Light Bulb? Evaluating Performance in Self-Play of Conversational Games as a Function of Model Characteristics
Nidhir Bhavsar, Jonathan Jordan, Sherzod Hakimov, David Schlangen
Modeling & Evaluating the Performance of Convolutional Neural Networks for Classifying Steel Surface Defects
Nadeem Jabbar Chaudhry, M. Bilal Khan, M. Javaid Iqbal, Siddiqui Muhammad Yasir
A New Approach for Evaluating and Improving the Performance of Segmentation Algorithms on Hard-to-Detect Blood Vessels
João Pedro Parella, Matheus Viana da Silva, Cesar Henrique Comin
Performance of large language models in numerical vs. semantic medical knowledge: Benchmarking on evidence-based Q&As
Eden Avnat, Michal Levy, Daniel Herstain, Elia Yanko, Daniel Ben Joya, Michal Tzuchman Katz, Dafna Eshel, Sahar Laros, Yael Dagan, Shahar Barami, Joseph Mermelstein, Shahar Ovadia, Noam Shomron, Varda Shalev, Raja-Elie E. Abdulnour
Advancing The Robotics Software Development Experience: Bridging Julia's Performance and Python's Ecosystem
Gustavo Nunes Goretkin, Joseph Carpinelli, Andy Park