Software Cache

Software caching aims to optimize data access speed and efficiency by storing frequently used data in faster, smaller memory levels closer to the processor. Current research focuses on improving cache architectures for various applications, including large language models, recommendation systems, and scientific workflows, employing techniques like semantic analysis, machine learning-based prediction, and combinatorial optimization algorithms to enhance hit rates and reduce latency. These advancements are crucial for improving the performance and scalability of numerous applications, ranging from high-performance computing to personalized online services.

Papers