[TMLR 2024] Efficient Large Language Models: A Survey
-
Updated
Jun 29, 2024
[TMLR 2024] Efficient Large Language Models: A Survey
Curated collection of papers in machine learning systems
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Here are my personal paper reading notes (including cloud computing, resource management, systems, machine learning, deep learning, and other interesting stuffs).
Course Material for the UG Course COMP4901Y
Projects and summaries for the Machine Learning [PPGEEC2318] course at UFRN, taught by Professor Ivanovitch Silva.
[Actively Maintained] [SIGCOMM 2023] Lightning: A Reconfigurable Photonic-Electronic SmartNIC for Fast and Energy-Efficient Inference
Price Incentive Model Efficiency System: 1.5-2x higher accuracy per training round than random/Oort, up to 30% decrease in convergence time.
CSCE 585 - Machine Learning Systems
Machine Learning Compiler Road Map
Learn how to design Machine Learning systems and prepare for an interview.
Assignments for Data Intensive Systems for Machine Learning Coursework
A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup
[ICML 2022] Rethinking Image-Scaling Attacks: The Interplay Between Vulnerabilities in Machine Learning Systems
A tool to predict the efficacy of DNN optimizations
Oort: Efficient Federated Learning via Guided Participant Selection
This is the course project for CSCE585: ML Systems. Students will build their machine learning systems based on the provided infrastructure --- Athena.
A curated list of resources to deep dive into the intersection of applied machine learning and threat detection.
A C++ implementation of the scalar-valued autograd engine micrograd
Add a description, image, and links to the machine-learning-systems topic page so that developers can more easily learn about it.
To associate your repository with the machine-learning-systems topic, visit your repo's landing page and select "manage topics."