Infrastructures™ for Machine Learning Training/Inference in Production.
-
Updated
May 24, 2019
Infrastructures™ for Machine Learning Training/Inference in Production.
[TMLR 2024] Efficient Large Language Models: A Survey
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Dive into machine learning system, start from reinventing the wheel.
Oort: Efficient Federated Learning via Guided Participant Selection
This is the course project for CSCE585: ML Systems. Students will build their machine learning systems based on the provided infrastructure --- Athena.
Learn how to design Machine Learning systems and prepare for an interview.
CSCE 585 - Machine Learning Systems
A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup
Assignments for Data Intensive Systems for Machine Learning Coursework
Machine Learning Compiler Road Map
Curated collection of papers in machine learning systems
A curated list of resources to deep dive into the intersection of applied machine learning and threat detection.
Here are my personal paper reading notes (including cloud computing, resource management, systems, machine learning, deep learning, and other interesting stuffs).
[ICML 2022] Rethinking Image-Scaling Attacks: The Interplay Between Vulnerabilities in Machine Learning Systems
A C++ implementation of the scalar-valued autograd engine micrograd
Course Material for the UG Course COMP4901Y
A tool to predict the efficacy of DNN optimizations
[Actively Maintained] [SIGCOMM 2023] Lightning: A Reconfigurable Photonic-Electronic SmartNIC for Fast and Energy-Efficient Inference
Add a description, image, and links to the machine-learning-systems topic page so that developers can more easily learn about it.
To associate your repository with the machine-learning-systems topic, visit your repo's landing page and select "manage topics."