Skip to content
@mit-han-lab

MIT HAN Lab

Efficient AI Computing. PI: Song Han

Pinned Loading

  1. streaming-llm streaming-llm Public

    [ICLR 2024] Efficient Streaming Language Models with Attention Sinks

    Python 6.3k 356

  2. smoothquant smoothquant Public

    [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

    Python 1.1k 126

  3. llm-awq llm-awq Public

    [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    Python 2k 146

  4. bevfusion bevfusion Public

    [ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

    Python 2.1k 377

  5. once-for-all once-for-all Public

    [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment

    Python 1.8k 333

  6. temporal-shift-module temporal-shift-module Public

    [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

    Python 2k 416

Repositories

Showing 10 of 51 repositories
  • TinyChatEngine Public

    TinyChatEngine: On-Device LLM Inference Library

    mit-han-lab/TinyChatEngine’s past year of commit activity
    C++ 610 MIT 57 28 3 Updated Jun 25, 2024
  • distrifuser Public

    [CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

    mit-han-lab/distrifuser’s past year of commit activity
    Python 476 MIT 12 6 0 Updated Jun 24, 2024
  • torchquantum Public

    A PyTorch-based framework for Quantum Classical Simulation, Quantum Machine Learning, Quantum Neural Networks, Parameterized Quantum Circuits with support for easy deployments on real quantum computers.

    mit-han-lab/torchquantum’s past year of commit activity
    Jupyter Notebook 1,228 MIT 181 56 (4 issues need help) 7 Updated Jun 19, 2024
  • Quest Public

    [ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference

    mit-han-lab/Quest’s past year of commit activity
    Cuda 63 3 1 0 Updated Jun 18, 2024
  • llm-awq Public

    [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    mit-han-lab/llm-awq’s past year of commit activity
    Python 2,046 MIT 146 110 7 Updated Jun 12, 2024
  • lmquant Public
    mit-han-lab/lmquant’s past year of commit activity
    Python 70 Apache-2.0 1 3 0 Updated Jun 12, 2024
  • litepose Public

    [CVPR'22] Lite Pose: Efficient Architecture Design for 2D Human Pose Estimation

    mit-han-lab/litepose’s past year of commit activity
    Python 299 MIT 35 18 1 Updated Jun 5, 2024
  • gan-compression Public

    [CVPR 2020] GAN Compression: Efficient Architectures for Interactive Conditional GANs

    mit-han-lab/gan-compression’s past year of commit activity
    Python 1,098 147 3 6 Updated Jun 5, 2024
  • efficientvit Public

    EfficientViT is a new family of vision models for efficient high-resolution vision.

    mit-han-lab/efficientvit’s past year of commit activity
    Python 1,573 Apache-2.0 141 75 0 Updated Jun 4, 2024
  • torchsparse Public

    [MICRO'23, MLSys'22] TorchSparse: Efficient Training and Inference Framework for Sparse Convolution on GPUs.

    mit-han-lab/torchsparse’s past year of commit activity
    Cuda 1,147 MIT 130 23 0 Updated May 31, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.