Skip to content
@Dao-AILab

Dao AI Lab

We are an AI research group led by Prof. Tri Dao

Popular repositories Loading

  1. flash-attention flash-attention Public

    Fast and memory-efficient exact attention

    Python 24.2k 2.8k

  2. quack quack Public

    A Quirky Assortment of CuTe Kernels

    Python 1k 137

  3. causal-conv1d causal-conv1d Public

    Causal depthwise conv1d in CUDA, with a PyTorch interface

    Cuda 905 192

  4. sonic-moe sonic-moe Public

    Accelerating MoE with IO and Tile-aware Optimizations

    Python 714 90

  5. fast-hadamard-transform fast-hadamard-transform Public

    Fast Hadamard transform in CUDA, with a PyTorch interface

    C 329 63

  6. gram-newton-schulz gram-newton-schulz Public

    Fast Polar Decomposition for Muon

    Python 155 13

Repositories

Showing 10 of 11 repositories

Top languages

Loading…

Most used topics

Loading…