Skip to content
View shawntan's full-sized avatar

Organizations

@nushackers @basement-gang

Block or report shawntan

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. neural-turing-machines neural-turing-machines Public

    Attempt at implementing system described in "Neural Turing Machines." by Graves, Alex, Greg Wayne, and Ivo Danihelka. (http://arxiv.org/abs/1410.5401)

    Jupyter Notebook 460 94

  2. scattermoe scattermoe Public

    Triton-based implementation of Sparse Mixture of Experts.

    Python 278 29

  3. SUT SUT Public

    Repository for Sparse Universal Transformers

    Python 20 1

  4. stickbreaking-attention stickbreaking-attention Public

    Stick-breaking attention

    Python 63 4

  5. open-lm-engine/lm-engine open-lm-engine/lm-engine Public

    LM engine is a library for pretraining/finetuning LLMs

    Python 182 30