Skip to content
View JenWei0312's full-sized avatar
:octocat:
Working from home
:octocat:
Working from home

Block or report JenWei0312

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. All_things_attention All_things_attention Public

    Comparison of different kinds of attentions

    Jupyter Notebook 1

  2. deepseek-moe deepseek-moe Public

    Python

  3. OLMo OLMo Public

    Forked from allenai/OLMo

    Modeling, training, eval, and inference code for OLMo

    Python

  4. huggingface/trl huggingface/trl Public

    Train transformer language models with reinforcement learning.

    Python 16.8k 2.4k

  5. allenai/OLMo allenai/OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 6.3k 692

  6. deepseek-mla deepseek-mla Public

    Implementation of DeepSeek's Multihead Latent Attention architecture

    Python