Skip to content
@whyNLP

whyNLP

Haoyi Wu's NLP research projects.

Popular repositories Loading

  1. LCKV LCKV Public

    Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance. Accepted to ACL 2024.

    Python 157 14

  2. Conic10K Conic10K Public

    Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.

    Python 30 1

  3. Probabilistic-Transformer Probabilistic-Transformer Public

    A probabilitic model for contextual word representation. Accepted to ACL2023 Findings.

    Python 25 2

  4. tinyllama tinyllama Public

    A side project that follows all the acceleration tricks in tinyllama, with the minimal modification to the huggingface transformers code.

    Python 13 4

  5. PCCoT PCCoT Public

    Parallel Continuous Chain-of-Thought with Jacobi Iteration. Accepted to EMNLP 2025.

    Python 12 3

  6. nni-slurm nni-slurm Public

    Forked from microsoft/nni

    A patch for NNI with slurm and W&B.

    Python 8

Repositories

Showing 8 of 8 repositories

Top languages

Loading…

Most used topics

Loading…