Skip to content
View sunyiyou's full-sized avatar

Highlights

  • Pro

Block or report sunyiyou

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. deeplearning-wisc/react deeplearning-wisc/react Public

    Code for NeurIPS 2021 paper "ReAct: Out-of-distribution Detection With Rectified Activations"

    Python 55 10

  2. deeplearning-wisc/knn-ood deeplearning-wisc/knn-ood Public

    Code for ICML 2022 paper "Out-of-distribution Detection with Deep Nearest Neighbors"

    Python 195 18

  3. sunblaze-ucb/reasoning_ladder sunblaze-ucb/reasoning_ladder Public

    Python 35 4

  4. sunblaze-ucb/omega sunblaze-ucb/omega Public

    Python 46 4

  5. rdi-berkeley/awesome-RLVR-boundary rdi-berkeley/awesome-RLVR-boundary Public

    A curated list of resources on Reinforcement Learning with Verifiable Rewards (RLVR) and the reasoning capability boundary of Large Language Models (LLMs).

    86 7

  6. sunblaze-ucb/rl-grok-recipe sunblaze-ucb/rl-grok-recipe Public

    Code repository for "RL Grokking Recipe: How RL Unlocks and Transfers New Algorithms in LLMs""

    Python 23