Skip to content
View billhhh's full-sized avatar
:octocat:
Coding
:octocat:
Coding

Block or report billhhh

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. KRPO_LLMs_RL KRPO_LLMs_RL Public

    The code repository for paper "Kalman Filter Enhanced Group Relative Policy Optimization for Language Model Reasoning"

    Python 13 1

  2. ShaSpec ShaSpec Public

    The official code repository of ShaSpec model from CVPR 2023 [paper](https://arxiv.org/pdf/2307.14126) "Multi-modal Learning with Missing Modality via Shared-Specific Feature Modelling"

    Python 96 10

  3. TrafficOptim_RL TrafficOptim_RL Public

    The code repo for paper "Multi-intersection Traffic Optimisation: ABenchmark Dataset and a Strong Baseline"

    Python 11

  4. MetaKD MetaKD Public

    The code repository of MetaKD model from paper (https://arxiv.org/pdf/2405.07155) "Meta-Learned Modality-Weighted Knowledge Distillation for Robust Multi-Modal Learning with Missing Data".

    Python 9 3

  5. Rethink-Merge Rethink-Merge Public

    The code repository of from [paper](https://arxiv.org/abs/2411.09263) "Rethinking Weight-Averaged Model-merging".

    Python 4 2

  6. RDP RDP Public

    Codes for IJCAI2020 paper "Unsupervised Representation Learning by Predicting Random Distances” https://arxiv.org/abs/1912.12186

    Python 29 8