Skip to content
View langfengQ's full-sized avatar
  • Singapore
  • 15:11 (UTC +08:00)

Highlights

  • Pro

Block or report langfengQ

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. verl-agent verl-agent Public

    verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

    Python 1.3k 117

  2. TimeMaster TimeMaster Public

    Official code for paper "TimeMaster: Training Time-Series Multimodal LLMs to Reason via Reinforcement Learning"

    Python 57 4

  3. MLF-DSResNet MLF-DSResNet Public

    This is the implementation of MLF & spiking DS-ResNet

    Python 17 2

  4. CoSo CoSo Public

    Official code for paper "Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning"

    Python 13 1

  5. tree-diffusion-planner tree-diffusion-planner Public

    Code for the paper "Resisting Stochastic Risks in Diffusion Planners with the Trajectory Aggregation Tree"

    7 1