Skip to content
View mansicer's full-sized avatar
🎆
coding
🎆
coding

Highlights

  • Pro

Organizations

@LAMDA-RL

Block or report mansicer

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. LAMDA-RL/ODIS LAMDA-RL/ODIS Public

    The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".

    Python 46 6

  2. MAIC MAIC Public

    The implementation of AAAI 2022 paper "Multi-Agent Incentive Communication via Decentralized Teammate Modeling".

    Python 59 10

  3. Q-Adapter Q-Adapter Public

    Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"

    Python 18 1

  4. LAMDA-RL/ReDA LAMDA-RL/ReDA Public

    The implementation of the AAMAS 2024 paper "Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation"

    Python 3 1