Skip to content
View ZhengyaoJiang's full-sized avatar

Organizations

@uclnlp @ucl-dark

Block or report ZhengyaoJiang

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. WecoAI/aideml WecoAI/aideml Public

    AIDE: AI-Driven Exploration in the Space of Code. The machine Learning engineering agent that automates AI R&D.

    Python 1.1k 173

  2. latentplan latentplan Public

    Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.

    Python 113 12

  3. Farama-Foundation/chatarena Farama-Foundation/chatarena Public

    ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

    Python 1.5k 147

  4. PGPortfolio PGPortfolio Public

    PGPortfolio: Policy Gradient Portfolio, the source code of "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem"(https://arxiv.org/pdf/1706.10059.pdf).

    Python 1.9k 762

  5. NLRL NLRL Public

    Source code of Neural Logic Reinforcement Learning (https://arxiv.org/abs/1904.10729)

    Python 77 28

  6. GTG GTG Public

    Source code of "Grid-to-Graph: Flexible Spatial Relational Inductive Biases for Reinforcement Learning" (AAMAS 2021).

    Python 29 8