romannekrasovaillm

Follow

Roman Nekrasov romannekrasovaillm

Follow

Agentic mid-training, reinforcement learning with reward verification (RLVR), scaling agent environments, interleaved agent reasoning with tools

3 followers · 4 following

Achievements

Achievements

Popular repositories Loading

qqr qqr Public

Forked from Alibaba-NLP/qqr

qqr is an RL training framework for open-ended agents.

Python 1
test test Public

Jupyter Notebook
suna suna Public

Forked from kortix-ai/suna

Suna - Open Source Generalist AI Agent

TypeScript
SWE-agent SWE-agent Public

Forked from SWE-agent/SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…

Python
acp acp Public

Forked from i-am-bee/acp

Agent Communication Protocol

Python
One-Shot-RLVR One-Shot-RLVR Public

Forked from ypwang61/One-Shot-RLVR

official repository for “Reinforcement Learning for Reasoning in Large Language Models with One Training Example”

Python