RobertKirk

Follow

Robert Kirk RobertKirk

Follow

PhD student at @ucl-dark. Interested in understanding LLM fine-tuning, AI safety and (super)alignment.

50 followers · 9 following

Achievements

Achievements

Highlights

Pro

Pinned Loading

facebookresearch/rlfh-gen-div facebookresearch/rlfh-gen-div Public archive

This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity

Python 50 7
tinystories-wrappers tinystories-wrappers Public

Code for the TinyStories experiments from "Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks".

Jupyter Notebook 9 1
facebookresearch/minihack facebookresearch/minihack Public archive

MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research

Python 519 68
stanford_alpaca stanford_alpaca Public

Forked from tatsu-lab/stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python