Skip to content
View aerosta's full-sized avatar

Block or report aerosta

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. rewardhackwatch rewardhackwatch Public

    Runtime detector for reward hacking and misalignment in LLM agents (89.7% F1 on 5,391 trajectories).

    Python 12