Skip to content
View renee-jia's full-sized avatar
πŸ€
πŸ€

Block or report renee-jia

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
renee-jia/README.md

Hey there, I'm Renee πŸ‘‹

AI Research Engineer @ Meta

I work on AI systems β€” from ranking and recommendation to LLMs and agents. I like thinking about how models behave in messy, real-world environments. Previously worked at Google and Amazon Alexa AI. Visiting Scholar at Harvard and University of Waterloo.


πŸ“¨ Connect with me


✏️ Writing

I write about things I'm learning and researching. Here's what I've been covering on my blog:

πŸ›‘οΈ AI Safety & Alignment β€” How reward hacking evolved from classical RL specification gaming to jailbreaks and deceptive alignment in LLMs. What it means for RLHF and building systems we can trust.

🧠 LLM Reasoning β€” What "reasoning" actually means in the context of large language models, grounded in research from chain-of-thought prompting to inference-time compute scaling.

🌐 Browser Agents & Goal Fidelity β€” Why the web is an adversarial environment for agents, and why being capable is not the same as being hard to manipulate.

🎯 Ranking & Recommendation Systems β€” A deep-dive series covering the full evolution: from foundational collaborative filtering, through the deep learning era, to modern sequential learning and long user history modeling in ads systems.


πŸŒ„ Beyond the code

When I'm not thinking about AI:

πŸ‚ Snowboarding β€” PSIA-AASI Level 1 certified instructor

♠️ Tournament Poker β€” Part-time player who loves the intersection of game theory, math, and psychology. Check out my results on Hendon Mob.


πŸ“Š GitHub Stats

Pinned Loading

  1. scholar-loop scholar-loop Public

    An autonomous AI scientist: a multi-agent loop over literature, experiments, self-critique and write-up, with deterministic guards against reward-hacking and hallucination.

    Python 348 24

  2. latent-feed latent-feed Public

    Automated AI news aggregation that feeds directly into your Obsidian vault. Stay current on the latest LLM research, trending repos, and breakthroughs β€” curated daily

    Python 213 27

  3. alpha-agent alpha-agent Public

    An AI-driven multi-agent trading platform for options trading and stock trends analysis. This project leverages advanced machine learning, real-time market data, and a modular multi-agent framework.

    Python 21 3

  4. trading-bot trading-bot Public

    Multi-agent macro trading bot: multi-factor stock scoring, momentum portfolio construction, backtesting vs SPY/Nasdaq, and live Alpaca execution.

    Python 103 15