I am interested in building AI systems that are useful, adaptive, and reliable in real-world settings.

My current focus is on post-training, reward modeling, reinforcement learning, and agentic AI systems. I am especially interested in how learning from feedback, preferences, and verifiable outcomes can make models more steerable and effective after pretraining.

Previously, my work has spanned natural language processing, information extraction, knowledge representation, fraud detection, and production machine learning systems. I care about both the modeling side and the systems side: how models are trained, evaluated, deployed, monitored, and improved in practice.

Current Interests

Published Work

Writing

I write about reinforcement learning, post-training, LLM systems, and applied machine learning. Blog →