Process-Reward-Models on Rajat Patel

Process-Reward-Models on Rajat Patelhttps://rajathpatel23.github.io/tags/process-reward-models/Recent content in Process-Reward-Models on Rajat PatelHugo -- 0.162.1en-usrpatel12@umbc.edu (Rajat Patel)rpatel12@umbc.edu (Rajat Patel)Sun, 07 Jun 2026 00:00:00 +0000Evidence-Driven Deep Research Agenthttps://rajathpatel23.github.io/posts/deep-research-agent/Sun, 07 Jun 2026 00:00:00 +0000rpatel12@umbc.edu (Rajat Patel)https://rajathpatel23.github.io/posts/deep-research-agent/How I built visibility into the research process itself — an explicit evidence state, step-level reward signals, and a planner that uses those signals deliberately. Without retraining.