Muhammad Ahmed Mohsin

I am a second year master’s student @ Stanford University, advised by Dr. Emily Fox and Prof. John M. Cioffi. My research focuses on applied reinforcement learning and machine learning for optimization, with an emphasis on building robust systems for complex, dynamic, and non-stationary decision-making. My interests also include LLM post-training and inference, including offline preference optimization for chain-of-thought enhancement, reinforcement learning for improving LLM reasoning, active alignment with Bayesian General Preference Models, and agentic frameworks for test-time compute budget allocation.

Research Interests

News

Reviewer

Research [representative | all]