
I'm a graduate student @ SAIL (Stanford AI Lab) co-advised by Dr. Emily Fox and Dr. John M. Cioffi
My research focuses on LLM post-training and inference, including preference optimization, active learning, and alignment for reasoning models, alongside reinforcement learning for high-diversity generation and adaptive agentic test-time compute.
I also develop Internet of Evolving Agents frameworks for self-evolving multi-agent systems with dynamic reputation modeling and social graph-based coordination, and work on applied reinforcement learning for complex, dynamic, and non-stationary decision-making, including RL methods tailored for LLM reasoning.
LLM Inference and Test-Time Scaling
Working on test-time training methods for scientific discovery under uncertainty, with a focus on adaptive compute allocation, agentic planning, and stratified scaling search for test-time reasoning in large language models and diffusion language models.
CoLM'26, NeurIPS'26, OngoingEvolving Agentic Systems
Developing Internet of Evolving Agents frameworks for self-evolving multi-agent systems with dynamic reputation modeling and social graph-based coordination mechanisms.
NeurIPS'26, OngoingReinforcement Learning for LLMs
Research on preference optimization, active learning, and alignment methods for large language model reasoning systems. Current work also explores reinforcement learning approaches for reward decomposition to mitigate sycophancy and improve alignment.
ICML'26, NeurIPS'26, OngoingStanford Artificial Intelligence Laboratory (SAIL)
December 2025 – Present · Advisor: Prof. Emily FoxProject: Internet of Evolving Agents
Project: Test-Time Compute and Reasoning in Large Language Models
Project: Bayesian Preference Alignment for Mathematical Reasoning
Intel Corporation
September 2024 – December 2024 · Advisor: Dr. John M. CioffiProject: Neural Gaussian Radio Fields for Environment Perception
Samsung Semiconductors
June 2024 – September 2025 · Advisor: Dr. John M. CioffiProject: Deep Reinforcement Learning Accelerated Optimization: Graph Neural Networks for Accelerating Low-Rank SDP Solvers (expected NeurIPS 2026)
M. A. Mohsin, A. Bilal, M. Umer, E. Fox
A. Bilal*, M. A. Mohsin*, M. Umer, D. F. Hougen
M. Umer*, M. A. Mohsin*, A. Bilal, Ellen Vitercik, J. M. Cioffi
M. A. Mohsin, M. Umer, A. Bilal, Ellen Vitercik, J. M. Cioffi
A. Bilal*, M. A. Mohsin*, M. Umer, D. F. Hougen, J. M. Cioffi
M. A. Mohsin, M. Umer, A. Bilal, J. M. Cioffi, Ellen Vitercik
M. A. Mohsin*, M. Umer*, A. Bilal, J. M. Cioffi

Cape Town, South Africa
Presenting research on AI-driven wireless networks
Served as an Area Chair for ICASSP.
Selected as a Qualcomm Fellowship finalist.
Serving as Workshop Co-Chair for VTC Fall 2026 in Boston.
Served as a member of the Technical Program Committee at NeurIPS 2025 and also as a NeurIPS reviewer.
Received the Exemplary Reviewer recognition for IEEE Wireless Communications Letters 2025.
Added as a founding member of the IEEE Special Interest Group on AI-driven TN-NTN Networks.
Paper accepted at ICML 2025 on "Continual Learning for Wireless Channel Estimation," along with a student travel grant to ICML.
Received an ICC student travel grant for Montreal and a best workshop paper award for RAG-optimized wireless environment perception.
Two papers on Hierarchical Deep RL and Joint Source Compression accepted at AAAI 2025 in Philadelphia.
Two papers on diffusion-based Langevin dynamics and minPMAC optimization accepted at IEEE ICASSP 2025 in India.
Awarded a Globecom 2024 travel grant for travel to Cape Town.
Received a Best Poster Award nomination at the 6G Summit in Abu Dhabi.
Awarded the Rector’s Gold Medal for best undergraduate thesis.
Accepted to Stanford with the Stanford Graduate Fellowship.
First paper accepted at ICDAR 2023, outperforming Microsoft’s DiT on table recognition tasks.
Received the ECAT Scholarship for ranking among the top 10 in Pakistan in the engineering category test.
Received the President’s Medal for ranking among the top 3 students across Pakistan at the HSSC level.
Awarded the NUST scholarship for maintaining a 4.0 GPA.
I am always open to discussing new research collaborations and opportunities.