About me

I am a fifth year PhD student in Applied Mathematics at York University, where I am fortunate to be supervised by Professor Hyejin Ku. My research focuses on risk-sensitive and distributional reinforcement learning. Previously, I worked as a quantitative researcher at Risklab AI, where I had the pleasure of collaborating with Professor Hamidreza Arian. I hold a BSc in Computer Science and an MBA in Finance from Sharif University of Technology. You can find my CV here.

I am generally interested in Reinforcment Learning and its applications. My main areas of research include:

Risk-sensitive and Safe RL
RL for LLM Reasoning

Publications:

Utility-Constrained Policy Optimization (Under review)
Decoupling Time and Risk: Risk-Sensitive Reinforcement Learning with General Discounting (Under review, arXiv)
Risk-sensitive Actor-Critic with Static Spectral Risk Measures for Online and Offline Reinforcement Learning (Expert Systems with Applications, Journal, arXiv)
Beyond CVaR: Leveraging Static Spectral Risk Measures for Enhanced Decision-Making in Distributional Reinforcement Learning (ICML 2025, PMLR, arXiv)
Encoded Value-at-Risk: A Predictive Machine for Financial Risk Management (Mathematics and Computers in Simulation 2022, Journal, arXiv)

News

May 2026: I joined RBC Borealis as a research intern to work on Knowledge distillation
May 2026: Preprint on “Utility-Constrained Policy Optimization”
March 2026: Our paper “Risk-sensitive Actor-Critic with Static Spectral Risk Measures for Online and Offline Reinforcement Learning” was published in the Expert Systems with Applications journal
February 2026: Preprint on “Decoupling Time and Risk: Risk-Sensitive Reinforcement Learning with General Discounting”
July 2025: Preprint on “Risk-sensitive Actor-Critic with Static Spectral Risk Measures for Online and Offline Reinforcement Learning”
May 2025: Our paper “Beyond CVaR: Leveraging Static Spectral Risk Measures for Enhanced Decision-Making in Distributional Reinforcement Learning” was accepted at ICML 2025
January 2025: Preprint on “Beyond CVaR: Leveraging Static Spectral Risk Measures for Enhanced Decision-Making in Distributional Reinforcement Learning”
December 2022: Our paper “Encoded Value-at-Risk: A Predictive Machine for Financial Risk Management” was published in the Mathematics and Computers in Simulation journal
September 2021: Joined York University, Mathematics and Statistics department.
November 2020: Preprint on “Encoded Value-at-Risk: A Predictive Machine for Financial Risk Management”
January 2020: Public talk on “Building Diversified Portfolios that Outperform Out-of-Sample” (Link to Presenation in Farsi)
September 2018: Received my bachelor’s degree in Computer Science, Ranked 1st in the class of 2018

Mehrdad Moghimi

Publications:

News