About me

I am a fifth year PhD student in Applied Mathematics at York University, where I am fortunate to be supervised by Professor Hyejin Ku. My research focuses on risk-sensitive and distributional reinforcement learning. Previously, I worked as a quantitative researcher at Risklab AI, where I had the pleasure of collaborating with Professor Hamidreza Arian. I hold a BSc in Computer Science and an MBA in Finance from Sharif University of Technology. You can find my CV here.


I am generally interested in Reinforcment Learning and its applications. My main areas of research include:

  • Risk-sensitive and Safe RL
  • RL for LLM Reasoning

Publications:

  • Utility-Constrained Policy Optimization (Under review)
  • Decoupling Time and Risk: Risk-Sensitive Reinforcement Learning with General Discounting (Under review, arXiv)
  • Risk-sensitive Actor-Critic with Static Spectral Risk Measures for Online and Offline Reinforcement Learning (Expert Systems with Applications, Journal, arXiv)
  • Beyond CVaR: Leveraging Static Spectral Risk Measures for Enhanced Decision-Making in Distributional Reinforcement Learning (ICML 2025, PMLR, arXiv)
  • Encoded Value-at-Risk: A Predictive Machine for Financial Risk Management (Mathematics and Computers in Simulation 2022, Journal, arXiv)

News

  • May 2026: I joined RBC Borealis as a research intern to work on Knowledge distillation
  • May 2026: Preprint on “Utility-Constrained Policy Optimization”
  • March 2026: Our paper “Risk-sensitive Actor-Critic with Static Spectral Risk Measures for Online and Offline Reinforcement Learning” was published in the Expert Systems with Applications journal
  • February 2026: Preprint on “Decoupling Time and Risk: Risk-Sensitive Reinforcement Learning with General Discounting”
  • July 2025: Preprint on “Risk-sensitive Actor-Critic with Static Spectral Risk Measures for Online and Offline Reinforcement Learning”
  • May 2025: Our paper “Beyond CVaR: Leveraging Static Spectral Risk Measures for Enhanced Decision-Making in Distributional Reinforcement Learning” was accepted at ICML 2025
  • January 2025: Preprint on “Beyond CVaR: Leveraging Static Spectral Risk Measures for Enhanced Decision-Making in Distributional Reinforcement Learning”
  • December 2022: Our paper “Encoded Value-at-Risk: A Predictive Machine for Financial Risk Management” was published in the Mathematics and Computers in Simulation journal
  • September 2021: Joined York University, Mathematics and Statistics department.
  • November 2020: Preprint on “Encoded Value-at-Risk: A Predictive Machine for Financial Risk Management”
  • January 2020: Public talk on “Building Diversified Portfolios that Outperform Out-of-Sample” (Link to Presenation in Farsi)
  • September 2018: Received my bachelor’s degree in Computer Science, Ranked 1st in the class of 2018