Posts by Tags

Decoupling Time and Risk: Risk-Sensitive RL with General Discounting

5 minute read

Published: February 09, 2026

In standard Reinforcement Learning (RL), the discount factor (\(\gamma\)) is often treated as a fixed parameter of the Markov Decision Process or a tunable hyperparameter for training stability. We typically default to exponential discounting, where the value of a reward decays by a constant factor at every time step.

Decoupling Time and Risk: Risk-Sensitive RL with General Discounting

5 minute read

Published: February 09, 2026

In standard Reinforcement Learning (RL), the discount factor (\(\gamma\)) is often treated as a fixed parameter of the Markov Decision Process or a tunable hyperparameter for training stability. We typically default to exponential discounting, where the value of a reward decays by a constant factor at every time step.

Decoupling Time and Risk: Risk-Sensitive RL with General Discounting

5 minute read

Published: February 09, 2026

In standard Reinforcement Learning (RL), the discount factor (\(\gamma\)) is often treated as a fixed parameter of the Markov Decision Process or a tunable hyperparameter for training stability. We typically default to exponential discounting, where the value of a reward decays by a constant factor at every time step.

Mehrdad Moghimi

Posts by Tags

Distributional RL

Decoupling Time and Risk: Risk-Sensitive RL with General Discounting

General Discounting

Decoupling Time and Risk: Risk-Sensitive RL with General Discounting

Stock-augmentation

Decoupling Time and Risk: Risk-Sensitive RL with General Discounting