On Markov decision processes with the stochastic differential Bellman Equation

Cakir, Merve Nur

On Markov decision processes with the stochastic differential Bellman Equation

Dateien

PHD Ohne Lebenslauf.pdf (1.24 MB)

Datum

2025

Autor:innen

Cakir, Merve Nur

Betreuer:innen

Möller, Ralf

Rössler, Andreas

Zusammenfassung

Stochastic differential equations play an important role in capturing the dynamics of complex systems, where uncertainty prevails in the form of noise. In complex systems noise is abundant, but its exact behaviour is unknown. However, noise can be simulated with stochastic processes. Stochastic calculi, such as the Itˆo formula, provide tools for navigating these systems. In this work, the adaptation of the Bellman equation, a cornerstone of dynamic programming, to the realm of stochastic differential equations is explored, facilitating the modeling of decision problems subject to noise. Value iteration and Q-learning, two well-known solution methods in machine learning, are extended to stochastic algorithms in order to approximate the solution for Markov decision processes with uncertainties modeled by the stochastic differential Bellman equation. These stochastic algorithms enable a realistic approach to modeling and solving decision problems in stochastic environments efficiently. The stochastic value iteration is applied when the environment is fully known, while the stochastic Q-learning extends its utility even in cases where transition probabilities remain unknown. Through theoretical analyses and case studies, these algorithms demonstrate their efficacy and applicability, delivering meaningful results. Additionally, the stochastic Q-learning achieves superior rewards compared to the deterministic algorithm, indicating its ability to optimize decision processes in stochastic environments more effectively by exploring more states. Finally, the stochastic differential Bellman equation is formulated as a system of ordinary equations, providing an alternative solution. For this, the concept of the random dynamical system is explored, of which a stochastic differential equation is an example.

Schlagwörter

markov decision process, stochastic process, machine learning

URI

https://epub.uni-luebeck.de/handle/zhb_hl/3377
urn:nbn:de:gbv:841-202502181

Institut/Klinik

Institut für Informationssysteme

Sektion

Informatik/Technik

Komplettanzeige

On Markov decision processes with the stochastic differential Bellman Equation

Dateien

Datum

Autor:innen

Betreuer:innen

Zeitschriftentitel

ISSN der Zeitschrift

Bandtitel

Verlag

Zusammenfassung

Beschreibung

Schlagwörter

Zitierform

URI

Institut/Klinik

Sektion