Developing Optimal Decision Strategies with Markovian Decision Process

Jay Prakash; Sarvesh Yadav; Umesh Sharma

doi:10.30574/gjeta.2024.20.1.0112

Developing Optimal Decision Strategies with Markovian Decision Process

Jay Prakash ^*, Sarvesh Yadav and Umesh Sharma

Department of Mathematics, Sanskriti University, Mathura, 281401, India.

Review Article

Global Journal of Engineering and Technology Advances, 2024, 20(01), 216–230.

Article DOI: 10.30574/gjeta.2024.20.1.0112

DOI url: https://doi.org/10.30574/gjeta.2024.20.1.0112

Publication history:

Received on 20 May 2024; revised on 02 July 2024; accepted on 05 July 2024

Abstract:

The use of Markovian Decision Processes (MDPs) in creating the best possible decision strategies is examined in this research. When outcomes are partly controlled by a decision-maker and partially random, MDPs offer a mathematical framework for simulating decision-making. Finding strategies that optimize cumulative benefits over time while accounting for decision-making's immediate and long-term effects is the main goal. We go over the fundamental ideas of MDPs, such as states, actions, transitions, rewards, and the Bellman equation, which serves as the foundation for figuring out the best course of action. Practical aspects are also looked at, such as computer techniques for solving MDPs and situations in which MDPs work especially well. Decision-makers can systematically examine complicated decision situations by utilizing MDPs, which can result in well-informed and effective decision-making techniques across a variety of areas. Creating methods and algorithms for determining the best policies or decision-making processes within the context of MDPs is another goal. This entails formulating techniques to calculate ideal policies that traverse a series of states and acts in a way that maximizes long-term rewards. The study provides some information about MPs as well as specific conclusions pertaining to martingales. We show the obtained results on a general finite-dimensional filter. This work presents some findings on a general finite dimensional filter, semi-martingale decomposition, Zakai recursion, and related findings about smoothers in different contexts. After deriving the log-likelihood function, it is shown to be both increasing and convergent. Furthermore, the MLE of the parameter has been found.

Keywords:

Markovian Process; Transportation; Algorithms; Zakai Recursion.

Full text article in PDF:

Click here

You are here

Developing Optimal Decision Strategies with Markovian Decision Process

Jay Prakash *, Sarvesh Yadav and Umesh Sharma

Jay Prakash ^*, Sarvesh Yadav and Umesh Sharma