Markov decision process
Markovi otsustusprotsess, MDP
olemus
aegdiskreetse stohhastilise juhtimise protsess,
otsustuse modelleerimise matemaatiline karkass
olukordadeks, kus tulemused on
osalt otsustaja kontrolli all, osalt stohhastilised
=
a mathematical model of sequential decisions and a dynamic optimization
ülevaateid
https://www.youtube.com/watch?v=A12_bdqW6M8
https://www.cs.rice.edu/~vardi/dag01/givan1.pdf
https://people.eecs.berkeley.edu/~pabbeel/cs287-fa12/slides/mdps-exact-methods.pdf
https://en.wikipedia.org/wiki/Markov_decision_process
http://www.pitt.edu/~schaefer/papers/MDPTutorial.pdf
rakendusi
https://towardsdatascience.com/real-world-applications-of-markov-decision-process-mdp-a39685546026
http://www.it.uu.se/edu/course/homepage/aism/st11/MDPApplications3.pdf
https://www.mecsj.com/uplode/images/photo/OVERVIEW_OF_MARKOV_DECISION_PROCESS_APPLICATIONS.pdf
vt ka
- Markovi ahel