Symbolic algorithms for qualitative analysis of Markov decision processes with Büchi objectives
Chatterjee, Krishnendu
Henzinger, Monika
Joglekar, Manas
Shah, Nisarg
We consider Markov decision processes (MDPs) with Büchi (liveness) objectives. We consider the problem of computing the set of almost-sure winning states from where the objective can be ensured with probability 1. Our contributions are as follows: First, we present the first subquadratic symbolic algorithm to compute the almost-sure winning set for MDPs with Büchi objectives; our algorithm takes O(n · √ m) symbolic steps as compared to the previous known algorithm that takes O(n 2) symbolic steps, where n is the number of states and m is the number of edges of the MDP. In practice MDPs have constant out-degree, and then our symbolic algorithm takes O(n · √ n) symbolic steps, as compared to the previous known O(n 2) symbolic steps algorithm. Second, we present a new algorithm, namely win-lose algorithm, with the following two properties: (a) the algorithm iteratively computes subsets of the almost-sure winning set and its complement, as compared to all previous algorithms that discover the almost-sure winning set upon termination; and (b) requires O(n · √ K) symbolic steps, where K is the maximal number of edges of strongly connected components (scc's) of the MDP. The win-lose algorithm requires symbolic computation of scc's. Third, we improve the algorithm for symbolic scc computation; the previous known algorithm takes linear symbolic steps, and our new algorithm improves the constants associated with the linear number of steps. In the worst case the previous known algorithm takes 5×n symbolic steps, whereas our new algorithm takes 4×n symbolic steps.
Springer
2013
info:eu-repo/semantics/article
doc-type:article
text
https://research-explorer.app.ist.ac.at/record/2831
Chatterjee K, Henzinger M, Joglekar M, Shah N. Symbolic algorithms for qualitative analysis of Markov decision processes with Büchi objectives. <i>Formal Methods in System Design</i>. 2013;42(3):301-327. doi:<a href="https://doi.org/10.1007/s10703-012-0180-2">10.1007/s10703-012-0180-2</a>
eng
info:eu-repo/semantics/altIdentifier/doi/10.1007/s10703-012-0180-2
info:eu-repo/semantics/altIdentifier/arxiv/1104.3348
info:eu-repo/grantAgreement/FWF//P 23499-N23
info:eu-repo/grantAgreement/FWF//S11407
info:eu-repo/grantAgreement/EC/FP7/279307
info:eu-repo/semantics/openAccess