TY  - CONF
AB  - We provide a learning-based technique for guessing a winning strategy in a parity game originating from an LTL synthesis problem. A cheaply obtained guess can be useful in several applications. Not only can the guessed strategy be applied as best-effort in cases where the game’s huge size prohibits rigorous approaches, but it can also increase the scalability of rigorous LTL synthesis in several ways. Firstly, checking whether a guessed strategy is winning is easier than constructing one. Secondly, even if the guess is wrong in some places, it can be fixed by strategy iteration faster than constructing one from scratch. Thirdly, the guess can be used in on-the-fly approaches to prioritize exploration in the most fruitful directions.
In contrast to previous works, we (i) reflect the highly structured logical information in game’s states, the so-called semantic labelling, coming from the recent LTL-to-automata translations, and (ii) learn to reflect it properly by learning from previously solved games, bringing the solving process closer to human-like reasoning.
AU  - Kretinsky, Jan
AU  - Meggendorfer, Tobias
AU  - Prokop, Maximilian
AU  - Rieder, Sabine
ID  - 14259
SN  - 0302-9743
T2  - 35th International Conference on Computer Aided Verification 
TI  - Guessing winning policies in LTL synthesis by semantic learning
VL  - 13964
ER  - 
TY  - CONF
AB  - A classic solution technique for Markov decision processes (MDP) and stochastic games (SG) is value iteration (VI). Due to its good practical performance, this approximative approach is typically preferred over exact techniques, even though no practical bounds on the imprecision of the result could be given until recently. As a consequence, even the most used model checkers could return arbitrarily wrong results. Over the past decade, different works derived stopping criteria, indicating when the precision reaches the desired level, for various settings, in particular MDP with reachability, total reward, and mean payoff, and SG with reachability.In this paper, we provide the first stopping criteria for VI on SG with total reward and mean payoff, yielding the first anytime algorithms in these settings. To this end, we provide the solution in two flavours: First through a reduction to the MDP case and second directly on SG. The former is simpler and automatically utilizes any advances on MDP. The latter allows for more local computations, heading towards better practical efficiency.Our solution unifies the previously mentioned approaches for MDP and SG and their underlying ideas. To achieve this, we isolate objective-specific subroutines as well as identify objective-independent concepts. These structural concepts, while surprisingly simple, form the very essence of the unified solution.
AU  - Kretinsky, Jan
AU  - Meggendorfer, Tobias
AU  - Weininger, Maximilian
ID  - 13967
SN  - 1043-6871
T2  - 38th Annual ACM/IEEE Symposium on Logic in Computer Science
TI  - Stopping criteria for value iteration on stochastic games with quantitative objectives
VL  - 2023
ER  - 
TY  - JOUR
AB  - Transforming ω-automata into parity automata is traditionally done using appearance records. We present an efficient variant of this idea, tailored to Rabin automata, and several optimizations applicable to all appearance records. We compare the methods experimentally and show that our method produces significantly smaller automata than previous approaches.
AU  - Kretinsky, Jan
AU  - Meggendorfer, Tobias
AU  - Waldmann, Clara
AU  - Weininger, Maximilian
ID  - 10602
JF  - Acta Informatica
KW  - computer networks and communications
KW  - information systems
KW  - software
SN  - 0001-5903
TI  - Index appearance record with preorders
VL  - 59
ER  - 
TY  - CONF
AB  - We consider the problem of approximating the reachability probabilities in Markov decision processes (MDP) with uncountable (continuous) state and action spaces. While there are algorithms that, for special classes of such MDP, provide a sequence of approximations converging to the true value in the limit, our aim is to obtain an algorithm with guarantees on the precision of the approximation.
As this problem is undecidable in general, assumptions on the MDP are necessary. Our main contribution is to identify sufficient assumptions that are as weak as possible, thus approaching the "boundary" of which systems can be correctly and reliably analyzed. To this end, we also argue why each of our assumptions is necessary for algorithms based on processing finitely many observations.
We present two solution variants. The first one provides converging lower bounds under weaker assumptions than typical ones from previous works concerned with guarantees. The second one then utilizes stronger assumptions to additionally provide converging upper bounds. Altogether, we obtain an anytime algorithm, i.e. yielding a sequence of approximants with known and iteratively improving precision, converging to the true value in the limit. Besides, due to the generality of our assumptions, our algorithms are very general templates, readily allowing for various heuristics from literature in contrast to, e.g., a specific discretization algorithm. Our theoretical contribution thus paves the way for future practical improvements without sacrificing correctness guarantees.
AU  - Grover, Kush
AU  - Kretinsky, Jan
AU  - Meggendorfer, Tobias
AU  - Weininger, Maimilian
ID  - 12775
SN  - 1868-8969
T2  - 33rd International Conference on Concurrency Theory 
TI  - Anytime guarantees for reachability in uncountable Markov decision processes
VL  - 243
ER  - 
TY  - CONF
AB  - Graph games played by two players over finite-state graphs are central in many problems in computer science. In particular, graph games with ω -regular winning conditions, specified as parity objectives, which can express properties such as safety, liveness, fairness, are the basic framework for verification and synthesis of reactive systems. The decisions for a player at various states of the graph game are represented as strategies. While the algorithmic problem for solving graph games with parity objectives has been widely studied, the most prominent data-structure for strategy representation in graph games has been binary decision diagrams (BDDs). However, due to the bit-level representation, BDDs do not retain the inherent flavor of the decisions of strategies, and are notoriously hard to minimize to obtain succinct representation. In this work we propose decision trees for strategy representation in graph games. Decision trees retain the flavor of decisions of strategies and allow entropy-based minimization to obtain succinct trees. However, decision trees work in settings (e.g., probabilistic models) where errors are allowed, and overfitting of data is typically avoided. In contrast, for strategies in graph games no error is allowed, and the decision tree must represent the entire strategy. We develop new techniques to extend decision trees to overcome the above obstacles, while retaining the entropy-based techniques to obtain succinct trees. We have implemented our techniques to extend the existing decision tree solvers. We present experimental results for problems in reactive synthesis to show that decision trees provide a much more efficient data-structure for strategy representation as compared to BDDs.
AU  - Brázdil, Tomáš
AU  - Chatterjee, Krishnendu
AU  - Kretinsky, Jan
AU  - Toman, Viktor
ID  - 297
TI  - Strategy representation by decision trees in reactive synthesis
VL  - 10805
ER  - 
TY  - JOUR
AB  - We present a new algorithm for the statistical model checking of Markov chains with respect to unbounded temporal properties, including full linear temporal logic. The main idea is that we monitor each simulation run on the fly, in order to detect quickly if a bottom strongly connected component is entered with high probability, in which case the simulation run can be terminated early. As a result, our simulation runs are often much shorter than required by termination bounds that are computed a priori for a desired level of confidence on a large state space. In comparison to previous algorithms for statistical model checking our method is not only faster in many cases but also requires less information about the system, namely, only the minimum transition probability that occurs in the Markov chain. In addition, our method can be generalised to unbounded quantitative properties such as mean-payoff bounds. 
AU  - Daca, Przemyslaw
AU  - Henzinger, Thomas A
AU  - Kretinsky, Jan
AU  - Petrov, Tatjana
ID  - 471
IS  - 2
JF  - ACM Transactions on Computational Logic (TOCL)
SN  - 15293785
TI  - Faster statistical model checking for unbounded temporal properties
VL  - 18
ER  - 
TY  - JOUR
AB  - We consider Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) objectives. There exist two different views: (i) the expectation semantics, where the goal is to optimize the expected mean-payoff objective, and (ii) the satisfaction semantics, where the goal is to maximize the probability of runs such that the mean-payoff value stays above a given vector. We consider optimization with respect to both objectives at once, thus unifying the existing semantics. Precisely, the goal is to optimize the expectation while ensuring the satisfaction constraint. Our problem captures the notion of optimization with respect to strategies that are risk-averse (i.e., ensure certain probabilistic guarantee). Our main results are as follows: First, we present algorithms for the decision problems which are always polynomial in the size of the MDP. We also show that an approximation of the Pareto-curve can be computed in time polynomial in the size of the MDP, and the approximation factor, but exponential in the number of dimensions. Second, we present a complete characterization of the strategy complexity (in terms of memory bounds and randomization) required to solve our problem. 
AU  - Chatterjee, Krishnendu
AU  - Křetínská, Zuzana
AU  - Kretinsky, Jan
ID  - 466
IS  - 2
JF  - Logical Methods in Computer Science
SN  - 18605974
TI  - Unifying two views on multiple mean-payoff objectives in Markov decision processes
VL  - 13
ER  - 
TY  - CONF
AB  - Markov decision processes (MDPs) are standard models for probabilistic systems with non-deterministic behaviours. Long-run average rewards provide a mathematically elegant formalism for expressing long term performance. Value iteration (VI) is one of the simplest and most efficient algorithmic approaches to MDPs with other properties, such as reachability objectives. Unfortunately, a naive extension of VI does not work for MDPs with long-run average rewards, as there is no known stopping criterion. In this work our contributions are threefold. (1) We refute a conjecture related to stopping criteria for MDPs with long-run average rewards. (2) We present two practical algorithms for MDPs with long-run average rewards based on VI. First, we show that a combination of applying VI locally for each maximal end-component (MEC) and VI for reachability objectives can provide approximation guarantees. Second, extending the above approach with a simulation-guided on-demand variant of VI, we present an anytime algorithm that is able to deal with very large models. (3) Finally, we present experimental results showing that our methods significantly outperform the standard approaches on several benchmarks.
AU  - Ashok, Pranav
AU  - Chatterjee, Krishnendu
AU  - Daca, Przemyslaw
AU  - Kretinsky, Jan
AU  - Meggendorfer, Tobias
ED  - Majumdar, Rupak
ED  - Kunčak, Viktor
ID  - 645
SN  - 978-331963386-2
TI  - Value iteration for long run average reward in markov decision processes
VL  - 10426
ER  - 
TY  - CONF
AB  - Transforming deterministic ω
-automata into deterministic parity automata is traditionally done using variants of appearance records. We present a more efficient variant of this approach, tailored to Rabin automata, and several optimizations applicable to all appearance records. We compare the methods experimentally and find out that our method produces smaller automata than previous approaches. Moreover, the experiments demonstrate the potential of our method for LTL synthesis, using LTL-to-Rabin translators. It leads to significantly smaller parity automata when compared to state-of-the-art approaches on complex formulae.
AU  - Kretinsky, Jan
AU  - Meggendorfer, Tobias
AU  - Waldmann, Clara
AU  - Weininger, Maximilian
ID  - 13160
SN  - 0302-9743
T2  - Tools and Algorithms for the Construction and Analysis of Systems
TI  - Index appearance record for transforming Rabin automata into parity automata
VL  - 10205
ER  - 
TY  - JOUR
AB  - We consider the problem of computing the set of initial states of a dynamical system such that there exists a control strategy to ensure that the trajectories satisfy a temporal logic specification with probability 1 (almost-surely). We focus on discrete-time, stochastic linear dynamics and specifications given as formulas of the Generalized Reactivity(1) fragment of Linear Temporal Logic over linear predicates in the states of the system. We propose a solution based on iterative abstraction-refinement, and turn-based 2-player probabilistic games. While the theoretical guarantee of our algorithm after any finite number of iterations is only a partial solution, we show that if our algorithm terminates, then the result is the set of all satisfying initial states. Moreover, for any (partial) solution our algorithm synthesizes witness control strategies to ensure almost-sure satisfaction of the temporal logic specification. While the proposed algorithm guarantees progress and soundness in every iteration, it is computationally demanding. We offer an alternative, more efficient solution for the reachability properties that decomposes the problem into a series of smaller problems of the same type. All algorithms are demonstrated on an illustrative case study.
AU  - Svoreňová, Mária
AU  - Kretinsky, Jan
AU  - Chmelik, Martin
AU  - Chatterjee, Krishnendu
AU  - Cěrná, Ivana
AU  - Belta, Cǎlin
ID  - 1407
IS  - 2
JF  - Nonlinear Analysis: Hybrid Systems
TI  - Temporal logic control for stochastic linear systems using abstraction refinement of probabilistic games
VL  - 23
ER  - 
TY  - CONF
AB  - We introduce a general class of distances (metrics) between Markov chains, which are based on linear behaviour. This class encompasses distances given topologically (such as the total variation distance or trace distance) as well as by temporal logics or automata. We investigate which of the distances can be approximated by observing the systems, i.e. by black-box testing or simulation, and we provide both negative and positive results. 
AU  - Daca, Przemyslaw
AU  - Henzinger, Thomas A
AU  - Kretinsky, Jan
AU  - Petrov, Tatjana
ID  - 1093
TI  - Linear distances between Markov chains
VL  - 59
ER  - 
TY  - CONF
AB  - We present a new algorithm for the statistical model checking of Markov chains with respect to unbounded temporal properties, including full linear temporal logic. The main idea is that we monitor each simulation run on the fly, in order to detect quickly if a bottom strongly connected component is entered with high probability, in which case the simulation run can be terminated early. As a result, our simulation runs are often much shorter than required by termination bounds that are computed a priori for a desired level of confidence on a large state space. In comparison to previous algorithms for statistical model checking our method is not only faster in many cases but also requires less information about the system, namely, only the minimum transition probability that occurs in the Markov chain. In addition, our method can be generalised to unbounded quantitative properties such as mean-payoff bounds.
AU  - Daca, Przemyslaw
AU  - Henzinger, Thomas A
AU  - Kretinsky, Jan
AU  - Petrov, Tatjana
ID  - 1234
TI  - Faster statistical model checking for unbounded temporal properties
VL  - 9636
ER  - 
TY  - CONF
AB  - We consider weighted automata with both positive and negative integer weights on edges and
study the problem of synchronization using adaptive strategies that may only observe whether
the current weight-level is negative or nonnegative. We show that the synchronization problem is decidable in polynomial time for deterministic weighted automata.
AU  - Kretinsky, Jan
AU  - Larsen, Kim
AU  - Laursen, Simon
AU  - Srba, Jiří
ID  - 1499
TI  - Polynomial time decidability of weighted synchronization under partial observability
VL  - 42
ER  - 
TY  - CONF
AB  - Quantitative extensions of temporal logics have recently attracted significant attention. In this work, we study frequency LTL (fLTL), an extension of LTL which allows to speak about frequencies of events along an execution. Such an extension is particularly useful for probabilistic systems that often cannot fulfil strict qualitative guarantees on the behaviour. It has been recently shown that controller synthesis for Markov decision processes and fLTL is decidable when all the bounds on frequencies are 1. As a step towards a complete quantitative solution, we show that the problem is decidable for the fragment fLTL\GU, where U does not occur in the scope of G (but still F can). Our solution is based on a novel translation of such quantitative formulae into equivalent deterministic automata.
AU  - Forejt, Vojtěch
AU  - Krčál, Jan
AU  - Kretinsky, Jan
ID  - 1594
TI  - Controller synthesis for MDPs and frequency LTL\GU
VL  - 9450
ER  - 
TY  - CONF
AB  - We propose a flexible exchange format for ω-automata, as typically used in formal verification, and implement support for it in a range of established tools. Our aim is to simplify the interaction of tools, helping the research community to build upon other people’s work. A key feature of the format is the use of very generic acceptance conditions, specified by Boolean combinations of acceptance primitives, rather than being limited to common cases such as Büchi, Streett, or Rabin. Such flexibility in the choice of acceptance conditions can be exploited in applications, for example in probabilistic model checking, and furthermore encourages the development of acceptance-agnostic tools for automata manipulations. The format allows acceptance conditions that are either state-based or transition-based, and also supports alternating automata.
AU  - Babiak, Tomáš
AU  - Blahoudek, František
AU  - Duret Lutz, Alexandre
AU  - Klein, Joachim
AU  - Kretinsky, Jan
AU  - Mueller, Daniel
AU  - Parker, David
AU  - Strejček, Jan
ID  - 1601
TI  - The Hanoi omega-automata format
VL  - 9206
ER  - 
TY  - JOUR
AB  - Modal transition systems (MTS) is a well-studied specification formalism of reactive systems supporting a step-wise refinement methodology. Despite its many advantages, the formalism as well as its currently known extensions are incapable of expressing some practically needed aspects in the refinement process like exclusive, conditional and persistent choices. We introduce a new model called parametric modal transition systems (PMTS) together with a general modal refinement notion that overcomes many of the limitations. We investigate the computational complexity of modal and thorough refinement checking on PMTS and its subclasses and provide a direct encoding of the modal refinement problem into quantified Boolean formulae, allowing us to employ state-of-the-art QBF solvers for modal refinement checking. The experiments we report on show that the feasibility of refinement checking is more influenced by the degree of nondeterminism rather than by the syntactic restrictions on the types of formulae allowed in the description of the PMTS.
AU  - Beneš, Nikola
AU  - Kretinsky, Jan
AU  - Larsen, Kim
AU  - Möller, Mikael
AU  - Sickert, Salomon
AU  - Srba, Jiří
ID  - 1846
IS  - 2-3
JF  - Acta Informatica
TI  - Refinement checking on parametric modal transition systems
VL  - 52
ER  - 
TY  - CONF
AB  - We provide a framework for compositional and iterative design and verification of systems with quantitative information, such as rewards, time or energy. It is based on disjunctive modal transition systems where we allow actions to bear various types of quantitative information. Throughout the design process the actions can be further refined and the information made more precise. We show how to compute the results of standard operations on the systems, including the quotient (residual), which has not been previously considered for quantitative non-deterministic systems. Our quantitative framework has close connections to the modal nu-calculus and is compositional with respect to general notions of distances between systems and the standard operations.
AU  - Fahrenberg, Uli
AU  - Kretinsky, Jan
AU  - Legay, Axel
AU  - Traonouez, Louis
ID  - 1882
TI  - Compositionality for quantitative specifications
VL  - 8997
ER  - 
TY  - CONF
AB  - We consider Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) objectives. There exist two different views: (i) ~the expectation semantics, where the goal is to optimize the expected mean-payoff objective, and (ii) ~the satisfaction semantics, where the goal is to maximize the probability of runs such that the mean-payoff value stays above a given vector. We consider optimization with respect to both objectives at once, thus unifying the existing semantics. Precisely, the goal is to optimize the expectation while ensuring the satisfaction constraint. Our problem captures the notion of optimization with respect to strategies that are risk-averse (i.e., Ensure certain probabilistic guarantee). Our main results are as follows: First, we present algorithms for the decision problems, which are always polynomial in the size of the MDP. We also show that an approximation of the Pareto curve can be computed in time polynomial in the size of the MDP, and the approximation factor, but exponential in the number of dimensions. Second, we present a complete characterization of the strategy complexity (in terms of memory bounds and randomization) required to solve our problem. 
AU  - Chatterjee, Krishnendu
AU  - Komárková, Zuzana
AU  - Kretinsky, Jan
ID  - 1657
TI  - Unifying two views on multiple mean-payoff objectives in Markov decision processes
ER  - 
TY  - GEN
AB  - We consider Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) objectives. 
There have been two different views: (i) the expectation semantics, where the goal is to optimize the expected mean-payoff objective, and (ii) the satisfaction semantics, where the goal is to maximize the probability of runs such that the mean-payoff value stays above a given vector.  
We consider the problem where the goal is to optimize the expectation under the constraint that the satisfaction semantics is ensured, and thus consider a generalization that unifies the existing semantics.
Our problem captures the notion of optimization with respect to strategies that are risk-averse (i.e., ensures certain probabilistic guarantee).
Our main results are algorithms for the decision problem which are always polynomial in the size of the MDP. We also show that an approximation of the Pareto-curve can be computed in time polynomial in the size of the MDP, and the approximation factor, but exponential in the number of dimensions.
Finally, we present a complete characterization of the strategy complexity (in terms of memory bounds and randomization) required to solve our problem.
AU  - Chatterjee, Krishnendu
AU  - Komarkova, Zuzana
AU  - Kretinsky, Jan
ID  - 5429
SN  - 2664-1690
TI  - Unifying two views on multiple mean-payoff objectives in Markov decision processes
ER  - 
TY  - GEN
AB  - We consider Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) objectives. 
There have been two different views: (i) the expectation semantics, where the goal is to optimize the expected mean-payoff objective, and (ii) the satisfaction semantics, where the goal is to maximize the probability of runs such that the mean-payoff value stays above a given vector.  
We consider the problem where the goal is to optimize the expectation under the constraint that the satisfaction semantics is ensured, and thus consider a generalization that unifies the existing semantics. Our problem captures the notion of optimization with respect to strategies that are risk-averse (i.e., ensures certain probabilistic guarantee).
Our main results are algorithms for the decision problem which are always polynomial in the size of the MDP.
We also show that an approximation of the Pareto-curve can be computed in time polynomial in the size of the MDP, and the approximation factor, but exponential in the number of dimensions. Finally, we present a complete characterization of the strategy complexity (in terms of memory bounds and randomization) required to solve our problem.
AU  - Chatterjee, Krishnendu
AU  - Komarkova, Zuzana
AU  - Kretinsky, Jan
ID  - 5435
SN  - 2664-1690
TI  - Unifying two views on multiple mean-payoff objectives in Markov decision processes
ER  -