Policy optimization using a lexicographic preference ordering / P. Stork and J.-M. Viaene

Policy optimization using a lexicographic preference ordering / P. Stork and J.-M. Viaene