Stocco, TeoAlahi, Alexandre2020-02-132020-02-132020-02-132019-09-04https://infoscience.epfl.ch/handle/20.500.14299/165526Learning online combinatorial stochastic policies with deep reinforcementtext::conference output::conference proceedings::conference paper