Implementation of Hindsight Differentiable Policy Optimization, as described in the paper Deep Reinforcement Learning for Inventory Networks: Toward Reliable Policy Optimization. Go to main_run.ipynb ...