Training Larger Networks for Deep Reinforcement Learning: https://arxiv.org/pdf/2102.07920.pdf
Visualizing the Loss Landscape of Neural Nets https://arxiv.org/pdf/1712.09913.pdf
http://lukemetz.com/exploring-hyperparameter-meta-loss-landscapes-with-jax/
ANALYZING REINFORCEMENT LEARNING BENCHMARKS WITH RANDOM WEIGHT GUESSING https://arxiv.org/pdf/2004.07707.pdf
Exploring Model-based Planning with Policy Networks https://arxiv.org/pdf/1906.08649.pdf
Understanding the Impact of Entropy on Policy Optimization https://arxiv.org/pdf/1811.11214.pdf
Fantastic Generalization Measures and Where to Find Them: https://arxiv.org/pdf/1912.02178.pdf — worst case sharpness (similar to volume) best predictor is sharpness relative to parameter magnitude
Implementation matters in deep policy gradients https://arxiv.org/pdf/2005.12729.pdf
A closer look at deep policy gradients https://arxiv.org/abs/1811.02553
- https://www.reddit.com/r/MachineLearning/comments/9v0r0c/r_are_deep_policy_gradient_algorithms_truly/