Personal project
List of relevant papers
- Training Larger Networks for Deep Reinforcement Learning: https://arxiv.org/pdf/2102.07920.pdf
- Visualizing the Loss Landscape of Neural Nets https://arxiv.org/pdf/1712.09913.pdf
- http://lukemetz.com/exploring-hyperparameter-meta-loss-landscapes-with-jax/
- ANALYZING REINFORCEMENT LEARNING BENCHMARKS WITH RANDOM WEIGHT GUESSING https://arxiv.org/pdf/2004.07707.pdf
- Exploring Model-based Planning with Policy Networks https://arxiv.org/pdf/1906.08649.pdf
- Understanding the Impact of Entropy on Policy Optimization https://arxiv.org/pdf/1811.11214.pdf
- Fantastic Generalization Measures and Where to Find Them: https://arxiv.org/pdf/1912.02178.pdf — worst case sharpness (similar to volume) best predictor is sharpness relative to parameter magnitude
- Implementation matters in deep policy gradients https://arxiv.org/pdf/2005.12729.pdf
- A closer look at deep policy gradients https://arxiv.org/abs/1811.02553
- https://www.reddit.com/r/MachineLearning/comments/9v0r0c/r_are_deep_policy_gradient_algorithms_truly/