YellowFin and the Art of Momentum Tuning

Jian Zhan, Ioannis Mitliagkas, Christopher Re

Stanford

Intro

Appendix

正好和上一次读的ArXiv paper: The Marginal Value of Adaptive Gradient Methods in Machine Learning 对应。都是Adam的应用有局限。

results matching ""

    No results matching ""