Agostinelli, Forest, Matthew Hoffman, Peter Sadowski, and Pierre Baldi. 2015.
βLearning Activation Functions to Improve Deep Neural Networks.β In
Proceedings of International Conference on Learning Representations (ICLR) 2015.
Anil, Cem, James Lucas, and Roger Grosse. 2018.
βSorting Out Lipschitz Function Approximation,β November.
Arjovsky, Martin, Amar Shah, and Yoshua Bengio. 2016.
βUnitary Evolution Recurrent Neural Networks.β In
Proceedings of the 33rd International Conference on International Conference on Machine Learning - Volume 48, 1120β28. ICMLβ16. New York, NY, USA: JMLR.org.
Balduzzi, David, Marcus Frean, Lennox Leary, J. P. Lewis, Kurt Wan-Duo Ma, and Brian McWilliams. 2017.
βThe Shattered Gradients Problem: If Resnets Are the Answer, Then What Is the Question?β In
PMLR, 342β50.
Cho, Youngmin, and Lawrence K. Saul. 2009.
βKernel Methods for Deep Learning.β In
Proceedings of the 22nd International Conference on Neural Information Processing Systems, 22:342β50. NIPSβ09. Red Hook, NY, USA: Curran Associates Inc.
Clevert, Djork-ArnΓ©, Thomas Unterthiner, and Sepp Hochreiter. 2016.
βFast and Accurate Deep Network Learning by Exponential Linear Units (ELUs).β In
Proceedings of ICLR.
Duch, WΕodzisΕaw, and Norbert Jankowski. 1999.
βSurvey of Neural Transfer Functions.βGlorot, Xavier, Antoine Bordes, and Yoshua Bengio. 2011.
βDeep Sparse Rectifier Neural Networks.β In
Aistats, 15:275.
Goodfellow, Ian J., David Warde-Farley, Mehdi Mirza, Aaron Courville, and Yoshua Bengio. 2013.
βMaxout Networks.β In
ICML (3), 28:1319β27.
Hayou, Soufiane, Arnaud Doucet, and Judith Rousseau. 2019.
βOn the Impact of the Activation Function on Deep Neural Networks Training.β In
Proceedings of the 36th International Conference on Machine Learning, 2672β80. PMLR.
He, Kaiming, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2015a.
βDelving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification.β arXiv:1502.01852 [Cs], February.
βββ. 2016.
βIdentity Mappings in Deep Residual Networks.β In
arXiv:1603.05027 [Cs].
Hochreiter, Sepp. 1998.
βThe Vanishing Gradient Problem During Learning Recurrent Neural Nets and Problem Solutions.β International Journal of Uncertainty Fuzziness and Knowledge Based Systems 6: 107β15.
Hochreiter, Sepp, Yoshua Bengio, Paolo Frasconi, and JΓΌrgen Schmidhuber. 2001.
βGradient Flow in Recurrent Nets: The Difficulty of Learning Long-Term Dependencies.β In
A Field Guide to Dynamical Recurrent Neural Networks. IEEE Press.
Klambauer, GΓΌnter, Thomas Unterthiner, Andreas Mayr, and Sepp Hochreiter. 2017.
βSelf-Normalizing Neural Networks.β In
Proceedings of the 31st International Conference on Neural Information Processing Systems, 972β81. Red Hook, NY, USA: Curran Associates Inc.
Laurent, Thomas. n.d. βThe Multilinear Structure of ReLU Networks,β 9.
Lee, Jaehoon, Yasaman Bahri, Roman Novak, Samuel S. Schoenholz, Jeffrey Pennington, and Jascha Sohl-Dickstein. 2018.
βDeep Neural Networks as Gaussian Processes.β In
ICLR.
Maas, Andrew L., Awni Y. Hannun, and Andrew Y. Ng. 2013.
βRectifier Nonlinearities Improve Neural Network Acoustic Models.β In
Proceedings of ICML. Vol. 30.
Pascanu, Razvan, Tomas Mikolov, and Yoshua Bengio. 2013.
βOn the Difficulty of Training Recurrent Neural Networks.β In
arXiv:1211.5063 [Cs], 1310β18.
Rahaman, Nasim, Aristide Baratin, Devansh Arpit, Felix Draxler, Min Lin, Fred A. Hamprecht, Yoshua Bengio, and Aaron Courville. 2019.
βOn the Spectral Bias of Neural Networks.β arXiv:1806.08734 [Cs, Stat], May.
Ramachandran, Prajit, Barret Zoph, and Quoc V. Le. 2017.
βSearching for Activation Functions.β arXiv:1710.05941 [Cs], October.
Sitzmann, Vincent, Julien N. P. Martel, Alexander W. Bergman, David B. Lindell, and Gordon Wetzstein. 2020.
βImplicit Neural Representations with Periodic Activation Functions.β arXiv:2006.09661 [Cs, Eess], June.
Srivastava, Rupesh Kumar, Klaus Greff, and JΓΌrgen Schmidhuber. 2015.
βHighway Networks.β In
arXiv:1505.00387 [Cs].
Unser, Michael. 2019.
βA Representer Theorem for Deep Neural Networks.β Journal of Machine Learning Research 20 (110): 30.
Wisdom, Scott, Thomas Powers, John Hershey, Jonathan Le Roux, and Les Atlas. 2016.
βFull-Capacity Unitary Recurrent Neural Networks.β In
Advances in Neural Information Processing Systems, 4880β88.
Yang, Greg, and Hadi Salman. 2020.
βA Fine-Grained Spectral Perspective on Neural Networks.β arXiv:1907.10599 [Cs, Stat], April.
No comments yet. Why not leave one?