Banerjee, Arindam, Srujana Merugu, Inderjit S Dhillon, Joydeep Ghosh, and John Lafferty. 2005. “Clustering with Bregman Divergences.” Journal of Machine Learning Research 6 (10).
Bansal, Nikhil, and Anupam Gupta. 2019. “Potential-Function Proofs for First-Order Methods.”
Benamou, Jean-David, Guillaume Carlier, Marco Cuturi, Luca Nenna, and Gabriel Peyré. 2014. “Iterative Bregman Projections for Regularized Transportation Problems.” arXiv:1412.5154 [Math]
Collins, Michael, S. Dasgupta, and Robert E Schapire. 2001. “A Generalization of Principal Components Analysis to the Exponential Family.”
In Advances in Neural Information Processing Systems
. Vol. 14. MIT Press.
Flammarion, Nicolas, and Francis Bach. 2017. “Stochastic Composite Least-Squares Regression with Convergence Rate O(1/n).” arXiv:1702.06429 [Math, Stat]
Gneiting, Tilmann, and Adrian E Raftery. 2007. “Strictly Proper Scoring Rules, Prediction, and Estimation.” Journal of the American Statistical Association
102 (477): 359–78.
Goldstein, Tom, Stanley Osher, Tom Goldstein, and Stanley Osher. 2009. “The Split Bregman Method for L1-Regularized Problems.” SIAM Journal on Imaging Sciences
2 (2): 323.
Gopalan, Parikshit, Lunjia Hu, Michael P. Kim, Omer Reingold, and Udi Wieder. 2022. “Loss Minimization Through the Lens of Outcome Indistinguishability.”
Harremoës, Peter. 2015. “Proper Scoring and Sufficiency.” arXiv:1507.07089 [Math, Stat]
Li, Housen, Johannes Schwab, Stephan Antholzer, and Markus Haltmeier. 2020. “NETT: Solving Inverse Problems with Deep Neural Networks.” Inverse Problems
36 (6): 065005.
Nielsen, Frank. 2018. “An Elementary Introduction to Information Geometry.” arXiv:1808.08271 [Cs, Math, Stat]
Nock, Richard, Aditya Krishna Menon, and Cheng Soon Ong. 2016. “A Scaled Bregman Theorem with Applications.” arXiv:1607.00360 [Cs, Stat]
Reid, Mark D., and Robert C. Williamson. 2011. “Information, Divergence and Risk for Binary Experiments.” Journal of Machine Learning Research
12 (Mar): 731–817.
Singh, Ajit P., and Geoffrey J. Gordon. 2008. “A Unified View of Matrix Factorization Models.”
In Machine Learning and Knowledge Discovery in Databases
, 358–73. Springer.
Sra, Suvrit, and Inderjit S. Dhillon. 2006. “Generalized Nonnegative Matrix Approximations with Bregman Divergences.”
In Advances in Neural Information Processing Systems 18
, edited by Y. Weiss, B. Schölkopf, and J. C. Platt, 283–90. MIT Press.
Wibisono, Andre, Ashia C. Wilson, and Michael I. Jordan. 2016. “A Variational Perspective on Accelerated Methods in Optimization.” Proceedings of the National Academy of Sciences
113 (47): E7351–58.
Yin, W, S Osher, D Goldfarb, and J Darbon. 2008. “Bregman Iterative Algorithms for \(\ell_1\)-Minimization with Applications to Compressed Sensing.” SIAM Journal on Imaging Sciences
1 (1): 143–68.