Agarwal, Arvind, and Hal Daumé Iii. 2011. “Generative Kernels for Exponential Families.”
In Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics
Alexanderian, Alen. 2015. “A Brief Note on the Karhunen-Loève Expansion.” arXiv:1509.07526 [Math]
Álvarez, Mauricio A., Lorenzo Rosasco, and Neil D. Lawrence. 2012. “Kernels for Vector-Valued Functions: A Review.” Foundations and Trends® in Machine Learning
4 (3): 195–266.
Aronszajn, N. 1950. “Theory of Reproducing Kernels.” Transactions of the American Mathematical Society
68 (3): 337–404.
Bach, Francis. 2008. “Exploring Large Feature Spaces with Hierarchical Multiple Kernel Learning.”
In Proceedings of the 21st International Conference on Neural Information Processing Systems
, 105–12. NIPS’08. USA: Curran Associates Inc.
Bacry, Emmanuel, and Jean-François Muzy. 2016. “First- and Second-Order Statistics Characterization of Hawkes Processes and Non-Parametric Estimation.” IEEE Transactions on Information Theory
62 (4): 2184–2202.
Balog, Matej, Balaji Lakshminarayanan, Zoubin Ghahramani, Daniel M. Roy, and Yee Whye Teh. 2016. “The Mondrian Kernel.” arXiv:1606.05241 [Stat]
Bochner, Salomon. 1959. Lectures on Fourier Integrals. Princeton University Press.
Bohn, Bastian, Michael Griebel, and Christian Rieger. 2018. “A Representer Theorem for Deep Kernel Learning.” arXiv:1709.10441 [Cs, Math]
Bonilla, Edwin V., Karl Krauth, and Amir Dezfouli. 2019. “Generic Inference in Latent Gaussian Process Models.” Journal of Machine Learning Research
20 (117): 1–63.
Christoudias, Mario, Raquel Urtasun, and Trevor Darrell. 2009. “Bayesian Localized Multiple Kernel Learning.”
UCB/EECS-2009-96. EECS Department, University of California, Berkeley.
Cortes, Corinna, Patrick Haffner, and Mehryar Mohri. 2004. “Rational Kernels: Theory and Algorithms.” Journal of Machine Learning Research
5 (December): 1035–62.
Cressie, Noel, and Hsin-Cheng Huang. 1999. “Classes of Nonseparable, Spatio-Temporal Stationary Covariance Functions.” Journal of the American Statistical Association
94 (448): 1330–39.
Delft, Anne van, and Michael Eichler. 2016. “Locally Stationary Functional Time Series.” arXiv:1602.05125 [Math, Stat]
Duvenaud, David. 2014. “Automatic Model Construction with Gaussian Processes.”
PhD Thesis, University of Cambridge.
Duvenaud, David K., Hannes Nickisch, and Carl E. Rasmussen. 2011. “Additive Gaussian Processes.”
In Advances in Neural Information Processing Systems
Duvenaud, David, James Lloyd, Roger Grosse, Joshua Tenenbaum, and Ghahramani Zoubin. 2013. “Structure Discovery in Nonparametric Regression Through Compositional Kernel Search.”
In Proceedings of the 30th International Conference on Machine Learning (ICML-13)
Gaspari, Gregory, and Stephen E. Cohn. 1999. “Construction of Correlation Functions in Two and Three Dimensions.” Quarterly Journal of the Royal Meteorological Society
125 (554): 723–57.
Genton, Marc G. 2001. “Classes of Kernels for Machine Learning: A Statistics Perspective.” Journal of Machine Learning Research
2 (December): 299–312.
Genton, Marc G., and Olivier Perrin. 2004. “On a Time Deformation Reducing Nonstationary Stochastic Processes to Local Stationarity.” Journal of Applied Probability
41 (1): 236–49.
Girolami, Mark, and Simon Rogers. 2005. “Hierarchic Bayesian Models for Kernel Learning.”
In Proceedings of the 22nd International Conference on Machine Learning - ICML ’05
, 241–48. Bonn, Germany: ACM Press.
Gneiting, Tilmann. 2002a. “Nonseparable, Stationary Covariance Functions for Space–Time Data.” Journal of the American Statistical Association
97 (458): 590–600.
———. 2002b. “Compactly Supported Correlation Functions.” Journal of Multivariate Analysis
83 (2): 493–508.
Gneiting, Tilmann, William Kleiber, and Martin Schlather. 2010. “Matérn Cross-Covariance Functions for Multivariate Random Fields.” Journal of the American Statistical Association
105 (491): 1167–77.
Grosse, Roger, Ruslan R. Salakhutdinov, William T. Freeman, and Joshua B. Tenenbaum. 2012. “Exploiting Compositionality to Explore a Large Space of Model Structures.”
In Proceedings of the Conference on Uncertainty in Artificial Intelligence
Hartikainen, J., and S. Särkkä. 2010. “Kalman Filtering and Smoothing Solutions to Temporal Gaussian Process Regression Models.”
In 2010 IEEE International Workshop on Machine Learning for Signal Processing
, 379–84. Kittila, Finland: IEEE.
Hinton, Geoffrey E, and Ruslan R Salakhutdinov. 2008. “Using Deep Belief Nets to Learn Covariance Kernels for Gaussian Processes.”
In Advances in Neural Information Processing Systems 20
, edited by J. C. Platt, D. Koller, Y. Singer, and S. T. Roweis, 1249–56. Curran Associates, Inc.
Hofmann, Thomas, Bernhard Schölkopf, and Alexander J. Smola. 2008. “Kernel Methods in Machine Learning.” The Annals of Statistics
36 (3): 1171–1220.
Kom Samo, Yves-Laurent, and Stephen Roberts. 2015. “Generalized Spectral Kernels.” arXiv:1506.02236 [Stat]
Kondor, Risi, and Tony Jebara. 2006. “Gaussian and Wishart Hyperkernels.”
In Proceedings of the 19th International Conference on Neural Information Processing Systems
, 729–36. NIPS’06. Cambridge, MA, USA: MIT Press.
Krauth, Karl, Edwin V. Bonilla, Kurt Cutajar, and Maurizio Filippone. 2016. “AutoGP: Exploring the Capabilities and Limitations of Gaussian Process Models.”
Lloyd, James Robert, David Duvenaud, Roger Grosse, Joshua Tenenbaum, and Zoubin Ghahramani. 2014. “Automatic Construction and Natural-Language Description of Nonparametric Regression Models.”
In Twenty-Eighth AAAI Conference on Artificial Intelligence
Mercer, J. 1909. “Functions of Positive and Negative Type, and Their Connection with the Theory of Integral Equations.” Philosophical Transactions of the Royal Society of London. Series A, Containing Papers of a Mathematical or Physical Character
209 (441-458): 415–46.
Micchelli, Charles A., and Massimiliano Pontil. 2005a. “Learning the Kernel Function via Regularization.” Journal of Machine Learning Research
6 (Jul): 1099–1125.
———. 2005b. “On Learning Vector-Valued Functions.” Neural Computation
17 (1): 177–204.
Minasny, Budiman, and Alex. B. McBratney. 2005. “The Matérn Function as a General Model for Soil Variograms.” Geoderma
, Pedometrics 2003, 128 (3–4): 192–207.
Murphy, Kevin P. 2012. Machine learning: a probabilistic perspective. 1 edition. Adaptive computation and machine learning series. Cambridge, MA: MIT Press.
Murray-Smith, Roderick, and Barak A. Pearlmutter. 2005. “Transformations of Gaussian Process Priors.”
In Deterministic and Statistical Methods in Machine Learning
, edited by Joab Winkler, Mahesan Niranjan, and Neil Lawrence, 110–23. Lecture Notes in Computer Science. Springer Berlin Heidelberg.
O’Callaghan, Simon Timothy, and Fabio T. Ramos. 2011. “Continuous Occupancy Mapping with Integral Kernels.”
In Twenty-Fifth AAAI Conference on Artificial Intelligence
Ong, Cheng Soon, Xavier Mary, Stéphane Canu, and Alexander J. Smola. 2004. “Learning with Non-Positive Kernels.”
In Twenty-First International Conference on Machine Learning - ICML ’04
, 81. Banff, Alberta, Canada: ACM Press.
Ong, Cheng Soon, and Alexander J. Smola. 2003. “Machine Learning Using Hyperkernels.”
In Proceedings of the Twentieth International Conference on International Conference on Machine Learning
, 568–75. ICML’03. Washington, DC, USA: AAAI Press.
Ong, Cheng Soon, Alexander J. Smola, and Robert C. Williamson. 2002. “Hyperkernels.”
In Proceedings of the 15th International Conference on Neural Information Processing Systems
, 495–502. NIPS’02. Cambridge, MA, USA: MIT Press.
———. 2005. “Learning the Kernel with Hyperkernels.” Journal of Machine Learning Research
6 (Jul): 1043–71.
Pérez-Abreu, Víctor, and Alfonso Rocha-Arteaga. 2005. “Covariance-Parameter Lévy Processes in the Space of Trace-Class Operators.” Infinite Dimensional Analysis, Quantum Probability and Related Topics
08 (01): 33–54.
Perrin, Olivier, and Rachid Senoussi. 1999. “Reducing Non-Stationary Stochastic Processes to Stationarity by a Time Deformation.” Statistics & Probability Letters
43 (4): 393–97.
Pfaffel, Oliver. 2012. “Wishart Processes.” arXiv:1201.3256 [Math]
Rakotomamonjy, Alain, Francis R. Bach, Stéphane Canu, and Yves Grandvalet. 2008. “SimpleMKL.” Journal of Machine Learning Research
9 (Nov): 2491–521.
Rasmussen, Carl Edward, and Christopher K. I. Williams. 2006. Gaussian Processes for Machine Learning
. Adaptive Computation and Machine Learning. Cambridge, Mass: MIT Press.
Remes, Sami, Markus Heinonen, and Samuel Kaski. 2017. “Non-Stationary Spectral Kernels.”
In Advances in Neural Information Processing Systems 30
, edited by I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett, 4642–51. Curran Associates, Inc.
———. 2018. “Neural Non-Stationary Spectral Kernel.” arXiv:1811.10978 [Cs, Stat]
Sampson, Paul D., and Peter Guttorp. 1992. “Nonparametric Estimation of Nonstationary Spatial Covariance Structure.” Journal of the American Statistical Association
87 (417): 108–19.
Schölkopf, Bernhard, Ralf Herbrich, and Alex J. Smola. 2001. “A Generalized Representer Theorem.”
In Computational Learning Theory
, edited by David Helmbold and Bob Williamson, 416–26. Lecture Notes in Computer Science. Springer Berlin Heidelberg.
Schölkopf, Bernhard, and Alexander J. Smola. 2002. Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond. MIT Press.
———. 2003. “A Short Introduction to Learning with Kernels.”
In Advanced Lectures on Machine Learning
, edited by Shahar Mendelson and Alexander J. Smola, 41–64. Lecture Notes in Computer Science 2600. Springer Berlin Heidelberg.
Sinha, Aman, and John C Duchi. 2016. “Learning Kernels with Random Features.”
In Advances in Neural Information Processing Systems 29
, edited by D. D. Lee, M. Sugiyama, U. V. Luxburg, I. Guyon, and R. Garnett, 1298–1306. Curran Associates, Inc.
Smith, Michael Thomas, Mauricio A. Alvarez, and Neil D. Lawrence. 2018. “Gaussian Process Regression for Binned Data.” arXiv:1809.02010 [Cs, Stat]
Stein, Michael L. 2005. “Space-Time Covariance Functions.” Journal of the American Statistical Association
100 (469): 310–21.
Sun, Shengyang, Guodong Zhang, Chaoqi Wang, Wenyuan Zeng, Jiaman Li, and Roger Grosse. 2018. “Differentiable Compositional Kernel Learning for Gaussian Processes.” arXiv Preprint arXiv:1806.04326.
Székely, Gábor J., and Maria L. Rizzo. 2009. “Brownian Distance Covariance.” The Annals of Applied Statistics
3 (4): 1236–65.
Uziel, Guy. 2020. “Nonparametric Sequential Prediction While Deep Learning the Kernel.”
In International Conference on Artificial Intelligence and Statistics
, 111–21. PMLR.
Vedaldi, A., and A. Zisserman. 2012. “Efficient Additive Kernels via Explicit Feature Maps.” IEEE Transactions on Pattern Analysis and Machine Intelligence
34 (3): 480–92.
Vert, Jean-Philippe, Koji Tsuda, and Bernhard Schölkopf. 2004. “A Primer on Kernel Methods.”
In Kernel Methods in Computational Biology
. MIT Press.
Vishwanathan, S. V. N., Nicol N. Schraudolph, Risi Kondor, and Karsten M. Borgwardt. 2010. “Graph Kernels.” Journal of Machine Learning Research
11 (August): 1201–42.
Wilk, Mark van der, Andrew G. Wilson, and Carl E. Rasmussen. 2014. “Variational Inference for Latent Variable Modelling of Correlation Structure.” In NIPS 2014 Workshop on Advances in Variational Inference.
Wilson, Andrew Gordon, and Ryan Prescott Adams. 2013. “Gaussian Process Kernels for Pattern Discovery and Extrapolation.”
In International Conference on Machine Learning
Wilson, Andrew Gordon, Christoph Dann, Christopher G. Lucas, and Eric P. Xing. 2015. “The Human Kernel.” arXiv:1510.07389 [Cs, Stat]
Wilson, Andrew Gordon, and Zoubin Ghahramani. 2011. “Generalised Wishart Processes.”
In Proceedings of the Twenty-Seventh Conference on Uncertainty in Artificial Intelligence
, 736–44. UAI’11. Arlington, Virginia, United States: AUAI Press.
———. 2012. “Modelling Input Varying Correlations Between Multiple Responses.” In Machine Learning and Knowledge Discovery in Databases, edited by Peter A. Flach, Tijl De Bie, and Nello Cristianini, 858–61. Lecture Notes in Computer Science. Springer Berlin Heidelberg.
Wilson, Andrew Gordon, Zhiting Hu, Ruslan Salakhutdinov, and Eric P. Xing. 2016. “Deep Kernel Learning.”
In Artificial Intelligence and Statistics
, 370–78. PMLR.
Wu, Zongmin. 1995. “Compactly Supported Positive Definite Radial Functions.” Advances in Computational Mathematics
4 (1): 283–92.
Yaglom, A. M. 1961. “Second-Order Homogeneous Random Fields.” Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, Volume 2: Contributions to Probability Theory
, January, 593–622.
———. 1987. Correlation Theory of Stationary and Related Random Functions. Volume II: Supplementary Notes and References. Springer Series in Statistics. New York, NY: Springer Science & Business Media.
Yu, Yaoliang, Hao Cheng, Dale Schuurmans, and Csaba Szepesvári. 2013. “Characterizing the Representer Theorem.”
In Proceedings of the 30th International Conference on Machine Learning (ICML-13)
Zhang, Aonan, and John Paisley. 2019. “Random Function Priors for Correlation Modeling.”
In International Conference on Machine Learning