Badii, and Politi. 1999. Complexity: Hierarchical Structures and Scaling in Physics. Cambridge Nonlinear Science Series.
Bolhuis, Tattersall, Chomsky, et al. 2014.
“How Could Language Have Evolved?” PLoS Biol.
Casacuberta, and de la Higuera. 2000.
“Computational Complexity of Problems on Probabilistic Grammars and Transducers.” In
Grammatical Inference: Algorithms and Applications.
Charniak. 1996a. “Tree-Bank Grammars.” In In Proceedings of the Thirteenth National Conference on Artificial Intelligence.
———. 1996b. Statistical Language Learning.
Chomsky, N. 1956.
“Three Models for the Description of Language.” IRE Transactions on Information Theory.
Chomsky, Noam. 2002. Syntactic Structures.
Clark, Alexander, and Eyraud. 2005.
“Identification in the Limit of Substitutable Context-Free Languages.” In
Algorithmic Learning Theory. Lecture Notes in Computer Science.
de la Higuera. 2000.
“Current Trends in Grammatical Inference.” In
Advances in Pattern Recognition. Lecture Notes in Computer Science.
———. 2010. Grammatical Inference : Learning Automata and Grammars.
de la Higuera, Piat, and Tantini. 2006.
“Learning Stochastic Finite Automata for Musical Style Recognition.” In
Implementation and Application of Automata. Lecture Notes in Computer Science.
Duvenaud, Lloyd, Grosse, et al. 2013.
“Structure Discovery in Nonparametric Regression Through Compositional Kernel Search.” In
Proceedings of the 30th International Conference on Machine Learning (ICML-13).
Elman. 1990.
“Finding Structure in Time.” Cognitive Science.
Fee, Kozhevnikov, and Hahnloser. 2004.
“Neural Mechanisms of Vocal Sequence Generation in the Songbird.” Annals of the New York Academy of Sciences.
Gold. 1967.
“Language Identification in the Limit.” Information and Control.
Grosse, Salakhutdinov, Freeman, et al. 2012.
“Exploiting Compositionality to Explore a Large Space of Model Structures.” In
Proceedings of the Conference on Uncertainty in Artificial Intelligence.
Grünwald. 1996.
“A Minimum Description Length Approach to Grammar Inference.” In
Connectionist, Statistical, and Symbolic Approaches to Learning for Natural Language Processing. Lecture Notes in Computer Science.
Hannun, Pratap, Kahn, et al. 2020.
“Differentiable Weighted Finite-State Transducers.” arXiv:2010.01003 [Cs, Stat].
Heckel, Lajios, and Menge. 2004.
“Stochastic Graph Transformation Systems.” In
Graph Transformations. Lecture Notes in Computer Science.
Hopcroft, and Ullman. 1979. Introduction to Automata Theory, Languages and Computation.
Jackendoff. 2002. Foundations of Language: Brain, Meaning, Grammar, Evolution.
Johnson. 2004.
“Gold’s Theorem and Cognitive Science.” Philosophy of Science.
Jones, Waring, Westwood, et al. 1868.
The grammar of ornament.
Keller, and Lutz. 1997. “Evolving Stochastic Context-Free Grammars from Examples Using a Minimum Description Length Principle.” In 1997 Workshop on Automata Induction Grammatical Inference and Language Acquisition.
Kleinberg, and Mullainathan. 2024.
“Language Generation in the Limit.”
Kontorovich, Cortes, and Mohri. 2006.
“Learning Linearly Separable Languages.” In
Algorithmic Learning Theory. Lecture Notes in Computer Science 4264.
López, Sempere, and García. 2004.
“Inference of Reversible Tree Languages.” IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics.
Manning. 2002. “Probabilistic Syntax.” In Probabilistic Linguistics.
Manning, and Schütze. 1999. Foundations of Statistical Natural Language Processing.
Mohri, and Riley. 2016.
“A Disambiguation Algorithm for Weighted Automata.” Theoretical Computer Science.
Nevill-Manning, and Witten. 1997. “Identifying Hierarchical Structure in Sequences: A Linear-Time Algorithm.”
Nowak, Komarova, and Niyogi. 2001.
“Evolution of Universal Grammar.” Science.
Pinker, and Bloom. 1990. “Natural Language and Natural Selection.” Behavioral and Brain Sciences.
Shalizi, and Crutchfield. 2000. “Computational Mechanics: Pattern and Prediction, Structure and Simplicity.”
Smith. 1984.
“Plants, Fractals, and Formal Languages.” In
SIGGRAPH Comput. Graph.
Talton, Yang, Kumar, et al. 2012.
“Learning Design Patterns with Bayesian Grammar Induction.” In
Proceedings of the 25th Annual ACM Symposium on User Interface Software and Technology - UIST ’12.
Tong, Bickett, Christiansen, et al. 2007.
“Learning Grammatical Structure with Echo State Networks.” Neural Networks.
Valiant, L. G. 1984.
“A Theory of the Learnable.” Commun. ACM.
Valiant, Leslie G. 2009.
“Evolvability.” J. ACM.
Vidal, Thollard, de la Higuera, et al. 2005a.
“Probabilistic Finite-State Machines - Part I.” IEEE Transactions on Pattern Analysis and Machine Intelligence.
———, et al. 2005b.
“Probabilistic Finite-State Machines - Part II.” IEEE Transactions on Pattern Analysis and Machine Intelligence.
Vinyals, Kaiser, Koo, et al. 2015.
“Grammar as a Foreign Language.” In
Advances in Neural Information Processing Systems 28.