Knowledge geometry

May 23, 2012 — October 20, 2021

collective knowledge
how do science
stringology
topology

See also:

Related question: What is the shape of the vocabulary of communicating people? When do we denote new things?

1 What is the shape of collected human knowledge?

Figure 1: Visualising the Neurips paper series by Hendrik Strobelt and Benjamin Hoover Lee Campbelland Marc’Aurelio Ranzato. See the blogpost for more info)

This is vague.

I do not know how to make this idea precise yet, but I would like to spitball a few ideas about how branches of knowledge might be adjacent to each other or not, in various senses.

For example, what metric space provides a natural embedding for the articles of an encyclopedia such that things that articles that are near in (the metric of) that space possess similar subject classifications? I have seen arguments that hyperbolic and other non-euclidean geometries arise here. But then, because our monkey minds work this way, we seem to project knowledge onto 2d euclidean maps, and it seems to be at least interesting:

Figure 2

What kind of knowledge relationship mechanisms are plausible? Could you mine patent networks or theorem networks to parameterise a stochastic process for this model which made it a plausible model for theorem growth? If not, what quality does knowledge posses which this could not encapsulate? If we have a good metric on knowledge that we can learn from (e.g.) the text of papers, can we use it to discover similar research? Or to identify lacunae in knowledge that we can fill?

Figure 3: Franka Miriam Brückler’s famous map of Middle Math

Or should we think about the mechanism of knowledge generation? Can we represent knowledge as a network (or a landscape?) that accretes around agent activity? Some kind of growth process around researchers? (keywords: “models of growth aggregation”, “rough interfaces”, “growth with surface diffusion”, “nucleation”, “morphogenesis”) Is this a constrained growth problem, like the one that governs coral drills?

Investigate configuration spaces of technologies. (See configuration space of the economy.) Genotype-phenotype interactions as a model of knowledge-economic systems? What is the most basic stochastic process that would serve as a model of these?

(Practical aside: How much area must a new thesis carve out from the unmade world?)

Now, going out on a limb, consider a problem domain that looks evolutionary if you squint at it: creating mathematical theorems. Certainly Gödel and Turing invite looking at the things as symbol strings. I saw a presentation (Leibon and Rockmore 2013) suggesting that there was a natural embedding of mathematical field onto hyperbolic geometry. Sure, his data set was Wikipedia mathematical article links, and the whole idea was tongue-in-cheek. But it feels like there is something in there, if not a whole-cloth topological theory of human knowledge. Is there some process driving mathematical innovation that means that the links between fields sit so naturally in hyperbolic space? Is it some characteristic of the subject matter itself? If either of these are true, would they be true of other fields? Science in general? Philosophy? Engineering? Design? Biological fitnesses?

4 References

Andjelković, Tadić, Mitrović Dankulov, et al. 2016. Topology of Innovation Spaces in the Knowledge Networks Emerging Through Questions-And-Answers.” PLoS ONE.
Bloom, Jones, Van Reenen, et al. 2020. Are Ideas Getting Harder to Find? American Economic Review.
Iacopini, Milojević, and Latora. 2018. Network Dynamics of Innovation Processes.” Physical Review Letters.
König, Battiston, Napoletano, et al. 2011. Recombinant Knowledge and the Evolution of Innovation Networks.” Journal of Economic Behavior & Organization.
Leibon, and Rockmore. 2013. Orienteering in Knowledge Spaces: The Hyperbolic Geometry of Wikipedia Mathematics.” PLoS ONE.
Loreto, Servedio, Strogatz, et al. 2016. Dynamics on Expanding Spaces: Modeling the Emergence of Novelties.” In Creativity and Universality in Language. Lecture Notes in Morphogenesis.
Napolitano, Evangelou, Pugliese, et al. n.d. Technology Networks: The Autocatalytic Origins of Innovation.” Royal Society Open Science.
Tadić, Dankulov, and Melnik. 2017. Mechanisms of self-organized criticality in social processes of knowledge creation.” Physical Review. E.
Thorngate, Liu, and Chowdhury. 2011. “The Competition for Attention and the Evolution of Science.” Journal of Artificial Societies and Social Simulation.