Validating and reproducing science
“Scientist, falsify thyself”. Peer review, academic incentives, credentials, evidence and funding
2020-05-16 — 2025-09-30
Wherein a catalogue of peer‑review mechanisms and experiments on reviewer‑assignment is presented, and reputation systems, open review overlays and algorithmic matching are considered.
Designing peer-review systems to validate scientific output validating scientific output.
Reputation systems, collective decision making, groupthink management, Bayesian elicitation, and other mechanism considerations for trustworthy science — our collective knowledge of reality.
- Related question: How do we discover research to peer-review?
1 Mathematical models of the reviewing process
E.g. (Cole1981Chance?);(Lindsey1988Assessing?);(Ragone2013Peer?);Nihar B. Shah et al. (2016);Whitehurst (1984).
Data from NeurIPS experiments might be useful: see, for example, Nihar B. Shah et al. (2016) or a blog post on the 2014 experiment (1, 2).
2 Rating
Su (2022),
3 Assignment for the peer-review process
There’s some fun mechanism design and algorithmic work involved in peer review; for example:
- Experiments with the ICML 2020 Peer-Review Process
- NeurIPS 2024 Experiment on Improving the Paper-Reviewer Assignment.
- Matthew Feeney, Markets in fact-checking
- Budish et al. (2009)
- Charlin, Zemel, and Boutilier (2011)
- Charlin and Zemel (2013)
- Flach et al. (2010)
- Gasparyan et al. (2015)
- Goldsmith and Sloan (2007)
- (Jan2018Recognition?)
- Jecmen et al. (2020)
- Jecmen et al. (2022)
- Littman (2021)
- Liu, Suel, and Memon (2014)
- Merrifield and Saari (2009)
- Mimno and McCallum (2007)
- Rodriguez and Bollen (2008)
- Nihar B. Shah (2022)
- (Solomon2007Role?)
- Stelmakh, Shah, and Singh (2021)
- Tang, Tang, and Tan (2010)
- Taylor (2008)
- Tran, Cabanac, and Hubert (2017)
- Vijaykumar (2020)
- Xiao, Dörfler, and van der Schaar (2014b)
- Y. Xu, Zhao, and Shi (2017)
- Y. E. Xu et al. (2024)
4 Incoming
Transparent Peer Review: A New Era for Scientific Publishing | The Scientist
Nihar B. Shah (2025) is the latest literature review, stitching together all the trendy, hip stuff
CS Paper Reviews — a tool to review papers and increase our odds of acceptance
Saloni Dattani, Real peer review has never been tried
Matt Clancy, What does peer review know?
Adam Mastroianni, The rise and fall of peer review
Science and the Dumpster Fire | Elements of Evolutionary Anthropology
F1000Research | Open Access Publishing Platform | Beyond a Research Journal
F1000Research is an Open Research publishing platform for scientists, scholars and clinicians offering rapid publication of articles and other research outputs without editorial bias. All articles benefit from transparent peer review and editorial guidance on making all source data openly available.
Reviewing is a Contract — Rieck on the social expectations around reviewing and chairing.
Jocelynn Pearl proposes some fun ideas — including blockchain-y ones — in Time for a Change: How Scientific Publishing is Changing For The Better.
The Black Spatula Project — Steve Newman
A 10 page paper caused a panic because of a math error. I was curious if Al would spot the error by just prompting: “carefully check the math in this paper” especially as the info is not in training data.
o1 gets it in a single shot. Should Al checks be standard in science?
Repository: nick-gibb/black-spatula-project: Verifying scientific papers using LLMs