To some extent, the production of academic knowledge is a public goods problem. Since academic publishing is part of this production, it inherits those oddities.

Cameron Neylon runs a cottage industry producing pragmatic publishing critiques from an institutional economics perspective:

e.g. The Marginal Costs of Article Publishing or A Journal is a Club:

We’d been talking about communities, cultures, economics, “public-making,” but it was the word ‘club’ and its associated concepts, both pejorative and positive, that crystallized everything. We were talking about the clubbishness of making knowledge — the term “Knowledge Clubs” emerged quickly — but also the benefits that such a club might gain in choosing to invest in wider sharing.

Working paper: Potts et al. (2016). Alternatively, see Afonso (2014), “How Academia resembles a drug gang”.

How to Get Something Out of Neoliberal Critique Without (Immediately) Overthrowing the Capitalist System:

In the business setting, this often leads incumbent publishers to a kind of spluttering defense of the value they create while simultaneously complaining that the customer doesn’t appreciate their work. Flip the target slightly, and we’d call this “missing the new market opportunity” or “failing to express the value offering clearly.” […]

Lingua, […] has gone from one of the most important journals in analytical linguistics to no longer being in the field and seems well on its way to becoming irrelevant. How does a company as competent in its business strategy as Elsevier let this happen? I would argue, as I did at the time that the former editorial board of Lingua resigned to form Glossa, that it was a failure to understand the assets.

The neoliberal analysis of Lingua showed an asset generating good revenues, with good analytics and a positive ROI. The capitalist analysis focused on the fixed assets and trademarks. But it turns out these weren’t what was creating value. What was creating value was the community, built around an editorial board and the goodwill associated with that.

Also, see Pushing costs downstream.

Here’s a thing I would like to be said a little better, but think is important: An Adversarial Review of “Adversarial Generation of Natural Language”: The argument is that even though it’s nice that arxiv avoids some of the problems of traditional publishing, it inherits some of the problems that traditional publishing tries to avoid. No free lunches.

2 Publishing as performance indicator

The classic academic problem. Journal rank and journal impact factor etc. We all know, and copiously complain that this system is broken, that it is a waste of money, that it subsidises private publishers greatly while doing little to assure quality compared to the price that people pay.

But until or unless we get so much fame we have nothing to prove, those of us aspiring to academic careers need to play the game. Your funders care, against your advice, but whatever, they have the money, so you need to care too in order that they will keep funding you.

Latrobe explains it. Scimago Journal rank is the Google Pagerank-inspired slightly hipper journal ranking. Their search tool is probably what you want. Impact factors come from the 60s and are still around, h-Index is also a thing. journalrank might be a factor too?

According to Latrobe, we have the following indices and (partial list of) weaknesses.

2.1 h-Index

Hirsch index: The number of articles in a journal [h] that have received at least [h] citations over a citation period.

Weaknesses:

Editors can manipulate by requiring contributors to add citations from their journals

Increases with age so bias towards researchers with long publication records

2.2 JIF

Journal Impact Factor: Citations to a journal in the JCR year to items published in the previous two years, divided by the total number of citable items (articles and reviews) published in the journal in the previous two years.

Weaknesses:

Limited to journals within Web of Science

Cannot be used to compare journals across different subject categories

2.3 SJR

SCImago Journal Rank: Average number of weighted citations received in a year, by articles published in a journal in the previous 3 years.

Weaknesses are that it is “complicated” and that the numbers are small.

So I guess if you must do a journal ranking this is the least bad method?

2.4 CORE/ICORE conference ratings

The ICORE rankings rank conferences by some notion of importance in fields. For example, in my workplace we can expect our travel to be only to “A*” conferences, which optimizes for prestige of the organization and defensibility of the funding. Are they the best conferences to go to in terms of outcomes? I do not know.

3 Other methods

Conference Ranks aggregates many measures at once.

4 Shadow libraries

Shadow libraries are “online databases of readily available content that is normally obscured or otherwise not readily accessible. Such content may be inaccessible for a number of reasons, including the use of paywalls, copyright controls, or other barriers to accessibility placed upon the content by its original owners.”

The biggest phenomenon in open access, as far as I can tell, is the massive pirate infrastructure providing open access to journals for free. See also Copyright activism.

Anecdotally, no work would happen in Indonesian academia, for example, without access to shadow libraries. They seem to be legal in some jurisdictions, but not in others. Check your local laws before accessing them. For some speculation and developments in the legality of shadow libraries, see

How illegal is scihub? : r/scihub
Sci-Hub legal status
Jonathan Basile’s essay on AAARG, Who’s Afraid of AAARG?.
Karaganis (2018):

From the top down, Shadow Libraries explores the institutions that shape the provision of educational materials, from the formal sector of universities and publishers to the broadly informal ones organized by faculty, copy shops, student unions, and students themselves. It looks at the history of policy battles over access to education in the post—World War II era and at the narrower versions that have played out in relation to research and textbooks, from library policies to book subsidies to, more recently, the several “open” publication models that have emerged in the higher education sector.

From the bottom up, Shadow Libraries explores how, simply, students get the materials they need. It maps the ubiquitous practice of photocopying and what are—in many cases—the more marginal ones of buying books, visiting libraries, and downloading from unauthorized sources. It looks at the informal networks that emerge in many contexts to share materials, from face-to-face student networks to Facebook groups, and at the processes that lead to the consolidation of some of those efforts into more organized archives that circulate offline and sometimes online—the shadow libraries of the title. If Alexandra Elbakyan’s Sci-Hub is the largest of these efforts to date, the more characteristic part of her story is the prologue: the personal struggle to participate in global scientific and educational communities, and the recourse to a wide array of ad hoc strategies and networks when formal, authorized means are lacking. If Elbakyan’s story has struck a chord, it is in part because it brings this contradiction in the academic project into sharp relief—universalist in principle and unequal in practice. Shadow Libraries is a study of that tension in the digital era.
The Dark Rule Utilitarian Argument For Science Piracy

Here are some popular shadow libraries:

Anna’s Archive/SciDB is a meta-index of sites that unpaywall journals etc.

📚 The largest truly open library in human history. ⭐️ We mirror Sci-Hub and LibGen. We scrape and open-source Z-Lib, DuXiu, and more. 📈 30,453,135 books, 100,357,111 papers — preserved forever. All our code and data are completely open source.

See wikipedia and TorrentFreak coverage of this institutions
scihub is the pirate site that has been the most successful in providing free access to academic papers. It may be shut down due to an Indian court case. (the geopolitics of this are fascinating!) and been quasi-continued as 🧬 SciDB
More free online libraries.

5 The ML conference ecosystem

A lot of the ML community is based around conferences, with a radically different design and culture than traditional journals. Much has been written on it. I should probably write some more and express my own personal takes ont he pros and cons of these conferences.

But not right now, because I am struggling to meet some publication deadlines; that in itself tells you much that you need to know about the cycle of such conferences

5.1 Deadlines

The main thing about ML conferences is when is the next one? For many years, a site called aideadlin.es was the community’s frantic, shared clock, the digital heartbeat timing the research cycle. It appears to be down now, but we keep the link here as a small monument to a vital piece of infrastructure. Its spirit lives on in successors like:

AI Deadlines - a Hugging Face Space by huggingface

5.2 Experiments

Hugo Larochelle, in Announcing the Transactions on Machine Learning Research, describes a new journal by the niche it fills, rather than assuming it is complete in itself.

[…] we’re happy to announce that we are founding a new journal, the Transactions on Machine Learning Research (TMLR). This journal is a sister journal of the existing, well-known Journal of Machine Learning Research (JMLR), along with the Proceedings of Machine Learning Research (PMLR) and JMLR Machine Learning Open Source Software (MLOSS). However, it departs from JMLR in a few key ways, which we hope will complement our community’s publication needs. Notably, TMLR’s review process will be hosted by OpenReview, and therefore will be open and transparent to the community. Another differentiation from JMLR will be the use of double-blind reviewing, the consequence being that the submission of previously published research, even with extension, will not be allowed. Finally, we intend to work hard on establishing a fast-turnaround review process, focusing in particular on shorter-form submissions that are common at machine learning conferences.

As these are all features of conferences like NeurIPS or ICLR, we hope that TMLR will become a welcome and familiar complement to conferences for publishing machine learning research. TMLR will also depart from conferences’ review process in a few key ways.

Anytime submission Being a journal, TMLR will accept submissions throughout the year. For this, we will be implementing a rolling review process which will be executed on a per-paper timeline.

Fast turnaround We are implementing a review timeline that will provide reviews to papers within 4 weeks of submission and decisions within 2 months. To enable this, we will implement a capped workload for action editors (the equivalent of conference area chairs) and reviewers so as to remain lightweight throughout the year, while also requesting a commitment to accept all assignment requests.

Acceptance based on claims Acceptance to TMLR will avoid judgments that are based on more subjective, editorial or speculative elements of typical conference decisions, such as novelty and potential for impact. Instead, the two criteria that will drive our review process will be the answers to the following two questions:

Are the claims made in the submission supported by accurate, convincing and clear evidence?

Would some individuals in TMLR’s audience be interested in the findings of this paper?

The first question therefore asks that we focus the evaluation on whether claims are matched by evidence. If they are not, authors will be asked to either provide new evidence or simply adjust their claims, even if that means the implications of the work are reduced (that’s OK!). The second, though somewhat more subjective, aims at ensuring the journal features work that does contribute additional knowledge to our community. A reviewer that is unsure as to whether a submission satisfies this criterion will be asked to assume that it does.

Certifications This will be a unique feature of TMLR, which is aimed at separating editorial statements on submitted work from their claim-based scientific assessment. An accepted paper will have the opportunity of being tagged with certifications, which are distinctions meant to highlight submissions with additional merit. At launch, we will include the following certifications:

Outstanding Certification, for papers deemed to be of exceptionally high quality and broadly significant for the field (along the lines of a best paper award at a top-tier conference).

Featured Certification, for papers judged to be of very high quality, along the lines of a conference paper selected for an oral or spotlight.

Reproducibility Certification, for papers whose primary purpose is reproduction of other published work and that contribute significant added value through additional baselines, analysis, ablations, or insights.

Survey Certification, for papers that not only meet the criteria for acceptance but also provide an exceptionally thorough or insightful survey of the topic or approach.

6 Tools

Figure 2: Tom Gauld, Suggested methods of presenting your findings

See also academic reading workflow for reader-oriented tips.

researchers.one:
A platform for scholarly publishing and peer review that empowers researchers with the
- Autonomy to pursue their passions,
- Authority to develop and disseminate their work, and
- Access to engage with the international community of scholars.
unpaywall:

Millions of research papers are available for free on government and university web servers, legally uploaded by the authors themselves, with the express permission of publishers. Unpaywall automatically harvests these freely shared papers from thousands of legal institutional repositories, preprint servers, and publishers, making them all available to you as you read.
Zenodo “is an open dependable home for the long-tail of science, enabling researchers to share and preserve any research outputs in any size, any format and from any science.”
- Research. Shared. — all research outputs from across all fields of science are welcome!
- Citeable. Discoverable. — uploads get a Digital Object Identifier (DOI) to make them easily and uniquely citeable…
- Flexible licensing — because not everything is under Creative Commons.
- Safe — your research output is stored safely for the future in the same cloud infrastructure as research data from CERN’s Large Hadron Collider.
A major win is the easy DOI-linking of data and code for reproducible research. (for free)
Open Conference Systems (OCS)
is a free Web publishing tool that will create a complete Web presence for your scholarly conference. OCS will allow you to:
- create a conference Web site
- compose and send a call for papers
- electronically accept paper and abstract submissions
- allow paper submitters to edit their work
- post conference proceedings and papers in a searchable format
- post, if you wish, the original data sets
- register participants
- integrate post-conference online discussions
Peeriodicals

A peeriodical is a lightweight virtual journal with you as the Editor-in-chief, giving you complete freedom in setting editorial policy to select the most interesting and useful manuscripts for your readers.

I did not find that explanation as useful as the interview the creators gave.
The Winnower

is an open access online scholarly publishing platform that employs open post-publication peer review. You guessed it! We think transparency from start to finish is critical in scientific communication. […]
Retraction Watch for sufficiently-high-profile-research is a watchdog blog that has somehow ended up doing well-regarded gatekeeping/exposure.

7 Peer review in

See Peer review.

8 Open access

Various open access (and occasionally also open source) journals attempt to disrupt the incumbent publishers with new business models based around the low cost of internet stuff. As with legacy journals, they have varying degrees of success.

One cute boutique example:

Open Journals

Open Journals is a collection of open source, open access journals. We currently have four main publications:

The Journal of Open Source Software

The Journal of Open Source Education

The Open Journal of Astrophysics

The Journal of Brief Ideas

All of our journals run on open source software, which is available under our GitHub organization profile: github.com/openjournals.

All of our journals are open access publications with content licensed under a Creative Commons Attribution 4.0 International License. Copyright remains with the submitting authors.

9 Incoming

The Strain on Scientific Publishing (Hanson et al. 2024)

Scientists are increasingly overwhelmed by the volume of articles being published. The total number of articles indexed in Scopus and Web of Science has grown exponentially in recent years; in 2022 the article total was ∼47% higher than in 2016, which has outpaced the limited growth—if any—in the number of practicing scientists. Thus, publication workload per scientist has increased dramatically. We define this problem as “the strain on scientific publishing.” To analyze this strain, we present five data-driven metrics showing publisher growth, processing times, and citation behaviors. We draw these data from web scrapes, and from publishers through their websites or upon request. Specific groups have disproportionately grown in their articles published per year, contributing to this strain. Some publishers enabled this growth by hosting “special issues” with reduced turnaround times. Given pressures on researchers to “publish or perish” to compete for funding, this strain was likely amplified by these offers to publish more articles. We also observed widespread year-over-year inflation of journal impact factors coinciding with this strain, which risks confusing quality signals. Such exponential growth cannot be sustained. The metrics we define here should enable this evolving conversation to reach actionable solutions to address the strain on scientific publishing.
Scientific Publishing: Enough is Enough - by Seemay Chou
What methods work for evaluating the impact of public investments in RD&I
Is Frontiers predatory?
- Is Frontiers a potential predatory publisher? – For Better Science
- Predatory reports: Is Frontiers Media a Predatory Publisher?
tl;dr: Their review process is perceived by the community to be suspect, and journal impact factor is generally decreasing.
PNAS is Not a Good Journal (and Other Hard Truths about Journal Prestige). At this point, most people in science are aware that journals are behaving more parasitically. They are sustained by the reputational capital of their self-fulfilling prophecies of prestige but are spending down that capital. The most generous interpretation of academia’s response is that we are too busy trying not to buttress public trust in science to reform these undeservedly respected institutions. The least generous is that we are symbiotic parasites together with the journals, using them for our own reputations at the cost of public trust in science. In between is the story that we are all too busy trying to meet deadlines to solve the collective action problem of boycotting journals.
Identify trusted publishers for your research • Think. Check. Submit.
“Journal Evaluation Tool” by Shilpa Rele, Marie Kennedy et al.
Funder compliant publishers | OAPEN
Felix Schönbrodt, My personal reviewing policy: No more billion-dollar donations.
Journal of Negative Results
Étienne Fortier-Dubois, Why Is ‘Nature’ Prestigious?
Web of Science: A Web of Nonsense
Google Scholar Metrics
Time for a Change: How Scientific Publishing is Changing For The Better

10 References

Aczel, Szaszi, and Holcombe. 2021. “A Billion-Dollar Donation: Estimating the Cost of Researchers’ Time Spent on Peer Review.” Research Integrity and Peer Review.

Afonso. 2014. “How Academia Resembles a Drug Gang.” SSRN Scholarly Paper.

Björk, and Solomon. 2013. “The Publishing Delay in Scholarly Peer-Reviewed Journals.” Journal of Informetrics.

Bogich, Balleseteros, Berjon, et al. n.d. “On the Marginal Cost of Scholarly Communication.”

Hanson, Barreiro, Crosetto, et al. 2024. “The Strain on Scientific Publishing.” Quantitative Science Studies.

Heckman, and Moktan. 2020. “Publishing and Promotion in Economics: The Tyranny of the Top Five.” Journal of Economic Literature.

Himmelstein, Rubinetti, Slochower, et al. 2019. “Open Collaborative Writing with Manubot.” Edited by Dina Schneidman-Duhovny. PLOS Computational Biology.

Ioannidis, Klavans, and Boyack. 2018. “Thousands of Scientists Publish a Paper Every Five Days.” Nature.

Karaganis, ed. 2018. Shadow Libraries: Access to Knowledge in Global Higher Education. International Development Research Centre.

Keller. 2024. “On the Difference Between Conferences and Journals in Artifical Intelligence and Computer Security.”

Krikorian, and Kapczynski. 2010. Access to knowledge in the age of intellectual property.

Pensky, Richardson, Serrano, et al. 2021. “Disrupt and Demystify the Unwritten Rules of Graduate School.” Nature Geoscience.

Potts, Hartley, Montgomery, et al. 2016. “A Journal Is a Club: A New Economic Model for Scholarly Publishing.” SSRN Scholarly Paper ID 2763975.

Schimmer, Ralf, Geschuhn, Kai Karin, and Vogler, Andreas. 2015. “Disrupting the subscription journals’ business model for the necessary large-scale transformation to open access.”

Sever. 2023. “Biomedical Publishing: Past Historic, Present Continuous, Future Conditional.” PLOS Biology.

van Noorden. 2013. “Open Access: The True Cost of Science Publishing.” Nature.

Wagenmakers, Sarafoglou, and Aczel. 2022. “One Statistical Analysis Must Not Rule Them All.” Nature.