Australia in data

2020-01-24 — 2025-11-18

Wherein Australia’s data is surveyed, national geodata collections are noted, and the AIATSIS Map of Indigenous Australia is presented as the conventional source for traditional‑owner boundaries.

data sets
place
Southeast Asia
straya

Data about Australia. Australia’s attempts to quantify itself vary in quality and usefulness. See Rob J. Hyndman’s piece We need more open data in Australia.

1 Cultural

There are some great cultural data sets in Australia, but I hear about them more for their fragility and controversy than for their content.

2 Geodata

There is in fact a vast amount of high quality open data about Australia, and I have completely lost track of this category. This section is woefully incomplete.

Figure 1

2.1 Real estate

Australians’ obsession with real estate makes microburbs a handy spatial dataset; it’s a loss leader for a real-estate-oriented data consultancy.

The microburbs dataset claims to include

  • 70,000 data points for every Microburb (55,000 of these, with an average of 400 residents), far more than publicly available on the site.

  • 10-300 data points on every address in Australia. It covers features to a fine-grained level:

    • Demographics
    • Geography
    • Amenities and businesses
    • Infrastructure
    • Voting patterns
    • Livability scores

This is ideal for:

  • Identifying real estate buying opportunities
  • Assessing the performance of real estate agents
  • Retail store placement and location analysis
  • Planning and development concerns

Their pitch explains the situation. I’d read it as: “We have not updated our data substantially since 2016, and our free services are increasingly stale.” I reckon if we need current info, they’ll cut us a deal.

2.2 Nature

2.3 Climate change

As noted under climate change, we can download projections data.

2.4 Industry

There’s a neat report Creative designs: geography of Australia’s digital technology industries (Hajkowicz et al. 2023) that plots where “tech happens” in Australia. Download here. Interactive map here.

3 Population data

3.1 Australian Bureau of Statistics

Traditionally, ABS data has been “open” but a total PITA to use automatically, because it arrives in idiosyncratic spreadsheets with an obscure filing system.

The data’s idiosyncrasies may be fixed by the new Data API; it’s an SDMX-REST:2.1-compliant repository that works with standard official statistics tools.

The more traditional spreadsheet-download method is still available, and there are some packages to help with that. Rob Hyndman points out the following:

  • Matt Cowgill’s readabs parses the awful spreadsheets the Australian government provides.
  • David Mitchell’s raustats does fancy downloading and handles Reserve Bank data too; however, as of 2022, it hadn’t been updated in 3 years.

3.2 Australian electoral data

eechidna, by Jeremy Forbes, Carson Sievert, Rob Hyndman, Diane Cook, Heike Hofmann, and many others, has spatiotemporal electorate data from 2001–2019. It’s an excellent package and the one I use myself. Recommended.

For a simple example, check out Where your vote counts (source code), which maps how marginal federal electorate votes in Australia have been, and therefore where we probably have the most power as voters.

Monash NUMBATs explain some tweaks: Hexmaps with sugar bag make it easier to see the electoral map.

Peter Ellis, on his Australian Federal Election 2019 forecasts post, introduces his useful ozfedelect package for R for modelling with electoral data. There’s more stuff from that author.

We can also DIY. Ben Raue advises:

The election results data published by the AEC is already pretty good. It’s tidy and easily interconnectable with unique IDs. Unlike the state and local election results which is why I started my own collection of those results in a tidy format. All the results websites are available from the AEC website then click through to downloads. In some cases datasets are broken up by geography (by state or electorate) but it’s trivial to merge them back together in R. The House of Reps downloads for 2019 are here.

5 Other fun stuff

  • Life tables.
  • OZdatasets is a large index of Australian datasets found by rOpenSci OzUnconf19 volunteers.
  • OpenNEM: NEM tracks energy-market stats, generation and supply.

6 Incoming

7 References

Bureau of Communications Research, and Deloitte Access Economics. 2016. Open Government Data and Why It Matters: A Critical Review of Studies on the Economic Impact of Open Government Data.”
Bureau of Meteorology. 2024. Climate Hazard Information for Australia.”
Forbes, Cook, and Hyndman. 2020. Spatial Modelling of the Two-Party Preferred Vote in Australian Federal Elections: 2001–2016.” Australian & New Zealand Journal of Statistics.
Guo, Mokany, Ong, et al. 2023. Plant Species Richness Prediction from DESIS Hyperspectral Data: A Comparison Study on Feature Extraction Procedures and Regression Models.” ISPRS Journal of Photogrammetry and Remote Sensing.
Hajkowicz, Evans, Cameron, et al. 2023. The Geography of Australia’s Digital Industries: Digital Technology Industry Clusters in Australia’s Capital Cities and Regions.”
Harwood, Williams, Lehmann, et al. 2021. 9 Arcsecond Gridded HCAS 2.1 (2001-2018) Base Model Estimation of Habitat Condition for Terrestrial Biodiversity, 18-Year Trend and 2010-2015 Epoch Change for Continental Australia.”
Houghton, John, and Gruen. 2012. Transparency and Productivity: The Effects of Open and Transparent Public Sector Information Management Practices on Costs and Productivity.” Occasional Paper 2.
Houghton, John W., and Gruen. 2014. Open Research Data: Report to the Australian National Data Service (ANDS).” Report.
Mokany, McCarthy, Falster, et al. 2022. Plant Diversity Spatial Layers for Australia.”
Williams, Harwood, Eric A., et al. 2021. Habitat Condition Assessment System (HCAS Version 2.1) Enhanced Method for Mapping Habitat Condition and Change Across Australia.”