Welcome to DIY Open Data
This is a collaborative list of 66 tools that the Open Data community use when exploring, processing and visualising data. The focus of the list is tools that enable individual and small teams to explore, process, prototype and publish data.
I hope this list proves useful in locating some useful tools and maybe even inspires you to have a go at some data exploration.
The initial list was collaboratively generated by attendees at a session at Open Data Camp 9 in Manchester.
Awesome List
Coding Environment
Data Portal
-
DataHub Awesome collections
Curated list of open datasets
-
Dtechtive
Search engine for open and commercial data
-
Global Biodiversity Information Facility
Free and open access to biodiversity data
-
Google Dataset Search
Search engine for data
-
Mendeley Data
Communal repository for sharing and storing research data
-
OECD Data Explorer
Data covering key topics for many countries
-
ONS NOMIS
Official UK Census and Labour Market Statistics
-
Open Energy
Icebreaker One data portal
-
The Humanitarian Data Exchange
Find, share and use humanitarian data all in one place
-
UK Data Service
Collection of UK economic, population and social research data
-
Zenodo
Open science data portal
-
data.gouv.fr
Open platform for French public data
-
data.gov
The home of US Government's Open Data
-
data.gov.uk
Find UK Government open data
Database
Desktop App
Integration Platform
-
Activity Pub
Protocol for federating notifications and content
-
IFTTT
Cloud-based event-driven automation service
-
Pipedream
Serverless API integration platform
-
Really Simple Syndication
Standard for syndicating content
-
Zapier
Powerful cloud-based automation
-
n8n
Workflow automation
Code Library
-
BallMapper
Topological data analysis platform
-
Leaflet
Javascript mapping library
-
Matplotlib
Visualisation with Python
-
PETL
Python extract, transform, load library
-
Plotly
Python graphing
-
d3.js
Javascript visualisation library
-
geopandas
Geospatial extension to pandas
-
janitor
R data cleansing
-
pandas
Data analysis and manipulation tool
-
pyjanitor
Python data cleansing
-
remark
In-browser slideshow
-
reveal.js
HTML presentation framework
-
seaborn
Python statistical data visualization
-
skimpy
Create summary statistics about pandas or Polars data frames
-
skimr
Summary statistics for R data
-
xaringan
Create remark.js presentations with R Markdown
Miscellaneous
-
Amazon EMR
Run and scale big data workloads
Orchestration
Programming Language
-
Python
The Python programming language
-
R
Statistical computing and visualisation software environment
-
Rust
Fast and memory efficient compiled language
-
The Julia Programming Language
High-performance programming language
-
perl
Venerable and powerful programming language