Data as critical national infrastructure
Human societies change when resources become either plentiful or scarce. The agrarian revolution saw food become available at a scale, and with a reliability, never seen before, while the industrial revolution saw manufactured goods become available at levels and costs not previously seen. We are now in the midst of a data revolution, and data is following the same path as food and manufactured goods before it.
However, data is not like food or manufactured goods, which once consumed are no longer available to anyone else. One of the greatest challenges of the modern world is how to distribute resources equitably and use them sustainably.
We are now in the midst of a data revolution, and data is following the same path as food and manufactured goods before it
When infrastructures such as energy, transport, housing and agriculture are built for the 21st century, we worry about how we will afford them, make them available and do so in a way that does not diminish resources for future generations. The data revolution does not play by these rules. At the Open Data Institute (ODI), we have been working to promote the use of data at web-scale to support open innovation and provide knowledge for everyone.
Data does not have to be a rival good: your possession of it does not detract from my possession of it. If you use it, then it does not become unusable by me. Of course, we may choose to set up payment and protection systems around data and we must protect people’s privacy. But the ODI has been demonstrating the benefits of a data infrastructure that is as open as possible: certain data has greater use and value the more widely it is shared and used. Data is a new class of public infrastructure for the 21st century, and we need to view it as one that is as fundamental to modern society as power and transport, and that requires investment, curation and protection. Engineered correctly, data can generate extraordinary amounts of economic value.
Data does not have to be a rival good: your possession of it does not detract from my possession of it. If you use it, then it does not become unusable by me
When the human genome was first sequenced in 2000, there was a race to patent that data. In fact, it took a 2013 judgement of the US Supreme Court to rule that the genome could not be patented. The value that has accrued from that decision has eclipsed a monopolistic assignment of genomic data to a particular organisation.
When Transport for London (TfL) made its transport data openly available in 2010, it had a transformational impact on the data ecosystem. It led to a growing number of apps and companies using that data to provide services that we all use as we get around one of the great cities of the world more efficiently. Those apps and companies now form part of a global market as transport services around the world open up their data to a common standard. Moreover, TfL calculated that its decision had a return on investment of 58:1. These examples demonstrate that open data generates huge economic returns.
open data generates huge economic returns
Part of the duty of a modern state is to provide open data infrastructure and promote open standards to make them globally interoperable, and open data initiatives around the world are beginning to demonstrate what this common public data infrastructure might look like.
Geospatial data, such as administration regions, maps and addresses, are a vital building block for open data infrastructure. Lists of legal entities (both public and private sector), company directors and beneficial ownership registers help us link data together and understand which organisation is responsible for particular activities.
Meanwhile, specific sectors have their own infrastructural needs. Health needs lists of hospitals, surgeries, practitioners and procedures. Transport needs rail and bus stations, airports, timetables, fares and realtime travel information. Democracy needs lists of elections, results, candidates and places where people can vote.
Both research and experience show us that to maximise value, the data infrastructure must be open and used by as many people as possible. It should be viewed as an asset that needs to be invested in and curated to make sure that the data is as up-to-date and accurate as possible. Data needs to be protected from abuse by bad actors and, in many cases, recognised as critical national infrastructure. It needs to be designed for openness and the global web of data.
Both research and experience show us that to maximise value, the data infrastructure must be open and used by as many people as possible
An effective, efficient and equitable data infrastructure will generate value for this and succeeding generations. The UK needs to have the policy foresight to appreciate the importance of this enterprise, with public and private agencies or organisations coordinating and encouraging open innovation as it seeks to develop, maintain and deploy a national data infrastructure.
Essential infrastructure needs to be engineered to high standards, and must be robust, scalable, accessible, durable, secure, and of good provenance. The UK needs to develop the engineering skills needed to ensure that its data infrastructure has these characteristics.
***
This article has been adapted from "Data as critical national infrastructure", which originally appeared in the print edition of Ingenia 69 (December 2016).
Contributors
Professor Sir Nigel Shadbolt FREng is Professor of Computer Science at the University of Oxford and Principal of Jesus College. He is also the Chairman and Co-founder of the Open Data Institute. In 2009, Sir Nigel was appointed Information Adviser to the UK government, helping transform public access to government information.
Keep up-to-date with Ingenia for free
SubscribeRelated content
Software & computer science
Pushing the barriers to model complex processes
In 2007, Imperial College London spinout Process Systems Enterprise Ltd won the MacRobert Award for its gPROMS (general-purpose PROcess Modelling System) software. Costas Pantelides and Mark Matzopoulos, two of the key people behind the success of gPROMS, tell how they created a way in which engineers can harness physics, chemistry and engineering knowledge within a framework that solves highly complex mathematical problems.
Compact atomic clocks
Over the last five decades, the passage of time has been defined by room-sized atomic clocks that are now stable to one second in 100 million years. Experts from the Time and Frequency Group and the past president of the Institute of Physics describe a new generation of miniature atomic clocks that promise the next revolution in timekeeping.
The rise and rise of GPUs
The technology used to bring 3D video games to the personal computer and to the mobile phone is to take on more computing duties. How have UK companies such as ARM and ImaginationTechnologies contributed to the movement?
EU clarifies the European parameters of data protection
The European Union’s General Data Protection Regulation, due for adoption this year, is intended to harmonise data protection laws across the EU. What are the engineering implications and legal ramifications of the new regulatory regime?
Other content from Ingenia
Quick read
- Environment & sustainability
- Opinion
A young engineer’s perspective on the good, the bad and the ugly of COP27
- Environment & sustainability
- Issue 95
How do we pay for net zero technologies?
Quick read
- Transport
- Mechanical
- How I got here
Electrifying trains and STEMAZING outreach
- Civil & structural
- Environment & sustainability
- Issue 95