Little, unique codes hold our world together

This post was written by Ian Makgill, the founder of OpenOpps.com, the largest database of open tender opportunities in the world.

Have you noticed that your email address is completely unique? Its obvious that this would be true, but it bears thinking about. In all the millions and millions of email addresses there isn’t another like yours, if there was another like yours there would be no way for your colleagues and friends to send a note to just you. Uniqueness is a critical feature that is required for email to work. We’ve known that uniqueness is important for handling information: your address is unique, published books have a unique ISBN number and cars have unique registrations.

We do this so that we can tie our information together effectively, and once that’s in place we can build services on top of that. Imagine a postal service that didn’t have a way of differentiating between addresses, or a library that couldn’t differentiate between two books with the same title? Those long digit numbers and postcodes look geeky and impenetrable, but they’re actually the glue that holds the service together. To libraries ISBN codes are as important as the shelves and the roof, they are part of the infrastructure that our services are built on and without them, our services would struggle to function.

Government has only recently started to try and bring the codes that are used in the different parts of the public sector together into a single set of codes. The work of the registers team is an important start in building a full understanding of what the public sector is made up of, but it is a combination of lists that get updated, rather than a canonical register of every body in the public sector. Its a great start, but it needs more investment and proper support from within government.

The lack of well organised, unique codes for public sector bodies is especially problematic when you try to understand an activity that happens in lots of different parts of the public sector, like giving a grant for instance. Grants are awarded by all sorts of public bodies to all sorts of other bodies, but when there are no codes for the organisation that awarded the grant, it can be really hard to ‘zoom out’ from individual grants and try and understand how those grants might be grouped together. So, without codes you can’t know how many grants were made by local councils, or how many codes were made by organisations in London, or who received the money.

In fact, if you try to do any form of financial analysis across different parts of government, you quickly become unstuck, because we don’t have the underlying infrastructure to link data back to a single record. Which means that people use the data references that they think is best, but that good intent doesn’t work when you try and analyse the data.

Below is the list of different names that the Department of Health has used to describe itself itself in contract award notices in the last two years. Eleven of these records describe the same organisation, one describes two organisations:

  • Ambulance Radio Programme, Department of Health
  • Department Health
  • Department of Health
  • Department Of Health
  • DEPARTMENT OF HEALTH
  • Department of Health (DH)
  • Department Of Health (Government)
  • DoH — Public Health Agency
  • The Department of Health  c/o Ambulance Radio Programme
  • The Department of Health c/o Ambulance Radio Programme
  • The Secretary of State for Health acting as part of the Crown through the Commercial Medicines Unit (part of the Department of Health)
  • The Secretary of State for Health acting as part of the Crown through the Department of Health

In computing terms we call codes like ISBN and passport numbers identifiers, because the codes let us know that we’re referring to one, specific thing. In the same way that not having proper identifiers means that our letters go missing and our books don’t get returned, when we don’t have identifiers for public organisations, our understanding of grantmaking, procurement and budgets is always going to be compromised; data goes missing and simple questions like “Which councils have given grants to Barnardos?” can’t be answered.

Without identifiers, we can’t analyse what we should be able to analyse, but can’t even know what data is missing. Instead, we fumble around trying to guess what is happening. However, even when codes do exist, we’re not using them properly. Every company and every charity has a unique identifier that can be used when they receive a grant or a contract, but when public bodies publish data about who won those grants and contracts don’t add identifiers, so we can’t understand who awarded the grant or who received it.

You might think this doesn’t matter, but when we spend £300bn every year with suppliers, and when Carillion went into administration, the government had to ask Carillion where all of their contracts were. If we don’t know where the contracts are, how can we make sense of whether there’s a risk to the taxpayer and the service user from poor contracting?

At OpenOpps.com we spend all of our time trying to answer the question “who spent what with whom” and so we’ll be working with 360Giving to try and find out how to build a better version of the data required to answer that question.

This blog was edited on 25th May to include a reference to the government list of registers. It was not the author’s intent to suggest that the list didn’t exist, but that it lacked the investment needed to make it really useful.