Data hubs

Practical insights and inspiration for achieving better public service outcomes. Data lies at the heart of what we do. This collection seeks to help local public service organisations unlock the potential of data, whilst protecting the people and communities that they are there to serve.

Jump to section

Opening-up data
Data hubs
Data quality challenge – how to avoid ‘big bad data’
Data mashups
Benefits derived from harnessing data

Opening-up data

Opening-up data is not just concerned with avoiding unnecessary security constraints and costs. Rather, it involves sharing information openly, internally and externally to improve its quality and value. Most data is not personal, sensitive or confidential, and much data security is unnecessary. It comes from a culture of ‘closed’ not ‘open’, where it is cheaper and easier to provide a blanket of security over all data rather than being more sophisticated in determining which data requires higher levels of security and protection. The culture extends to the workforce in which many feel they don’t need to worry about data security, because IT has sorted it all out – any breach is the fault of a failure of IT protection.

In practice however, UK public services are moving towards greater openness, and have regulatory and legal requirements to do so. The open data initiative began nearly a decade ago, and local government have arguably led the way, finding value in sharing data openly, allowing it to be used in ways beyond the original purpose for which it was captured, opening up data linkage possibilities across disparate organisations and systems.

Whilst the original government aims of ‘armies of armchair auditors’ keeping public services in check was neither effective nor feasible, there have been a range of other benefits which public service organisations can list in their data strategies:

Increasing customer and citizen insight
Undertaking more sophisticated data matching to improve data quality, reduce data redundancy and reduce fraud
Providing better customer services and anticipation of need or risk
Improved management of resources, such as in being able to find underused buildings to share
Better delivery of complex relationship services such as health and social care integration

Opening-up data can also ensure that many versions of the same information can be exposed, both within an organisation but also across related services in other organisations. This can help remove redundant data and ensure decisions are taken using the most current and up to date data.

It also can fundamentally change the culture of an organisations in relations to data principles and use, moving beyond the traditional silos of data held protectively in departments or by professionals unwilling (sometimes with good reason) to share with the wider organisation, partner or the citizen whose data it is. It also helps to ensure that data and information are managed in their own right, rather than as appendages locked into propriety or bespoke systems with their own unique formats and standards.

Data hubs

The idea of pooling mountains of data and analysing to produce new insights is nothing new. Tesco began doing this with customer shopping habits in the 1980s but did not have the technical capability to analyse and use the data volumes. Today things are different. Powerful processing capabilities coupled with artificial intelligence engines and cloud infrastructure have transformed the possibilities into reality. ‘Big Data’ projects are being reborn.

But building a massive data warehouse and simply assuming everyone will use it in a uniform way could be a recipe for disaster of optimism over reality. Such a project can become unwieldy, costly and hard to realise value. Big data warehouses also do not always fit well with the model of public and private cloud systems.

What is emerging as a virtual data hub, often operating at a local level around the roles of health and look government, with data held across individual applications using a mixture of, standards, open eight dollars in data management tools. Defining this is a data architecture is not easy, and CIOs need to collaborate closely with individual business teams to understand analytics agenda and requirements, and often data linkages are more important than creating data mountains in a data project.

Public services today are using predictive data analytics to start to mine data and solve the most intractable problems facing society. For example, Brent Community Protection team and IBM have carried out a proof of concept, building a predictive model to identify vulnerable young people most at risk of criminal exploitation, such as child sexual exploitation and gang drug running.

A growing network of connected devices in our public buildings, in our streets and even on our wrists is helping the public sector to deliver better services that are less intrusive and reduce the need for human intervention. The Cloud and AI are essential components to make sense of this stream of new data, using powerful and distributed cognitive engines to process structured and unstructured data, analysing and linking data according to patterns and connections, forming a big data virtual backbone. These algorithmic routines and logical components such as RPA (robotic process automation) are transforming the prospects of big data analytics for the public sector, predicting events and opportunities in every aspect of our daily lives.

Big data is often seen as the same as master data management – but they are not the same. Neither is the concept of data warehouse:

“Big data” – typically, a project, defined around a specific function or purpose (such as customer service)
“Data warehouse” – typically, about bringing all core data together into one large repository shared across the business
“Master data management” – a concept for how core data is managed; whilst often centralised, some will also be in distributed systems

Data quality challenge – how to avoid ‘big bad data’

There is a need to live with data error – it cannot be eliminated altogether. This means focussing on the biggest risks and working to mitigate the effects of poor data quality, with timely intervention and strong policies and processes. Making responsibilities and data governance clear is the essential starting point to understanding the value of data and preventing preventable errors.

What is quality data?

“Quality data is useful data, and the reverse is true”
Consistency in formats
Properly maintained
Timely
Accurate and true
Relevant metadata held
Business rules for use documented
Provenance known and retained
GDPR compliant

Benefits of good data

Value increases in data mining activity for problem solving
Data sharing is more effective and justifiable
Process automation is only possible with good, clean data
Customer relationship management systems and access authentication require good data
Decision making is faster and with more confidence, empowering suppliers, staff and customers.

Causes of poor data quality

Lack of ownership for the level of quality in data sets
Specific data roles are not clear (SIRO, DPO, CDO, etc)
Suppliers are not held accountable
Sloppy data handing/sharing practices and weak accountability
Information governance is missing or not directed from the board
Poor processes for data management or MDM
Culture does not value data
Data standards lacking or inconsistently applied
Risk management for data assets is immature, including security policies
Digital maturity is low, such as how social media and new multi-cloud systems are used

Data mashups

The power of data lies in the ability to connect different data sets to create new insight. It is the basis of AI, which uses the intelligence in data to make predictive analysis and automated decision making. Today, sophisticated data tools can combine data from many sources – audio, video, RSS feeds and raw numeric or non-numeric data. These ‘mashups’ of disparate data sources are the basis of data mining tools which can trawl for new intelligence through enormous data volumes, looking for patterns and business value as well as for machine learning applications.

In the past much of this type of analysis depended on data warehousing – a large and carefully designed pool of data. Although there is still a place for selective data warehousing, perhaps in a CRM (customer relationships management system), many applications today do not depend on a single database managed on-premise. Instead, they make calls to multiple data files in a mix of hosted cloud, on-premise and open source data on the internet.

Clearly this depends on:

Common data formats and definitions that ensure that data schemes permit appropriate data linkages to be made
Methods for data collection
Data analysis to establish the provenance and accuracy of data
Appropriate tools that do the actual mashup.

Data mashup tools are not the sole domain of the information or IT professional. They also allow end-users to define their own data mashups and to visualise and analyse the resulting data blocks. Some will be concerned about this, worried that end-users can misuse data, create incorrect formulae and therefore make the wrong insights and conclusions. Business intelligence (BI) tools can be locked down, but this will arguably hold back the progress and learning necessary in a truly data-driven organisation. In any case, competent end-users will find ways of extracting data into spreadsheets if they choose.

Public service organisations must learn to grow data skills and trust the judgements of data specialists and end-users alike, learning openly when mistakes are inadvertently made. Better to concentrate on ensuring the quality of master/core data and support end-users in becoming intelligent data analysts.

Benefits derived from harnessing data

Discussions with some of those linked to this work have listed a range of benefits generated by their data projects

Quicker and better decision-making through the availability of a high-quality easy to retrieve data base
Improving the response to customer citizens needs
Developing commercial solutions to increase income generation
Identifying data patterns, trends and new service opportunities
Developing efficiency of internal processes such as automation and business process improvement
Increasing the accuracy of data, especially through data matching techniques
Better understanding of demand management, forecasting and trends
Better understanding of citizens, and customers and communications with them
Lower operational costs
Improve risk management
Optimize marketing and communication to citizens and customers
Being able to simplify and consolidate business data interrogation tools and reporting
Having a better understanding of the scale and potential benefit of total corporate data assets
Improving data governance and information management cultures
The ability to be able to capture, store, retrieve and analyse data in a flexible and agile fashion
The ability to link data held across different data formats
Improving the tracking and recording of data origins and movements between databases and organisations – data provenance
Building more sophisticated analytical and predictive models, including supporting the basis of artificial intelligence development.

How to be an Effective Coach

How to be an Effective Coach

Open data products

Data Lab collaboration projects

Council and fire and rescue authority data

Unlocking the value of London’s public sector data

City Tools: shining a light on the technologies powering London’s boroughs

Worcestershire Office of Data Analytics (WODA)

Using data in the service of humanity

Vulnerable children

Suffolk authorities launch data analytics operation

West Midlands to get office for data analytics

Barking and Dagenham – social progress index source

Open data in Calderdale

Predicting child abuse

New approaches to community pain points

Civic hacking initiative

Relevant Socitm resources

Realising the potential of big data analytics

Helpful external resources

Harnessing data resources

London Office of Technology and Innovation (LOTI)

New service delivery models – R&D project

Tools to support open data publication and use

Open Data in a Day training course

Open data essentials – free eLearning

Data and Public Services Toolkit

5 steps to transform your IT organisation with AI

Checklist: How to design to scale

Big Data or Big Hype?

What’s the ideal model for an Office of Data Analytics?

Wise Council: Insights from the cutting edge of data-driven local government

How to be an Effective Coach

Jump to section

Opening-up data

Data hubs

Data quality challenge – how to avoid ‘big bad data’

What is quality data?

Benefits of good data

Causes of poor data quality

Data mashups

Benefits derived from harnessing data

Discussions with some of those linked to this work have listed a range of benefits generated by their data projects

Case Studies

Relevant Socitm resources

Helpful external resources