This site uses cookies

Some of these cookies are essential, while others help us to improve your experience by providing insights into how the site is being used.

For more detailed information please check our Cookie notice


Necessary cookies

Necessary cookies enable core functionality. This website cannot function properly without these cookies.


Cookies that measure website use

If you provide permission, we will use Google Analytics to measure how you use the website so we can improve it based on our understanding of user needs. Google Analytics sets cookies that store anonymised information about how you got to the site, the pages you visit, how long you spend on each page and what you click on while you’re visiting the site.

Synthetic Data Code Demo

15 Jun 2021 12:00 pm - 1:00 pm
Online
Training
Data skills
Other
In this free training series, we cover the advantages and disadvantages of synthetic data. We explore the variety of methods available to generate synthetic data. Finally, we discuss the nuanced definitions comprising synthetic data itself.
In this code demo we will showcase some data synthesis tooling. We work through a manual process of creating synthetic data in Python. We explore a web-based data generation library, Mockaroo. We replicate this dataset using a data generation library for Python called Faker. We simulate the rolling of dice and children’s shoe sizes.
Presenter: Joe Allen, UK Data Service
In the first webinar, we introduce synthetic data and explore some reasons why we want to use it.
In the second webinar, we explore two categories of data synthesis methods: Masking and Redaction.
In the third webinar, we explore three categories of data synthesis methods: Coarsening, Mimicking and Simulation.
Recordings of UK Data Service webinars are made available on our YouTube channel and, together with the slides, on our past events pages soon after the webinar has taken place.

Event resources