Using agent-based models as a simulation tool to generate synthetic data

18 Feb 2026 2:00 pm - 4:00 pm

Online

Training

Synthetic data

Workshop

This free online workshop explores the intersection of two computational social science ideas, synthetic data and agent-based models:

Synthetic data – which is generated data rather than observed data. This can be useful when balancing research needs with data availability or security.
Agent-based models (ABM) – which simulate interactions and can produce synthetic data for diverse or even counterfactual scenarios.

This workshop will cover:

why create synthetic data using ABM
when it would be appropriate to do so
how to create the data using open source software NetLogo
what the resulting data can be used for

We will be using NetLogo for the live software demonstration, which is an open-source programming language and integrated development environment for agent-based modelling.

Workshop details

The workshop is structured into four parts:

Introduction to synthetic data
What is and is not synthetic data? What is it good for? What are some common ways to generate it?
Introduction to top-down versus bottom-up
What do top-down and bottom-up mean? How do these relate to research problems and data generation methods?
Introduction to agent-based models
What are agent-based models? How do they work? What data do they generate? This part includes demonstrations of free and user-friendly agent-based modelling software.
Q&A session
Participants can engage directly with the content, ask questions related to the presentation or live coding demonstration, and get links to useful resources.

There will be a comfort break during the workshop.

Presenter

Jools Kasmire, UK Data Service
Jools is a Senior Research Fellow and Computational Social Science Training Leader for the UK Data Service Training and User Support team. Her training themes include data mining, text-mining, machine learning, synthetic data, reproducibility, writing code, programming languages, data management, data visualization, web-scraping, API’s, agent-based models, natural language processing, and social network analysis. She has an undergraduate degree in linguistics, MSc in evolution of language and cognition, PhD in evolution of socio-technical networks, and has spent 10 years with the University of Manchester on computational methods for social science research. Jools enjoys showing people methods that allow them to create new research questions or to think about established research questions in new ways.

Resources

Participants will have access to all workshop materials (slide decks, web-only agent-based models, and links to free agent-based modelling software). Additionally, a recording of the workshop will be made available post-event.

Prerequisites

No formal prerequisites are required to attend. However, those who wish to actively participate in the software demonstration should have access to a computer with NetLogo installed or have a browser open to the NetLogo web environment.

Level: Suitable for all. Participants who want to find out more about agent-based modelling and/or synthetic data in advance of the workshop can watch our YouTube videos:

Experience/knowledge required: None, although participants who want to follow along with the live software demonstration may benefit from exploring NetLogo tutorials

Target audience: Anyone who wants to learn more about how to:

generate synthetic data
use agent-based models to generate data for social science research

Any questions regarding this workshop can be sent to  booking@ukdataservice.ac.uk.

This event will be livestreamed on our UK Data Service YouTube channel but the chat will be disabled. By registering and attending the Zoom event you will be able to ask questions and interact.

Recordings of UK Data Service events are made available on our YouTube channel and, together with the slides, on our past events pages soon after the event has taken place.

Event resources

Recording (YouTube)

Presentation (PDF)

This site uses necessary cookies

Website stats

Using agent-based models as a simulation tool to generate synthetic data

Event resources