This site uses cookies

Some of these cookies are essential, while others help us to improve your experience by providing insights into how the site is being used.

For more detailed information please check our Cookie notice


Necessary cookies

Necessary cookies enable core functionality. This website cannot function properly without these cookies.


Cookies that measure website use

If you provide permission, we will use Google Analytics to measure how you use the website so we can improve it based on our understanding of user needs. Google Analytics sets cookies that store anonymised information about how you got to the site, the pages you visit, how long you spend on each page and what you click on while you’re visiting the site.

Introduction to Data Linkage

22 Mar 2018 12:00 am
Training
Data skills
Other

We recommend this course is booked in conjunction with ADRC-E Short Course T057 Evaluating linkage quality for the analysis of linked data on the 23rd March 2018, but it can also be booked as a separate one day course.

This short course is designed to give participants a practical introduction to data linkage and is aimed at researchers either intending to use data linkage themselves or those who want to understand more about the process so that they can analyse linked data. Introduction to Data Linkage will cover examples of the uses of data linkage, data preparation, and methods for linkage (including deterministic and probabilistic approaches and privacy-preserving linkage).

The main focus of this course will be health data, although the concepts will apply to many other areas. This course includes a mixture of lectures and practical sessions that will enable participants to put theory into practice.

Evaluating linkage quality for the analysis of linked data is a separate course on the 23rd March, which will cover processing of linked data, concepts of linkage error and bias, and handling linkage error in analysis.

Further course details are on the NCRM website.

Course Tutors: Dr Katie Harron, Dr James Doidge

Course Contents:

The course covers

  • Overview of data linkage (data linkage systems, benefits of data linkage, types of projects)
  • Linkage methods (deterministic and probabilistic, privacy-preserving)
  • The linkage process (data preparation, blocking, classification)
  • Overview of linkage error
  • Practical sessions

Learning Outcomes:

By the end of the course participants will:

  • Understand the background and theory of data linkage methods
  • Prepare data for linkage
  • Perform deterministic and probabilistic linkage

Computer Software and Computer workshops:

This event includes computer workshops.

Participants will need to bring their own laptops with a Windows operating system (Macs will not work) with Excel, and LinkPlus software.

Pre-requisites:

The course does not assume any prior knowledge of data linkage. Some experience of using Excel will be useful for the practical session.