This site uses cookies

Some of these cookies are essential, while others help us to improve your experience by providing insights into how the site is being used.

For more detailed information please check our Cookie notice

Necessary cookies

Necessary cookies enable core functionality. This website cannot function properly without these cookies.

Cookies that measure website use

If you provide permission, we will use Google Analytics to measure how you use the website so we can improve it based on our understanding of user needs. Google Analytics sets cookies that store anonymised information about how you got to the site, the pages you visit, how long you spend on each page and what you click on while you’re visiting the site.

Webinar: Introduction to QAMyData ‘health-check’ tool for numeric data

2 Dec 2019 2:00 pm - 3:00 pm
Data skills

This webinar is an introduction to the new QAMyData tool for health-checking your numeric data, recently launched in November 2019.

The tool uses automated methods to detect and report on some of the most common problems found in survey or numeric data, such as missingness, duplication, outliers and direct identifiers. The open source tool helps data creators and users quality assess a numeric data file using a comprehensive list of ‘tests’, classified into types: file, metadata, data integrity, and direct identifiers. Popular file formats can be tested, including SPSS, Stata, SAS and CSV. The test configuration feature allows the creation of your own unique Data Quality Profile, that can play a useful role in your ‘FAIR’ data checking.

The webinar will describe the tests that are included in the tool, how to configure these to meet your own quality thresholds, and how to download the software from our Github page. We will also show our teaching exercise using messy data that can help promote data management skills.

The webinar will consist of a 30 minute presentation followed by 20 minutes for questions.

Presenters: Louise Corti, Cristina Magder and Myles Offord

Level: Intermediate
Experience/knowledge required: Some knowledge of survey or numeric data
Target audience: Data publishers/data archivists, users of numeric data, peer reviewers of data, quantitative research lecturers