Metadata
Metadata
Metadata are an important subset of core data documentation
Collating and recording metadata is important for the purposes of cataloguing, citing, discovering and retrieving data collections. Metadata are a subset of core data documentation providing standardised, structured information.
Metadata are intended for reading by machines, and help to explain the purpose, origin, time references, geographic location, creator, access conditions and terms of use of a data collection. Without this essential documentation, collections become of limited value simply because researchers and reusers will not be able to search for or cite the data collection.
With new policies and initiatives asking researchers to pre-register their studies and to share their data, catalogued information about research studies has shown the importance of having metadata clearly documented from the very start of a research study.
Example schemes
- Dublin Core
- ISO 19115 for geographic information
- Data Documentation Initiative (DDI)
- Statistical Data and Metadata eXchange (SDMX)
- Metadata Encoding and Transmission Standard (METS)
- General International Standard Archival Description (ISAD(G))
- DataCite metadata schema for the publication and citation of digital datasets with a persistent identifier.
Data Documentation Initiative
The DDI is a rich and detailed metadata standard originally designed for describing social, behavioural and economic sciences data. It is used by most social science data archives in the world.
DDI catalogue records contain mandatory and optional metadata elements relating to study description, data file description and variable description. The study description contains information about the context of the data collection, such as bibliographic citation of the study and data, the scope of the study, like topics, geography, time, method of data collection, sampling and processing, data access information, and information on accompanying materials.
The data file description indicates data format, file type, file structure, missing data, weighting variables and software used. Variable descriptions indicate the variable labels and codes.
The UK Data Service uses DDI to structure catalogue records. The use of standardised records in eXtensible Mark-up Language (XML) brings key data documentation together into a single document, creating rich and structured content about the data.
Metadata can be harvested for data sharing through the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH).
The UK Data Service catalogue records
Our DDI records contain mandatory and optional metadata elements on the following:
- Study description – information about the context of the data collection, such as bibliographic citation of the study and data, the scope of the study (topics, geography, time), methodology of data collection, sampling and processing, data access information, and information on accompanying materials.
- Data file description – information on data format, file type, file structure, missing data, weighting variables and software.
- Variable descriptions.
We collect the initial catalogue record information from our data deposit form, which is completed by the data depositor. We then enhance information from the accompanying documentation to create a conformant metadata record.
Where researchers can provide detailed and meaningful data collection titles, descriptions, keywords, contextual and methodological information in the deposit form, it helps us to create rich resource-discovery metadata for their deposited collections. We assign keywords from our own HASSET thesaurus.
Data citations
Depositors are encouraged to provide information about original and subsequent reports and publications or presentations based on our data collections, so these references can be added as further documentation.
We are always interested in capturing case studies of data reuse in our collections to encourage further use of data. We prepare a standard bibliographic citation for each data collection so that users can correctly cite the data sources in research outputs.
Example extract from a UK Data Service DDI catalogue record
<stdyInfo> <subject> <keyword> </keyword> <topcClas Vocab=”unknown”>Economic processes and indicators – Economics</topcClas> <topcClas Vocab=”unknown”>Economic systems and development – Economics</topcClas> <topcClas Vocab=”unknown”>General – Employment and labour</topcClas> <topcClas Vocab=”unknown”>Elites and leadership – Social stratification and groupings</topcClas> <topcClas Vocab=”unknown”>Management and organisation – Industry and management</topcClas> </subject> <abstract>This project aims to develop knowledge and understanding of the contemporary globalisation of the headhunting industry in Europe and its implications for new forms and geographies of executive search and selection. Europe has become the most complex and sophisticated pan-regional market for executive search, fuelled by free labour mobility within the EU, thereby offering a unique environment in which to study the changing practices of the headhunting industry.</abstract> <sumDscr> <nation>UK</nation> <geogCover>London</geogCover> <nation>France</nation> <geogCover>Paris</geogCover> <nation>The Netherlands</nation> <geogCover>Amsterdam</geogCover> <nation>Germany</nation> <geogCover>Frankfurt</geogCover> <nation>Belgium</nation> <geogCover>Brussels</geogCover> <universe>Executive search consultants, researchers and associations in London, Paris, Frankfurt, Amsterdam and Brussels, 2006-2007</universe> <anlyUnit>Individuals</anlyUnit> <anlyUnit>Institutions/organisations</anlyUnit> <collDate event=”start” date=”01/2006″>January 2006</collDate> <collDate event=”end” date=”08/2007″>August 2007</collDate> <timePrd event=”start” date=”01/1980″>January 1980</timePrd> <timePrd event=”end” date=”08/2007″>August 2007</timePrd> </sumDscr> <notes> </notes> <method> <dataColl> <sources> <dataSrc rule=”Sources used”>The Executive Grapevine, The Directory of Executive Recruitment, published by The Executive Grapevine International Ltd. Editions consulted: 1980, 1985, 1990, 1994, 2000, 2005</dataSrc> <dataSrc rule=”Source location and access”>Copies are held at the British Library and the most recent edition is available for private purchase.</dataSrc> </sources> <collMode>Face-to-face interview</collMode> <collMode rule=”Other”>Time series for search firm and office data collated from the Executive Grapevine Directories of International Recruitment</collMode> <sampProc>Purposive selection/case studies</sampProc> <timeMeth>Cross-sectional (one-time) study</timeMeth> </dataColl> </method> <othrStdyMat> |