Data management for social survey research: 2-day training workshop

..or, "Progress in data managment: To Stata and beyond"


University of Stirling, 24-25 August 2009

This workshop seeks to give an introduction to issues in - and the payoff to - data management for social survey research. It has a special focus on the Stata package, as we argue that this provides the most suitable approach to combining complex data management and data analysis for social scientists. Below are links to workshop materials including slides used in the various presentations, and the files used in the programme of 3 training lab sessions.

Download the workshop programme (pdf)

Workshop materials

In addition to those materials listed below which are provided free open-access, the lab sessions of the workshop make use of survey microdata files which we can't distribute ourselves, but which are available for download via the UK Data Archive (details on the necessary files are found within the relevant command files).


Session 1 Slides: 'The significance of data management..', (pdf / ppt)
Session 2 Slides: 'Discipline and data management..', (pdf / ppt)
Session 3 Lab 1: Core data management with Stata

Principle stata format 'do' files: lab1_master.do ; lab0.do; lab1_highlights.do

Additional data and command files used within the lab: voting_example_data.do ; seglabelsv1.do ; LDA example data folder.

See also the pdf guide to the lab sessions, and links from the related Longitudinal Data Analysis for Social Science Researcher's website).
Session 4 Slides: 'Data management and variable operationalisations', (pdf / ppt)
Session 5 Slides: 'What is e-Social Science?', (pdf / ppt)
Session 6 Slides: 'Documentation and Workflows', (pdf / ppt)
Session 7 Lab 2: Advanced data management with Stata

Principle stata format 'do' files: lab2_master.do ; profile.do; file_matching_extensions.do; documentation_for_replication.do ; variable_operationalisations.do; rae_exemplar.do;

Additional data and command files used within the lab: add_varnames_to_labels.do ; soc90_labels.do ; sub-files invoked within 'documentation for replication' ; folder with further data on occupations ; folder with RAE example data files.

See also the pdf guide to the lab sessions, and links from the related Longitudinal Data Analysis for Social Science Researcher's website).
Session 8 Slides: 'Standardisation, harmonisation and measurement', (pdf / ppt)
Session 9 Slides: 'Data management and frontier in survey research', (pdf / ppt)
Session 10 Lab 3: 'Online resources for data management':

(handout pdf / video demos (DAMES - eHealth) / GEODE practical materails)








This page was last updated on 23 August 2009.