Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Presentation . 2017
License: CC BY SA
Data sources: Datacite
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Presentation . 2017
License: CC BY SA
Data sources: Datacite
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Other literature type . 2017
License: CC BY SA
Data sources: ZENODO
versions View all 2 versions
addClaim

Metadata-Driven Scientific Use File Data Management

Authors: Bela, Daniel;

Metadata-Driven Scientific Use File Data Management

Abstract

With several excellent software tools emerging for the task, survey instrument creation in became more and more structured and automated in recent years. However, after field work has been done, it usually is up to one or several data managers in research institutions to process the data files and create ready-to-use analysis datasets and documentation. This data management process often is badly structured and documented, and seldom automated in social research. Many of the procedures that have to be run in order to create usable datasets, however, contain the potential for full- or semi-automation as soon as the procedures themselves are structured appropriately. In order to deal with the vast of incoming field data from the German National Educational Panel Study (NEPS), the data management team at LIfBi (in cooperation with partners across Germany) implemented such a structured and semi-automated approach for creating and updating the Scientific Use Files for the six panel cohorts of NEPS. This happened by conceptually separating several data management tasks from each other, and creating interface steps for interchanging data extracts (e.g. for coding text answers from the surveys or generating additional variables) with external partners. Additionally, every step of the data management process that could be automated by re-using information from the survey instruments or field documentation (e.g. renaming of variables, labeling, translation), has been designed to make use of this potential for automation. This led to a large amount of additional meta-information that now directly is integrated into NEPS Scientific Use File datasets, such as full questionnaire texts. Based on these experiences, a sketch of 'best practice' solutions to implement a metadata-driven data management workflow can be established. This presentation will focus on conceptual solutions to improve data management procedures in order to make them more structured, better documented, and less error-prone. Eventually, this approach can lead to better survey data for analyses, and reduce unsystematic variance in data management procedures---which otherwise necessarily constitute (in the best case) a large workload of fixing data afterwards or (in the worst case) biased research results.

Keywords

Stata, NEPS, data management, social sciences

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
    OpenAIRE UsageCounts
    Usage byUsageCounts
    visibility views 6
    download downloads 12
  • 6
    views
    12
    downloads
    Powered byOpenAIRE UsageCounts
Powered by OpenAIRE graph
Found an issue? Give us feedback
visibility
download
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
views
OpenAIRE UsageCountsViews provided by UsageCounts
downloads
OpenAIRE UsageCountsDownloads provided by UsageCounts
0
Average
Average
Average
6
12
Green