DDI-RDF Discovery Vocabulary This specification defines the DDI Discovery Vocabulary, an RDF Schema vocabulary that enables discovery of research and survey data on the Web. It is based on DDI (Data Documentation Initiative) XML formats. This specification defines the DDI Discovery Vocabulary, an RDF Schema vocabulary that enables discovery of research and survey data on the Web. It is based on DDI (Data Documentation Initiative) XML formats. Thomas Hartmann Richard Cyganiak Joachim Wackerow Benjamin Zapilko Thomas Hartmann Sarven Capadisli Franck Cotton Richard Cyganiak Arofan Gregory Benedikt Kämpgen Olof Olsson Heiko Paulheim Joachim Wackerow Benjamin Zapilko Version 0.6 - 2013-09-30 Analysis Unit Analyseeinheit The process collecting data is focusing on the analysis of a particular type of subject. If, for example, the adult population of Finland is being studied, the AnalysisUnit would be individuals or persons. Data element Élément de donnée RepresentedVariables encompasse study-independent, re-usable parts of variables like occupation classification. Data file Fichier de données The class DataFile, which is also a dcmitype:Dataset, represents all the data files containing the microdata datasets. Descriptive statistics Statistique descriptive SummaryStatistics pointing to variables and CategoryStatistics pointing to categories and codes are both DescriptiveStatistics. Summary statistics For SummaryStatistics, maximum values, minimum values, and standard deviations can be defined. Category statistics For CategoryStatistics, frequencies, percentages, and weighted percentages can be defined. Instrument Instrument de collecte The data for the study are collected by an Instrument. The purpose of an Instrument, i.e. an interview, a questionnaire or another entity used as a means of data collection, is in the case of a survey to record the flow of a questionnaire, its use of questions, and additional component parts. A questionnaire contains a flow of questions. LogicalDataSet Ensemble de données Each study has a set of logical metadata associated with the processing of data, at the time of collection or later during cleaning, and re-coding. LogicalDataSet represents the microdata dataset. Question Question A Question is designed to get information upon a subject, or sequence of subjects, from a respondent. responseDomain The response domain of questions. The response domain has to be an instance of the class Representation. Questionnaire Fragebogen A questionnaire contains a flow of questions. Questionnaires must contain 1 to n questions using the object property question. Particular questions may be contained in 0 to n questionnaires. Study Étude A Study represents the process by which a data set was generated or collected. Study Group Studiengruppe In some cases, where data collection is cyclic or on-going, data sets may be released as a StudyGroup, where each cycle or wave of the data collection activity produces one or more data sets. This is typical for longitudinal studies, panel studies, and other types of series (to use the DDI term). In this case, a number of Study objects would be collected into a single StudyGroup. Variable Variable Variables provide a definition of the column in a rectangular data file. Variable is a characteristic of a unit being observed. A variable might be the answer of a question, have an administrative source, or be derived from other variables. Universe Univers A Universe is the total membership or population of a defined class of people, objects or events. Mapping This class is for representing mappings betwenn DDI-RDF and DDI-XML. See Section 10 in the specification for more details and examples. number of cases nombre d'observations This property is used for representing the case quantity of a DataFile. frequency fréquence This property is used to describe the frequencies within category statistics. See Sections 6 and 7 more more details and examples. is public ist öffentlich This property is used as a flag indicating if the microdata dataset is publicly available. The value true indicates that the dataset can be accessed (usually downloaded) by anyone. is valid Indicates if the code (represented by skos:Concept) is valid or missing. Please note that this property is a feature at risk, since the domain is not a class of Disco. Maintainers of the domain ontology may define their own property. question text Fragetext This property contains the actual text of a question as string. See Section 8.2 for examples. percentage pourcentage This property is used to describe the percentages within category statistics. See Sections 6 and 7 more more details and examples. computation base pourcentage computationBase expresses if the cases - which are the basis of the computation of a statistics value - are valid, invalid or the total of both. The usage of computationBase for frequency differs from the usage for the percentage statistics and the summary statistics. A distinction regarding computationBase doesn’t apply to frequency as category statistic. Please find more details in Section 6.3 of the specification. cumulative percentage This property is used to describe the cumulative percentages within category statistics. See Sections 6 and 7 more more details and examples. purpose Grund The purpose of a Study of a StudyGroup. subtitle Untertitel The sub-title of a Study of a StudyGroup. start date Defines the start date of a period of time. Please note that this property is a feature at risk, since the domain is not a class of Disco. Maintainers of the domain ontology may define their own property. end date Defines the end date of a period of time. Please note that this property is a feature at risk, since the domain is not a class of Disco. Maintainers of the domain ontology may define their own property. Mapping from and to DDI-L Mapping from and to DDI-L. See Section 10 in the specification for more details and examples. Mapping from and to DDI-C Mapping from and to DDI-C. See Section 10 in the specification for more details and examples. context specifies conditions which have to be fulfilled for specific mappings context specifies conditions which have to be fulfilled for particular mappings. Context information can be either a SPARQL query or an informal description as plain literal. variable quantity This property can be used when (1) no variable level information is available and when (2) only a stub of the RDF is requested e.g when returning basic information on a study of file, no information on potentially hundreds or thousands of variables references or metadata has to be returned. analysis unit Analyseeinheit This property links to the analysis unit of a Study, a StudyGroup, or a Variable. based on utilise l'élément de donnée This property points to the RepresentedVariable the Variable is based on. collection mode Datenerfassungsmodus This property points to the mode of collection of a Questionnaire which is a skos:Concept. concept a pour concept This property points to the DDI concept of a RepresentedVariable, a Variable, or a Question aggregation This property points to the aggregated data set of a microdata data set. The aggregated data set is a qb:DataSet of the RDF Data Cube Vocabulary. data file a pour fichier de données This property points to the DataFile of a Study or a LogicalDataSet. DDI file DDI-Datei This property points from a Study or a StudyGroup to the original DDI file which is a foaf:Document. external documentation externe Dokumentation This property points from an Instrument to a foaf:Document which is the external documentation of the Instrument. funded by This property points from a Study or a StudyGroup to the funding foaf:Agent which is either a foaf:Person or a org:Organization. had role This property indicates the role of an Agent, e.g. analyst, data modeler, programmer, co-investigator or others. in group This property points from a Study to the StudyGroup which contains the Study. input variable variable en entrée This property indicates the original Variable of an aggregated qb:DataSet. Please note that this property is a feature at risk, since the domain is not a class of Disco. Maintainers of the domain ontology may define their own property. instrument a comme instrument This property indicates the Instrument of a Study or a LogicalDataSet. kind of data The general kind of data (e.g. geospatial, register, survey) collected in this study, given either as a skos:Concept, or as a blank node with attached free-text rdfs:label. product Produkt This property indicates the LogicalDataSets of a Study. question a comme question This property indicates the Questions associated to Variables or contained in Questionnaires. representation a pour représentation RepresentedVariables and Variables can have a Representation whose individuals are either of the class rdfs:Datatype (to represent values) or skos:ConceptScheme (to represent code lists). statistics category a pour concept statistique This property points to the skos:Concept (representing codes and categories) of a specific CategoryStatistics individual. statistics data file a pour fichier statistique This property indicates the DataFile of a specific DesciptiveStatistics individual. DescriptiveStatistics may have statisticsDataFile relations to 0 to n data files (DataFile) and data files (DataFile) may be in 0 to n statisticsDataFile relations to DescriptiveStatistics individuals. statistics variable a pour variable statistique This property indicates the Variable of a specific SummaryStatistics individual. SummaryStatistics point to 0 to n variables (Variable) using the object property statisticsVariable. weighted by SummaryStatistics or CategoryStatistics resources may be weighted by a specific Variable. universe a comme univers This property indicates the Universe(s) of Studies, StudyGrous, RepresentedVariables, Variables, Questions, and LogicalDataSets. variable Variable This property indicates the Variable of a Study and points to Variable contained in the LogicalDataSet. summary statistics type This property points to the summary statistics type of a Questionnaire which is a skos:Concept.