Dataset: Generations and Gender Survey Italy Wave 1 & Wave 2

Abstract

The Generations and Gender Survey (GGS) provides micro-level data with the aim of significantly improving the knowledge base for social science and policymaking in Europe and developed countries elsewhere.  
In Europe 2020, the European Union develops a strategy "to help us come out stronger from the crisis and turn the EU into a smart, sustainable and inclusive economy delivering high levels of employment, productivity and social cohesion". The economic crisis affects not only day-to-day decisions, but also fundamental choices at all stages of people's lives:  marriage and childbearing, the combination of employment and caring responsibilities for the young and the old, retirement, housing, and ageing well. The GGS has been developed to provide scientists with high-quality data to contribute scientifically grounded answers to these key policy questions. Survey content focuses on intergenerational and gender relations between people, expressed in care arrangements and the organization of paid and unpaid work. Key feature of the survey are:  
- Cross-national comparability. In each country data is collected on the basis of a common international questionnaire and guidelines about the methodology. Data processing includes central harmonization of national datasets.  
- A broad age range. It includes respondents between the ages of 18 and 80.
- A longitudinal design. It has a panel design, collecting information on the same persons at three-year intervals.  
- A large sample size. It has an average of 9,000 respondents per country at Wave 1.
- A theory-driven and multidisciplinary questionnaire. It provides data for policy relevant research by demographers, economists, sociologists, social policy researchers, social psychologists and epidemiologists. The questionnaire is inspired by the theory of planned behavior.
- Possibility to combine the survey data with macro data provided by the GGP Contextual Database. This combination enables analyses of individuals and families in their cultural, economic, political, social and policy contexts.

Variable Groups

Document Description

Full Title

Generations and Gender Survey Italy Wave 1 & Wave 2

Alternative Title

GGS Italy Wave 1 & Wave 2

Identification Number

GGSW1.W2.17

Date of Distribution

2016-08-23

Version

Working Version: GGS Wave 1 Version 4.3. & and GGS Wave 2 Version 1.3.  

Update of variable catagories and documentation with the release of Poland Wave 2 Version 1.3.

Date: 2018-02-26

Guide To Codebook

In the field “Study Description”, users can find metadata about surveys. This includes the distributors, keywords, abstract, and guidelines on the bibliographic citation.  
Country specific metadata include information on survey producers, methodology and processing.This information was provided by the GGP-country teams, based on a metadata grid with pre-structured questions. Links to relevant references (e.g., working papers and questionnaires) are also provided.  


The field “Data Files Description” provides metadata about the data file, such as file contents, missing values, as well as changes across different GGS versions.

The field "Variable Description" provides information on each variable, such as question text, descriptions of country specific categories and variables, universe (i.e., subset of respondents to whom the question was asked), country specific deviations to GGS routing, descriptions of the ways in which consolidated and derived variables are calculated. Variables are ordered according to the sections of the GGS codebook.

PLEASE NOTICE THAT WE DOCUMENT ONLY VARIABLES HAVING VALID CASES.  
VARIABLES HAVING ALL SYSTEM MISSING CASES ARE NOT DOCUMENTED.  
This is the reason why the total no. of variables in the documentation is smaller than the total number of variables in the SPSS and STATA files.

Full Title

GGS_W1-V.4.2.&W2-V.1.3_Italy

Producer

Name Affiliation Abbreviation Role
Arianna Caporali Institut national d'études démographiques (INED) AC

Study Description

Full Title

Generations and Gender Survey Italy Wave 1 & Wave 2

Alternative Title

GGS Italy Wave 1 & Wave 2

Parallel Title

Famiglia e soggetti sociali (FSS) (2003); Criticità dei percorsi lavorativi in un'ottica di genere (2007)

Identification Number

GGSW1.W2.17

Authoring Entity

Name Affiliation
Italian National Institute of Statistics (Istat)

Other identifications and acknowledgments

Name Affiliation Role
Istat Central Directorate for the study of conditions and quality of life (DCCV), Department of Social Structure and Dynamics (SDS) Data treatment, cleaning and weighting

Producer

Name Affiliation Abbreviation Role
Italian National Institute of Statistics Istat

Funding Agency/Sponsor

Name Abbreviation Role Grant
Italian National Institute of Statistics Istat
Italian Ministry of Labour and Social Policy

Data Distributor

Name Affiliation Abbreviation
Institut national des études démographiques - 133 boulevard Davout 75980 Paris Cedex 20, France. INED
Netherlands Interdisciplinary Demographic Institute - Lange Houtstraat 19, NL-2511 CV The Hague, The Netherlands NIDI

Depositor

Name Affiliation Abbreviation
Italian National Institute of Statistics Istat

Bibliographic Citation

United Nations 2005. Generations & Gender Programme: Survey Instruments. New York and Geneva: UN, 2005.

List of Keywords

Date of Collection

Start End Cycle
2003-11-01 2004-01-01 Wave 1
2007-02-01 2007-03-15 Wave 2

Country

ITALY  (ITA)

Geographic Coverage

Whole territory of the country.

Geographic Unit

WAVE 1
Region (20 regions and 2 autonomous provinces).

WAVE 2
Geographical areas: North-west, North-east, Centre, South, Islands and the municipality size.

Unit of Analysis

Individuals

Universe

WAVE 1
People of all ages as of December 31st, 2002, living in private households in Italy.

WAVE 2
People aged 18-64 years in wave 1 (i.e. in 2003), living in private households.

Kind of Data

Survey data

Time Method

Panel

Data Collector

Sampling Procedure

WAVE 1 SAMPLING PROCEDURE  
1. Sampling frame
1.1 Type of frame: Population register from municipalities selected at first stage.
1.2  Frame coverage: not available
1.3 Frame size: 24,000 households selected from 755 primary units (municipalities), about 10% of total municipalities.
1.4 Level of units available: Municipalities.

2. Sampling method
2.1 Sampling method type: Two-stage sampling with stratification of primary sampling unit.
First stage: selection of primary sampling units (municipalities) stratified by region, municipal typology and population size. Inclusion probabilities are proportional to demographic size (PPS) of municipalities in each stratum; the selection is performed according to the Madow method (Madow, W. G. 1949. "On the Theory of Systematic Sampling, II," Annals of Mathematical Statistics, 20, 333-354.)
Second stage: elementary sampling units (households) are selected from Municipal population offices with Systematic sample and equal inclusion probabilities.
2.2 Sampling stage definition
- PSU: Municipalities.
- SSU: Households.
- TSU: not applicable
2.3 Sampling stage size
- PSU: About 800.
- SSU: 24,453.
- TSU: not applicable
2.4 Unit selection: The primary sampling units of the sample design (the municipalities) are stratified by region, municipal typology and size in terms of population. Inclusion probabilities are proportional to demographic size and the selection scheme is random.
2.5 Final stage unit selection: Final units are not stratified and are selected by systematic random (i.e. use of step) with random seed.
2.6 Within Household unit selection: All persons in the household were selected.
2.7 Stratification: Only primary units are stratified as described above, with the aim to obtain a self-weighting sampling design at regional level.
2.8 Sample size
- Starting size sample: 24,453 households.
- Aimed total size at Wave 1: About 50,000 individuals of all ages.
- Aimed total size at Wave 3: No wave 3 planned.  
2.9 Estimated Non-response
- Initial non-response: About 15%.
- Yearly attrition: None.  
- Non response measures: None, post stratification is applied in weighting process.  
- Within household non-responses measures: Proxy response was used. The woman is usually selected as proxy of other household members.

WAVE 2 SAMPLING PROCEDURE  
1. Sampling frame
1.1 Type of frame: Name list of respondents aged 18-64 years in wave 1 (2003) with an available telephone number (mobile or not)
1.2  Frame coverage: Target population: 10,000 individuals
1.3 Frame size: 19,565  individuals
1.4 Level of units available: individuals

2. Sampling method
2.1 Sampling method type: There are 11,511 households, and a total amount of 25,102 individuals (aged 18-64 years in 2003), with an available telephone number. As the aim was to reach a target of 10,000 valid interviews, it was decided to select randomly 19,565 individuals from all households.
2.2 Sampling stage definition
  - PSU: The sample design is a complex one as it derives from the sample design adopted for wave 1 (i.e. two-stage sampling procedure) and  the selection of people aged 18-64 in wave 1 to be intervied in wave 2 (Tertiary sample unit).
  - SSU:
  - TSU:  
2.3 Sampling stage size
  - PSU:  
  - SSU:  
  - TSU: 19,565 individuals
2.4 Unit selection: At least one individual selected from each household.
2.5 Final stage unit selection: According to the name list
2.6 Within Household unit selection: According to the name list
2.7 Stratification:  
2.8 Sample size
  - Starting size sample: 19,565 individuals
  - Aimed total size at Wave 1: 10,000 individuals
  - Aimed total size at Wave 3:  
2.9 Estimated Non-response
  - Initial non-response:  
  - Yearly attrition:  
  - Non response measures: Oversampling. In order to reach 10,000 valid interviews, the selected sample is 19,565 individuals
  - Within household non-responses measures: Proxy answers were accepted only if it was impossible to book an appointment with the target individual.

Mode of Data Collection

WAVE 1
Method: Face-to-Face (personal interview) + self-administered questionnaire.
Technique: Paper and Pencil.

WAVE 2
Method: Telephone interview
Technique: CATI

Type of Research Instrument

WAVE 1
Structured questionnaire in Italian and German for people living in Bolzano/Bozen.

WAVE 2
Blaise

Characteristics of Data Collection Situation

WAVE 1 DATA COLLECTION  
1. Interviewers
1.1 Total number of interviewers: 1,645.
1.2 Number of interviewers in the field: 1,645.
1.3 Network organization: 22 regional coordinators.
1.4 Working arrangement of interviewers: Istat makes use of the Municipal Council networks for selecting interviewers, responding to the criteria defined by Istat. Interviewers are mainly employees of the municipal office itself, but also students and others part-time workers. The interviewers belong to a list of household surveys' interviewers that is regionally and centrally monitored. The archive of interviewers is centrally supervised and collects information about interviewers' performances along their past experiences.
1.5 Payment of interviewers: Interviewers were paid per successful interview. Moreover, they received a reimbursement for training and transmission of quality information.

2. Interviewer training
2.1 General interviewing: There were instruction meetings in each province, run by Istat regional offices, held with the interviewers from the sampled municipalities (first step training). Istat regional offices were, in turn, trained by the Istat central office (second step training).  
2.2 Survey specific: Yes. General information about the Programme was presented.
2.3 Length: The first step training (Istat central office to regional offices) took two days. The second step training took one day and documentation and slides were disseminated. At the end of the training meetings, a form was filled in collecting the participants' assessments.
2.4 Control of performance: Respondents were randomly recontacted to ascertain the occurred interview. There were two attempts to evaluate each interviewer and in case of false interview all families of the same interviewer were contacted. Only a small portion, less than 4% of the interviews, was considered irregular. Moreover, Istat checked the questionnaires in order to evaluate their quality.  
2.5 Interviewer survey: No.  

3. Contact protocols
3.1 Advance letter: In order to limit the refusal rate Istat send a letter, before starting up the survey. The letter informed families about the interview and aimed at stimulating the interest of the respondents, explaining the reasons why their collaboration is important, reassuring people about their most common fears and allowing them to understand the expected goals and results.
3.2 Cold contacts: Telephone.
3.3 Scheduling / scattering: The interviewer organized the contacts by him/herself.  
3.4 Contact history: No.  
3.5 Min number of contacts: No.  
2.6 Max number of contacts: No.  

4. Questionnaire localization
4.1 Validation: No.
4.2 Pre-test: No. However, the survey was previously conducted in 1998 on an independent sample and, in that case, a pilot study was performed. New questions pertaining to GGS were tested independently.
4.3 Length of interview: Average length of interview per household: 70 minutes.

WAVE 2 DATA COLLECTION  
1. Interviewers
1.1 Total number of interviewers: about 20 interviewers, scheduled in 3 groups per day
1.2 Number of interviewers in the field:  
1.3 Network organization: centralized
1.4 Working arrangement of interviewers: Fully employed and Contracted - other
1.5 Payment of interviewers: Per interview

2. Interviewer training
2.1 General interviewing: Yes, interviewers were trained about general interviewing  technique especially to focus on the follow-up conditions and to the aim of the survey. They were trained by Istat researchers. They have experience on survey interviews because most of them had already worked for Istat.  
2.2 Survey specific: Yes, interviewers were also trained about GGS issues. Generally speaking, they received briefing from Istat researcher with PowerPoint presentations. A FAQ document was also prepared to support them. After the training and some tests of interview, there was also a de-briefing.
2.3 Length: Two days
2.4 Control of performance: Yes, Istat personnel and researchers supervised the fieldwork to support and monitor the adequacy of the interviews.
2.5 Interviewer survey: Not a survey, but de-briefings were organized to clarify doubts and provide general aid. Moreover, the monitoring system allow to link interviewer performances to contacts and valid interviews.

3. Contact protocols
3.1 Advance letter: Yes, an advance letter was sent to all sampled individuals. It expresses acknowledgment for the participation in wave1, it explains the aim of the follow-up survey, the relevance for the study of generations and gender, it informs about the protection of personal data and it provides contact information and a toll-free number where to ask for further information. The letter is signed by the President of Istat.
3.2 Cold contacts:  
3.3 Scheduling / scattering: Yes, the scheduling procedure distinguishes between morning, afternoon and evening. Moreover, an appointment could be fixed with respondents any day of the week, included on Saturdays and Sundays.
3.4 Contact history: Yes, there was the so-called "Scheda contatti"
3.5 Min number of contacts:  
2.6 Max number of contacts: Yes, 11.

4. Questionnaire localization
4.1 Validation: No re-translation was done
4.2 Pre-test: Yes, normal survey route. Moreover the questionnaire was also tested on part of the available sample.
4.3 Length of interview: Average duration about 1 hour.  

In case the respondent was busy or not available, we attempted to agree upon a new appointment. Also if the respondent was unavailable to answer to the landline, we offered the possibility to call him/her back on the mobile.

Actions to Minimize Losses

WAVE 1 ACTIONS  
1.  Dealing with nonresponse
1.1 Screening: Availability to participate to subsequent surveys was asked.
1.2 Refusal conversion: Usual techniques of refusal conversion (an advance letter by Istat was sent to the selected households).
1.3 Incentives: No.

2. Tracking of sampled units
2.1 Respondent contact information: Yes, contact details of the respondent were collected such us: addresses, home telephone number and mobile number of each family member.
2.2 Other contact information: Yes, contact information of each family member.
2.3 Cards: On-line thank-you letter was sent showing the main results of the survey on www.istat.it.
2.4 Additional surveys: No
2.5 Administrative records: For records without a telephone number, Istat asked to the municipality to provide information about their possible change in residential address in order to track them elsewhere.

WAVE 2 ACTIONS  
1.  Dealing with nonresponse
1.1 Screening: At first it was asked whether the respondent named … was at home and available to be interviewed.
1.2 Refusal conversion:
1.3 Incentives: No incentives

2. Tracking of sampled units
2.1 Respondent contact information: Yes
2.2 Other contact information: No, but the information about the family was used to ask about him/her in order to minimize attrition.  
2.3 Cards: No, but the report of the survey was disseminated on the web site.
2.4 Additional surveys: No
2.5 Administrative records: Yes, especially to look for individuals who moved from the household in wave1.

Weighting

WAVE 1 WEIGHT VARIABLES  
The procedure that allows defining the final weights to be attributed to the responding sample units is articulated in the following steps:
1) The direct weights are calculated as the reciprocal of the inclusion probability of the units;
2) The non-response correction factors are calculated for the total non-response, as the inverse of the response rate at level of the municipality to which each unit belongs;  
3) We thus obtain the base weights, or weights corrected for the total non-response, multiplying the direct weights by the corresponding corrective factors for total non-response;  
4) The final correction factors are constructed to allow satisfying, on a regional level, the condition of equality among the known totals of the auxiliary variables and the corresponding sample estimates;  
5) Finally, the final weights are calculated by means of the product of the base weights for the correction factors obtained as described at step 4.

WAVE 2 WEIGHT VARIABLES  
Yes. The calculation of the final weights to be applied to the sample units interviewed in 2007  are obtained starting from the final weights for the wave 1 survey.
1) the direct weights are taken from the final weights defined in wave1 (2003 survey);
2) the final correction factors are constructed to allow satisfying, at macro-area level, sex and age-group, the condition of equality between known totals (calculated on the longitudinal population 2003-2007) of the auxiliary variables and the corresponding sample estimates;
3) Finally, the final weights are calculated as the products of the base weights and correction factors obtained above.

Cleaning Operations

WAVE 1 CLEANING OPERATIONS
Cleaning operations were performed. Moreover deterministic and probabilistic non-response imputation methods were performed.

WAVE 2 CLEANING OPERATIONS   
Yes, cleaning operations were performed and consistency with wave 1 was also checked.

Response Rate

WAVE 1
Frequency of final disposition codes:
I = complete interview: 19,227 households.
P = partial interview: 0
NE = non-eligible: DK
NC = non-contact: DK
R = refusal: 1,560
O = other non-response: DK  
UC = unknown eligibility, contacted: DK
UC = unknown eligibility, non-contact: DK
eC = estimated proportion of contacted cases of unknown eligibility that are eligible: DK
eN = estimated proportion of non-contacted cases of unknown eligibility that are eligible: DK

WAVE 2
Information is coming

Restrictions

In order to access micro data files, users have to sign and submit a Statement of affiliation, confidentiality and acceptable usage. They also have to submit a title and abstract of their research project. They can use the data for all their research projects, except for datasets from Australia and Norway. Users of these datasets need to submit a new application form if they want to use the data in a different research project. The access rights from Wave 1 data are transferred to the Wave 2 data.

Access Authority

Name Affiliation E-mail address Universal Resource Identifier
UNECE Population Unit - Palais des Nations - CH-1211 Geneva 10 - Switzerland. Tel: +41 22 917 24 77 - fax: +41 22 917 01 07 ggp@unece.org http://www.unece.org/pau/

Citation Requirement

In any work emanating from research based on the Generations and Gender Survey micro-data, I will acknowledge that these data were obtained from the GGP Data Archive and refer to the publication that describes the model survey instruments: United Nations 2005. Generations & Gender Programme: SurveyInstruments. New York and Geneva: UN, 2005

Deposit Requirement

Users of GGS micro-data are required to send any research papers based on the Generations and Gender Survey micro-data or aggregate tabulations to the Population Activities Unit of the UN Economic Commission for Europe, for inclusion in the GGP publications archive.

Conditions

In order to access, it is necessary to subscribe to the GGP Data User Space, and to follow the instructions available on the GGP data access webpage.

Disclaimer

The authors and producers bear no responsibility for the uses of the GGS data, or for interpretations or inferences based on these uses. The producers accept no liability for indirect, consequential or incidental damages or losses arising from use of the data collection, or from the unavailability of, or break in access to the service for whatever reason.

Related Materials

Italy_Questionnaire_W1_en

Italy_Questionnaire_W2_it

Other References Note

Italian country presentations at the GGP International Working Group Meetings

Data Files Description

File Name

GGS_Wave1_Italy_V.4.3..NSDstat

Contents of Files

VARIABLES HAVING ALL SYSTEM MISSING CASES ARE DROPPED BEFORE PUBLICATION IN NESSTAR.
This is the reason why the total no. of variables in the Nesstar data file is smaller than the total number of variables in the SPSS and STATA files.

Variables are ordered according to the sections of the GGS codebook: Household, Children, Partnerships, Household Organisation and Partnership Quality, Parents and Parental Home, Fertility, Health and Well-Being, Respondent's Activity and Income, Partner's Activity and Income, Household Possessions, Income and Transfers, Value Orientations and Attitudes, Interviewers' report.
The variables begin with a letter designating the wave of data collection ("a" for the first wave likewise "b" for the second wave). We have attempted to keep the names of variables the same across the waves, and all the new variables would be identified as follows ["wave letter"]n e.g.  bn301.  
Although we encourage the countries to strictly follow the GGS Questionnaire, countries might implement a question that differs to a considerable extent from the GGS Questionnaire. In this case either we add country specific response values, or we introduce a country specific variable.  
Country specific values are added when the question follows the model questionnaire, but the answers are not at all or partly compatible. They are at least 4 digits long (F4 format) and begin with the country code: e.g., Australia 2401. Hence, the country code, as an example, for Australia is 24.  
A country specific variable is introduced when the question differs from the model questionnaire albeit measuring the same concept. This kind of variables is identified with a suffix given by the country code plus a number, e.g., Australia a119_2401.
In order to have an overview of GGS country code, please refer to the variable "acountry".

File Structure

Record Group

Overall Case Count

9570

Overall Variable Count

656

Type of File

Nesstar 200801

Extent of Processing Checks

DATA HARMONISATION
The data is submitted in an already pre-harmonised form. It is prepared and organised according to the GGS standards.  
Harmonisation aims at achieving a clear and comparable format of the GGS micro-data files that would be adequate for cross-country comparison.  The harmonisation procedure basically is composed of:
1. Label checks  
This step makes sure that all the variables are named the same across the countries and refer to a particular question in the GGS Questionnaire. Also the value labels are checked. They should be the same across GGS datasets.  
2. Dealing with grids
The GGS Questionnaire holds several grids of either event history information or members of the household. Such data needs to be harmonized with specific attention to order and logical consistency of grid-rows (be either household members or events such as births). In data sense each row of the grid is represented by variable name followed by a subscripted number ("_#"). Each subscript thus represents one household member or one event. Part of the grid harmonization is grid sorting. Grid rows are sorted according to pre-defined key. For example in the household grid, the household members are sorted according to their relationship to the respondent i.e. the relation to respondent variable (ahg3_# or bhg3_# ). Respondents would appear, first, followed by their partners and children if any and then followed by other household members. As there may be more then one child (or other relative) living in the household they also would need to be sorted. In the case of the household grid, age is used as the secondary sorting key (starting with the oldest person to the youngest).
3. Routing
Routing check ensures that the structure of underlying data set matches the structure of the GGS questionnaire. Its main goal is to code any given variable in the dataset to either a valid response, nonresponse or skip as indicated in the questionnaire. Consequently, the indicated skip in the quetionnaire is represented with a system missing code (. in STATA, sysmis in SPSS), while the missing information for other reasons is coded into non-applicable/no response (i.e. codes 7, 8, 9 in SPSS or .a, .b, .c in STATA).  
4. Consolidation  
The process consolidates the information scattered over several variables into a single one. The consolidation procedure is carried out in the Children Section, the Partnership Section and the Parents and Parental Home Section.
5. Imputation  
Due to its sensitive nature, the respondents are reluctant to share income information with the interviewer. In order to be able to use income information in a cross country comparative study and not to loose too many observations in the process it is necessary to impute the approximately correct distribution of the income variable in each country.  
6. Calculation of derived variables
We calculate derived variables out of the following variables:
- grid variables (i.e., household grid, children grid, and partnership history grid); the codebook starts with the constructed variables that sum the key socio-demographic characteristics of the respondent.
- month and year variables,  
- hours and minutes variables,
- frequency and unit variables.  
Occupation variables are recoded into ISCO-88 1 digit.
Explanations of the ways in which consolidated and derived variables are obtained, are available under the field "Note" of the "Variable Description" sections.
For a more detailed and technical procedure please refer to the Data Cleaning and Harmonisation Guidelines.

Missing Data

The following missing values have been assigned:
- 6, 96, 996, etc. = Unknown (only for consolidated variables in the group "administrative variables")
- 7, 97, 997, etc. = Don't know
- 8, 98, 998, etc. = Refusal
- 9, 99, 999, etc. = Not-applicable/no response

Version

Harmonized dataset, GGS Wave1, version 4.3.

Notes

IMPROVEMENTS INTRODUCED WITH V.4.3. (August 2016):
Variables corrected with Version 4.3.
- fertintent (no more ambiguous labelling)
- a1101 (corrected error in coding)
- aweight (now available also for NLD CZE SWE POL)
- aregion (now available also for HUN)
- aplace (now available also for HUN)
- a5112 (corrected routing error for ROU)
- a5113 (corrected routing error for ROU)
- a5114 (corrected routing error for ROU)
- a5115 (corrected routing error i for ROU)
- a211b_ (corrected error for POL & GEO)
- ankids (corrected error for POL & GEO)
- a1008mnth (corrected error for NGR & BEL)
- a108 (now available for SWE)
- a109_1 (now available for SWE)
- a109_2 (now available for SWE)
- a149 (now available for SWE)
- a309 (now available for SWE)
- aregion (now available for SWE)
- a620_ (corrected error for DEU & CZE)
- a402 (corrected error for POL)
- a149 (corrected error routing error in NOR)
- a344 (corrected error routing error in NOR)
- a256_ (corrected error for POL & GEO)

IMPROVEMENTS INTRODUCED WITH V.4.2. (February 2014):
The update from v4.1 to v4.2 does not include corrections of existing variables.  
The update only includes additional variables which are derived from the pre-existing datasets
- Variables derived from grid variables and variables which concern the respondents and his/her partner: numdissol numdivorce nummarriage numpartners livingwithpartner childprevp femage maleage femeduc maleeduc fertintent numbiol numres numnonres numstep numallchild ageyoungest ageoldest numrespleave numotherparentleave coreschild coresparen coresgrandp coressibl.
- Variables derived from month and year variable:  
a808Dur a822Dur a907Dur a911Dur a914Dur; a303cAgeP a315AgeP a316cAgeP a374cAgeP a608AgeP a610AgeP a617bAgeP a621AgeP a914AgeP a941AgeP; a107AgeR a121AgeR a150AgeR a239aAgeR a239bAgeR a240AgeR a301AgeR a302bAgeR; a311AgeR a314bAgeR a314dAgeR a371AgeR a372bAgeR a603AgeR a608AgeR a610AgeR a613AgeR a614AgeR a619AgeR a621AgeR a816AgeR a822AgeR a871AgeR a5116AgeR a5117bAgeR; a302bTdiff a314bTdiff a314dTdiff a372bTdiff.
- Variables derived from hours and minutes variables: a324_hour a520_hour a540_hour.
- Variables derived from frequency and unit variables: a205mnth,a241mnth,a325mnth,a355mnth,a359mnth,a363mnth,a367mnth,a521mnth,a541mnth,a1008mnth,a1102mnth; a203c_?w a204c_?w.
- Occupation variables recoded into ISCO-88 1 digit: a828_1dig a832_1dig a861_1dig a917_1dig a921_1dig a933_1dig a5112_1dig a5114_1dig.

IMPROVEMENTS INTRODUCED WITH V.4.0 (March 2012):
- New constructed variables: asex aage abyear aeduc aactstat aparstat amarstat anpartner ankids ahhsize ahhtype ahhsize.
- New consolidated variables on respondents' current activity: a870, a871m, a871y, a873, a874, a875.
- New consolidated variables on respondents' partners current activity: a940, a941m, a941y, a945.
- Variables corrected: a601 and a602 (corrected, with consequences on the response rate of subsequent variables), a622, a624, a626, a627*, a628*, a629*, a383 (now rounded).
- Corrected weight variables: aweight (composed post-stratification weight that combines two weights: a) based on age, gender, region (full matrix) and b) based in age, sex, household size (margins))

FIRST DATASET RELEASED: V. 3.0 (December 2011).

Notes

Before publication in Nesstar GGS micro data files are further processed so as to ease online data browsing and analysing.
We delete variables having all system missings.

File Name

GGS_Wave2_Italy_V.1.3..NSDstat

Contents of Files

VARIABLES HAVING ALL SYSTEM MISSING CASES ARE DROPPED BEFORE PUBLICATION IN NESSTAR.
This is the reason why the total no. of variables in the Nesstar data file is smaller than the total number of variables in the SPSS and STATA files.

Variables are ordered according to the sections of the GGS codebook: Household, Children, Partnerships, Household Organisation and Partnership Quality, Parents and Parental Home, Fertility, Health and Well-Being, Respondent's Activity and Income, Activity and Education History, Partner's Activity and Income, Household Possessions, Income and Transfers, Value Orientations and Attitudes, Interviewers' report.
The variables begin with a letter designating the wave of data collection ("a" for the first wave likewise "b" for the second wave). We have attempted to keep the names of variables the same across the waves, and all the new variables would be identified as follows ["wave letter"]n e.g.  bn301.  
Although we encourage the countries to strictly follow the GGS Questionnaire, countries might implement a question that differs to a considerable extent from the GGS Questionnaire. In this case either we add country specific response values, or we introduce a country specific variable.  
Country specific values are added when the question follows the model questionnaire, but the answers are not at all or partly compatible. They are at least 4 digits long (F4 format) and begin with the country code: e.g., Australia 2401. Hence, the country code, as an example, for Australia is 24.  
A country specific variable is introduced when the question differs from the model questionnaire albeit measuring the same concept. This kind of variables is identified with a suffix given by the country code plus a number, e.g., Australia a119_2401.
In order to have an overview of GGS country code, please refer to the variable "acountry".

File Structure

Record Group

Overall Case Count

6476

Overall Variable Count

204

Type of File

Nesstar 200801

Extent of Processing Checks

WAVE 2 DATA HARMONISATION: see "Extent of Processing Checks" "WAVE 1 DATA HARMONISATION".

Missing Data

The following missing values have been assigned:
- 6, 96, 996, etc. = Unknown (only for consolidated variables in the group "administrative variables")
- 7, 97, 997, etc. = Don't know
- 8, 98, 998, etc. = Refusal
- 9, 99, 999, etc. = Not-applicable/no response

Version

Harmonized dataset, GGS Wave2, version 1.3.

Notes

FIRST DATASET RELEASED: V. 1.3. (August 2016).

Notes

Before publication in Nesstar GGS micro data files are further processed so as to ease online data browsing and analysing.
We delete variables having all system missings.  

WAVE 2 DATASETS - Main differences compared to WAVE 1 datasets
Wave 2 datasets include an additional new section that had not been implemented in the Wave 1 data collection. It is the section no. 8 "Activity and Education History". Respondents report comprehensively on their activity and education history since age 16. Two additional sections are also present at the end of wave 2 dataset: "Interviewer observations" and "Interviewer report" (respectively sections no. 13 and 14).
A set of constructed variables at the top of the data file increase the usability of the GGS data by summarizing key socio-demographic characteristics of the respondent (age, birth year, sex, level of educational attainment, activity status, partnership status, number of co-resident partners, number of children, household size, household type). An additional set of variables consolidates information on the current activity of the respondent and his/her partner that is otherwise spread over the questionnaire. Another set of consolidated variables concern respondents' parents and parental home.

WAVE 2 DATASETS - Variables names
Variables in the Wave 2 data sets that are consistent with variables implemented in the Wave 1 questionnaire are named identically. Wave 2 variable names start with the letter "b" compared to letter "a" in Wave 1. Variables that have not been implemented in Wave 1 but collected in Wave 2 begin with "bn".  
In Wave 2 datasets published in Nesstar, the variable "brid - R identification number" has been renamed into "arid" (same variable name than Wave 2). This allows the user to merge Wave 1 and Wave 2 datasets in Nesstar.
In Wave 2 datasets published in Nesstar, variable labels have the indication "(W2)". This allows the user to distinguish Wave 2 variables from Wave 1 variables, on the basis of the variable labels.

Download

Metadata Index

This is the Metadata Index for a Nesstar Server.
Nesstar is a tool used for analysing, visualising and downloading datasets.

Click the "Explore Dataset" button to open the dataset.