Dataset: Generations and Gender Survey Lithuania Wave 1 & Wave 2

Abstract

The Generations and Gender Survey (GGS) provides micro-level data with the aim of significantly improving the knowledge base for social science and policymaking in Europe and developed countries elsewhere.  
In Europe 2020, the European Union develops a strategy "to help us come out stronger from the crisis and turn the EU into a smart, sustainable and inclusive economy delivering high levels of employment, productivity and social cohesion". The economic crisis affects not only day-to-day decisions, but also fundamental choices at all stages of people's lives:  marriage and childbearing, the combination of employment and caring responsibilities for the young and the old, retirement, housing, and ageing well. The GGS has been developed to provide scientists with high-quality data to contribute scientifically grounded answers to these key policy questions. Survey content focuses on intergenerational and gender relations between people, expressed in care arrangements and the organization of paid and unpaid work. Key feature of the survey are:  
- Cross-national comparability. In each country data is collected on the basis of a common international questionnaire and guidelines about the methodology. Data processing includes central harmonization of national datasets.  
- A broad age range. It includes respondents between the ages of 18 and 80.
- A longitudinal design. It has a panel design, collecting information on the same persons at three-year intervals.  
- A large sample size. It has an average of 9,000 respondents per country at Wave 1.
- A theory-driven and multidisciplinary questionnaire. It provides data for policy relevant research by demographers, economists, sociologists, social policy researchers, social psychologists and epidemiologists. The questionnaire is inspired by the theory of planned behavior.
- Possibility to combine the survey data with macro data provided by the GGP Contextual Database. This combination enables analyses of individuals and families in their cultural, economic, political, social and policy contexts.

Variable Groups

Document Description

Full Title

Generations and Gender Survey Lithuania Wave 1 & Wave 2

Alternative Title

GGS Lithuania Wave 1 & Wave 2

Identification Number

GGS.W1.W2.25

Date of Distribution

2014-04-04

Version

Working Version: GGS Wave 1 Version 4.3 and GGS Wave 2 Version 1.3.

Update of variable catagories and documentation with the release of Poland Wave 2 Version 1.3.

Date: 2018-02-26

Guide To Codebook

In the field "Study Description", users can find metadata about surveys. This includes the distributors, keywords, abstract, and guidelines on the bibliographic citation.  
Country specific metadata include information on survey producers, methodology and processing. This information was provided by the GGP-country teams, based on a metadata grid with pre-structured questions. Links to relevant references (e.g., working papers and questionnaires) are also provided.  

The field "Data Files Description" provides metadata about the data file, such as file contents, missing values, as well as changes across different GGS versions.

The field "Variable Description" provides information on each variable, such as question text, descriptions of country specific categories and variables, universe (i.e., subset of respondents to whom the question was asked), country specific deviations to GGS routing, descriptions of the ways in which consolidated and derived variables are calculated. Variables are ordered according to the sections of the GGS codebook.

PLEASE NOTICE THAT WE DOCUMENT ONLY VARIABLES HAVING VALID CASES.  
VARIABLES HAVING ALL SYSTEM MISSING CASES ARE NOT DOCUMENTED.  
This is the reason why the total no. of variables in the documentation is smaller than the total number of variables in the SPSS and STATA files.

Full Title

GGS_W1-V.4.3.&W2-V.1.3_Lithuania

Producer

Name Affiliation Abbreviation Role
Arianna Caporali Institut national d'études démographiques (INED) AC

Study Description

Full Title

Generations and Gender Survey Lithuania Wave 1 & Wave 2

Alternative Title

GGS Lithuania Wave 1 & Wave 2

Parallel Title

Kartu ir lyciu tyrimas

Identification Number

GGS.W1.W2.25

Authoring Entity

Name Affiliation
Vlada Stankuniene Institute for Demographic Research, Lithuanian Social Research Centre

Other identifications and acknowledgments

Name Affiliation Role
Baltic Surveys Ltd Sampling, interviewing, data entry, cleaning, weighting
Dalia Ambrozaitiene Statistics Lithuania Consultations

Producer

Name Affiliation Abbreviation Role
Vlada Stankuniene Institute for Demographic Research, Lithuanian Social Research Centre IDR LSRC Head

Funding Agency/Sponsor

Name Abbreviation Role Grant
Lietuvos valstybinis mokslo ir studiju fondas - Lithuanian State Science and Studies Foundation LVMSF Main funding agency (Wave 1 and Wave 2 (until 2010) C-14/2007
Ministry of Science and Education Funding (Wave 1)

Data Distributor

Name Affiliation Abbreviation
Institut national des études démographiques - 133 boulevard Davout 75980 Paris Cedex 20, France. INED
Netherlands Interdisciplinary Demographic Institute - Lange Houtstraat 19, NL-2511 CV The Hague, The Netherlands NIDI

Depositor

Name Affiliation Abbreviation
Institute for Demographic Research, Lithuanian Social Research Centre.

Bibliographic Citation

United Nations 2005. Generations & Gender Programme: Survey Instruments. New York and Geneva: UN, 2005.

List of Keywords

Date of Collection

Start End Cycle
2006-04-01 2006-12-30 Wave 1
2009-06 2009-12 Wave 2

Country

Lithuania  (LTU)

Geographic Coverage

Whole territory of the Republic of Lithuania.

Geographic Unit

County (10 administrative divisions of Lithuania - counties).

Unit of Analysis

Individuals

Universe

WAVE 1
Target population: permanent residents of Lithuania aged 18-79 years.

WAVE 2  
Persons who took part at Wave 1 and agreed to be re-contacted for Wave 2 (5810 persons, i.e., 58% of the 10,036 persons interviewed at Wave 1).
- SAMPLE REFRESHMENT: In addition to these persons, 12882 individuals were also contacted before Wave 2. These persons were NOT inteviewed at Wave 1 and are NOT included in the harmonised Wave 2 file.

Kind of Data

Survey data

Time Method

Panel

Data Collector

Sampling Procedure

WAVE 1 SAMPLING PROCEDURE
1. Sampling frame
1.1 Type of frame: List of geographic units and population as of 1 January 2006, Statistics Lithuania.
1.2  Frame coverage:  Individuals residing in Lithuania aged 18-79 years, excluding institutionalised persons (prisons, army, hospitals, etc.).
1.3 Frame size: 2,557,085 individuals
1.4 Level of units available: Settlements and districts within urban areas.

2. Sampling method
2.1 Sampling method type: Multistage random sample design had been used for the study. The first stage of the sampling procedure consists in defining administrative regions of Lithuania: 10 such regions are defined. In order to identify cluster of sampling, sampling points are selected according to the size of the settlements that are grouped into big, medium, small towns and villages. The number of the sampling points in each category depends on the percentage of population living in the settlements of this category.
2.2 Sampling stage definition
  - PSU: Administrative regions (counties).
  - SSU: Settlements.
  - TSU: Individuals aged 18-79.
2.3 Sampling stage size
  - PSU: 10.
  - SSU: 318.
  - TSU: 10,036.
2.4 Unit selection: Systematic random.
2.5 Final stage unit selection: SRS (Simple Random Sampling).
2.6 Within Household unit selection: Last birthday method.
2.7 Stratification: Implicit.
2.8 Sample size
  - Starting size sample: 912 PSU's (912 x 11 = 10,032)
  - Aimed total size at Wave 1: 10,036.
  - Aimed total size at Wave 3: None.
2.9 Estimated Non-response
  - Initial non-response: Not estimated.
  - Yearly attrition: Not estimated.
  - Non response measures: Not applicable for random route methodology.
  - Within household non-responses measures: Three call-backs were used in order to contact the household or eligible respondent. If after three call-backs there was no contact, no other measures were used and hence, the household was marked as non-response.

WAVE 2 SAMPLING PROCEDURE
Sampling: panel (wave 1 database of addresses). All the Wave 1 persons who had agreed to be recontacted for a second interview (i.e., 5810 respondents which is equal to 58 of Wave 1 respondents).
SAMPLE REFRESHMENT: In addition to these persons, 12882 individuals were also contacted before Wave 2. These persons were NOT inteviewed at Wave 1 and they represent a new sample. This allowed to carry out 5748 additional completed interviews. These interviews are NOT included in the released harmonised Wave 2 file.
ESTIMATED NON-RESPONSE
- Initial non-response: 5810 respondents of the 1st wave agreed to be interviewed during the 2nd wave of GGS in 2009. After the 1st wave while collecting updated contact information about respondents we received following information:
272 respondents died/ were ill / not able to communicate;
449 respondents moved to other country;
691 respondents moved to other location, contact information not available;
443 respondents - no contact with them, no information about them available.
- Yearly attrition: None, the yearly attrition was not estimated
- Non response measures: Sending post cards after the fieldwork, Sending letters and address cards for checking the contact information in 2007 and 2008.  
If no response received via ordinary mail, respondents were contacted via phone from address' data base.  If telephone number was not available, interviewers visited addresses from data base and collected updated information.  Respondents having e-mail addresses were sent e-mail letters with request for updated contact information.
- Within household non-responses measures: In wave 2, no specified number of visits in order to contact necessary person and conduct the interview. In ad hoc survey - three call-backs were used in order to contact the household or eligible respondent. If, after three call-backs there was no contact, no other measures were used - the household was marked as non-response

Mode of Data Collection

Method: Face-to-Face (personal interview).
Technique: Paper and Pencil.

Type of Research Instrument

Structured questionnaire in Lithuanian and in Russian.

Characteristics of Data Collection Situation

WAVE 1 DATA COLLECTION
1. Interviewers
1.1 Total number of interviewers: 191.
1.2 Number of interviewers in the field: Each interviewer had to conduct particular number of interviews during specific period. Some of them could finish the work earlier than others.
1.3 Network organization:  Field coordinators: field manager in central office;  three regional supervisors in central office; four regional supervisors in regions.
1.4 Working arrangement of interviewers: Contracted - other.
1.5 Payment of interviewers: Per interview.

2. Interviewer training
2.1 General interviewing: Interviewers are trained before each survey. They always are instructed on general interviewing techniques. Gallup has developed interviewers' manual including instructions how to behave in possible complicated situations, how to speak with potential respondent, how to persuade him/her to take part in the survey, how to ask questions, etc.
2.2 Survey specific: All interviewers were instructed before GGS survey. First step of training was performed with regional supervisors. They were informed about the purpose of the survey and about the necessity strictly to follow requirements. Regional supervisors were asked to read through whole questionnaire, noticing and commenting all the difficulties, and to ask questions if there were any unclear things. Regional supervisors were given prepared instruction for interviewers and they used it while reading the questionnaire. After that they had to make one interview each themselves and commented difficulties experienced. All their comments and difficult situations were discussed with project manager and field supervisor.  
Second step of training was performed by regional supervisors with interviewers in separate regions. The procedure was the same - they were informed about the purpose of the survey and about the necessity strictly to follow the requirements. Interviewers were asked to read through whole questionnaire, noticing and commenting all the difficulties, and to ask questions if there were any unclear things. Interviewers were given prepared instruction for interviewers and they used it while reading the questionnaire. After that they had to make interview themselves and to comment difficulties experienced. All their comments and difficult situations were discussed with regional supervisor. Interviewers were trained in persuading the respondents to participate in next wave of the survey also. After training each interviewer was given one questionnaire and was asked to interview one person. These questionnaires were brought to regional supervisors and checked. If everything was done well, questionnaires were filled correctly, interviewers were allowed to conduct GGS survey interviews. If questionnaires were filled not correctly, interviewer either had to repeat training course, or he/she was not allowed to do interviews in this survey.
2.3 Length: Trainings - 8 full working days. Training contained both general interviewing questions and survey specific issues. Approximately one third of time was devoted to general interviewing and two thirds of time to survey specific issues.
2.4 Control of performance: After the completed questionnaires were brought to the supervisor they are ranged according to number of interview on random route administration sheet. Demographic characteristics of the respondent in the questionnaire were compared with demographic characteristics in random route administration sheet.  
Next step - to check if interviewers chose necessary person for interview according to the birthday rule.  
On the third step of quality control - all questionnaires were checked by supervisors: accuracy of data recording and following skip rules was checked. If there were some inaccurate or not clear parts, interviewers had to re-contact respondent and clarify obscurities.
Randomly selected 10% of respondents of each interviewer were re-contacted in order to check if interview was really conducted and if interviewer presented respondent correct information about the survey. Respondents most often were re-contacted by phone. Regional supervisors followed description of random route by randomly selected interviewers and visited respondents at their home for interviewers work control. All respondents were asked about the willingness to take part in the next waves of GGS and if agreed, their names and addresses were recorded. After the fieldwork finished, they were re-contacted by mail, phone or in-person and their data checked on a regular base.
2.5 Interviewer survey: No.

3. Contact protocols
3.1 Advance letter: No.
3.2 Cold contacts: Face-to-face.
3.3 Scheduling / scattering: Yes. Contact attempts were scattered over different days of the week (week-days and week-ends) and different time of the day.
3.4 Contact history: Yes, in a regular contact sheet.
3.5 Min number of contacts: Number of interviews that should be conducted in PSU was prescribed (10-12 interviews per PSU, 3 call-backs per potential respondent).
2.6 Max number of contacts: Number of interviews that should be conducted in PSU was prescribed, not number of contacts.

4. Questionnaire localization
4.1 Validation: Yes, back-translation.
4.2 Pre-test: A pilot was carried out in March 2006 (150 respondents; 75 men + 75 women;  95 aged 18-49 + 55 aged 50-79;  number of interviewers : 104; number of settlements: 67 ; mean length of the interview: 139 minutes, min  54 minutes,  max 280 minutes,  mode - 95 minutes,  25th percentile 105 minutes,  median 130 minutes,  75th percentile 172 minutes). Major findings of the pilot: questionnaire is too long and too complicated. Two unacceptable themes: (1) possessions in household and inheritance, (2) partners and fertility (for older people). The majority of questionnaire adaptation was done in the Section 10 of the GGS Wave 1 Core questionnaire, namely some more general questions on the eagerness of respondent to answer the questions on possessions were added, while some questions were dropped (i.e. 1010-1018).
4.3 Length of interview: In the majority of cases respondents were co-operative. The average duration of interview was about 80 minutes depending on person interviewed - young or old, single or living with family, having several children or none, etc. In approximately one fifth of interviews other persons participated - children (most often small), parents or partner.

WAVE 2 DATA COLLECTION
1. Interviewers
1.1 Total number of interviewers: 179
1.2 Number of interviewers in the field: 118 for wave 2 and 151 in ad-hoc of new respondents.
1.3 Network organization:  Field coordinators: field manager in central office,  three regional supervisors in central office, four regional supervisors in regions.
1.4 Working arrangement of interviewers: Contracted - other
1.5 Payment of interviewers: Per interview

2. Interviewer training
2.1 General interviewing: Interviewers are trained before each survey. They always are instructed on general interviewing techniques. Gallup has developed interviewers' manual including instructions how behave in possible complicated situations, how to speak with potential respondent, how to persuade him to take part in the survey, how to ask questions, etc. Before the 2nd wave interviewers were instructed how to behave in situation if  the person who agreed to participate in longitude refuses in the last moment.  
2.2 Survey specific:  same as for Wave 1
2.3 Length: same as for Wave 1
2.4 Control of performance: In wave 2, demographics of each completed interview was checked with demographics from personal card of the participant of the 1st wave.  
Ad-hoc survey of new respondents : After the completed questionnaires were brought to the supervisor they are ranged according to number of interview on random route administration sheet. Demographic characteristics of the respondent in the questionnaire were compared with demographic characteristics in random route administration sheet. Next step – to check if interviewers chose necessary person for interview according to the birthday rule.  
In wave 2 and in ad hoc survey all questionnaires were checked by supervisors - accuracy of data recording and following skip rules was checked. If there were some inaccurate or not clear parts, interviewers had to re-contact the respondent and clarify obscurities.
Randomly selected 10% of respondents of each interviewer were re-contacted in order to check if interview was really conducted and if interviewer presented respondent correct information about the survey. Respondents most often were re-contacted by phone. Regional supervisors followed description of random route by randomly selected interviewers and visited respondents at their home for interviewers work control. All respondents were asked about the willingness to take part in the next waves of GGS and if agreed, their names and addresses were recorded.  
2.5 Interviewer survey: None

3. Contact protocols
3.1 Advance letter: Respondents who agreed to participate in wave 2 were contacted several times during the period between 1st and 2nd waves - after first wave post card was sent with thank for participation. After that - 2 letters reminding about the second wave and with request to update contact information via ordinary mail. If no response received via ordinary mail respondents were contacted via phone or via e-mail or face-to-face.
In af hoc survey of new respondents - letter informing introducing GGS survey, the subject and the aim of the survey, the agency conducting interviews in Lithuania, the use of data in the future, etc. and asking potential respondents to take part in it.
3.2 Cold contacts: Face-to-face
3.3 Scheduling / scattering: Yes. Contact attempts were scattered over different days of the week (week-days and week-ends) and different time of the day.
3.4 Contact history: In ad hoc survey - in a regular contact sheet
3.5 Min number of contacts: In wave 2 - no minimum number of contacts.
In ad hoc survey - mumber of nterviews that should be conducted in PSU was prescribed (10-12 interviews per PSU, 3 call-backs per potential respondent)
2.6 Max number of contacts: In wave 2 - no maximum number of contacts.
In ad hoc survey - number of interviews that should be conducted in PSU was prescribed, not number of contacts

4. Questionnaire localization
4.1 Validation: back-translation
4.2 Pre-test: DK
4.3 Length of interview: DK

Actions to Minimize Losses

WAVE 1 & WAVE 2 ACTIONS
1.  Dealing with nonresponse:  
1.1 Screening: No.
1.2 Refusal conversion: Gallup (Baltic Surveys Ltd. is member of Gallup Inc.) has developed an Interviewers' Manual where possible complicated situations are described, most often asked questions by respondents and possible answers to them are listed. During interviewers' training this issue is usually widely discussed.
1.3 Incentives: No.

2. Tracking of sampled units
2.1 Respondent contact information: Yes, contact details of the respondent were collected.
2.2 Other contact information: Yes, contact details of a proxy (such as parents, friend) were collected.
2.3 Cards: A "thank-you-letter" and a "change-of-address-notification" card (with paid postage fee, included in all the mailings as a standard part of the letter) were sent to all the respondents in 2007 and in 2008. If there were e-mail addresses given, "change-of-address-notification" messages were sent to these addresses also. If there was no postal information, respondents were called by phone or visited.
2.4 Additional surveys: No.
2.5 Administrative records: No.

Control operations

Yes, SPSS.

Weighting

WAVE 1
A weighting procedure was used in order to adjust the sample. Weight coefficient composes information about sex, age and type of settlement (rural-urban).

WAVE 2
Wave 2 - no.
Ad hoc syrvey - yes. A weighting procedure was used in order to adjust the sample. Weight coefficient composes information about sex, age and type of settlement (rural-urban).

Cleaning Operations

WAVE 1 & WAVE 2 OPERATIONS
Consistency of all questionnaires was checked by supervisors first. In data entry process skip rules were used in order to avoid mistakes. When data was entered, data file was cleaned by detecting errors and correcting them.

Response Rate

WAVE 1
Response rate - Final disposition codes:
I = complete interview: I = 10,036
P = partial interview: P = 0
NE = non-eligible : NE = 1,687
NC = non-contact : NC = 7,634
R = refusal: R = 7,685
O = other non-response: O = 2,842
UC = unknown eligibility, contacted: UC = DK
UC = unknown eligibility, non-contact: UC = DK
eC = estimated proportion of contacted cases of unknown eligibility that are eligible: eC = DK
eN = estimated proportion of non-contacted cases of unknown eligibility that are eligible: eN = DK

WAVE 2
Response rate - Final disposition codes:
I = Complete interview: I= Wave 2 - 2294 (total number of addresses used - 3176); In ad-hoc survey - 5748
P = Partial interview: P=0
NE = non-eligible: NE= Wave 2 - 0; In ad-hoc survey - 539"
NC = non-contact: NC= Wave 2 - 293 (total number of addresses used - 3176); In ad-hoc survey - 2427
R = refusal: R= Wave 2 - 376 (total number of addresses used - 3176); In ad-hoc survey - 2695
O = Other non-response: O= Wave 2 - 213 (total number of addresses used - 3176); In ad-hoc survey - 1147
UC = unknown eligibility, contacted: UC= dk
UC = unknown eligibility, non-contact: UC= dk
eC = estimated proportion of contacted cases of unknown eligibility that are eligible: eC= dk
eN = estimated proportion of non-contacted cases of unknown eligibility that are eligible: eN= dk

Completeness of Study Stored

WAVE 1  
GGS core questionnaire was adapted to Lithuanian context: some questions were droped or modified.

WAVE 2
The cases beloning to the new Lithuanian W2 sample, which consists of 5748 completed interviews, have not been harmonised yet.  
These respondents answered to a mix of Wave 1 and Wave 2 variables.  These case are therefore NOT included in the release harmonised Wave 2 file.

Restrictions

In order to access micro data files, users have to sign and submit a Statement of affiliation, confidentiality and acceptable usage. They also have to submit a title and abstract of their research project. They can use the data for all their research projects, except for datasets from Australia and Norway. Users of these datasets need to submit a new application form if they want to use the data in a different research project. The access rights from Wave 1 data are transferred to the Wave 2 data.

Access Authority

Name Affiliation E-mail address Universal Resource Identifier
UNECE Population Unit - Palais des Nations - CH-1211 Geneva 10 - Switzerland. Tel: +41 22 917 24 77 - fax: +41 22 917 01 07 ggp@unece.org http://www.unece.org/pau/

Citation Requirement

In any work emanating from research based on the Generations and Gender Survey micro-data, I will acknowledge that these data were obtained from the GGP Data Archive and refer to the publication that describes the model survey instruments: United Nations 2005. Generations & Gender Programme: SurveyInstruments. New York and Geneva: UN, 2005

Deposit Requirement

Users of GGS micro-data are required to send any research papers based on the Generations and Gender Survey micro-data or aggregate tabulations to the Population Activities Unit of the UN Economic Commission for Europe, for inclusion in the GGP publications archive.

Conditions

In order to access, it is necessary to subscribe to the GGP Data User Space, and to follow the instructions available on the GGP data access webpage.

Disclaimer

The authors and producers bear no responsibility for the uses of the GGS data, or for interpretations or inferences based on these uses. The producers accept no liability for indirect, consequential or incidental damages or losses arising from use of the data collection, or from the unavailability of, or break in access to the service for whatever reason.

Related Materials

Lithuania_Questionnaire_W1_lt

Lithuania_Questionnaire_W2_lt

Other References Note

Lithuanian country presentations at the GGP International Working Group Meetings

Data Files Description

File Name

GGS_Wave1_Lithuania_V.4.3..NSDstat

Contents of Files

GGS Wave 1

VARIABLES HAVING ALL SYSTEM MISSING CASES ARE DROPPED BEFORE PUBLICATION IN NESSTAR.
This is the reason why the total no. of variables in the Nesstar data file is smaller than the total number of variables in the SPSS and STATA files.

Variables are ordered according to the sections of the GGS codebook: Household, Children, Partnerships, Household Organisation and Partnership Quality, Parents and Parental Home, Fertility, Health and Well-Being, Respondent's Activity and Income, Partner's Activity and Income, Household Possessions, Income and Transfers, Value Orientations and Attitudes, Interviewers' report.
The variables begin with a letter designating the wave of data collection ("a" for the first wave likewise "b" for the second wave). We have attempted to keep the names of variables the same across the waves, and all the new variables would be identified as follows ["wave letter"]n e.g.  bn301.  
Although we encourage the countries to strictly follow the GGS Questionnaire, countries might implement a question that differs to a considerable extent from the GGS Questionnaire. In this case either we add country specific response values, or we introduce a country specific variable.  
Country specific values are added when the question follows the model questionnaire, but the answers are not at all or partly compatible. They are at least 4 digits long (F4 format) and begin with the country code: e.g., Australia 2401. Hence, the country code, as an example, for Australia is 24.  
A country specific variable is introduced when the question differs from the model questionnaire albeit measuring the same concept. This kind of variables is identified with a suffix given by the country code plus a number, e.g., Australia a119_2401.
In order to have an overview of GGS country code, please refer to the variable "acountry".

File Structure

Record Group

Overall Case Count

10036

Overall Variable Count

1399

Type of File

Nesstar 200801

Extent of Processing Checks

WAVE 1 DATA HARMONISATION
The data is submitted in an already pre-harmonised form. It is prepared and organised according to the GGS standards.  
Harmonisation aims at achieving a clear and comparable format of the GGS micro-data files that would be adequate for cross-country comparison.  The harmonisation procedure basically is composed of:
1. Label checks  
This step makes sure that all the variables are named the same across the countries and refer to a particular question in the GGS Questionnaire. Also the value labels are checked. They should be the same across GGS datasets.  
2. Dealing with grids
The GGS Questionnaire holds several grids of either event history information or members of the household. Such data needs to be harmonized with specific attention to order and logical consistency of grid-rows (be either household members or events such as births). In data sense each row of the grid is represented by variable name followed by a subscripted number ("_#"). Each subscript thus represents one household member or one event. Part of the grid harmonization is grid sorting. Grid rows are sorted according to pre-defined key. For example in the household grid, the household members are sorted according to their relationship to the respondent i.e. the relation to respondent variable (ahg3_# or bhg3_# ). Respondents would appear, first, followed by their partners and children if any and then followed by other household members. As there may be more then one child (or other relative) living in the household they also would need to be sorted. In the case of the household grid, age is used as the secondary sorting key (starting with the oldest person to the youngest).
3. Routing
Routing check ensures that the structure of underlying data set matches the structure of the GGS questionnaire. Its main goal is to code any given variable in the dataset to either a valid response, nonresponse or skip as indicated in the questionnaire. Consequently, the indicated skip in the quetionnaire is represented with a system missing code (. in STATA, sysmis in SPSS), while the missing information for other reasons is coded into non-applicable/no response (i.e. codes 7, 8, 9 in SPSS or .a, .b, .c in STATA).  
4. Consolidation  
The process consolidates the information scattered over several variables into a single one. The consolidation procedure is carried out in the Children Section, the Partnership Section and the Parents and Parental Home Section.
5. Imputation  
Due to its sensitive nature, the respondents are reluctant to share income information with the interviewer. In order to be able to use income information in a cross country comparative study and not to loose too many observations in the process it is necessary to impute the approximately correct distribution of the income variable in each country.  
6. Calculation of derived variables
We calculate derived variables out of the following variables:
- grid variables (i.e., household grid, children grid, and partnership history grid); the codebook starts with the constructed variables that sum the key socio-demographic characteristics of the respondent.
- month and year variables,  
- hours and minutes variables,
- frequency and unit variables.  
Occupation variables are recoded into ISCO-88 1 digit.
Explanations of the ways in which consolidated and derived variables are obtained, are available under the field "Note" of the "Variable Description" sections.
For a more detailed and technical procedure please refer to the Data Cleaning and Harmonisation Guidelines.

Missing Data

The following missing values have been assigned:
- 6, 96, 996, etc. = Unknown (only for consolidated variables in the group "administrative variables")
- 7, 97, 997, etc. = Don't know
- 8, 98, 998, etc. = Refusal
- 9, 99, 999, etc. = Not-applicable/no response

Version

Harmonized dataset, GGS Wave1, version 4.3.

Notes

IMPROVEMENTS INTRODUCED WITH V.4.3. (August 2016):
Variables corrected with Version 4.3.
- fertintent (no more ambiguous labelling)
- a1101 (corrected error in coding)
- aweight (now available also for NLD CZE SWE POL)
- aregion (now available also for HUN)
- aplace (now available also for HUN)
- a5112 (corrected routing error for ROU)
- a5113 (corrected routing error for ROU)
- a5114 (corrected routing error for ROU)
- a5115 (corrected routing error i for ROU)
- a211b_ (corrected error for POL & GEO)
- ankids (corrected error for POL & GEO)
- a1008mnth (corrected error for NGR & BEL)
- a108 (now available for SWE)
- a109_1 (now available for SWE)
- a109_2 (now available for SWE)
- a149 (now available for SWE)
- a309 (now available for SWE)
- aregion (now available for SWE)
- a620_ (corrected error for DEU & CZE)
- a402 (corrected error for POL)
- a149 (corrected error routing error in NOR)
- a344 (corrected error routing error in NOR)
- a256_ (corrected error for POL & GEO)

IMPROVEMENTS INTRODUCED WITH V.4.2. (February 2014):
The update from v4.1 to v4.2 does not include corrections of existing variables.  
The update only includes additional variables which are derived from the pre-existing datasets
- Variables derived from grid variables and variables which concern the respondents and his/her partner: numdissol numdivorce nummarriage numpartners livingwithpartner childprevp femage maleage femeduc maleeduc fertintent numbiol numres numnonres numstep numallchild ageyoungest ageoldest numrespleave numotherparentleave coreschild coresparen coresgrandp coressibl.
- Variables derived from month and year variable:  
a808Dur a822Dur a907Dur a911Dur a914Dur; a303cAgeP a315AgeP a316cAgeP a374cAgeP a608AgeP a610AgeP a617bAgeP a621AgeP a914AgeP a941AgeP; a107AgeR a121AgeR a150AgeR a239aAgeR a239bAgeR a240AgeR a301AgeR a302bAgeR; a311AgeR a314bAgeR a314dAgeR a371AgeR a372bAgeR a603AgeR a608AgeR a610AgeR a613AgeR a614AgeR a619AgeR a621AgeR a816AgeR a822AgeR a871AgeR a5116AgeR a5117bAgeR; a302bTdiff a314bTdiff a314dTdiff a372bTdiff.
- Variables derived from hours and minutes variables: a324_hour a520_hour a540_hour.
- Variables derived from frequency and unit variables: a205mnth,a241mnth,a325mnth,a355mnth,a359mnth,a363mnth,a367mnth,a521mnth,a541mnth,a1008mnth,a1102mnth; a203c_?w a204c_?w.
- Occupation variables recoded into ISCO-88 1 digit: a828_1dig a832_1dig a861_1dig a917_1dig a921_1dig a933_1dig a5112_1dig a5114_1dig.

FIRST DATASET RELEASED: V. 4.1 (April 2012).

File Name

GGS_Wave2_Lithuania_V1.3..NSDstat

Contents of Files

GGS Wave 2

VARIABLES HAVING ALL SYSTEM MISSING CASES ARE DROPPED BEFORE PUBLICATION IN NESSTAR.
This is the reason why the total no. of variables in the Nesstar data file is smaller than the total number of variables in the SPSS and STATA files.

Variables are ordered according to the sections of the GGS codebook: Household, Children, Partnerships, Household Organisation and Partnership Quality, Parents and Parental Home, Fertility, Health and Well-Being, Respondent's Activity and Income, Partner's Activity and Income, Household Possessions, Income and Transfers, Value Orientations and Attitudes, Interviewers' report.
The variables begin with a letter designating the wave of data collection ("a" for the first wave likewise "b" for the second wave). We have attempted to keep the names of variables the same across the waves, and all the new variables would be identified as follows ["wave letter"]n e.g.  bn301.  
Although we encourage the countries to strictly follow the GGS Questionnaire, countries might implement a question that differs to a considerable extent from the GGS Questionnaire. In this case either we add country specific response values, or we introduce a country specific variable.  
Country specific values are added when the question follows the model questionnaire, but the answers are not at all or partly compatible. They are at least 4 digits long (F4 format) and begin with the country code: e.g., Australia 2401. Hence, the country code, as an example, for Australia is 24.  
A country specific variable is introduced when the question differs from the model questionnaire albeit measuring the same concept. This kind of variables is identified with a suffix given by the country code plus a number, e.g., Australia a119_2401.
In order to have an overview of GGS country code, please refer to the variable "acountry".

File Structure

Record Group

Overall Case Count

2292

Overall Variable Count

1184

Type of File

Nesstar 200801

Extent of Processing Checks

WAVE 2 DATA HARMONISATION: see "Extent of Processing Checks" "WAVE 1 DATA HARMONISATION".

Missing Data

The following missing values have been assigned:
- 6, 96, 996, etc. = Unknown (only for consolidated variables in the group "administrative variables")
- 7, 97, 997, etc. = Don't know
- 8, 98, 998, etc. = Refusal
- 9, 99, 999, etc. = Not-applicable/no response

Version

Harmonized dataset, GGS Wave2, version 1.3.

Notes

IMPROVEMENTS INTRODUCED WITH GGS_Wave2_V.1.3 (August 2016)
Correction of the following variables that were previously  erronous: b343_*, bnnumdissol, bnumdissol, bnnumdivorce, bnumdivorce, bnnummarriage, bnummarriage.

IMPROVEMENTS INTRODUCED WITH GGS_Wave2_V.1.2 (April 2015)
The update from v1.1 to v1.2 does not include corrections of existing variables. The update only includes additional variables which are derived from the pre-existing datasets.  

- Variables derived from grid variables and variables which concern the respondents and his/her partner: bnumdissol, bnnumdissol, bnumdivorce, bnnumdivorce, bnnummarriage, bnummarriage, bnumpartnerships, bnnumpartnerships, bnrespartafterw1, blivingwithpartner, bchildprevp, bnchildprevp, bfemage, bmaleage, bfemeduc , , bmaleeduc, bfertintent, bnumbiol, bnumnonres, bnumres, bnumstep, bnumallchild, bageoldest, bageyoungest, bcoreschild, bcoresgrandp, bcoresparen, bcoressibl , bhhtype.
- Variables derived from month and year variable: b121AgeR, b150AgeR , bn152AgeR, b239aAgeR, b239bAgeR, b240AgeR, bn304Agb303cAgeP, b311AgeR, b315AgeP, b316cAgeP, b371AgeR, b372bAgeR, b372bTdiff, b374cAgeP, b5116AgeR, b5117bAgeR, b603AgeR, b608AgeP, b608AgeR, b610AgeP, b610AgeR, b621AgeP, b621AgeR, b871AgeR, b907Dur, b911Dur, b914AgeP, b914Dur, b941AgeP.
- Variables derived from hours and minutes variables: b324hour, b520hour, b540hour, b221hour_x.
- Variables derived from frequency and unit variables: b203c_xw, b204c_xw, b205mnth, b241mnth, b325mnth, b521mnth, b1008mnth.
- Occupation variables recoded into ISCO-88 1 digit: b828_1dig, b832_1dig, b861_1dig, b917_1dig, b921_1dig, b933_1dig.
- Three groups of variables derived from section no. 8 "Activity and Education History": 1) variables counting the total number of different activity and education situations Rs has had since age 16 (i.e., bnnumworkstatuses, bnnumstudentstatuses, bnnumemplstatuses, bnnumselfemplstatuses, bnnumhelpfamstatuses, bnnumunemplstatuses, bnnumretiredstatuses, bnnummilitarystatuses, bnnumhomestatuses, bnnummatleavestatuses, bnnumparleavestatuses, bnnumdisabilitystatuses, bnnumotherstatuses, bnnum1401, bnnum1501, bnnum1801, bnnum1301, bnnumparttime, bnnumfulltime, bnnumboth, bnnumparttime_1801, bnnumparttime_1802, bnnumpartfulltime_1803, bnnumfulltime_1804, 2) the total duration in month of each of the different situation (i.e., bndurstudentstatuses, bnduremplstatuses, bndurselfemplstatuses, bndurhelpfamstatuses, bndurunemplstatuses, bndurretiredstatuses, bndurmilitarystatuses, bndurhomestatuses, bndurmatleavestatuses, bndurparleavestatuses, bndurilldisabledstatuses, bndurotherstatusstatuses, bndur1501, bndur1401, bndur1301, bndurparttime, bndurlastparttime, bndurstudwhilework), 3) the age of R at the beginning and end of part-time employments (i.e., bn876_xAgeR, bn877_xAgeR, bn878xAgeR, bn879_xAgeR).

The availability of these variables in each different country-specific file depends on the availability of variables used for their calculation.

IMPROVEMENTS INTRODUCED WITH V.1.2. in Nesstar GGS micro data files (April 2015):
Publication of variables that were previously deleted before dataset release in Nesstar. The following variables are concerned: grid variables, month and year variables, hours and minutes variables, frequency and unit variables, and occupation variables.

FIRST DATASET RELEASED: V.1.1. (April 2014).

Notes

WAVE 2 DATASETS - Main differences compared to WAVE 1 datasets
Wave 2 datasets include an additional new section that had not been implemented in the Wave 1 data collection. It is the section no. 8 "Activity and Education History". Respondents report comprehensively on their activity and education history since age 16. Two additional sections are also present at the end of wave 2 dataset: "Interviewer observations" and "Interviewer report" (respectively sections no. 13 and 14).
A set of constructed variables at the top of the data file increase the usability of the GGS data by summarizing key socio-demographic characteristics of the respondent (age, birth year, sex, level of educational attainment, activity status, partnership status, number of co-resident partners, number of children, household size, household type). An additional set of variables consolidates information on the current activity of the respondent and his/her partner that is otherwise spread over the questionnaire. Another set of consolidated variables concern respondents' parents and parental home.

WAVE 2 DATASETS - Variables names
Variables in the Wave 2 data sets that are consistent with variables implemented in the Wave 1 questionnaire are named identically. Wave 2 variable names start with the letter "b" compared to letter "a" in Wave 1. Variables that have not been implemented in Wave 1 but collected in Wave 2 begin with "bn".  
In Wave 2 datasets published in Nesstar, the variable "brid - R identification number" has been renamed into "arid" (same variable name than Wave 2). This allows the user to merge Wave 1 and Wave 2 datasets in Nesstar.
In Wave 2 datasets published in Nesstar, variable labels have the indication "(W2)". This allows the user to distinguish Wave 2 variables from Wave 1 variables, on the basis of the variable labels.

Download

Metadata Index

This is the Metadata Index for a Nesstar Server.
Nesstar is a tool used for analysing, visualising and downloading datasets.

Click the "Explore Dataset" button to open the dataset.