Dataset: Generations and Gender Survey Georgia Wave 1 & Wave 2

Abstract

The Generations and Gender Survey (GGS) provides micro-level data with the aim of significantly improving the knowledge base for social science and policymaking in Europe and developed countries elsewhere.  
In Europe 2020, the European Union develops a strategy "to help us come out stronger from the crisis and turn the EU into a smart, sustainable and inclusive economy delivering high levels of employment, productivity and social cohesion". The economic crisis affects not only day-to-day decisions, but also fundamental choices at all stages of people's lives:  marriage and childbearing, the combination of employment and caring responsibilities for the young and the old, retirement, housing, and ageing well. The GGS has been developed to provide scientists with high-quality data to contribute scientifically grounded answers to these key policy questions. Survey content focuses on intergenerational and gender relations between people, expressed in care arrangements and the organization of paid and unpaid work. Key feature of the survey are:  
- Cross-national comparability. In each country data is collected on the basis of a common international questionnaire and guidelines about the methodology. Data processing includes central harmonization of national datasets.  
- A broad age range. It includes respondents between the ages of 18 and 80.
- A longitudinal design. It has a panel design, collecting information on the same persons at three-year intervals.  
- A large sample size. It has an average of 9,000 respondents per country at Wave 1.
- A theory-driven and multidisciplinary questionnaire. It provides data for policy relevant research by demographers, economists, sociologists, social policy researchers, social psychologists and epidemiologists. The questionnaire is inspired by the theory of planned behavior.
- Possibility to combine the survey data with macro data provided by the GGP Contextual Database. This combination enables analyses of individuals and families in their cultural, economic, political, social and policy contexts.

Variable Groups

Document Description

Full Title

Generations and Gender Survey Georgia Wave 1 & Wave 2

Alternative Title

GGS Georgia Wave 1 & Wave 2

Identification Number

GGS.W1.W2.13

Date of Distribution

2012-12-20

Version

Working Version: GGS Wave 1 Version 4.3 and GGS Wave 2 Version 1.3.

Update of variable catagories and documentation with the release of Poland Wave 2 Version 1.3.

Date: 2018-02-26

Guide To Codebook

In the field “Study Description”, users can find metadata about surveys. This includes the distributors, keywords, abstract, and guidelines on the bibliographic citation.  
Country specific metadata include information on survey producers, methodology and processing. For Wave 1, this information was provided by GGP-country teams, based on a metadata grid with pre-structured questions. For Wave 2, it was taken from the references listed under “Other References Note”.

The field “Data Files Description” provides metadata about the data file, such as file contents, missing values, as well as changes across different GGS versions.

The field "Variable Description" provides information on each variable, such as question text, descriptions of country specific categories and variables, universe (i.e., subset of respondents to whom the question was asked), country specific deviations to GGS routing, descriptions of the ways in which consolidated and derived variables are calculated. Variables are ordered according to the sections of the GGS codebook.

PLEASE NOTICE THAT WE DOCUMENT ONLY VARIABLES HAVING VALID CASES.  
VARIABLES HAVING ALL SYSTEM MISSING CASES ARE NOT DOCUMENTED.  
This is the reason why the total no. of variables in the documentation is smaller than the total number of variables in the SPSS and STATA files.

Full Title

GGS_W1-V.4.3.&W2-V.1.3_Georgia

Producer

Name Affiliation Abbreviation Role
Arianna Caporali Institut national d'études démographiques (INED) AC

Study Description

Full Title

Generations and Gender Survey Georgia Wave 1 & Wave 2

Alternative Title

GGS Georgia Wave 1 & Wave 2

Parallel Title

Family Relations and Intergenerational Support

Identification Number

GGS.W1.W2.13

Authoring Entity

Name Affiliation
Dr. Irina Badurashvili Director of the Georgian Centre of Population Research (GCPR)

Other identifications and acknowledgments

Name Affiliation Role
GGP Project team in Georgia GCPR GGP implemetation in Georgia

Producer

Name Affiliation Abbreviation Role
Georgian Centre of Population Research GCPR

Funding Agency/Sponsor

Name Abbreviation Role Grant
Institute national d'études démographiques (National institute of demographic studies) INED 85% of fieldwork for Wave 1 - 20% of fieldwork for Wave 2
United Nations Population Fund in Georgia UNFPA-Georgia 15% of fieldwork for Wave 1 - 80% of fieldwork for Wave 2

Data Distributor

Name Affiliation Abbreviation
Institut national des études démographiques - 133 boulevard Davout 75980 Paris Cedex 20, France. INED
Netherlands Interdisciplinary Demographic Institute - Lange Houtstraat 19, NL-2511 CV The Hague, The Netherlands NIDI

Depositor

Name Affiliation Abbreviation
Georgian Centre of Population Research GCPR

Bibliographic Citation

United Nations 2005. Generations & Gender Programme: Survey Instruments. New York and Geneva: UN, 2005.

List of Keywords

Date of Collection

Start End Cycle
2006-03 2006-05 Wave 1
2009-04 2009-06 Wave 2

Country

Georgia  (GEO)

Geographic Coverage

WAVE 1
Whole territory of Georgia except of autonomy republic of Abkhazia and part of former South-Ossetia, territories that were not covered by population census of 2002.

WAVE 2
Same territory and population than Wave 1. However, 3 villages in Shida Kartli region with 75 respondents from Wave 1 are not covered anymore by the survey. As a result of the military conflicts in August 2008, they are not anymore under the jurisdiction of Georgian governments. Close collaboration of GCPR with Georgian governmental structure yielded to contact with 64% of them.

Geographic Unit

Region (11 administrative regions on the whole territory of Georgia under control of Georgian authorities).

Unit of Analysis

Individuals

Universe

WAVE 1
Resident non-institutionalized population aged 18-79 around whole Georgia covered by population census of 2002.

WAVE 2
Persons who took part to WAVE 1 willing to participate againt to the survey.

Kind of Data

Survey data

Time Method

Panel

Data Collector

Georgian Centre of Population Research, Independent organisation  (GCPR)

Sampling Procedure

WAVE 1 SAMPLING PROCEDURE  
1. Sampling frame
1.1 Type of frame: Name lists constructed several months prior to the fieldwork on the basis of micro-census of population residing in PSU selected from the complete list of census units by population census of 2002. Georgian statistical office has prepared the electronic version of households' address information based on initial paper documentation of census 2002. Then interviewers have visited all the households (more than 31,000) and recorded the personal information (name, age and sex) on all household members. This information was computerized in GCPR and used at the second stage of sampling for sample of appropriate individuals from the list.
1.2  Frame coverage: Total target population residing on the whole territory of Georgia aside of autonomy republic of Abkhazia and part of former South Ossetia, territories where jurisdiction of Georgian authorities is not expanding in fact and that were not covered by Georgian population census of 2002.
1.3 Frame size: 16,852 population census units.
1.4 Level of units available: Individuals.

2. Sampling method
2.1 Sampling method type: Multistage- 2 stages' design of sampling, at the first stage: sampling of PSU, at the second stage: individuals for interview
2.2 Sampling stage definition
  - PSU: is equal to population census unit by population census of 2002; those census units with less than 40 households were merged with neighbouring census unit and considered as one primary sampling unit.
  - SSU: Potential respondent.
  - TSU: NA.
2.3 Sampling stage size
  - PSU: 400.
  - SSU: 10,000 respondents in basic list and about 4,000 in the reserve list upon request
  - TSU: NA.
2.4 Unit selection: PPS-method.
2.5 Final stage unit selection: SRS (Simple Random Sampling).
2.6 Within Household unit selection: NA.
2.7 Stratification: Explicit: stratification at the first stage of sampling (sampling of PSU) by type of settlements: city (30,000 inhabitants and more), town (less than 30,000 inhabitants), rural settlement.
2.8 Sample size:
  - Starting size sample: 10,000.
  - Aimed total size at Wave 1: 10,000.
  - Aimed total size at Wave 3: None.
2.9 Estimated Non-response
  - Non response measures: Substitutes - selected ad-hoc: reserve list of potential respondents
  - Within household non-responses measures: None - the household was marked as non-response.

WAVE 2 SAMPLING PROCEDURE
To conduct as many repeated interviews as possible of 10.000 respondents at GGS Wave 1.

Mode of Data Collection

Method: Face-to-Face (personal interview)
Technique: Paper and pencil (PAPI)

Type of Research Instrument

Structured questionnaire in Georgian and Russian.

Characteristics of Data Collection Situation

WAVE 1 DATA COLLECTION
1. Interviewers
1.1 Total number of interviewers: 131.
1.2 Number of interviewers in the field: Each interviewer had a fixed number of interviews to carry out. Some of them could finish the work earlier than others.
1.3 Network organization: Field coordinators - 10 regional coordinators in all administrative regions of Georgia, aside of Tbilisi, coordination of fieldwork in Tbilisi, capital city - by project team.
1.4 Working arrangement of interviewers: Fully contracted for period of fieldwork people, mostly professional interviewers, some specially trained and fully employed.
1.5 Payment of interviewers: Per interview, some additional benefits for those working in montain areas and areas with poor transport communication.

2. Interviewer training:  
2.1 General interviewing: No.
2.2 Survey specific: Interviewers in Tbilisi, capital city, were mostly the experienced interviewers, so one-week of training in GGS-specific issues has been conducted for them with home works during the training course and exams at the end. In regions, two stages-training has been implemented. At the first stage, interviewers were trained by regional supervisors, then the project team visited each region with purpose of organizing of exams for regional interviewers, and for those successfully passed this exam additional two-day training with home works and experimental interviewing. All regional supervisors were trained in advance in Tbilisi, provided with instruction of procedure of training at place. Groups of interviewers were designed with taking into account their experience. People having the experience of dealing with the economic surveys, as interviewers of household surveys of Georgian statistical office were trained by distinctive program than those worked before mainly on sociological surveys.
2.3 Length: Duration of each training session was one week, some interviewers who did not successfully pass exams were invited repeatedly to attend part of the training session on definite questionnaire chapters with the next group
2.4 Control of performance: Yes, 3 respondents in each PSU consisted 25 interviewed households have been visited for control. In case of problems found all addresses were visited for control. All filled questionnaires before the data-entry passed the procedure of logical control by the special group of trained staff.
2.5 Interviewer survey: Permanent discussions with interviewers and supervisors during fieldwork allowed us to adjust the operating procedures during the fieldwork.

3. Contact protocols
3.1 Advance letter: Household visit during micro-census of population in selected PSU several months before survey. Interviewers were required to mark any family members unable to be interviewed due to health conditions.
3.2 Cold contacts: During micro-census of population in selected PSU several months prior to survey using face-to-face contact with any family member where respondent had chance to be chosen for survey.
3.3 Scheduling / scattering: The scheduling procedure was up to interviewers and it was different in urban and rural areas and dependent on respondent's employment schedule and his/her personal preferences. Interviewers in big cities were advised not to call to potential respondents in advance, as pilot survey has shown that probability of refusal is higher in this case, while if potential respondents meet with interviewers they are more likely to do a favour and accept interviewing, or in case of time-limits for the respondent to make an appointment for another time.
3.4 Contact history: No, as interviewers were allowed to replace the potential respondent. However, regional supervisors were responsible for monitoring of all cases of replacement and proper following the procedure of replacement.
3.5 Min number of contacts: 3 visits.
2.6 Max number of contacts: NA.

4. Questionnaire localization
4.1 Validation: Re-translation was used.
4.2 Pre-test: A pre-test was carried out in September 2004 (150 respondents). The design effect characteristics and response rates found by the pilot study confirmed that proposed sampling design is sufficient to meet the GGP requirement. Furthermore, it was found that some questions in the Core Questionnaire needed to be modified and reformulated.
4.3 Length of interview: Average length of interview: 71.1 minutes.

WAVE 2 DATA COLLECTION
- Total number of interviewers: 165
- Network organization: 10 regional supervisors in all administrative regions of Georgia, aside of Tbilisi, coordination of fieldwork in Tbilisi, capital city - by project team.
- Control of performance: Three stages of control:  
1) Controllers have visited 8 randomly chosen addresses in each of 400 PSU (Nmax=25) provided by GCPR staff – 3200 addresses in total.  
2) Interviewers having at least one problem on at least one controlled address were additionally screened. In total the second stage of control have been implemented in 36 PSU.  
3) In case of significant discrepancies in age of respondents between two waves GCPR staff was making comparison of personal information on respondent and his household composition   by I and II wave.

Actions to Minimize Losses

WAVE 1 ACTIONS
1.  Dealing with nonresponse
1.1 Screening: Yes, as a mini-census of population in selected PSU has been conducted.
1.2 Refusal conversion: Majority of the interviewers were working in the regions where they permanently live and were able to establish good contacts with people due to the knowledge of local pecularities and attitudes between people. This knowledge made easier for them to convince potential respondents for interview.
1.3 Incentives: None.

2. Tracking of sampled units
2.1 Respondent contact information: Yes, contact details mainly fix and mobile telephone numbers of the respondent were collected for following contacts.
2.2 Other contact information: Yes, contact details of any person who we may apply later were collected.
2.3 Cards: Only a brochure with results of Wave 1 was sent.
2.4 Additional surveys: No.
2.5 Administrative records: No

WAVE 2 ACTIONS: metadata are forthcoming

Control operations

Routine manual checking of all filled questionnaires on paper by group for logical control (3 persons). When necessary contacts with interviewers and request to clarify the situation even three additional short contacts with respondents were practiced.

Weighting

WAVE 1 WEIGHTING
Regional weights (for 11 administrative regions of Georgia) by gender and 2 aggregated age groups of respondents (18-44 and 45-79) have been built based on the results of implemented fieldwork. Interviewers in case of interview's failure were allowed to replace the potential respondent from the name-list of basic sample by the appropriate (same gender and age group) respondent from the reserve list in the frame of same PSU. But when no one was left anymore, the appropriate respondents in reserve list (the name list in some less populated areas was limited due to the small number of inhabitants in definite PSU) first respondents of other age group and next of opposite gender have been interviewed. Finally there is an slight under-representation of young (in age group 18-44), women (49.04% instead of 50.6 %), and men (53.7% instead of 54.7%) in the sample of interviewed respondents compared with the structure of the target population.
In the harmonised dataset the variables aweight is provided, which is a standardized weight based on the country specific population weight (recommended for use).

WAVE 2 WEIGHTING: metadata are forthcoming.

Cleaning Operations

The datafile was cleaned (consistency checking, wildcode checking, ….) by SPSS specialist before being sent to data harmonization center.

Response Rate

WAVE 1  
Response rate - Final disposition codes:
I = complete interview: 10,000
P = partial interview: 0 (There were 2 partial interviews, but with less than 67% of all in-scope questions answered)
NE = non-eligible : 16
NC = non-contact : 604
R = refusal: 363
O = other non-response: 802
UC = unknown eligibility, contacted: DK
UC = unknown eligibility, non-contact: DK
eC = estimated proportion of contacted cases of unknown eligibility that are eligible: DK
eN = estimated proportion of non-contacted cases of unknown eligibility that are eligible DK

WAVE 2  
Response rate - Final disposition codes:
- I = complete interview: 8,303
- P = partial interview: 6 (in capital city Tbilisi)
NE = non-eligible : DK
NC = non-contact : 21
R = refusal: 20
O = other non-response: DK
UC = unknown eligibility, contacted: DK
UC = unknown eligibility, non-contact: DK
eC = estimated proportion of contacted cases of unknown eligibility that are eligible: DK
eN = estimated proportion of non-contacted cases of unknown eligibility that are eligible DK
- Response Rate: 83%. The lowest response rate was in the capital-city Tbilisi-79%. Among other regions with the lowest response rate there is Kvemo-Kartli region (82%). The highest response rate was in Kakheti region (89%).
- Attrition rate: Attrition between two waves did not exceed 17% - of which 3% were refusals; 0.6% were absence of individual (military service, prison, sailor etc.); 0.4% were illness of respondent that made impossible the interview; 4% were deaths of respondents between the two waves; 4% were temporary away respondents; 5% were new addresses of respondents that could not be obtained.

Information on Wave 2 is not complete. Further information is forthcoming.

Completeness of Study Stored

WAVE 1
GGS questionnaire in Georgia is fully comparable with the Core Questionnaire and also one optional sub-module - "Intention of breaking up" - was added. About 25 country-specific questions have a distinctive numeration and information. They are not stored in the harmonized GGS dataset.

WAVE 2
About 22 country-specific questions have a distinctive numeration and information. They concern completeness of death registration, abortions, religious marriage, and migration. They are not stored in the harmonized GGS dataset.

Restrictions

In order to access micro data files, users have to sign and submit a Statement of affiliation, confidentiality and acceptable usage. They also have to submit a title and abstract of their research project. They can use the data for all their research projects, except for datasets from Australia and Norway. Users of these datasets need to submit a new application form if they want to use the data in a different research project. The access rights from Wave 1 data are transferred to the Wave 2 data.

Access Authority

Name Affiliation E-mail address Universal Resource Identifier
UNECE Population Unit - Palais des Nations - CH-1211 Geneva 10 - Switzerland. Tel: +41 22 917 24 77 - fax: +41 22 917 01 07 ggp@unece.org http://www.unece.org/pau/

Citation Requirement

In any work emanating from research based on the Generations and Gender Survey micro-data, I will acknowledge that these data were obtained from the GGP Data Archive and refer to the publication that describes the model survey instruments: United Nations 2005. Generations & Gender Programme: SurveyInstruments. New York and Geneva: UN, 2005

Deposit Requirement

Users of GGS micro-data are required to send any research papers based on the Generations and Gender Survey micro-data or aggregate tabulations to the Population Activities Unit of the UN Economic Commission for Europe, for inclusion in the GGP publications archive.

Conditions

In order to access, it is necessary to subscribe to the GGP Data User Space, and to follow the instructions available on the GGP data access webpage.

Disclaimer

The authors and producers bear no responsibility for the uses of the GGS data, or for interpretations or inferences based on these uses. The producers accept no liability for indirect, consequential or incidental damages or losses arising from use of the data collection, or from the unavailability of, or break in access to the service for whatever reason.

Notes

Metadata on Wave 2 survey methodology were retrieved mainly from Georgian country report at the seventh GGP Network of National Focal Points meeting which took place in Marrakech, Morocco, 27 September 2009.

Related Materials

Website of Georgian Centre of Population Research (GCPR) - Page on Georgian GGS

Georgia_Questionnaire_W1_en

Georgia_Questionnaire_W2_en

Other References Note

Georgian country presentations at the GGP International Working Group Meetings

Data Files Description

File Name

GGS_Wave1_Georgia_V.4.3..NSDstat

Contents of Files

GGS Wave 1  

VARIABLES HAVING ALL SYSTEM MISSING CASES ARE DROPPED BEFORE PUBLICATION IN NESSTAR.
This is the reason why the total no. of variables in the Nesstar data file is smaller than the total number of variables in the SPSS and STATA files.

Variables are ordered according to the sections of the GGS codebook: Household, Children, Partnerships, Household Organisation and Partnership Quality, Parents and Parental Home, Fertility, Health and Well-Being, Respondent's Activity and Income, Partner's Activity and Income, Household Possessions, Income and Transfers, Value Orientations and Attitudes, Interviewers' report.
The variables begin with a letter designating the wave of data collection ("a" for the first wave likewise "b" for the second wave). We have attempted to keep the names of variables the same across the waves, and all the new variables would be identified as follows ["wave letter"]n e.g.  bn301.  
Although we encourage the countries to strictly follow the GGS Questionnaire, countries might implement a question that differs to a considerable extent from the GGS Questionnaire. In this case either we add country specific response values, or we introduce a country specific variable.  
Country specific values are added when the question follows the model questionnaire, but the answers are not at all or partly compatible. They are at least 4 digits long (F4 format) and begin with the country code: e.g., Australia 2401. Hence, the country code, as an example, for Australia is 24.  
A country specific variable is introduced when the question differs from the model questionnaire albeit measuring the same concept. This kind of variables is identified with a suffix given by the country code plus a number, e.g., Australia a119_2401.
In order to have an overview of GGS country code, please refer to the variable "acountry".

File Structure

Record Group

Overall Case Count

10000

Overall Variable Count

1443

Type of File

Nesstar 200801

Extent of Processing Checks

WAVE 1 DATA HARMONISATION
The data is submitted in an already pre-harmonised form. It is prepared and organised according to the GGS standards.  
Harmonisation aims at achieving a clear and comparable format of the GGS micro-data files that would be adequate for cross-country comparison.  The harmonisation procedure basically is composed of:
1. Label checks  
This step makes sure that all the variables are named the same across the countries and refer to a particular question in the GGS Questionnaire. Also the value labels are checked. They should be the same across GGS datasets.  
2. Dealing with grids
The GGS Questionnaire holds several grids of either event history information or members of the household. Such data needs to be harmonized with specific attention to order and logical consistency of grid-rows (be either household members or events such as births). In data sense each row of the grid is represented by variable name followed by a subscripted number ("_#"). Each subscript thus represents one household member or one event. Part of the grid harmonization is grid sorting. Grid rows are sorted according to pre-defined key. For example in the household grid, the household members are sorted according to their relationship to the respondent i.e. the relation to respondent variable (ahg3_# or bhg3_# ). Respondents would appear, first, followed by their partners and children if any and then followed by other household members. As there may be more then one child (or other relative) living in the household they also would need to be sorted. In the case of the household grid, age is used as the secondary sorting key (starting with the oldest person to the youngest).
3. Routing
Routing check ensures that the structure of underlying data set matches the structure of the GGS questionnaire. Its main goal is to code any given variable in the dataset to either a valid response, nonresponse or skip as indicated in the questionnaire. Consequently, the indicated skip in the quetionnaire is represented with a system missing code (. in STATA, sysmis in SPSS), while the missing information for other reasons is coded into non-applicable/no response (i.e. codes 7, 8, 9 in SPSS or .a, .b, .c in STATA).  
4. Consolidation  
The process consolidates the information scattered over several variables into a single one. The consolidation procedure is carried out in the Children Section, the Partnership Section and the Parents and Parental Home Section.
5. Imputation  
Due to its sensitive nature, the respondents are reluctant to share income information with the interviewer. In order to be able to use income information in a cross country comparative study and not to loose too many observations in the process it is necessary to impute the approximately correct distribution of the income variable in each country.  
6. Calculation of derived variables
We calculate derived variables out of the following variables:
- grid variables (i.e., household grid, children grid, and partnership history grid); the codebook starts with the constructed variables that sum the key socio-demographic characteristics of the respondent.
- month and year variables,  
- hours and minutes variables,
- frequency and unit variables.  
Occupation variables are recoded into ISCO-88 1 digit.
Explanations of the ways in which consolidated and derived variables are obtained, are available under the field "Note" of the "Variable Description" sections.
For a more detailed and technical procedure please refer to the Data Cleaning and Harmonisation Guidelines.

Missing Data

The following missing values have been assigned:
- 6, 96, 996, etc. = Unknown (only for consolidated variables in the group "administrative variables")
- 7, 97, 997, etc. = Don't know
- 8, 98, 998, etc. = Refusal
- 9, 99, 999, etc. = Not-applicable/no response

Version

Harmonized dataset, GGS Wave1, version 4.3.

Notes

IMPROVEMENTS INTRODUCED WITH V.4.3. (August 2016):
Variables corrected with Version 4.3.
- fertintent (no more ambiguous labelling)
- a1101 (corrected error in coding)
- aweight (now available also for NLD CZE SWE POL)
- aregion (now available also for HUN)
- aplace (now available also for HUN)
- a5112 (corrected routing error for ROU)
- a5113 (corrected routing error for ROU)
- a5114 (corrected routing error for ROU)
- a5115 (corrected routing error i for ROU)
- a211b_ (corrected error for POL & GEO)
- ankids (corrected error for POL & GEO)
- a1008mnth (corrected error for NGR & BEL)
- a108 (now available for SWE)
- a109_1 (now available for SWE)
- a109_2 (now available for SWE)
- a149 (now available for SWE)
- a309 (now available for SWE)
- aregion (now available for SWE)
- a620_ (corrected error for DEU & CZE)
- a402 (corrected error for POL)
- a149 (corrected error routing error in NOR)
- a344 (corrected error routing error in NOR)
- a256_ (corrected error for POL & GEO)

IMPROVEMENTS INTRODUCED WITH V.4.2. (February 2014):
The update from v4.1 to v4.2 does not include corrections of existing variables.  
The update only includes additional variables which are derived from the pre-existing datasets
- Variables derived from grid variables and variables which concern the respondents and his/her partner: numdissol numdivorce nummarriage numpartners livingwithpartner childprevp femage maleage femeduc maleeduc fertintent numbiol numres numnonres numstep numallchild ageyoungest ageoldest numrespleave numotherparentleave coreschild coresparen coresgrandp coressibl.
- Variables derived from month and year variable:  
a808Dur a822Dur a907Dur a911Dur a914Dur; a303cAgeP a315AgeP a316cAgeP a374cAgeP a608AgeP a610AgeP a617bAgeP a621AgeP a914AgeP a941AgeP; a107AgeR a121AgeR a150AgeR a239aAgeR a239bAgeR a240AgeR a301AgeR a302bAgeR; a311AgeR a314bAgeR a314dAgeR a371AgeR a372bAgeR a603AgeR a608AgeR a610AgeR a613AgeR a614AgeR a619AgeR a621AgeR a816AgeR a822AgeR a871AgeR a5116AgeR a5117bAgeR; a302bTdiff a314bTdiff a314dTdiff a372bTdiff.
- Variables derived from hours and minutes variables: a324_hour a520_hour a540_hour.
- Variables derived from frequency and unit variables: a205mnth,a241mnth,a325mnth,a355mnth,a359mnth,a363mnth,a367mnth,a521mnth,a541mnth,a1008mnth,a1102mnth; a203c_?w a204c_?w.
- Occupation variables recoded into ISCO-88 1 digit: a828_1dig a832_1dig a861_1dig a917_1dig a921_1dig a933_1dig a5112_1dig a5114_1dig.

IMPROVEMENTS INTRODUCED WITH V.4.1. (April 2012):
- Variables corrected: amarstat, ankids, aweight, a540t, a925*
- Value labels defined: a5108_2

IMPROVEMENTS INTRODUCED WITH V.4.0 (March 2012):
- New constructed variables: asex, aage, abyear, aeduc, aactstat, aparstat, amarstat, anpartner, ankids, ahhsize, ahhtype
- New consolidated variables on respondents' current activity: a870, a871m, a871y, a873, a874, a875
- New consolidated variables on respondents' partners current activity: a940, a941m, a941y, a943, a945
- Variables previously unavailable, now available: aweight
- Variables corrected: a602 (with consequences on subsequent variables), a611, a612, a623, a624, a625, a627*, a628*, a629*, a631, a383 (age of current partner now rounded)
- Changes in the variable/value labels: a203b_* (country specific values "public nursery" and "private nursery" recoded into GGS values "nursery"; country specific value "person caring for children after school" recoded into GGS value "babysitter").

IMPROVEMENTS INTRODUCED WITH V.3.0 (August 2010):
- New consolidated variables: a370, a379, a383, a384
- Changes in the variable/value labels: a309, a322, a380 (missing labels have been added); a828, a832, a861, a917, a921, a5112, a5114, a828, a832 (all labels are now ISCO codes)
- Variables renamed: a203c_1u..._`i'u and a204c_1u..._`i'u renamed into respectively a204c_1u..._`i'u and a204cu_1...`i'
- Variables corrected: a808m y (date of reaching educational degree (a150m y) implemented for those who have never worked), a822m y (date of reaching educational degree (a150m y) implemented for those who always have been a homemaker), a1003 and a1004 (previously was coded as first, second, third...mentionned; in the new version is coded per item as a yes/no question).

FIRST DATASET RELEASED: V.2.6. (March 2010).

File Name

GGS_Wave2_Georgia_V.1.3..NSDstat

Contents of Files

GGS Wave 2

VARIABLES HAVING ALL SYSTEM MISSING CASES ARE DROPPED BEFORE PUBLICATION IN NESSTAR.
This is the reason why the total no. of variables in the Nesstar data file is smaller than the total number of variables in the SPSS and STATA files.

Variables are ordered according to the sections of the GGS codebook: Household, Children, Partnerships, Household Organisation and Partnership Quality, Parents and Parental Home, Fertility, Health and Well-Being, Respondent's Activity and Income, Partner's Activity and Income, Household Possessions, Income and Transfers, Value Orientations and Attitudes, Interviewers' report.
The variables begin with a letter designating the wave of data collection ("a" for the first wave likewise "b" for the second wave). We have attempted to keep the names of variables the same across the waves, and all the new variables would be identified as follows ["wave letter"]n e.g.  bn301.  
Although we encourage the countries to strictly follow the GGS Questionnaire, countries might implement a question that differs to a considerable extent from the GGS Questionnaire. In this case either we add country specific response values, or we introduce a country specific variable.  
Country specific values are added when the question follows the model questionnaire, but the answers are not at all or partly compatible. They are at least 4 digits long (F4 format) and begin with the country code: e.g., Australia 2401. Hence, the country code, as an example, for Australia is 24.  
A country specific variable is introduced when the question differs from the model questionnaire albeit measuring the same concept. This kind of variables is identified with a suffix given by the country code plus a number, e.g., Australia a119_2401.
In order to have an overview of GGS country code, please refer to the variable "acountry".

File Structure

Record Group

Overall Case Count

8292

Overall Variable Count

1385

Type of File

Nesstar 200801

Extent of Processing Checks

WAVE 2 DATA HARMONISATION: see "Extent of Processing Checks" "WAVE 1 DATA HARMONISATION".

Missing Data

The following missing values have been assigned:
- 6, 96, 996, etc. = Unknown (only for consolidated variables in the group "administrative variables")
- 7, 97, 997, etc. = Don't know
- 8, 98, 998, etc. = Refusal
- 9, 99, 999, etc. = Not-applicable/no response

Version

Harmonized dataset, GGS Wave2, version 1.3.

Notes

IMPROVEMENTS INTRODUCED WITH GGS_Wave2_V.1.3 (August 2016)
Correction of the following variables that were previously  erronous: b343_*, bnnumdissol, bnumdissol, bnnumdivorce, bnumdivorce, bnnummarriage, bnummarriage.

IMPROVEMENTS INTRODUCED WITH GGS_Wave2_V.1.2 (April 2015)
The update from v1.1 to v1.2 does not include corrections of existing variables. The update only includes additional variables which are derived from the pre-existing datasets.  

- Variables derived from grid variables and variables which concern the respondents and his/her partner: bnumdissol, bnnumdissol, bnumdivorce, bnnumdivorce, bnnummarriage, bnummarriage, bnumpartnerships, bnnumpartnerships, bnrespartafterw1, blivingwithpartner, bchildprevp, bnchildprevp, bfemage, bmaleage, bfemeduc , , bmaleeduc, bfertintent, bnumbiol, bnumnonres, bnumres, bnumstep, bnumallchild, bageoldest, bageyoungest, bcoreschild, bcoresgrandp, bcoresparen, bcoressibl , bhhtype.
- Variables derived from month and year variable: b121AgeR, b150AgeR , bn152AgeR, b239aAgeR, b239bAgeR, b240AgeR, bn304Agb303cAgeP, b311AgeR, b315AgeP, b316cAgeP, b371AgeR, b372bAgeR, b372bTdiff, b374cAgeP, b5116AgeR, b5117bAgeR, b603AgeR, b608AgeP, b608AgeR, b610AgeP, b610AgeR, b621AgeP, b621AgeR, b871AgeR, b907Dur, b911Dur, b914AgeP, b914Dur, b941AgeP.
- Variables derived from hours and minutes variables: b324hour, b520hour, b540hour, b221hour_x.
- Variables derived from frequency and unit variables: b203c_xw, b204c_xw, b205mnth, b241mnth, b325mnth, b521mnth, b1008mnth.
- Occupation variables recoded into ISCO-88 1 digit: b828_1dig, b832_1dig, b861_1dig, b917_1dig, b921_1dig, b933_1dig.
- Three groups of variables derived from section no. 8 "Activity and Education History": 1) variables counting the total number of different activity and education situations Rs has had since age 16 (i.e., bnnumworkstatuses, bnnumstudentstatuses, bnnumemplstatuses, bnnumselfemplstatuses, bnnumhelpfamstatuses, bnnumunemplstatuses, bnnumretiredstatuses, bnnummilitarystatuses, bnnumhomestatuses, bnnummatleavestatuses, bnnumparleavestatuses, bnnumdisabilitystatuses, bnnumotherstatuses, bnnum1401, bnnum1501, bnnum1801, bnnum1301, bnnumparttime, bnnumfulltime, bnnumboth, bnnumparttime_1801, bnnumparttime_1802, bnnumpartfulltime_1803, bnnumfulltime_1804, 2) the total duration in month of each of the different situation (i.e., bndurstudentstatuses, bnduremplstatuses, bndurselfemplstatuses, bndurhelpfamstatuses, bndurunemplstatuses, bndurretiredstatuses, bndurmilitarystatuses, bndurhomestatuses, bndurmatleavestatuses, bndurparleavestatuses, bndurilldisabledstatuses, bndurotherstatusstatuses, bndur1501, bndur1401, bndur1301, bndurparttime, bndurlastparttime, bndurstudwhilework), 3) the age of R at the beginning and end of part-time employments (i.e., bn876_xAgeR, bn877_xAgeR, bn878xAgeR, bn879_xAgeR).

The availability of these variables in each different country-specific file depends on the availability of variables used for their calculation.

IMPROVEMENTS INTRODUCED WITH V.1.1. (March 2013):
- Variables corrected: bage, b203* (unit of frequency now available), b510, b530, b626, b904a, b905,b906a, bn871 (added a country specific response category)
- Changes in the variable/value labels: b309, b322, b380 (value labels correspond to ISCO codes); bfloor, b5106b_s, b5106b_b; b861, b917, b921, b933 (some values which were not ISCO have been recoded into user missings), bn884 (bn884_1 to bn884_3 now available)
- Variables dropped because not part of the GGS Core Questionnaire: bn1401u_1
- Variables now available: b149 (ISCO codes)
IMPROVEMENTS INTRODUCED WITH V.1.1. in the derived variables calculated for Nesstar GGS micro data files (March 2013):
- Variables corrected: b203c_*w and b204c_*w (previously frequencies greater than 7 were put equal to system missing, now they are put as equal to seven).
- Variables previously dropped and now available: bint_st and bint_pr.

FIRST DATASET RELEASED: V. 1.0. (November 2012).

Notes

WAVE 2 DATASETS - Main differences compared to WAVE 1 datasets
Wave 2 datasets include an additional new section that had not been implemented in the Wave 1 data collection. It is the section no. 8 "Activity and Education History". Respondents report comprehensively on their activity and education history since age 16. Two additional sections are also present at the end of wave 2 dataset: "Interviewer observations" and "Interviewer report" (respectively sections no. 13 and 14).
A set of constructed variables at the top of the data file increase the usability of the GGS data by summarizing key socio-demographic characteristics of the respondent (age, birth year, sex, level of educational attainment, activity status, partnership status, number of co-resident partners, number of children, household size, household type). An additional set of variables consolidates information on the current activity of the respondent and his/her partner that is otherwise spread over the questionnaire. Another set of consolidated variables concern respondents' parents and parental home.

WAVE 2 DATASETS - Variables names
Variables in the Wave 2 data sets that are consistent with variables implemented in the Wave 1 questionnaire are named identically. Wave 2 variable names start with the letter "b" compared to letter "a" in Wave 1. Variables that have not been implemented in Wave 1 but collected in Wave 2 begin with "bn".  
In Wave 2 datasets published in Nesstar, the variable "brid - R identification number" has been renamed into "arid" (same variable name than Wave 2). This allows the user to merge Wave 1 and Wave 2 datasets in Nesstar.
In Wave 2 datasets published in Nesstar, variable labels have the indication "(W2)". This allows the user to distinguish Wave 2 variables from Wave 1 variables, on the basis of the variable labels.

Download

Metadata Index

This is the Metadata Index for a Nesstar Server.
Nesstar is a tool used for analysing, visualising and downloading datasets.

Click the "Explore Dataset" button to open the dataset.