Harmonized Datafiles and Variables for High-Frequency Phone Surveys on COVID-19 Version 1 (February 2021) Prepared by the Living Standards Measurement Study (LSMS) team Objective To facilitate the use of data collected through the high-frequency phone surveys on COVID-19, the Living Standards Measurement Study (LSMS) team has created the harmonized datafiles using two household surveys: 1) the country’s latest face-to-face survey which has become the sample frame for the phone survey, and 2) the country’s high-frequency phone survey on COVID-19. The LSMS team has extracted and harmonized variables from these surveys, based on the harmonized definitions and ensuring the same variable names. These variables include demography as well as housing, household consumption expenditure, food security, and agriculture. Inevitably, many of the original variables are collected using questions that are asked differently. The harmonized datafiles include the best available variables with harmonized definitions. Two harmonized datafiles are prepared for each survey. The two datafiles are: 1. HH: This datafile contains household-level variables. The information include basic household characterizes, housing, water and sanitation, asset ownership, consumption expenditure, consumption quintile, food security, livestock ownership. It also contains information on agricultural activities such as crop cultivation, use of organic and inorganic fertilizer, hired labor, use of tractor and crop sales. 2. IND: This datafile contains individual-level variables. It includes basic characteristics of individuals such as age, sex, marital status, disability status, literacy, education and work. Harmonization Guidelines • Household and individual IDs are not harmonized. The variable names and values are extracted and kept as they are in the original dataset. • When the survey does not include variables necessary to create harmonized variables, the variables are still created as missing. • Any assumptions made during the harmonization process are noted in the do-file as comments. When possible, these will also be attached as a note to the harmonized variables. • When the data from the new round of phone surveys become available, they will be harmonized, and the harmonized variables will be added to the end of each harmonized datafile. • Once the variables are harmonized, a quality check is performed by using programs called hh_validation and ind_validation. They test if all the variables are in the harmonized datafile, if all the variables are named as in the data dictionary, if all the variables have the correct format, if all the variables take plausible values or are in the plausible ranges, and if some of the variables are mutually consistent. • Once the quality check is performed, the harmonized variables are labeled using programs called hh_label and ind_label. The programs label variables and value labels ensuring their consistency with the data dictionary. Note • The high-frequency phone survey on COVID-19 has multiple rounds of data collection. When variables are extracted from multiple rounds of the survey, the originating round of the survey is noted with “_rX” in the variable name, where X represents the number of the round. For example, a variable with “_r3” presents that the variable was extracted from Round 3 of the high-frequency phone survey. Round 0 refers to the country’s latest face-to-face survey which has become the sample frame for the high-frequency phone surveys on COVID-19. When the variables are without “_rX”, they were extracted from Round 0. • All harmonized datasets contain a variable which is a unique identifier for the household (Household ID). This variable is used as the unique key variable in the merging of the two harmonized datafiles. • The harmonized datasets can be merged with (1) the country’s latest face-to-face survey which has become the sample frame for the phone survey, and (2) the country’s high-frequency phone survey on COVID-19. The household’s unique identifier variable (Household ID) is used to merge the household-level datasets and the individual’s unique identifier variables (both Household ID and Individual ID) are used to merge the individual-level files. • Individuals who are added after Round 0 (during the country’s high-frequency phone survey on COVID-19) are missing certain information which were collected only in Round 0 (the country’s latest face-to-face survey). It is also impossible to merge these individuals with the Round 0 dataset. These individuals can be identified with the variable “member_r0” (member_r0=0) in the individual-level datafile. • There are 2 variables available for age of household members: (1) age at Round 0, and (2) age at Round 1+. The former is not available for the individuals who are added after Round 0 (during the country’s high-frequency phone survey on COVID-19). The latter is updated with the most recent round of the survey. Data Dictionary Household-level file The datafile is named CCC_HH where CCC=three-letter ISO country code. In addition to the harmonized variables, the datafile also includes country-specific geographic variables. Name Labels and codes Instructions / notes hhid Household ID Household Unique Identifier Production note: The variable name and the values are kept as they are in the Round 0 original dataset. year Survey year (Round 0) The year when data collection started for Round 0 (format YYYY). rural Rural/Urban Urban/rural jurisdiction (based on country-specific 1=Rural definition). 2=Urban Production note: The ‘semi-urban’ category is assimilated to ‘urban’. ea_latitude EA Latitude (Modified) Average of household GPS coordinates in each EA taken at Round 0 (a random offset within a specified range is applied following the MeasureDHS methodology). ea_longitude EA Longitude (Modified) Average of household GPS coordinates in each EA taken at Round 0 (a random offset within a specified range is applied following the MeasureDHS methodology). dwelling Ownership of dwelling Ownership of dwelling that the household resides in. 0=No • Yes includes ownership whether or not full- 1=Yes payment has yet been made. • No includes free (authorized and not authorized) and rented. roof Modern roof The dwelling that the household resides in has modern 0=No roof. 1=Yes • Yes includes corrugated iron sheets, clay tiles, concrete, cement, plastic sheet, asbestos sheet, step tiles, long/short span sheets, zinc and aluminum. • No includes thatch and mud. floor Modern floor The dwelling that the household resides in has modern 0=No floor. 1=Yes • Yes includes cement, concrete, wood, and tile. • No includes sand, dirt, straw, and smoothed mud. walls Modern exterior walls The dwelling that the household resides in has modern 0=No exterior walls. 1=Yes • Yes includes burnt bricks, cement, concrete, and iron sheets. • No includes mud, stone, unburnt bricks, wood, bamboo, and cardboard. toilet Access to improved toilet The household has access to improved toilet, based on 0=No JMP standard definition. 1=Yes • Yes includes flush/pour-flush toilet to sewers, septic tanks or pit latrines, pit latrine with slab, composting toilet, twin pit latrine with slab, and container based sanitation. This also includes flush/pour-flush toilet to don’t know where. • No includes flush/pour-flush to open drain, flush/pour-flush to elsewhere, pit latrine without slab/open-pit, bucket, hanging toilet/hanging latrine, and no facility/bush/field. water Access to improved The household has access to improved water source, drinking water source based on JMP standard definition. 0=No • Yes includes piped, public tap or standpipe, 1=Yes borehole or tube well, protected well, protected spring, rainwater collection, tanker-truck, cart with small tank/drum, water kiosk, bottled water, and sachet water. • No includes unprotected well, unprotected spring, and surface water. If the main source of water differs between the wet and dry season, refers to the water source during dry season. rooms Number of rooms Number of habitable rooms occupied by the household. It excludes storerooms, toilets, bathrooms, kitchens and garage. elect Connection to electricity The household has connection to electricity, irrespective of 0=No its source or its use. 1=Yes tv Ownership of television Ownership of a television, irrespective of who owns it in 0=No the household. 1=Yes radio Ownership of radio Ownership of a radio, irrespective of who owns it in the 0=No household. 1=Yes refrigerator Ownership of refrigerator Ownership of a refrigerator, irrespective of who owns it in 0=No the household. 1=Yes bicycle Ownership of bicycle Ownership of a bicycle, irrespective of who owns it in the 0=No household. 1=Yes mcycle Ownership of motorcycle Ownership of a motorcycle, irrespective of who owns it in 0=No the household. 1=Yes car Ownership of car or other Ownership of a car or any other vehicle, irrespective of vehicle who owns it in the household. 0=No 1=Yes mphone Ownership of mobile Ownership of a mobile phone, irrespective of who owns it phone in the household. 0=No 1=Yes computer Ownership of computer Ownership of a computer, irrespective of who owns it in 0=No the household. 1=Yes internet Access to Internet Access to Internet by a device owned by the household, 0=No irrespective of who owns it in the household. 1=Yes generator Ownership of generator Ownership of a generator, irrespective of who owns it in 0=No the household. 1=Yes land Ownership of land Ownership of any type of land by the household 0=No irrespective of its use. This includes residential land, 1=Yes agricultural land (cultivated, fallow, rented out), pastureland, forest and business/commercial plots. land_tot Total land size owned (ha) Includes both residential and agricultural land. Land size should be in hectares. By convention 1 ha = 2.471 acres. Production note: land_tot equals to 0 if the household does not own any land (land=0). land_cultivate Total land size cultivated Total land area cultivated by the household in hectares d (ha) regardless of ownership. By convention 1 ha = 2.471 acres. Production note: land_cultivate equals to 0 if the household does not cultivate any crops (crop=0). cons_quint Consumption quintile The household’s quintile ranking identified by the 1=Poorest household’s annual normalized per-capita consumption 2=Poorer expenditure with price (spatial and temporal) adjustments. 3=Middle This includes food and non-food items, and includes 4=Richer purchased, own produced and received (gifts and other 5=Richest sources). Production note: Consumption aggregates used are retrieved from the Round 0 original dataset and are not recalculated for the harmonized dataset. totcons Total annual per capita Total annual per capita consumption in local currency, consumption without price (spatial and temporal) adjustments. Production note: Consumption aggregates used are retrieved from the Round 0 original dataset and are not recalculated for the harmonized dataset. foodcons Total annual per capita Total annual per capita food consumption in local food consumption currency, without price (spatial and temporal) adjustments. Production note: Consumption aggregates used are retrieved from the Round 0 original dataset and are not recalculated for the harmonized dataset. nonfoodcons Total annual per capita Total annual per capita non-food consumption in local non-food consumption currency, without price (spatial and temporal) adjustments. Production note: Consumption aggregates used are retrieved from the Round 0 original dataset and are not recalculated for the harmonized dataset. totcons_adj Total per capita Total annual per capita consumption in local currency, with consumption, spatially and price (spatial and temporal) adjustments. temporally adjusted Production note: Consumption aggregates used are retrieved from the Round 0 original dataset and are not recalculated for the harmonized dataset. foodcons_adj Total annual per capita Total annual per capita food consumption in local food consumption, spatially currency, with price (spatial and temporal) adjustments. and temporally adjusted Production note: Consumption aggregates used are retrieved from the Round 0 original dataset and are not recalculated for the harmonized dataset. nonfoodcons_ Total annual per capita Total annual per capita non-food consumption in local adj non-food consumption, currency, with price (spatial and temporal) adjustments. spatially and temporally Production note: Consumption aggregates used are adjusted retrieved from the Round 0 original dataset and are not recalculated for the harmonized dataset. rent Rental income The household has income from rent. 0=No 1=Yes remit Received remittance The household reported to have received remittances 0=No (international and/or domestic). 1=Yes assist Received assistance The household reported to have received assistance from 0=No institutions or government. 1=Yes finance Account from financial Ownership of an account from financial institutions, institutions irrespective of who owns it in the household. 0=No 1=Yes any_work % of working adults Percentage of the working age (15-64) members of the working household who worked in any income-generating activities in the last 7 days. ag_work % of working adults Percentage of the working age (15-64) members of the working in agriculture household who worked in agricultural activities in the last 7 days. nfe_work % of working adults Percentage of the working age (15-64) members of the working in non-farm family household who worked in non-farm family enterprise in enterprise the last 7 days. ext_work % of working adults Percentage of the working age (15-64) members of the working in wage work household who worked in wage work in the last 7 days. nfe Ownership of non-farm Ownership of non-farm family enterprise, irrespective of family enterprise who owns it in the household. 0=No 1=Yes crop Crop cultivation The household cultivates any crops, irrespective of the 0=No final destination of the output. 1=Yes crop_number Number of crops cultivated Number of crops cultivated by the household. Production note: Calculate only if the household cultivates any crops (crop=1). cash_crop Cash crop cultivation The household cultivates at least one of the main cash 0=No crops in the country. 1=Yes Production note: Calculate only if the household cultivates any crops (crop=1). org_fert Use of organic fertilizer The household uses organic fertilizer in at least one plot. 0=No Production note: Calculate only if the household cultivates 1=Yes any crops (crop=1). inorg_fert Use of inorganic fertilizer The household uses inorganic fertilizer in at least one plot. 0=No Production note: Calculate only if the household cultivates 1=Yes any crops (crop=1). pest_fung_her Use of pesticides, The household uses pesticides, fungicides or herbicides in b fungicides or herbicides at least one plot. 0=No Production note: Calculate only if the household cultivates 1=Yes any crops (crop=1). hired_lab Use of hired labor The household uses hired labor in at least one plot. This 0=No excludes post-harvest activities. 1=Yes Production note: Calculate only if the household cultivates any crops (crop=1). ex_fr_lab Use of exchange and/or The household uses exchange and/or free labor in at least free labor one plot. This excludes post-harvest activities. 0=No Production note: Calculate only if the household cultivates 1=Yes any crops (crop=1). hired_lab_ph Use of hired labor for post- The household uses hired labor in at least one of the harvest activities harvested crops. 0=No Production note: Calculate only if the household cultivates 1=Yes any crops (crop=1). ex_fr_lab_ph Use of exchange and/or The household uses exchange and/or free labor in at least free labor for post-harvest one of the harvested crops. activities Production note: Calculate only if the household cultivates 0=No any crops (crop=1). 1=Yes tractor Use of tractor The household uses a tractor in at least one plot. 0=No Production note: Calculate only if the household cultivates 1=Yes any crops (crop=1). ph_loss Post-harvest crop loss The household lost a portion of any crop after harvest. 0=No Production note: Calculate only if the household cultivates 1=Yes any crops (crop=1). sell_crop Sale of crop The household sells a portion of the harvest of at least one 0=No crop in any state. 1=Yes • Yes includes sales of unprocessed and/or processed crops. Production note: Calculate only if the household cultivates any crops (crop=1). sell_process Sale of processed crop The household sells a portion of processed crops. 0=No Production note: Calculate only if the household cultivates 1=Yes any crops (crop=1). sell_unprocess Sale of unprocessed crop The household sells a portion of unprocessed crops. 0=No Production note: Calculate only if the household cultivates 1=Yes any crops (crop=1). livestock Ownership of livestock Ownership of any livestock by the household. 0=No 1=Yes lruminant Ownership of large Ownership of any large ruminants (cattle) by the ruminant household. 0=No Production note: lruminant equals to 0 (No) if the 1=Yes household does not own any livestock (livestock=0). sruminant Ownership of small Ownership of any small ruminants (sheep/goat) by the ruminant household. 0=No Production note: sruminant equals to 0 (No) if the 1=Yes household does not own any livestock (livestock=0). poultry Ownership of poultry Ownership of any chicken or other poultry by the 0=No household. It refers to any type of birds including geese, 1=Yes and doves. Production note: poultry equals to 0 (No) if the household does not own any livestock (livestock=0). equines Ownership of equine Ownership of any horses/donkeys by the household. 0=No Production note: equines equals to 0 (No) if the household 1=Yes does not own any livestock (livestock=0). camelids Ownership of camelid Ownership of any camelids by the household. 0=No Production note: camelids equals to 0 (No) if the 1=Yes household does not own any livestock (livestock=0). pig Ownership of pig Ownership of any pigs by the household. 0=No Production note: pig equals to 0 (No) if the household does 1=Yes not own any livestock (livestock=0). bee Ownership of bee colonies Ownership of any bee colonies by the household. 0=No Production note: bee equals to 0 (No) if the household 1=Yes does not own any livestock (livestock=0). phone_sampl Phone sample The household is selected for phone survey. e 0=No 1=Yes contact_rX Successfully contacted The household was successfully contacted (Round X). (Round X) Production note: Calculate only if the household is selected 0=No for phone survey (phone_sample=1). 1=Yes interview_rX Interviewed (Round X) The household was interviewed (Round X). 0=No • Yes includes interview completed and partially 1=Yes completed. Production note: Calculate only if the household is successfully contacted (contact_rX=1). complete_rX Interview completed The interview was completed for the household (Round X). (Round X) Production note: Calculate only if the household is 0=No interviewed (interview_rX=1). 1=Yes hhsize_rX Household size (Round X) Number of household members (based on country-specific definition of a household) from Round X. Production note: hhsize_rX should equal to m0_14_rX+ m15_64_rX+ m65_rX+ f0_14_rX+ f15_64_rX+ f65_rX. Calculate only if the household completed the interview (complete_rX=1). m0_14_rX Number of males aged 0 to Number of male household members aged 0 to 14 years 14 (Round X) from Round X. Production note: Undefined age is counted as aged 15 to 64 years. Calculate only if the household completed the interview (complete_rX=1). m15_64_rX Number of males aged 15 Number of male household members aged 15 to 64 years to 64 (Round X) from Round X. Production note: Undefined age is counted as aged 15 to 64 years. Calculate only if the household completed the interview (complete_rX=1). m65_rX Number of males aged 65 Number of male household members aged 65 years and and above (Round X) above from Round X. Production note: Undefined age is counted as aged 15 to 64 years. Calculate only if the household completed the interview (complete_rX=1). f0_14_rX Number of females aged 0 Number of female household members aged 0 to 14 years to 14 (Round X) from Round X. Production note: Undefined age is counted as aged 15 to 64 years. Calculate only if the household completed the interview (complete_rX=1). f15_64_rX Number of females aged 15 Number of female household members aged 15 to 64 to 64 (Round X) years from Round X. Production note: Undefined age is counted as aged 15 to 64 years. Calculate only if the household completed the interview (complete_rX=1). f65_rX Number of females aged 65 Number of female household members aged 65 years and and above (Round X) above from Round X. Calculate only if the household completed the interview. Production note: Undefined age is counted as aged 15 to 64 years. Calculate only if the household completed the interview (complete_rX=1). adulteq_rX Adult equivalence (Round Number of adult equivalents in the household, computed X) based on the standard FAO scale. Production note: Calculate only if the household completed the interview (complete_rX=1). Calculate for each household by summing up the following adult equivalent factor given to each member according to his/her age and sex: Male Female <1 yr 0.27 0.27 1-3 yrs 0.45 0.45 4-6 yrs 0.61 0.61 7-9 yrs 0.73 0.73 10-12 yrs 0.86 0.78 13-15 yrs 0.96 0.83 16-19 yrs 1.02 0.77 20 and above 1.00 0.73 fies_mod_rX Probability of being Probability of the household being moderately or severely moderately/ severely food food insecure is higher than 50% (p>=0.5) (Round X). This insecure >= 50% (Round X) variable is computed based on the Food Insecurity Experience Scale (FIES) methodology. fies_sev_rX Probability of being Probability of the household being severely food insecure severely food insecure >= is higher than 50% (p>=0.5) (Round X). This variable is 50% (Round X) computed based on the Food Insecurity Experience Scale (FIES) methodology. head_chg_rX Household head changed The household member identified as head of household in (Round X) Round X is different from the head identified in the 0=No previous round of the survey. 1=Yes Production note: Calculate only if the household is successfully interviewed (complete_rX=1). respond_chg_ Respondent changed Respondent in Round X differs from the respondent in the rX (Round X) previous round of the survey in which the household was 0=No successfully contacted. 1=Yes Production note: Calculate only if the household is interviewed (interview_rX=1). wt_rX Cross section household Cross section weighting coefficient to be used in all weight (Round X) calculations referring to household level data. wt_panel_rX Panel household weight Panel weighting coefficient that is applicable only to the (Round X) household that was successfully interviewed in all X rounds. Individual-level file The datafile is named CCC_IND where CCC=three-letter ISO country code. In addition to the harmonized variables, the datafile also includes country-specific geographic variables. Name Labels and codes Instructions / notes hhid Household ID Household unique identifier. Production note: The variable name and the values are kept as they are in the Round 0 original dataset. indiv Individual ID Individual identifier. It uniquely identifies the individual when combined with the household unique identifier. Production note: The variable name and the values are kept as they are in the Round 0+ original dataset. sex Sex Sex of the household member. 1=Male 2=Female age Age (Round 0) Age (in years) of the household member at Round 0. Production note: The values are taken from the Round 0 original dataset. Unknown values are saved as missing. age_p Age (Round 1+) Age (in years) of the household member at Round 1+. Production note: The values are taken from the Round 1+ (phone survey) dataset. Unknown values are saved as missing. The values are updated with the latest data available. married Currently married The household member is currently married. 0=No • Yes includes married (monogamous and 1=Yes polygamous) and informal/loose/civil union. form_married Formerly married The household member is formerly married. 0=No • Yes includes divorced, separated and widowed. 1=Yes nev_married Never married The household member is never married. 0=No • Yes includes single and never married. 1=Yes disability With disability This household member is with disability. This variable is 0=No calculated based on the Washington Group on Disability 1=Yes Statistics methodology. Production note: Calculate it based on the cut-off recommended by the Washington Group for the Washington Group Short Set on Functioning (WG-SS): the level of inclusion is any 1 domain/question is coded A LOT OF DIFFICULTY or CANNOT DO AT ALL. religion Religion Religion of the household member. This variable is 1=Christianity obtained by recoding country-specific information. 2=Islam 3=Other literacy Literacy This household member can read and write in any 0=No language. 1=Yes educ Highest level of education Highest level of education completed of the household completed member. This variable is obtained by recoding country- 0=None specific information. The best possible match is sought, 1=Primary but the correspondence between country-specific values 2=Secondary and the standardized codes may be imperfect. 3=Tertiary work Working status The household member has worked in the last 7 days. 0=No 1=Yes member_rX Member of household The person is identified as a member of the household (Round X) (Round X). 0=No Production note: Calculate only if the household is 1=Yes successfully interviewed (complete_rX=1). head_rX Head of household (Round The household member is identified as head of the X) household (Round X). 0=No Production note: For this dataset, there should be one and 1=Yes only one head per household. If there are more than one household head, the older one is considered as head. Calculate only if the household is successfully interviewed (complete_rX=1). respond_rX Respondent (Round X) The household member is a respondent of the survey 0=No (Round X). 1=Yes Production note: Calculate only if the household is interviewed (interview_rX=1). relation_rX Relationship to head of Relationship of the household member to head of the household (Round X) household (Round X). The country-specific information is Country-specific codes kept as is. Production note: Calculate only if the household is successfully interviewed (complete_rX=1). Reference Dupriez, Olivier (2005). Poverty PPPs: Building a Database on Household Consumption Profiles by ICP Basic Heading, Description of Work. Unpublished document, World Bank Development Data Group.