Policy Research Working Paper                          9622




          When Distance Drives Destination,
          Towns Can Stimulate Development
                                  Joachim De Weerdt
                                   Luc Christiaensen
                                     Ravi Kanbur




Social Protection and Jobs Global Practice
April 2021
Policy Research Working Paper 9622


  Abstract
 While city migrants see their welfare increase much more                           trumps the attraction from their promise of greater wealth,
 than those moving to towns, many more rural-urban                                  making towns more appealing destinations. Education
 migrants end up in towns. This phenomenon, docu-                                   mitigates these effects (lesser deterrence from distance,
 mented in detail in Kagera, Tanzania, begs the question                            greater attraction from wealth), while poverty reduces the
 why migrants move to seemingly suboptimal destinations.                            attraction of wealth, consistent with the notion of urban
 Using an 18-year panel of individuals from this region and                         sorting. With about two-thirds of the rural population in
 information on the possible destinations from the census,                          low-income countries living within two hours from a town,
 this study documents, through dyadic regressions and con-                          these findings underscore the importance of vibrant towns
 trolling for individual heterogeneity, how the deterrence                          for inclusive development.
 of further distance to cities (compared to towns) largely




 This paper is a product of the Social Protection and Jobs Global Practice. It is part of a larger effort by the World Bank to
 provide open access to its research and make a contribution to development policy discussions around the world. Policy
 Research Working Papers are also posted on the Web at http://www.worldbank.org/prwp. The authors may be contacted
 at lchristiaensen@worldbank.org.




         The Policy Research Working Paper Series disseminates the findings of work in progress to encourage the exchange of ideas about development
         issues. An objective of the series is to get the findings out quickly, even if the presentations are less than fully polished. The papers carry the
         names of the authors and should be cited accordingly. The findings, interpretations, and conclusions expressed in this paper are entirely those
         of the authors. They do not necessarily represent the views of the International Bank for Reconstruction and Development/World Bank and
         its affiliated organizations, or those of the Executive Directors of the World Bank or the governments they represent.


                                                       Produced by the Research Support Team
                   When Distance Drives Destination,
                                                                                    *
                  Towns Can Stimulate Development

                                    „                               …                         §
        Joachim De Weerdt                  Luc Christiaensen               Ravi Kanbur




      JEL Codes: J61, O15, O55.
      Key Words: Africa, internal migration, urbanization, secondary towns.




   * We thank Kalle Hirvonen for useful comments and Economic Development Initiatives (EDI), the Tanza-
nian National Bureau of Statistics (NBS) and the Organisation for Economic Co-operation and Development
(OECD) for making the underlying data publically available. We acknowledge funding from Excellence of
Science (EOS) Research Project 30784531 at the Research Foundation – Flanders (FWO).
   „ IOB, University of Antwerp & LICOS, KU Leuven. E-mail joachim.deweerdt@uantwerp.be
   … The World Bank. E-mail lchristiaensen@worldbank.org
   § Cornell University. Email sk145@cornell.edu
1    Introduction

In most of the developing world, wealth is unequally spread geographically and increases
with city size (Ferre et al. 2012; Young, 2013; Gollin et al., 2021). This suggests migration
as an important avenue for growth and poverty reduction, affecting both those who move
(McKenzie et al. 2010, Beegle et al. 2011, Bryan et al. 2014) and those who remain behind
(Giles and Yoo 2007, Kinnan et al. 2018, De Weerdt and Hirvonen 2016, Morten 2019).
Moves to big cities are, on average, much more lucrative than moves to small towns. But
survey data show that few people make it to the cities and many more migrants end up with
more modest income gains realized through moves to smaller towns (Christiaensen et al.,
2019). A key question is therefore how migrants choose their destinations: why do migrants
move to seemingly suboptimal locations?

One consideration is that there are perhaps a number of nonmonetary amenities, such as
health, public goods, crime or pollution that serve to compensate for the lower consumption
levels (Dustmann and Okatenko, 2014). Gollin et al. (2021) argue against this possibility by
showing that in 20 developing countries and across a range of non-monetary amenities the
urban gradient remains clearly visible: practically every amenity they can measure seems
to improve with population density. The authors interpret this as evidence in favor of a
friction model in which people’s mobility is somehow restricted, which allows the spatial
disequilibrium to persist. Support for this view from our study setting in Tanzania comes
from Ingelaere et al. (2018) who show that there is a widespread desire to move out of
the rural areas and into larger urban agglomerations, especially by younger people. But
despite their acute awareness of the spatial differences in living standards, for the average
young person living in a remote village, moving is also considered a daunting, sometimes
impossible, endeavor.

Much of the migration literature over the past couple of decades has further emphasized the
importance of selection, with the younger, better educated and richer found more likely to
move. They are often better equipped to overcome migration costs and could stand to benefit
more from the employment opportunities for the skilled that urban areas offer (Young, 2013;
Lucas 2015; Hamory, 2020). As those categories are also smaller in number, this could help


                                             1
explain the smaller number eventually making it to the city, compared to the towns.

We use household survey and census data from Tanzania to explore our key question, why
many more migrants move to seemingly suboptimal towns. As in other settings documented
in the literature, the data confirm the existence of rising living standards across the urban
spectrum, from rural areas over towns to cities. That urban gradient is visible both in terms
of assets and housing wealth as well as amenities. Overlaying the wealth distributions, we
find that living standards of the 90th percentile in rural areas still lie below those of the 10th
percentile in urban areas.

This is problematic for the basic Todaro (1969) model, which assumes that the rural wage
needs to equal the weighted average of the high and low urban wage, with the weights being
the likelihood of getting each. Kanbur et al. (2019) consider an extension of the basic
model that introduces migration costs.1 These can bring the Todaro model back in line
with the empirical observation in our setting that the rural wage is lower than both the
lower and higher urban wage. Their second extension to the model is to consider multiple
destinations (town or city), each characterized by different formal and informal destination
wages (Kanbur et al., 2019).2

Guided by this model, we use a novel data set on migrants for whom we have pre-migration
and post-migration survey data to explore how migration costs and expected income at
destination affect destination choice. At baseline in the early 1990s, the respondents all lived
in Kagera, a large, remote and primarily rural region in the northwestern part of Tanzania.
By 2010 about half of those who were still alive had moved out to other villages, towns
or cities. We use the pre-migration survey data to create a dyadic data set of all possible
migrant-destination pairs, among which the migrant will have chosen one and not chosen all
others. We then use census data to construct measures of the basic Todaro building blocks
(expected wages and migration costs), allowing us to test the relative importance of each.

Particularly, we proxy migration costs with distance. The use of distance as a proxy for
   1
     Migration costs are broadly defined here, including transport costs but also the risk associated with the
move as well as setup and job search costs at the destination, and so forth. They typically increase with
distance.
   2
     Similarly, Lagakos et al. (2020) emphasize the importance of heterogeneous migration costs in under-
standing observed migration patterns and returns across destinations (and individual attributes).


                                                     2
migration costs (broadly defined) and migration deterrent has a long history in the migra-
tion literature (Ravenstein, 1885; Stewart, 1941; Schwarz, 1973; Lucas, 2001; Lucas 2015).3
Migration networks (such as the presence of a family member at destination) could help
overcome migration costs, even over large distances (Munshi, 2003; Bao et al. 2007; Stuart
and Taylor, 2019), though the presence of a migration network is often itself inversely related
with distance.

The results confirm that destinations that are closer and have higher local living standards
are more likely to be chosen, but distance is, by quite some margin, the most important
determinant. Expected wages would need to go up by 5.7 standard deviations to offset a
1 standard deviation increase in distance. Education increases the attractiveness of higher
living standards at destination. Nonetheless, simulations show that towns still trump cities
in our sample over most of the education gradient. Only after completing upper secondary
education do cities become more attractive. The poor, on the other hand, are even more
deterred by the effects of distance, while being less inclined to respond to the benefits from a
higher (expected) living standard at destination. With the larger share of rural populations
being poorer, less educated and living closer to towns than to large urban centers, it does
not surprise that a smaller than expected share of rural migrants eventually ends up in the
larger, more distant cities.

The paper proceeds as follows. We first present the data and characterize the study area
of Kagera and its migrants in more detail (section 2). Section 3 motivates the regression
specification and identification strategy analyzing the effects of distance and destination
attractiveness on destination choice, as well as how they change given key migrant charac-
teristics. Section 4 presents the empirical results and interprets them with respect to the
observed migration patterns of Kagera’s population (more to towns than to cities). Section
5 concludes.
   3
    Ravenstein’s first law of migration, formulated in 1885 and derived from observing place of birth in the
British 1871 and 1881 censuses, states that most migrants move only a short distance (Lucas, 2001). The
importance of distance for migration has further been generalized into a gravity model of migration, in which
the number of people migrating is inversely proportional to the square of the distance between origin and
destination and proportional to the product of their populations (Stewart, 1941).




                                                     3
2     Context, data and the urban gradient

This section introduces our two data sources: the Kagera Health and Development Survey
(KHDS) and the 2002 Tanzania Population and Housing Census. Within each we demon-
strate rising living standards from villages across towns to cities, which we refer to as the
urban gradient. In KHDS the gradient is demonstrated for changes in migrant welfare in a
panel data set, while in the census it is shown to hold cross-sectionally for all residents of
these areas.



2.1    Kagera

The migrants we study all originate from the Kagera region of Tanzania. Figure 1 provides a
map showing where the region is situated within Tanzania. Kagera, being primarily depen-
dent on agriculture and far away from the ocean, is badly connected to the global economy.
It is typical of remote parts of Africa where policy prescriptions for economic growth are
elusive (De Weerdt, 2010). Given small plot sizes (typically 3 or 4 acres per household)
and relatively low-productivity agriculture, internal migration will be an important aspect
of growth and poverty reduction for this region.

In the context of this paper it is useful to highlight some of the geographical specificities of
the Kagera region, which may be relevant for the study of destination choice and in particular
whether urban migrants end up in large cities or smaller towns. The top panel of Figure 1
shows the baseline villages from the KHDS survey where all our respondents originate (the
survey is described in the next subsection) and the bottom panel shows the urban areas in
the country, split into small towns (10-100k), big towns (100-500k) and cities (>500k).

We see that Kagera lies at the periphery of the country and has, within its borders, only
one large town (Bukoba) and no cities. The closest city is Mwanza on the southern shores
of Lake Victoria, while Tanzania’s prime city, Dar es Salaam, is located at the other end of
the country. In short, Kagera is a relatively remote region and that should be kept in mind
when analyzing urban destination choices of migrants originating from Kagera.


                                              4
2.2     KHDS

The Kagera Health and Development Survey (KHDS) is a study into the long-run wealth
dynamics of households and individuals within Northwest Tanzania. This study entails the
resurvey of a panel of households, originally interviewed for 4 rounds from 1991 to 1994.
Resurveys were then organized in 2004 and 2010. A multi-topic household questionnaire is
administered to all split-off households originating from the baseline households, including
those that have moved out of the baseline location.

This constitutes one of the longest-running African panel data sets of this nature and offers an
unprecedented set of research opportunities for examining long-run (nearly 20 years) trends
in and mechanisms of poverty persistence and economic growth in rural households. As the
children of the original respondents have now formed their own households, intergenerational
and migration issues can be addressed by the survey data. The data is of particularly high
quality and the 2010 round of the survey was conducted using electronic survey questionnaires
administered on handheld computers, with automated skips and validation checks run during
the interview when errors could be corrected at source. The resulting improvements in data
quality have been formally assessed by Caeyers et al. (2012).

KHDS has maintained a highly successful tracking rate. Table 1 shows that in 2010 88%
of the original 6,353 respondents had either been located and interviewed, or, if deceased,
sufficient information regarding the circumstances of their death collected.

In 2010, households were found in three cities: Dar es Salaam, Mwanza and Kampala. This
is defining cities as agglomerations with more than 500,000 inhabitants. The 2012 census
puts the population of Dar es Salaam at 4.36m and Mwanza at 0.7m.4 There are a further 21
respondents who moved to areas that, while administratively recognized within Tanzania as
cities, have a population below 500,000 (Arusha, Tanga and Mbeya). Results do not change
   4
    For Mwanza this is the sum of Nyamagana Municipal Council (363,452) and Ilemela Municipal Council
(343,001). More detailed analysis using 2002 census data (Wenban-Smith and Ambroz, 2014) showed that
while the municipal districts of Mwanza counted a population of 596,885, only 65% of these lived in strictly
urban wards (others lived in rural or mixed wards). The current census does not yet disclose the new
categorizations, but applying the same rural-urban ratio of 65% to the current census figure gives us an
urban population of 456,631. This estimate lies under 500,000, but we decided to treat Mwanza as a city
based on the district-level figures.



                                                     5
if we change the definition from population-based to administrative. All KHDS locations
were matched with their census ward-level classification, which distinguish between urban,
mixed and rural areas. All urban and mixed areas are classified as towns (or cities if in
Mwanza, Dar or Kampala); all rural areas as rural. The KHDS CAPI application had a
series of conditional drop-down menus where the interviewer chose the region, district, ward
and village in which the household was located. These were pre-populated with the exact
census locations and codes, so that the matching exercise is perfect. The census classification
is based on 2002 data and the average migrant respondent in our survey moved in 2003.

The consumption data originate from extensive food and non-food consumption modules
in the survey, carefully designed to maintain comparability across survey rounds and con-
trolling for seasonality. The consumption aggregate includes home produced and purchased
food and non-food expenditure. The non-food component includes a range of non-food pur-
chases, as well as utilities, expenditure on clothing/personal items, transfers out, and health
expenditures. Funeral expenses and health expenses prior to the death of an ill person were
excluded. Conservatively, rent is also excluded from the aggregate to avoid large differences
in prices for similar quality housing being the driver of any measured urban-rural disparities.
The aggregates are temporally and spatially deflated using data from the price question-
naires included in each survey round. As household size may differ between urban and rural
households, consumption is expressed in per adult equivalent units rather than per capita.
The poverty line is set at 326,474.2 Tanzanian shillings (TSh),5 calibrated to yield for our
sample of respondents who remained in Kagera the same poverty rate as the 2007 National
Household Budget Survey estimate for rural areas (37.6 percent).

Unless otherwise indicated the analysis below is conducted at the individual level.

Decomposition analysis by Christiaensen et al. (2019), illustrated again in Table 2, shows
that moves to cities are more lucrative for those who make them, but that many more
migrants end up in secondary towns. Furthermore, while there was similarly clear stochastic
dominance between future town migrants and future city migrants by 2010, there was no
stochastic dominance during the early 1990s at baseline (Figure 2), suggesting that town
and city migrants were observationally equivalent (at least by consumption per capita). For
  5
      At the time of the survey, one US dollar was worth around TSh 1,450.


                                                     6
migrants to rural areas the stochastic dominance was already present at baseline and gets
exacerbated by the end line (except for consumption per capita values to the far right of the
distribution, where one would unlikely want to place a poverty line). There is a clear urban
gradient to poverty, just as in Ferre et al. (2012) and Lanjouw and Marra (2018).



2.3    Census

The KHDS survey informs us on the plight of the average Kagera migrant at the different
destinations. This section considers the general population characteristics across the whole
of Tanzania. In 2002, about halfway between the baseline and endline KHDS rounds, the
Tanzania Population and Housing Census was conducted. The Integrated Public Use Mi-
crodata Series (IPUMS) provides access to the long-form census data on over 3.7 million
individuals in the country, which can be disaggregated by district, and within each district
by rural and urban areas, giving a total of 254 geographical areas to potentially consider
(Minnesota Population Center, 2014).

The long-form sample contains basic census information, such as the age and sex of all
members, from which we can derive household size and a dependency ratio. Information on
housing characteristics includes the number of bedrooms in the dwelling and the materials
out of which the roof, walls and floors are made. Further details on utilities allow us to
calculate who has electricity, piped water and a flush toilet. We further know about asset
ownership. For each individual there is information on their employment status and the
industry or sector they work in. The location means for each variable are presented in
Table 3.

Many of the variables collected reflect the wealth of the household. Lacking weights, such
as prices, to meaningfully add up the various wealth components available in our data, we
reduce the dimensionality of our wealth data by extracting the first principal component of
a number of housing, utility and asset outcomes of sampled households.

We use the following variables for the principal component analysis: number of bedrooms
per person, whether the dwelling has an iron sheet roof (or better), walls from baked bricks


                                             7
(or better), a tile or cement floor, electric lighting, piped water, a flush toilet and whether
it owns a radio, owns a phone and owns an iron.

Our asset index combines housing and other assets with amenities. Yet, amenities (such
as contentment with local public services and area security) often explain more of people’s
intention to migrate than satisfaction with their personal assets and living standards (Dust-
mann and Okatenko, 2014). To help distinguish between the effect of these two types of
wealth (private/public) on attracting migrants, we also create a separate asset index based
on variables that relate purely to housing and other asset wealth (number of bedrooms per
person, type of roof, wall, or floor and ownership of radio, iron and phone), as well as an
amenities index based purely on the subset of variables relating to amenities (availability of
electric lighting, piped water and flush toilet).

Housing, utilities and assets outcomes improve as one moves from villages over towns to
cities, in line with what Gollin et al. (2021) document for much of the developing world
(Table 3). This is reflected in the constructed wealth index which is much higher in towns
than in rural areas and higher still in cities compared to towns (see also Figure 2). We
also see that, with respect to the constructed wealth index, the 10th percentile in cities
already outperforms the 90th percentile in rural areas. More broadly, the town distribution
of wealth mostly dominates this of rural areas, while the city wealth distribution dominates
both (Figure 3).

Furthermore, urban households are smaller, younger and their members have more years of
formal education. City households are also slightly more educated than town households (0.6
more years of formal education on average – maximum years of education in the household is
8.64 years on average in the city versus 8.01 in towns). The census asks whether the household
experienced any death among its members in the 12 months preceding the interview, followed
by a question on the sex and age of the deceased. Somewhat fewer urban households were
bereaved and in cities the age of the deceased is higher, although the size effects are not
large. Very few rural households have a non-seasonally employed member, but respectively
24% and 38% of town and city households do. Finally, while only 9% of households have
a member working outside of the agricultural sector in rural areas, 54% and 74% do so in
towns and cities.


                                               8
3     Determinants of destination choice

3.1    The extended Todaro model

The previous analysis has highlighted at least two stylized facts, which are not in line with
the basic Todaro (1969) model. First, urban welfare levels clearly dominate rural welfare
levels, across the whole distribution. This is true when we look at data on those who
migrate to these locations in the KHDS, or on all those residing there in the census data.
In the census, for example, Table 3 shows the constructed wealth index in the rural areas
going from -0.99 at the 10th percentile to 0.29 at the 90th percentile, while in cities these
same percentiles span 0.33 to 2.73: even the lowest city welfare level, measured this way,
outperforms that of the highest rural ones. Similarly, KHDS shows that moves to towns or
cities clearly outperform not moving or moves to rural areas.

These facts are problematic for the original Todaro model, as the model requires rural wages
to equate the expected city wages in equilibrium. Kanbur et al. (2019) extend the model
by introducing a cost to moving for each migrant (financial, social, psychological,...). The
equilibrium condition now requires that rural wages plus migration costs need to equal the
expected urban wage, so that there is a range of costs that can make the Todaro model fit
the stylized facts.

Second, both KHDS and the census show that the economy is not dualistic, but displays a
clear gradient from villages, over towns to cities. Thus, the second extension the authors
make to the Todaro model is the introduction of multiple destinations.

The model highlights the role of a number of key variables when it comes to destination
choice: the cost of moving and the destination wages. The next section introduces these
variables into a regression framework to assess their relative importance and their interaction
with poverty, education and age.

We use distance to destination as a proxy for the cost of moving. While the use of distance
has been somewhat ignored in the more recent development literature, it has a long tradition



                                              9
in migration (Lucas, 2001). The focus on distance as proxy for migration deterrent in this
study, is further motivated by the work of Ingelaere et al. (2018), who conduct detailed life
history interviews with 75 respondents sampled from the Kagera Health and Development
Survey (KHDS), the same survey underlying the analysis in this paper. They show how
respondents themselves, when narrating their life histories, talked about distance as an
impediment to moving: it raises the cost of moving, it complicates returning home to visit
or when things go wrong at destination, it makes it difficult to maintain ties, can erode
informal property rights and typically comes with higher cultural barriers to moving.



3.2    Econometric specification

If we are to understand destination choice among migrants then it is as important to know
where the migrant moved to as it is to know which potential destinations the migrant did
not migrate to. Within the census we can identify 78 locations that were destinations for
KHDS migrants, representing either urban or rural locations in 57 districts. Our analysis
makes the assumption that these 78 locations are the potential destinations for our sample
of migrants. They revealed themselves as practically feasible destinations for at least one of
our migrants.

Each of the migrants in our sample has chosen to move to one potential destination; and
has therefore also chosen not to move to the 77 other potential destinations. In order to
understand that choice better, we create a dyadic data set that contains 78 observations for
each migrant i; one observation for each potential destination d. Our dependent variable
Yid is a dummy equal to one if i was found in location d during the last survey round and
zero otherwise. Our analysis will consist of studying the correlates of Yid . This kind of
dyadic analysis has been used before to understand destination choice by Fafchamps and
Shilpi (2013). We follow these authors in analyzing only those who have actually migrated
in order to not confound the destination choice with the migration choice. We also analyze
separately those who have moved to rural areas and those who have moved to urban areas.

Destination choice Yid will depend on the observed and unobserved characteristics of the
individual (including those related to household and community circumstances), the desti-

                                             10
nation (such as the standard of living the migrant would expect to achieve at destination)
and the relation between the individual and the location (such as distance to destination or
the interaction between the individual’s wealth level and the destination’s wealth level).

Our data setup has 78 observations per individual, which allows us to address observed and
unobserved individual characteristics of the individual by including an individual fixed effect
αi in the regression. Another econometric concern is whether the structure of the error
terms is influenced by the baseline survey’s two-stage sampling design. We therefore report
standard errors that account for clustering within each of the 51 KHDS villages of origin,
which were the primary sampling units in the baseline survey (Abadie et al., 2017).

The key independent variables of interest include the characteristics of the destination district
in vector Dd and relational variables in vector Rid that are specific to the i − d pair, such as
distance of d to i’s baseline village and how the effect of destination characteristics changes
with individual characteristics such as education interacted with destination characteristics.
We do not include any individual, household or community level effects. These are controlled
for by the inclusion of the individual fixed effect αi , which helps protect our estimated
coefficients on the destination and relational variables of interest from bias due to unobserved
individual heterogeneity (selection).

We then estimate the following equation:


                                 Yid = Dd β1 + Rid β2 + αi +   id ,                          (1)


where   id   is an error term.

In our most basic regression setup we populate Dd average wealth at destination and Rid
with the natural logarithm of the km distance of the destination district to the baseline
community.




                                               11
4     Results

4.1    Distance is not dead

In the first column of Table 4 we estimate Equation 1 on a dyadic data set that links all
migrants to all destinations. Consistent with the insights from the literature, both wealth and
distance are significantly correlated to destination choice, with opposite signs: the further
away is the potential destination, the less likely it will be chosen; the wealthier the location
is, the more likely it will be chosen. The (absolute) magnitude of the distance coefficient is,
however, 4 times that of the wealth coefficient. Multiplying those coefficients by the standard
deviations of the corresponding variables in the dyadic data set (0.97 for distance and 0.71
for wealth), we see that average wealth at destination would need to go up by 5.7 standard
deviations to offset the negative effect of a one standard deviation increase in distance.

These results hold controlling for individual characteristics. However, this does not exclude
that the deterring and appealing effects of distance and wealth, respectively, differ by indi-
vidual characteristics. We explore this by interacting distance and wealth at destination with
baseline poverty status (1=poor) (Table 4, column 2), years of formal education (column 3),
and baseline age (column 4). Coming from a poor household increases the negative effect of
distance and decreases the attraction of high wealth at destination. Similarly, education at-
tenuates the friction related to distance and enlarges the pull of high wealth areas, consistent
with high wealth areas offering more opportunities for higher educated people. Interactions
with age show that older people display a slightly lower pull-effect of wealth and are equally
affected by distance as young people.

Our wealth index includes both housing and other asset wealth, as well as amenities. We
investigate whether either of these components dominates destination choice and enter sep-
arate indices for each of these components of wealth as well as their interaction (column 5).
Only the interaction of the two indices is statistically significant, suggesting that neither the
assets nor the amenities component dominate the effects of the wealth index, but both are
complementary to each other.



                                              12
Finally, we explore whether distance and wealth have different effects depending on whether
the potential destination is urban or rural (column 6). To do so, we include a dummy variable
indicating the urban status of the destination as a regressor, as well as its interactions with
distance and wealth. Urban destinations are substantially less likely to be chosen, but
controlling for whether the destination is urban or rural, we do not see any differential effect
in the pull effect of the destination’s wealth. The deterring effect of distance also remains,
but is twice as strong for rural than for urban destinations.

We explore these differences between urban and rural migrants further by altering how we
form the dyads. In Table 5 we link all urban migrants to all 47 urban locations. For urban
migrants selecting urban locations distance and wealth are both significantly associated with
destination choice, with coefficients roughly of equal absolute magnitude. At a per standard
deviation change the absolute magnitude for distance (sd=0.91 in this dataset pairing urban
migrants with urban destinations) is 1.6 times greater than that for wealth (sd=0.50). In
Table 6 we link all rural migrants to all 31 rural locations. Again, both distance and wealth
matter, but expressed in standard deviation change (sd=1.03 for distance and a much lower
sd=0.20 for wealth in the rural-rural dyadic dataset) distance outweighs wealth by a factor
of 7.5.

For urban migrants, interactions with poverty and educational status at the outset further
show that the pulling power of urban destination wealth is reduced for poorer households.
Better educated individuals, on the other hand, tend to find wealthier urban destinations
more attractive than lesser educated individuals and also tend to be less deterred by distance.
Nonetheless, the deterring effect of distance remains, even for highly educated individuals
(Table 5). It is only by around three years of postsecondary education that distance loses its
effect completely. Urban migrants with such high levels of education would not be deterred
by distance and, furthermore, the pull effect of destination wealth would be 50% higher
for them, compared to those who only completed primary education (the modal level of
education in our sample).

Poor rural migrants, on the other hand, are more deterred by distance than poor urban
migrants, and as their urban counterparts, they are also less inclined to respond to larger
wealth at destination. As with urban migrants, more education reduces the deterring effect


                                              13
of distance for rural migrants (albeit at a lesser gradient) and increases the appeal of wealth
at destination (this time at a steeper gradient). There is also a small negative interaction
effect between age and wealth.

A Shapley decomposition for the regressions in columns 1 of Table 4 shows that distance
accounts for 98% of explained variation of destination choice in the full sample. A similar
exercise for column 1 of Table 5 shows that distance accounts for 75% of the explained
variation once we restrict the analysis to dyads formed by urban migrants choosing between
urban locations. We should bear in mind that the R-squares are quite low in all regressions
(between 4 and 17 percent). This is in large part due to the fact that we are considering a
large number of possible destinations for each person (47 urban and 31 rural destinations).
But it also a reminder that distance does not tell the whole story. Despite the low R-square,
it is obvious from the F-test for joint significance of all regressors reported in the table that,
taken together, wealth and distance are important determinants of destination choice.



4.2     Distance, towns and cities

For all three dyadic samples of Table 4, Table 5 and Table 6 the regressions highlight the
dominance of distance over wealth as a correlate of destination choice, both as a share of
explained variation and when comparing the size of standardized regression coefficients. In
this subsection we ask to what extent distance and wealth differences between cities and
towns explain the observed preference for towns as a destination choice in our study setting,
and how that varies across types of households.6

Our baseline villages lie, on average, 3.06 logged kilometers from the closest town and 5.64
from the closest city, so the difference is 2.58. Plugging that difference into the regression
equation estimated in column 1 of Table 5 (urban migrants only) shows, ceteris paribus,
an 8.5 (=0.033*2.58) percentage point higher likelihood of choosing towns because they
are closer. Only 2.6 percentage points of this advantage of towns over cities gets undone
   6
    As pointed out in subsection 2.2, Kagera is a relatively remote region located far from the nearest cities
(Figure 1). The choice between city or town would play out differently, obviously, for a migrant living in
close proximity to a big city.



                                                     14
through higher destination wealth in cities, which we know, from Table 3, is 0.69 in favor
of cities (0.038*0.69=2.6). The first panel (‘ALL’) in Figure 4 visualizes this horse race
between distance and wealth for urban migrants. We can also conduct this exercise using
all migrant-destination pairs in Table 4. Here towns have a 6.5 percentage point higher
likelihood of being chosen (0.025*2.58) and the wealth effect does little in favor of cities
reducing that number by only 0.4 percentage points (0.006*0.69). Clearly, the pull effect of
higher wealth in cities is much lower than the discouragement effect of the corresponding
increase in distance. We conjecture that that is the reason many people prefer towns over
cities.

Who does end up going to the city? Our heterogeneity analysis in Table 5 shows how
wealth at destination has a larger pull effect for the non-poor and the educated and how the
negative effect of distance on destination choice is attenuated by schooling, but exacerbated
by poverty. Our simulations depicted in Figure 4 show that the distance effect becomes very
attenuated for high levels of education. The deterrent effect of distance is over five times
stronger for those without education compared to those with higher secondary education.
In fact only for this latter group is the distance effect is so attenuated that the wealth effect
(which increases with education, although much more modestly) becomes stronger and cities
become more attractive than towns. As the regression results do not simultaneously interact
with other individual characteristics, we do not conclude from this analysis that raising
education levels would increase migration to cities. Rather the results indicate that there
exists a type of respondent for whom the trade-off between distance and wealth no longer
favors towns as a destination choice. Such differential frictions would lead to the kind of
sorting documented by Hamory et al. (2020) and Young (2013). This may result from
education itself as well as other characteristics of the household where the more educated
person grew up, such as wealth levels, access to networks, as well as mindset which may all
help overcome migration costs.




                                              15
5     Concluding remarks

This paper starts from the observation that in our study area of Kagera, Tanzania, much
larger shares of the rural population tend to migrate to towns than to cities, despite greater
welfare gains from moving to the latter than from moving to the former. With many of the
rural migrants originally also poor, town migration contributed more to poverty reduction
than city migration. To shed light on this seemingly suboptimal destination choice of rural-
urban migrants, we draw on an extension of the Todaro model which explicitly considers
different destinations (towns and cities) as well as spatial friction/migration costs to motivate
an empirical specification to examine the drivers of this equilibrium.

In particular, using a dyadic regression structure applied to our two decade long panel of
individuals from Kagera, augmented with the wealth to be expected at the different pos-
sible destinations from the census, we estimate the differential effect on destination choice
of migration costs and expected wealth gain at destination controlling for individual char-
acteristics (i.e. selection) through individual fixed effects. Migration costs were proxied by
distance to the destination, following a longstanding (though recently somewhat ignored)
tradition in the migration literature (Lucas, 2001). Acknowledging the well-documented
importance of selection in migration, individual heterogeneity in the anticipated deterrent
effect of distance and the expected pull effects of expected wealth at destination were further
explored through interaction with the migrants’ poverty status, education level, and age be-
fore migrating. Simulations were finally conducted to explore the empirical strengths of the
different factors in explaining the migration flows.

The findings underscore the continuing relevance of distance for destination choice (rural
and urban); in our sample, expected wealth at destination would have to increase by 5.7
standard deviations on average to offset a 1 standard deviation increase in (log) distance. In
this, it is consistent with Ravenstein’s first law of migration, put forward in 1885, stating that
most migrants move only a short distance. When confined to urban destinations only, the
deterrent effect of distance remains, but the attraction of wealth increases, with the strength
of these effects in turn affected by the migrant’s poverty (reduced attraction of wealth) and
educational status (reduced deterrence of distance, increased attraction of wealth). With the


                                               16
closest cities on average much further away for most of the rural population than the closest
towns and rural populations often poorer and less educated (Beegle and Christiaensen 2019),
the finding that more rural-urban migrants end up in towns than in cities, contributing in
the aggregate more to poverty reduction than city migration, does not surprise in light of the
regression findings. The importance of distance in understanding the decomposition results
is also confirmed by the simulations.

Broadly interpreted, these findings could be seen as supporting the New Urban Agenda
(United Nations, 2017), which calls for balanced territorial development policies and plans
that strengthen the role of small and intermediate cities and towns, especially since a large
share of the global rural population gravitates around small and intermediate urban centers.
Globally, 28% of the rural population lives within 1 hour travel time from a city of 1 million
or more; 27% within one hour travel time from an intermediate city (250,000-1 million)
and 27% within one hour traveling from a town (250,000 or less) (Cattaneo, Nelson, and
McMenomy, 2021). In low-income countries, the share of the rural population living within
one hour of a town rises to 43%, with another 20% living within 2 hours; 13% live within one
hour from an intermediate city and 7% within one hour from a city (>1 million). Among
the extreme poor, 80 percent are rural (82% in Sub-Saharan Africa) (Beegle et al. 2019).
African cities thus face a double burden to act as engines of poverty reduction; they need
to become economically dense—not merely crowded, to generate the virtuous agglomeration
economies associated with cities in the developed world (Lall, Henderson, and Venables,
2017), and they are also far from where the poor live.

At a minimum, the findings indicate that the role of distance in migration decisions, high-
lighted in the earlier work on migration, continues to be important empirically, especially in
more remote areas, and that it may have been neglected prematurely more generally. The
paper did not try to uncover why distance plays such an important role. The aforementioned
in-depth life histories with 75 respondents from the KHDS survey (Ingelaere et al. 2018)
point among other things to transport costs and the liquidity constraints to overcome this,
the socio-cultural affinity with the destination, and the ability to maintain ties with the
home community and return home when things go wrong (safety net). Networks can reduce
the deterrence of distance (especially over long distances and including deterrence related
to socio-cultural affinity) (Lucas, 2001; 2015). It explains their importance in determining

                                             17
migration and destination. But their emergence often also depends on distance in the first
place. Some distance-related aspects, such as transport costs and information, may be more
amenable to policy, and can reduce friction even over short distance, as illustrated by the ef-
fect of cash transfers on seasonal migration in Bangladesh (Bryan, Chowdhury and Mobarak,
2014). Others may take more time (such as the development of social safety nets). Together
it would suggest that the effect of distance on destination choice is likely to continue for
some time.

Finally, two potential spill-overs following the interplay between distance, towns and mi-
gration may further accelerate the structural transformation and deserve further attention.
First, many new towns are arising in Tanzania, as they are across much of the African con-
tinent. OECD/SWAC (2020) combines demographic data with satellite and aerial imagery
to document a total of 7,617 urban agglomerations of at least 10,000 people in Africa in
2015. In 1950 that number stood at just 624. Growth in the number of urban agglomera-
tions has been especially strong in recent years. From 2000 to 2015, 2,475 new urban areas
came into existence through in-situ urbanization. With every new small town (a subset
of) rural dwellers will see the distance between their home village and the nearest urban
agglomeration shrink. Given the primacy of distance as a determinant of destination choice
documented here, we would expect frictions to rural-urban migration to come down through
the rise of small towns. Such rural-urban migration is an important component of structural
transformation, so the rise of small towns is not only a consequence of this process, but could
actually speed it up. Relatedly, there are good reasons to believe other frictions, for example
related to the flow of goods, services and information between urban and rural areas will
also come down with the emergence of small towns. While important studies like Dorosh
and Thurlow (2013, 2014) and Gibson et al. (2017) have studied how urban growth in large
versus small towns affects rural poverty reduction, we still know very little about the exact
mechanisms through which this happens.

Second, given their proximity, towns can be particularly important in facilitating the first
move, which is often the hardest, thereby potentially instigating a virtuous cycle of physical
and economic mobility. The results from the Kagera life histories highlight the difficultly
and importance of the first move, an oft ignored phenomenon. It is difficult because it is
often a step in the dark that prospective migrants need to undertake with little preparation

                                              18
and before all the right elements (finances, skills, networks and the like) are in place to
ensure that the move will be successful. It often comes with many risks and hardships. It is
important because it often sets in motion a virtuous circle of physical and economic mobility.
In this sense we should not just judge the value of towns as points of final destinations, but
also as enablers of a first move that builds networks, financial resources and human capital
potentially (but not automatically) leading to further migration - and in particular, further
migration that would not have been possible without the intermediary step. With towns
being more easily accessible, rural dwellers will become more physically mobile, which will
open opportunities to make life-improving choices. The importance of transit migration has
been highlighted by Artuc and Ozden (2018) in the context of international migration, but
it remains under-documented in low-income contexts as in Africa.




                                             19
References
Abadie, Alberto, Susan Athey, Guido Imbens and Jeffrey Wooldridge. 2017. When Should
  You Adjust Standard Errors for Clustering? NBER Working Paper No. 24003.
Artuc, Erhan and Caglar Ozden. 2018. Transit Migration: All Roads Lead to America.
  Economic Journal 128(July):F306–F334.
Bao, Shuming, Orn B. Bovarsson, Jack, W., Hou, and Yaohui Zhao. 2007. Interprovincial
  Migration in China: The Effects of Investment and Migrant Networks. IZA Discussion
  Paper 2924.
Beegle, K., J., De Weerdt, and S., Dercon, 2011, Migration and Economic Mobility in
  Tanzania: Evidence from a Tracking Survey, Review of Economics and Statistics, 93-3:
  1010-1033.
Beegle, Kathleen, Luc Christiaensen. 2019. Accelerating Poverty Reduction in Africa. Wash-
  ington, DC: World Bank. https://openknowledge.worldbank.org/handle/10986/32354
Bryan, G., Chowdhury S. and Mobarak A. 2014. Underinvestment in a Profitable Technol-
  ogy: The Case of Seasonal Migration in Bangladesh, Econometrica 82(5): 1671–1748.
Caeyers, Bet, Joachim De Weerdt and Neil Chalmers. 2012. Improving Consumption Mea-
  surement and Other Survey Data Through CAPI: Evidence from a Randomized Experi-
  ment. Journal of Development Economics 98:19–33.
Cattaneo, Andrea, Andrew, Nelson, and Theresa, McMenomy. 2021. Global Mapping of
  Urban-Rural Catchment Areas Reveals Unequal Access to Services. PNAS 118(2): 1-8.
Christiaensen, L., J. De Weerdt, and Y. Todo, 2013. Urbanization and Poverty Reduction:
  the Role of Rural Diversification and Secondary Towns,” Agricultural Economics, 44(4-
  5):435-447.
Christiaensen, Luc, Joachim De Weerdt and Ravi Kanbur. 2019. Decomposing the con-
  tribution of migration to poverty reduction: methodology and application to Tanzania.
  Applied Economics Letters 26(12):978-982. DOI: 10.1080/13504851.2018.1527436
De Weerdt, Joachim. 2010. Moving out of Poverty in Tanzania: Evidence from Kagera.
  Journal of Development Studies 46(2): 331-349.
De Weerdt, Joachim and Kalle Hirvonen. 2016. Risk Sharing and Internal Migration.
  Economic Development and Cultural Change 65(1):63-86.
Dorosh Paul and James Thurlow. 2013. Agriculture and small towns in Africa. Agricultural
  Economics 44(2013):449–459.
Dorosh, Paul, and James, Thurlow, 2014. Can Cities or Towns Drive African Development?
  Economy-wide Analysis for Ethiopia and Uganda. World Development 63:11.
Dustmann, C., and A. Okatenko. 2014. “Out-migration, Wealth Constraints, and the
  Quality of Local Amenities.” Journal of Development Economics 110: 52–63.
Fafchamps, Marcel and Forhad Shilpi. 2013. Determinants of the Choice of Migration

                                           20
  Destination. Oxford Bulletin of Economics and Statistics, 75(3): 388-409.
    e, C´
Ferr´   eline, Francisco, H.G., Ferreira, and Peter Lanjouw. 2012. Is There a Metropolitan
   Bias? The Relationship between Poverty and City Size in a Selection of Developing
   Countries. World Bank Economic Review 26-3: 351-382.
Gibson, John, Gaurav Datt, Rinku Murgai and Martin Ravallion. 2017. For India’s Rural
  Poor, Growing Towns Matter More Than Growing Cities. World Development 98(2017):413-
  429.
Giles, John and Kyeongwon Yoo. 2007. Precautionary behavior, migrant networks, and
   household consumption decisions: An empirical analysis using household panel data from
   rural China, Review of Economics and Statistics 89(3):534-551.
Gollin, Douglas, Martina Kirchberger and David Lagakos. 2021. In search of a spatial
  equilibrium in the developing world. Journal of Urban Economics 121, 103301.
Hamory, Joan, Marieke Kleemans, Nicholas Li and Edward Miguel. 2020. Reevaluating
  agricultural productivity gaps with longitudinal microdata. Journal of the European
  Economic Association, jvaa043.
Ingelaere Bert, Luc Christiaensen, Joachim De Weerdt and Ravi Kanbur. 2018. Why Sec-
   ondary Towns Can Be Important for Poverty Reduction – A Migrant Perspective. World
   Development 105:273-282.
Kanbur, Ravi, Luc Christiaensen and Joachim De Weerdt. 2019. Where to Create Jobs to
  Reduce Poverty: Cities or Towns? Journal of Economic Inequality 17(4):543-564.
Kinnan, Cynthia, Shing-Yi Wang and Yongxiang Wang. 2018. Access to Migration for Rural
  Households. American Economic Journal: Applied Economics 10(4):79-119.
Lagakos, David, Samuel Marshall, Ahmed Mushfiq Mobarak, Corey Vernot, and Michael E.
  Waugh. 2020. Migration costs and observational returns to migration in the developing
  world. Journal of Monetary Economics 113: 138-154.
Lanjouw, P. and Marra M. 2018. Urban poverty across the spectrum of Vietnam’s towns
  and cities. World Development 110(2018):295-306.
Lucas, Robert. 2001. The Effects of Proximity and Transportation on Developing Country
  Population Migrations. Journal of Economic Geography 1(3): 323-339.
Lucas, Robert. 2015. Internal Migration in Developing Economies: An Overview. KNO-
  MAD Working Paper 6: Washington D.C.
McKenzie, David, John Gibson and Steven Stillman. 2010. How Important Is Selection? Ex-
  perimental vs. Non-Experimental Measures of the Income Gains from Migration. Journal
  of the European Economic Association 8(4):913–945.
Minnesota Population Center. 2014. Integrated Public Use Microdata Series, International:
  Version 6.3 [Machine-readable database]. Minneapolis: University of Minnesota.
Morten, Melanie. 2019. Temporary Migration and Endogenous Risk Sharing in Village


                                           21
  India. Journal of Political Economy. Forthcoming doi10.1086/700763.
Munshi, Kaivan. 2003. Networks in the Modern Economy: Mexican Migrants in the U. S.
  Labor Market. The Quarterly Journal of Economics, 118(2), 549-599.
OECD/SWAC (2020), Africa’s Urbanisation Dynamics 2020: Africapolis, Mapping a New
  Urban Geography, West African Studies, OECD Publishing, Paris.
Ravenstein, E.G.. 1885. The Laws of Migration. Journal of the Royal Statistical Society 48:
  167-235.
Stewart, John. 1941. An inverse distance variation for certain social influences. Science 93:
   89-90.
Stuart, Bryan and Evan Taylor. 2019. Migration Networks and Location Decisions: Evidence
   from U.S. Mass Migration. IZA Discussion Paper No. 12709.
Todaro, Michael. 1969. A Model of Labor Migration and Urban Unemployment in Less
  Developed Countries. American Economic Review 59(1):138-148.
United Nations. 2017. New Urban Agenda. Habitat III. https://uploads.habitat3.org/hb3/NUA-
  English-With-Index-1.pdf.
Wenban-Smith, H. and Ambroz, A. 2014. Urbanisation in Tanzania. IGC Working Paper
  E-40104-TZA-1/2/3, International Growth Centre.
Young, Alwyn. 2013. Inequality, the Urban-Rural Gap and Migration. Quarterly Journal
  of Economics 128(4):1727-1785.




                                            22
Tables and Figures




                     23
     Table 1: KHDS response rates

                     2004           2010
Interviewed     4430 (70%)     4339 (68%)
Deceased         961 (15%)     1275 (20%)
Untraced         962 (15%)      739 (12%)
TOTAL         6353 (100%)    6353 (100%)




                    24
                            Figure 1: Tanzania and Kagera




This map shows the location of baseline KHDS enumeration areas, as well as the spatial
distribution of urban agglomerations in Tanzania. Agglomeration and population data are
from Africapolis (OECD/SWAC, 2020).

                                          25
         Table 2: Decomposing growth and poverty reduction by 2010 location

                                      Growth (consumption per capita)
                                                                        Share in
                           1991-94                        Change in     total
2010 Location   N          average       2010 average     average       growth
Rural           1,086      347,433       573,281          225,848       0.29
Town            702        390,934       906,228          515,293       0.43
City            285        400,836       1,229,495        828,659       0.28
TOTAL           2,073      369,617       776,247          406,630       1.00

                                              Poverty headcount
                                                                        Share in
                                                                        total net
                                                          Change in     poverty
2010 Location   N          1991-94       2010             headcount     reduction
Rural           1,086      0.56          0.35             -0.21         0.40
Town            702        0.45          0.14             -0.31         0.38
City            285        0.45          0.02             -0.42         0.21
TOTAL           2,073      0.50          0.23             -0.27         1.00




                                         26
Figure 2: Comparison of baseline and endline consumption per capita, by 2010 location
(KHDS)
                                    CDF of baseline KHDS cons pc, by future location                                                                                   CDF of endline cons pc, by current location
                           1                                                                                                                            1


                           .8                                                                                                                           .8
  Cumulative Probability




                                                                                                                               Cumulative Probability
                           .6                                                                                                                           .6


                           .4                                                                                                                           .4


                           .2                                                                                                                           .2


                           0                                                                                                                            0
                                0                      200000                  400000                             600000                                     0                      500000                 1000000                             1500000
                                            annualized HH total consumption per capita in 2010 TZS                                                                       annualized HH total consumption per capita in 2010 TZS

                                                                            c.d.f. of village   c.d.f. of town                                                                         c.d.f. of village                   c.d.f. of town
                                                                            c.d.f. of city                                                                                             c.d.f. of city
                                The 10% highest cons pc values were dropped for a better rendition of the graph                                              The 10% highest cons pc values were dropped for a better rendition of the graph



                                                                              Baseline CDF                                                                                                Endline CDF




                                Figure 3: CDF of census-based wealth index, by location (population-weighted)
                                                                       1



                                                                       .8
                                              Cumulative Probability




                                                                                                                                                                                                   Rural
                                                                       .6


                                                                                                                                                                                                   Town
                                                                       .4

                                                                                                                                                                                                   City
                                                                       .2



                                                                       0
                                                                            -2                   0                         2                                                      4
                                                                                                           Wealth index




                                                                                                                   27
                                        Table 3: Census location means

                                                                           Rural      Town      City
                     At least corrugated iron sheet roof                     0.33     0.84       0.99
                     At least baked brick walls                             0.18       0.52     0.88
                     At least tile or cement floor                           0.11       0.62     0.88
                     Electric lighting                                       0.01     0.29       0.45
                     Piped water                                             0.21      0.68     0.77
                     Flush Toilet                                           0.00       0.11     0.16
                     No. bedrooms                                            2.29     2.20       2.08
                     No. bedrooms per person                                0.71       0.71     0.72
                     GoodElec                                                0.01     0.27       0.42
                     Owns Radio                                              0.45     0.67       0.75
                     Owns Phone                                              0.01      0.08     0.20
                     Owns Iron                                               0.03      0.07      0.09
                     No. members age 19 or younger                           2.68      2.14     1.83
                     No. members age 20-60                                   1.82     1.86       2.11
                     No. members age 61 or older                             0.26     0.15       0.10
                     Household size                                          4.76      4.15      4.04
                     Dep. ratio (as share of 20-60 year olds)                1.71      1.27     0.94
                     Maximum years of formal education in HH                 6.70     8.01       8.64
                     HH experienced death past 12 months                    0.06       0.05     0.05
                     Age died (if deceased past 12 months)                  23.20     27.81     27.13
                     At least one member non-seasonal employee               0.05     0.24      0.38
                     At least one member works in non-ag sector              0.09      0.54      0.74
                     Wealth index                                           -0.46      0.79     1.48
                     Wealth index at 10th percentile                        -0.99     -0.65      0.33
                     Wealth index at 20th percentile                        -0.99     -0.23      0.75
                     Wealth index at 80th percentile                        -0.09      1.80      2.34
                     Wealth index at 90th percentile                         0.29      2.65     2.73
                     Amenities index                                        -0.37      0.73     1.18
                     Asset index                                            -0.44     0.69       1.38
 Notes: Averages are constructed using the census long form survey weights. Only Dar es Salaam and Mwanza are considered
cities. The constructed wealth index is the first principal component using the following variables: number of bedrooms per
person, whether the dwelling has an iron sheet roof (or better), walls from baked bricks (or better), a tile or cement floor,
electric lighting, piped water, a flush toilet and whether it owns a radio, owns a phone and owns an iron.




                                                            28
                     Table 4: Destination choice of migrants - dyadic regressions


                                               (1)            (2)            (3)             (4)            (5)             (6)
Distance to destination (ln km)            -0.025***      -0.023***      -0.039***       -0.023***       -0.027***      -0.035***
                                           (-21.176)      (-18.077)      (-31.178)       (-14.512)       (-17.531)      (-13.328)
   (-”-) * (poor HH)                                       -0.003*
                                                           (-1.825)
   (-”-) * (yrs schooling)                                                0.002***
                                                                          (12.043)
   (-”-) * (age in years)                                                                 -0.0001
                                                                                          (-1.589)
   (-”-) * (urban destination)                                                                                           0.019***
                                                                                                                          (5.527)
Wealth index                                0.006***       0.009***      -0.006***       0.009***                        0.022***
                                             (6.062)        (8.244)       (-5.880)        (6.031)                         (4.390)
   (-”-) * (poor HH)                                      -0.006***
                                                           (-4.642)
   (-”-) * (yrs schooling)                                                0.002***
                                                                          (12.496)
   (-”-) * (age in years)                                                               -0.0001***
                                                                                          (-2.821)
   (-”-) * (urban destination)                                                                                            -0.003
                                                                                                                         (-0.535)
Asset index                                                                                                -0.003
                                                                                                         (-0.856)
Amenities index                                                                                            -0.002
                                                                                                         (-0.486)
(Asset index)*(Amenities index)                                                                          0.020***
                                                                                                          (7.231)
Urban destination                                                                                                       -0.141***
                                                                                                                         (-6.455)
R-square                                     .0443          .0449          .0497           .0444            .049          .0561
F                                             827            427            568             414              590           433
p-value F                                    0.000          0.000          0.000           0.000           0.000          0.000
N                                           156,936        155,688        156,936         156,936         156,936        156,936
Notes: LPM estimates of Equation 1 with standard errors clustered by the 51 origin enumeration areas. Regression coefficients
with t-statistics in parentheses. The dyads are formed by linking all 2,012 KHDS migrants to all possible 78 destinations.
Destination wealth is the average household wealth index at destination calculated from the census. The wealth index is the
first principal component using the following variables: number of bedrooms per person, whether the dwelling has an iron sheet
roof (or better), walls from baked bricks (or better), a tile or cement floor, electric lighting, piped water, a flush toilet and
whether it owns a radio, owns a phone and owns an iron. The amenities index is constructed in a similar fashion using a subset
of these variables (electric lighting, piped water and flush toilet) and the asset index is constructed using all the other variables.
Distance is the natural logarithm of distance in kilometers between the baseline location and the potential destination, as the
crow flies.




                                                               29
     Table 5: Destination choice of urban migrants - interactions

                                            (1)           (2)            (3)            (4)
Distance to destination (ln km)         -0.033***      -0.035***     -0.063***      -0.031***
                                        (-12.249)      (-10.987)     (-15.508)       (-5.657)
   (-”-) * (poor HH)                                     0.003
                                                        (0.786)
   (-”-) * (yrs schooling)                                            0.004***
                                                                       (8.516)
   (-”-) * (age in years)                                                             -0.000
                                                                                     (-0.327)
Wealth index                             0.038***      0.042***       0.024***       0.040***
                                         (11.866)      (12.680)        (3.124)       (10.626)
   (-”-) * (poor HH)                                   -0.008**
                                                       (-2.274)
   (-”-) * (yrs schooling)                                            0.002***
                                                                       (2.787)
   (-”-) * (age in years)                                                             -0.000
                                                                                     (-0.383)
R-square                                  .0485            .049         .0547         .0485
F                                          240             130           294           133
p-value F                                 0.000           0.000         0.000         0.000
N                                         45,449         45,120        45,449         45,449
Notes: LPM estimates of Equation 1 with standard errors clustered by the 51 origin enumer-
ation areas. The dyads in the first and fourth column are all migrants linked to all possible
destinations. Regression coefficients with t-statistics in parentheses. The dyads in the second
column are all urban migrants linked to all potential urban destinations. The dyads in the third
column are all rural migrants linked to all potential rural destinations. Destination wealth is
the average household wealth index at destination calculated from the census. The wealth index
is the first principal component using the following variables: number of bedrooms per person,
whether the dwelling has an iron sheet roof (or better), walls from baked bricks (or better), a
tile or cement floor, electric lighting, piped water, a flush toilet and whether it owns a radio,
owns a phone and owns an iron. The amenities index is constructed in a similar fashion using
a subset of these variables (electric lighting, piped water and flush toilet) and the asset index
is constructed using all the other variables. Distance is the natural logarithm of distance in
kilometers between the baseline location and the potential destination. Poverty information
comes from the baseline household in which i was residing in 1991-94. Education is measured
as years of formal education at baseline.




                                              30
      Table 6: Destination choice of rural migrants - interactions

                                             (1)            (2)           (3)            (4)
Distance to destination (ln km)          -0.071***      -0.066***      -0.083***      -0.071***
                                         (-29.180)      (-26.434)      (-22.623)      (-23.494)
   (-”-) * (poor HH)                                     -0.007**
                                                         (-2.181)
   (-”-) * (yrs schooling)                                             0.002***
                                                                        (5.180)
   (-”-) * (age in years)                                                               0.000
                                                                                       (0.146)
Wealth index                              0.049***      0.065***        0.017*        0.063***
                                           (6.186)       (6.401)        (1.961)        (6.282)
   (-”-) * (poor HH)                                    -0.026**
                                                        (-2.214)
   (-”-) * (yrs schooling)                                             0.006***
                                                                        (4.612)
   (-”-) * (age in years)                                                              -0.000**
                                                                                       (-2.203)
R-square                                    .172            .172           .174          .172
F                                            740            380             390           376
p-value F                                  0.000           0.000          0.000          0.000
N                                          32,395         32,116         32,395         32,395
Notes: LPM estimates of Equation 1 with standard errors clustered by the 51 origin enumer-
ation areas. Regression coefficients with t-statistics in parentheses. The dyads are all rural
migrants linked to all potential rural destinations. Destination wealth is the average household
wealth index at destination calculated from the census. The wealth index is the first principal
component using the following variables: number of bedrooms per person, whether the dwelling
has an iron sheet roof (or better), walls from baked bricks (or better), a tile or cement floor,
electric lighting, piped water, a flush toilet and whether it owns a radio, owns a phone and owns
an iron. The amenities index is constructed in a similar fashion using a subset of these variables
(electric lighting, piped water and flush toilet) and the asset index is constructed using all the
other variables. Distance is the natural logarithm of distance in kilometers between the baseline
location and the potential destination. Poverty information comes from the baseline household
in which i was residing in 1991-94. Education is measured as years of formal education at
baseline.




                                               31
                     Figure 4: Likelihood of choosing towns over cities




This figure shows how much more likely it is for KHDS migrants to choose a town over a
city as an urban destination because of differences in distance to and wealth at the urban
destination. The y-axis measures this as a predicted likelihood by plugging into the regres-
sions estimates of Table 5 (i) the difference between average distance to the nearest town
and average distance to the nearest city, and (ii) the difference between average town wealth
and average city wealth. The solid bars add these two effects together. The figure shows
that towns are attractive because they are closer and cities because they are wealthier. The
distance effect dominates the wealth, effect explaining the preference for towns, for all except
the most highly educated.




                                             32