Rural Population Largely Absent in Global Estimates

Aalto University

Global population datasets, crucial for decision-making by governments and institutions, may underestimate rural populations by as much as 53% to 84%, reveals an Aalto University study.

Governments, international bodies and researchers rely on global population data for resource allocation and infrastructure planning to disease epidemiology and disaster risk management. In a new study published in Nature Communications, researchers from Aalto University in Finland show the profound and systematic extent to which these datasets underestimate rural population figures worldwide.

'For the first time, our study provides evidence that a significant proportion of the rural population may be missing from global population datasets,' says postdoctoral researcher Josias Láng-Ritter from Aalto University.

'We were surprised to find that the actual population living in rural areas is much higher than the global population data indicates — depending on the dataset, rural populations have been underestimated by between 53% to 84% over the period studied. The results are remarkable, as these datasets have been used in thousands of studies and extensively support decision-making, yet their accuracy has not been systematically evaluated,' he says.

Population censuses are root cause for biases

Due to a lack of comprehensive reference data, previous attempts to gauge the accuracy of global population datasets have tended to focus on individual countries or urban areas.

In their study, the Aalto University researchers compared the five most widely used global population datasets*— maps that divide the planet into evenly spaced, high-resolution grid cells with population counts based on census data — with resettlement numbers from over 300 rural dam projects in 35 countries.

The study focused on population maps for the period 1975-2010, due to a lack of dam data from later years. Findings showed 2010 datasets had the least bias, missing between one-third (32%) to three-quarters (77%) of the rural population. Although global population datasets from subsequent years (2015 and 2020) have been taken into use, the team says there is strong reason to believe the latest data still misses part of the global population.

'While our study shows accuracy has somewhat improved over decades, the trend is clear: global population datasets miss a significant portion of the rural population. With the same basic practices in place, it's unlikely that slightly improved input data could correct for this level of bias. And even if the most recent population maps reflected reality, earlier datasets have influenced decision-making for decades and are still used to monitor change, for instance providing a distorted picture of movement over time from the countryside to urban areas,' says Láng-Ritter.

Population censuses, the key input for the population datasets in use, are considered to be incomplete to varying degrees across regions. Not all nations have the resources for precise data collection and rural regions can be difficult to travel to, with populations spread thinly over vast areas.

Resettlement data from dam projects, in turn, provide multinational, independently gathered comparison points.

'When dams are built, large areas are flooded and people need to be relocated,' says Láng-Ritter. 'The relocated population is usually counted precisely because dam companies pay compensation to those affected. Unlike global population datasets, such local impact statements provide comprehensive, on-the-ground population counts that are not skewed by administrative boundaries. We then combined these with spatial information from satellite imagery."

Researchers found that the negative bias was systemic worldwide, with discrepancies particularly noticeable across datasets in China, Brazil, Australia, Poland and Colombia, where information was more readily available.

Population data critical for planning and development

With current estimates placing 43 percent of the world's 8.2 billion people in rural areas, the new findings have far-reaching consequences. Official population estimates published by the UN and the World Bank rely on the same national censuses that are the basis for the population maps.

According to Láng-Ritter, the findings suggest that the needs of people living in rural areas have been under-represented in decision-making globally compared to urban areas. Inadequate demographic data may have led to insufficient resources for health care or transport in rural regions, for example.

'In many countries, there may not be sufficient data available on a national level, so they rely on global population maps to support their decision-making: Do we need an asphalted road or a hospital? How much medicine is required in a particular area? How many people could be affected by natural disasters such as earthquakes or floods?' he says.

The necessity of the information may affect countries and regions differently, the team acknowledges, and even with inaccuracies the datasets hold crucial insight for policy and research.

'For example here in Finland, the population data is nowadays very reliable even in rural regions, as we were the second country in the world to start keeping digital population records already in 1990. But especially in crisis-hit countries, the shift towards digital population registers could take years, even decades,' says Láng-Ritter. 'To provide rural communities with equal access to services and other resources, we need to have a critical discussion about the past and future applications of these population maps.'

* The population datasets included in the study were WorldPop, GWP, GRUMP, LandScan, and GHS-POP.

/Public Release. This material from the originating organization/author(s) might be of the point-in-time nature, and edited for clarity, style and length. Mirage.News does not take institutional positions or sides, and all views, positions, and conclusions expressed herein are solely those of the author(s).View in full here.