Lecture 1-2
Introduction to
Computational
Urban
Science

Esteban Moro

Network Science Institute | Northeastern University

NETS 7983 Computational Urban Science

2025-02-07

Intro to Computational Social Science

In this lecture, we will present the main concepts of Computational Social Science (CUS) and its applications to Urban Science. Here is the summary

  • Introduction
  • Urban
    • Opportunities
    • Challenges
  • Computational
    • Sources of large-scale data
    • AI and Machine Learning for Urban Science
    • Advantages of large-scale data in Urban Science
    • Challenges of large-scale data
  • Science
    • Urban Science vs Computational Urban Science
    • “Soft” vs “hard” problems in CUS
    • Cities as complex systems

Introduction

What is Computational Urban Science?

  • Interdisciplinary, emerging field that combines:

Large-scale datasets + computational methods/models + in urban contexts

to explore the dynamics of cities and urban challenges.

  • A branch of Computational Social Science and related to Urban Science

  • Integrate many disciplines like sociology, geography, economics, urban planning, computer science, mathematics and many more.

  • Interested on the “why” rather than the “what” of urban phenomena.

[URBAN]

Why urban?

Urban areas are the epicenter of human activity. Living in cities has many advantages, but also many challenges.

People

  • 54% of the world’s population now lives in cities.
  • In North America or Europe, around 80% people in urban areas, compared to just 40% in 1900.

Resources

  • cities generate over 80% of the world’s GDP
  • NYC = 12\(^{th}\) largest economy.
  • 63% of all U.S. patents in just 20 metro areas

Growth of population in Urban areas

Why urban?

Resources

  • Higher productivity, higher wages.
  • Higher innovation, higher patents.

Social

  • Higher number of friends
  • Higher social capital

Distribution of social interconnectivity, Productivity and Innovation in small vs large cities. From [1]

Why urban?

But cities are also home to most critical problems in our society

  • Pollution: Urban areas are responsible for 70% of global CO2 emissions (transport + buildings)
  • Inequality: Even the most prosperous cities have high levels of inequality. In the US, roughly one in 6 urban residents live in proverty.
  • Crime: Urban areas have higher crime rates than rural areas. In the US, 80% of violent crimes occur in urban areas.
  • Traffic: Urban areas have higher traffic congestion. In the US, the average urban commuter spends 42 hours a year in traffic.

Carbon Footprint and population of different cities. From [2]

Why urban?

But cities are also home to most critical problems in our society

  • Housing: Urban areas have higher housing costs. In the US, the average urban resident spends 30% of their income on housing.
  • Health: Urban areas have higher rates of obesity and mental health problems and death rates by leading causes.

Death rates by leading causes in urban vs rural areas. From [3]

Why urban?

But cities are also home to most critical problems in our society

  • Pandemic Urban areas have higher rates of infectious diseases. In the US, 80% of COVID-19 cases occurred in urban areas. Death rates were also higher in urban areas.

Extracted from [4]

Why urban?

Cities are also interesting to study because these challenges are pervasive across nations, transcending cultural, economic and political boundaries.

For example, income inequality is rising in cities across the world, from the US to China to Brazil.

Extracted from [4]

Why urban?

Spatial (residential) segregation happens in all major cities.

Johannesburg, Southafrica from Unequalscenes.com

Santa Fe, Mexico. From Unequalscenes.com

Why urban?

  • This universality across time, geography and cultures makes cities a great laboratory to study human behavior and urban phenomena.

  • Solutions found in a particular city can be applied to other cities, making computationa urban science a powerful tool for policy making and urban planning.

  • By studying them we also gain a better understanding of most of our society.

[COMPUTATIONAL]

Why computational?

  • Understanding urban challenge requires good understanding of human behavior.

  • Traditional methods (surveys, interviews) to do that are:

    • Limited in scope and scale.
    • Infrequent and expensive.
    • Biased and subjective.
    • Not scalable.
  • More importantly the behavior extracted from traditional methods is based on two key ideas:

    • Residential anchoring: “you are were you live”, emphasizing static residential patterns.
    • Proximity-based accessibility: your behavior is described by the resources at short distance from a person’s residence.
  • Incredibly those two ideas are the basis for most of the urban planning and policy making.

Why computational?

  • However, the explosion of data from smartphones, sensors, and online platforms has transformed our ability to study human behavior and its intertwined dependence with urban areas beyond our residence.

  • This complemented with advances in machine learning, causal inference, computer vision, and in data science, has opened new opportunities to study urban phenomena.

  • This data is:

    • Continuous: it is collected in real-time.
    • Comprehensive: it covers a wide range of activities.
    • Objective: it is not subject to recall bias.
    • Scalable: it can be collected from millions of people.
    • Dynamic: it captures changes in behavior over time.
  • But it also has many challenges.

Sources of large-scale data

  • Mobile phone data: Call Detail Records (CDRs) provide information on the location and communication patterns of millions of people. Location Based Services (LBS) from apps provide more detailed information on mobility patterns through GPS geolocation.

  • Social media data: Twitter, Facebook, Instagram, and other platforms provide information on social interactions, opinions, and activities but can also geolocalize those activities.

  • Transaction data: Credit card transactions, e-commerce, and other financial data provide information on consumption patterns and economic activity within urban areas.

  • Sensors: IoT devices, cameras, and other sensors provide information on environmental conditions, traffic, and other urban phenomena.

  • Satellite data: Remote sensing data provides information on urban use, vegetation, and other environmental factors.

  • Open data: Data from government agencies, companies, and other organizations provide information on urban infrastructure, services, and other aspects of urban life.

Sources of large-scale data

Sources of large-scale data have different temporal and spatial resolutions, and different levels of detail and coverage

Feature Census Social Media Bank CDR GPS Sensors & Cameras
High Accuracy ✔️ ✔️ ✔️ ✔️
Availability ✔️ ✔️[?]
Pop. Coverage ✔️ ✔️ ✔️
Real-time ✔️ ✔️ ✔️ ✔️ ✔️
Cost ✔️ ✔️ ✔️ ✔️
Privacy ✔️
  • Because of that, different datasets are used to study different aspects of urban life.

Mobile phone data to understand human behavior

  • Mobile phone data is one of the most important sources of data for studying human behavior in urban areas. Different sources and at different temporal and spatial resolutions.

Sources of geolocalized activity from mobile phones

Mobile phone data to understand human behavior

Example, Location Base Services (LBS) data.

Transactional data to understand human behavior in cities

  • Transactional data (credit card transactions, transport cards) is another important source of data for studying human behavior in urban areas.

  • It provides information on consumption patterns, mobility patterns, and other economic activities.

  • Example: Urban lifestyles extracted from credit card transactions. From [5]

Transactional data to understand human behavior in cities

  • Example: recovery of urban consumption patterns after COVID-19 lockdowns in Mexico. By analyzing credit card transactions, we can see how consumption patterns changed during the lockdown and how they recovered after the lockdown was lifted.

Urban consumption patterns before, during, and after COVID-19 lockdowns in Mexico. From Moro et al

Social media data to understand human behavior in cities

  • Social media data is/was another way to understand human behavior in urban areas.

  • Apart from geolocalization, it also provider information about social interactions, opinions, and activities.

  • However, since 2015 and for privacy reasons, many social media platforms started to deprecate the use of geolocation of activities in their APIs.

  • Still some social media platforms provide some geographical information at aggregated (zip code) level.

Social media data to understand human behavior in cities

Example: using geographical mobility, activity, and content analysis from Twitter to understand patterns of unemployment in Spain [6]

Geographical Mobility in Spain using tweets. From [6]

Imagery datasets to understand urban dynamics

Advances in satellite technology and aerial and street imaging have provided new opportunities to study urban areas.

For example, Low-Earth orbit (LEO) constellations capture detailed (up to 1-meter resolution) and frequent (daily/hourly intervals) urban environments.

Low-Earth Orbit satellites visualization from LeoLabs

Super-resolution (15cm) imagery from the WorldView-3 Maxar mission

Imagery datasets to understand urban dynamics

Most of the satellite high-resolution imagery is commercial, but some of them are availble for scientific use like

European’s Union Sentinel Browser

Imagery datasets to understand urban dynamics

Example: Wealth estimation from satellite imagery in African cities. From [7]

Wealth estimation from satellite imagery in Nigeria cities. From [7]

Imagery datasets to understand urban dynamics

Example: change in industrial and economic areas, nights at light to estimate GDP. Offical GDP vs estimated.

Change in industrial and economic areas, nights at light to estimate GPD. From The Economist

Imagery datasets to understand urban dynamics

Advances in computer vision and machine learning have allowed the automatic extraction of information from street imagery. That allows tranforming street imagery or video into structured data about urban areas and period’s behavior.

AI Visual recognition of a street image

Imagery datasets to understand urban dynamics

For example, large-scale datasets from Google Street View have been used to estimate hidden neighborhood characteristics like socioeconomic status, crime rates, or even political preferences [8] [9]

Detecting cars to detect political preferences. From [8]

Feature extraction from Google Street view. From [9]

Sensor datasets

Finally, the rise of sensors and the “smart city” revolution has provided new sources of continuous data streams about city operations and resident activities. Embedded in infrastructure like traffic lights, public transportation systems, waste management facilities, and utility grids, these sensors provide real-time information about urban phenomena.

For example we have:

  • Sensors about Air Quality in our cities, like official ones from the EPA or community science efforts like PurpleAir in the US.

  • Noise measurements like the Noise-Map project.

  • Traffic sensors like the INRIX project.

Air Quality Data from Purpleair

Open datasets

Apart from the Census, there are many open datasets that can be used to study urban phenomena. In particular, many city authorities, and government agencies provide open data about urban infrastructure, services, and other aspects of urban life.

For example, the Open Data Portal of New York City or the Analyze Boston Portal provide data on a wide range of topics, from crime rates to building permits to restaurant inspections.

Canopy Change Assesment Project in the Boston Open data Portal

AI and Machine Learning for Urban Science

At the same time large-datasets become available advances in AI and machine learning have revolutionized the way we analyze and extract information from those datasets.

  • Traditional methods fall short to handle the scale, variety, and dynamic of large-scale datasets. We need advance methods to extract information from them.

  • For example

    • Deep-learning computer vision can easily classify and extract information from street imagery [9].
    • Modern causal inference methods like propensity matching, synthetic control can help find relationships between urban interventions and outcomes [10].
    • Network science can help us understand the structure and dynamics of urban dependency networks in urban areas [11].
    • Advance machine learning can help us predict urban phenomena like pandemic breakouts [12].

AI and Machine Learning for Urban Science

  • Spatial or geographical information is “different”. Its structure is different than tabular data or imagery datasets. In general spatial data is characterized by:

    • Spatial autocorrelation: nearby locations are more similar than distant locations.
    • Spatial heterogeneity: different locations have different characteristics.
    • Spatial dependence: the value of a location depends on the values of nearby locations.
  • Thus, algorithms need to be adapted to handle that data structure, like for example spatial regressions, spatial clustering, spatial machine learning, spatial causal inference or spatial networks. Failure to use those methods can lead to biased results.

Spatial Network of frienships between cities

Advantages of large-scale data in Urban Science

Why large-scale data has revolutionize Urban Science?

  1. Granularity
  2. Addressing the causal question
  3. Detecting small effects
  4. Uncovering complex urban interdependencies

Advantages of large-scale data in Urban Science

  1. Granularity: Large-scale data provides more detailed temporal and spatial information about individual behavior and urban phenomena. For example analyzing mobile phone data during the COVID-19 pandemic allowed us to track the spread of the virus and the effectiveness of interventions.

Average distance travelled by residents in NYC before and after social distance measures. From mobile phone mobility data

Advantages of large-scale data in Urban Science

  1. Addressing the causal question: Large-scale data is the appropriate data for causal inference because:
  • Causal inference requires more statistical power and complicated analysis designs than descriptive methods
  • It might contain a wide range of natural experiments or individual experiments that can be used to identify causal relationships.
  • Large-scale data is more likely to contain the necessary variation to identify different groups of people or areas to help in the design of quasi-experiments using conterfactual machine learning techniques like Synthetic Control or propensity matching.

Advantages of large-scale data in Urban Science

  1. Addressing the causal question:
  • Example: causal effect of food environments on the consumption of fast food using individual experiments –change of work context– within large-scale datasets. From [13]

Change in visits to Fast Food Outlets after changing workplaces and their food environments

Advantages of large-scale data in Urban Science

  1. Detecting small effects:
  • Surveys often lack the statistical power to detect small effects (e.g., NYC DOT mobility survey: ~3,000 participants).
  • Small impacts, like a bike lane increasing cycling rates, may go unnoticed in surveys but can be captured through large-scale mobile phone data.
  • Unlike large-scale behavior changes (e.g., COVID-19 lockdowns or natural disasters), most urban interventions produce small effects that are harder to detect.
  • Example: During the 2014 London Underground strikes, travel card data revealed that only 5% of commuters permanently adopted better routes post-strike ([14]).

Advantages of large-scale data in Urban Science

  1. Uncovering complex urban interdependencies:
  • Unlike traditional data collection methods, which often operate in silos, large-scale datasets (especially those coming from human activity) reflect how disparate systems in urban areas –like public transit, housing affordability, and air quality- are intricately linked.
  • For example, high-resolution mobility data can show how changes in public transportation routes influence ridership first, but also experience segregation, housing demand, or local air pollution levels.
  • Cities are complex systems, with interconnected components—people, infrastructure, businesses, and the environment—interacting across spatial and temporal scales.
  • Understanding these interactions through large-scale data is essential for addressing urban challenges holistically.

Challenges of large-scale data in Urban Science

However, large-scale data also has many challenges:

  1. Large scale Data \(\neq\) Truth.
  2. Data Accessibility.
  3. Privacy and Ethical Concerns.
  4. Complexity.

Challenges of large-scale data in Urban Science

  1. Large scale Data \(\neq\) Truth.
  • No matter how large the dataset, it is shaped by the method and biases inherent in its collection, pre-processing, and interpretation.
  • Designed data (census) vs. repurposed data (mobile phone data). Repurposed data is not collected and designed for research purposes.
  • Most critical challenges of repurposed data (see [15]):
    • Bias / representativity: not all people, activity, areas are included in the data. Demographic, temporal, and geographical biases can distort our understanding of urban challenges.
    • Drifting: the data might change over time because technological, legal, or social changes.
    • Algorithmically counfounded: the data might be biased because of the algorithms used to collect it.
    • Dirty: the data might be noisy, incomplete, or have errors.

Challenges of large-scale data in Urban Science

  1. Large scale Data \(\neq\) Truth
  • Example of algorithmically confounded:
    • Very recent large-scale study on Facebook found that its algorithm plays little role in driving political polarization [16].
    • However, some researchers argue the paper is flawed because, during the study, Facebook instituted a series of emergency changes to its algorithm to reduce the spread of misinformation. However, some researchers argue the paper is flawed because, during the study, Facebook instituted a series of emergency changes to its algorithm to reduce the spread of misinformation
  • Example of dirty large-scale datasets:
    • A German performance artist carried a cart full of smartphones in different streets of Berlin to trick Google Maps into indicating that traffic was terrible in those areas.

Challenges of large-scale data in Urban Science

  1. Large scale Data \(\neq\) Truth

Important

How can we adress these problems:

  • To alleviate biases, we can use techniques like pre- or post- stratification, weighting, or matching to make the data more representative of the population.
  • Always try to compare large-scale data or aggregates of them to other datasets.
  • Implement robustness methods to check how results depend on these alleviation methods.
  • If your study is longitudinal, implement these alleviations methods in time to control for potential drifting.
  • Learn how the data is collected, the original purpose of the data, the algorithms used to collect it, and even the technologies used to collect it and process it. :::

Challenges of large-scale data in Urban Science

  1. Data Accessibility.
  • Many large-scale datasets are controlled by private corporations or require significant financial resources
  • This can limit the ability of researchers to access and analyze the data, and can exclude some low-resource communities and groups to participate in the analysis and potential value of the data.
  • Regulated access or data public releases can help alleviate this problem.
  • Initatives like data challenges, data-for-good programs, or data collaboratives exemplify these efforts.

Challenges of large-scale data in Urban Science

  1. Privacy and Ethical Concerns.
  • Large-scale data often contains sensitive information about individuals, such as their location, social interactions, or financial transactions.
  • This raises concerns about privacy, consent, and data security.
  • Given the complexity of large-scale data, privacy might be even harder. For example, it is possible to uniquely identify individuals in highly anonymized mobility datasets by only knowing a small set of places visited by them [17].
  • Researchers must take steps to protect the privacy of individuals in the data, such as aggregating or anonymizing the data, obtaining informed consent, and following ethical guidelines.

Challenges of large-scale data in Urban Science

  1. Complexity.
  • Large-scale data is often complex, noisy, and unstructured, making it difficult to analyze and interpret.
  • Using and understanding large-scale data requires advanced computational and statistical methods, as well as domain-specific knowledge.
  • This can be a barrier for researchers without the necessary expertise or resources.
  • Misinterpretations of large-scale data can lead to incorrect conclusions and misguided policy recommendations.
  • For example, because we are dealing with massive amounts of data, there is always the possibility of finding spurious correlations that are not meaningful or do not reflect causal relationships, like the famous Google Flu Trends disaster [18] or correlations in explaining economic growth [19].

[SCIENCE]

Urban Science vs Computational Urban Science

  • Urban Science is the study of cities and urban areas, focusing on their structure, dynamics, and challenges.

  • Integrates disciplines like sociology, geography, economics, urban planning, and computer science.

  • Computational Urban Science can be viewed as an extension of Urban Science, studying old problems with:

    • More emphasis on the use of large-scale datasets and computational models to study urban challenges.
    • More focus on the “why” rather than the “what” of urban phenomena.
    • More emphasis on the use of AI, machine learning, and other computational methods to analyze urban data.

Urban Science vs Computational Urban Science

But the use of large-scale data and computational methods has permitted to study new problems that were not possible before and transformed the way we study old problems

Let’s see some examples:

  • From descriptive observations to predictive and prescriptive models.
  • Cities as a complex systems
  • Unveiling the universal laws of human behiavor in cities.

From descriptive observations to predictive and prescriptive models.

  • One crucial characteristic of Computational Urban Science is the different dimensions and nature of the problem under study.

  • Here is a list of typical problems under study in Urban Science

Note

  • Urban Planning
  • Transportation
  • Urbanization
  • Housing
  • Environmental sustainability
  • Public health
  • Resilience
  • Inequality and Segregation
  • Governance
  • Epidemics
  • Economic Growth

Most of this problems involve the study of the relationship between human behavior, urban environment and societal outcomes.

From descriptive observations to predictive and prescriptive models.

But are different in the level of predictability of human behavior.

Hard problems

  • For example, transportation or environmental sustainability problems can be well-framed using computational optimization tools, atmospheric models, and simultation models because they are based on physical laws and highly predictible human behavior.

  • In transportation, for example, poeple try to optimize their routes and thus, we can predict the traffic congestion in a city using the number of cars, the road network, and the traffic lights.

  • Hard problems in urban areas are highly predictable and can be solved using computational models.

From descriptive observations to predictive and prescriptive models.

Example: traffic simulation using agent-based-simulations in Berlin using the MATSim open source software.

From descriptive observations to predictive and prescriptive models.

But are different in the level of predictability of human behavior.

Soft problems

  • But other problems like urban planning, housing, inequality, or governance are based on less predictable and more complex human behavior subject to highly variable social, economic, and political factors.

  • Even the same problem can have both hard and soft components. For example, predicting what individuals are going to do at lunch time is very easy (hard problem), but the place they choose might be more difficult to predict.

  • Large-scale data is changing also the nature of some problems. For example, decades ago, it was very hard to predict store patronage, the segregation of people in their daily moves, the risk of getting infected by attending an event, or the impact of transportation policies on business

  • Computational Urban Science is transforming these problems from descriptive observations to predictive and prescriptive models.

From descriptive observations to predictive and prescriptive models.

Detecting income segregation at individual places in urban areas [20]

Cities as complex systems

Another characteristic of Computational Urban Science is the study of cities as complex systems, challenging the prevailing urban science paradigm that the “City is a Tree” [21]

  • Traditional studies in urban science were based on the hierarchical “Environment-Behavior-Outcome” paradigm (solid lines)

Traditional (solid) versus current (solid + dashed) view of urban challenges

  • For example: new bus route (environment) \(\rightarrow\) more people using the bus (behavior) \(\rightarrow\) less traffic congestion (outcome).

Cities as complex systems

This view is being challenged by the “City is a Network” paradigm (solid + dashed lines) specially by the works of Jane Jacobs [22], Christopher Alexander [21] and more recently Mike Batty [23].

  • Cities are complex networks (or systems of systems) where environments, individual and collective behaviors, and outcomes are interconnected in intricate ways (solid + dashed lines )

Traditional (solid) versus current (solid + dashed) view of urban challenges

  • Small surveys cannot map or detect these complex interdependencies, but large-scale data can. This is the core of Computational Urban Science: the study of urban challenges as complex system of systems.

Cities as complex systems

For example. Here is the manually annotated causal map of how environmental health socio and economic systems were intertwined during the COVID-19 pandemic

A preliminary causal diagram demonstrating the complexity of the COVID-19 pandemic environmental–health–socio–economic system. From

Unveiling the universal laws of human behavior in cities.

More granular data about people’s behavior has permitted establishing a set of scientific rules of laws that define the core principles of human behavior in cities.

Before Computational Urban Science, some of these principles were known but not quantified or generalized. For example, here are the “Three Laws of Geography” [24] [25]

  • Tobler’s First Law: “Everything is related to everything else, but near things are more related than distant things.”
  • Second Law: “Geographical variables can vary in an uncontrolled way.”
  • Third Law: Geographical variables of similar geographical configurations and more similar.

or the “Law of Gravity” [26] of aggregated flow of movements \(T_{ij}\) between places \(i\) and \(j\) \[ T_{ij} = \frac{P_i O_j}{d_{ij}^\beta} \] where \(O_j\) is the attractiveness of area \(j\) and \(P_i\) is the population of place \(i\).

Unveiling the universal laws of human behavior in cities.

Computational Urban Science has permitted to quantify and generalize these laws, and even to find new ones. For example, here are some of the most recent ones:

  • Radiation model: attractiveness of areas can be modified by the presence of other areas, which builds on the concept of “intervening opportunities” [27]

Universal distance-frequency distribution of population flows, from [28]

Unveiling the universal laws of human behavior in cities.

Computational Urban Science has permitted to quantify and generalize these laws, and even to find new ones. For example, here are some of the most recent ones:

  • Universal visitation Law of human mobility, which generalizes the gravity law by including also the and frequency of visits to a place [28]

Universal distance-frequency distribution of population flows, from [28]

Unveiling the universal laws of human behavior in cities.

Computational Urban Science has permitted to quantify and generalize these laws, and even to find new ones. For example, here are some of the most recent ones:

  • The universal law of urban scaling, which stablishes that many urban quantities scale with the population of the city with a power-law exponent [29]

Scaling of urban infrastructure and socio-economic output, from [29]

Unveiling the universal laws of human behavior in cities.

Computational Urban Science has allowed also to understand the emergence of those laws by finding the microscopic rules that generate them.

  • For example, the radiation model can be derived from the concept of “intervening opportunities” and the “universal vistitation law” from the concept of “preferential attachment” in the Exploration and Preferential mobility models.

  • This “emergence” of macroscopic universal laws from very simple microscopic description of human behavior might explain whay those laws are universal across different geographical, cultural, and societal context.

  • This is also at the core of Computational Urban Science: how local interactions between individuals, environments, and societal outcomes create universal laws to understand, predict, and address urban challenges globally.

Computational Urban Science

In summary, we can define Computational Urban Science as:

  • Urban: focused on Urban Challenges
  • Computational: Large-scale datasets + computational methods
  • Science: Cities as highly predictable and prescriptive complex emergent systems.

More reading:

  • Bit by Bit, by Salganik [15]
  • The City is not a tree [21]
  • The City is not a static tree [30]
  • Understanding Urban Economies, Land Use, and Social Dynamics in the City: Big Data and Measurement [31]

References

[1]
M. Arvidsson, N. Lovsjö, and M. Keuschnigg, “Urban scaling laws arise from within-city inequalities,” Nature Human Behaviour, vol. 7, no. 3, pp. 365–374, Mar. 2023, doi: 10.1038/s41562-022-01509-1.
[2]
D. Moran, K. Kanemoto, M. Jiborn, R. Wood, J. Többen, and K. C. Seto, “Carbon footprints of 13 000 cities,” Environmental Research Letters, vol. 13, no. 6, p. 064041, Jun. 2018, doi: 10.1088/1748-9326/aac72a.
[3]
S. C. Curtin and M. R. Spencer, “Trends in death rates in urban and rural areas: United states, 1999–2019,” 1921.
[4]
S. C. Curtin and M. Heron, “COVID-19 death rates in urban and rural areas: United states, 2020,” 2022.
[5]
R. Di Clemente, M. Luengo-Oroz, M. Travizano, S. Xu, B. Vaitla, and M. C. González, “Sequences of purchases in credit card data reveal lifestyles in urban populations,” Nature Communications, vol. 9, no. 1, pp. 1–8, Aug. 2018, doi: 10.1038/s41467-018-05690-8.
[6]
A. Llorente, M. Garcia-Herranz, M. Cebrian, and E. Moro, “Social Media Fingerprints of Unemployment,” PLOS ONE, vol. 10, no. 5, p. e0128692, May 2015, doi: 10.1371/journal.pone.0128692.
[7]
J. Blumenstock, J. Lain, I. Smythe, and T. Vishwanath, “Using big data and machine learning to locate the poor in nigeria,” The World Bank website. Blog, vol. 22, 2021.
[8]
T. Gebru et al., “Using deep learning and Google Street View to estimate the demographic makeup of neighborhoods across the United States,” Proceedings of the National Academy of Sciences, vol. 114, no. 50, pp. 13108–13113, Dec. 2017, doi: 10.1073/pnas.1700035114.
[9]
Z. Fan, F. Zhang, B. P. Y. Loo, and C. Ratti, “Urban visual intelligence: Uncovering hidden city profiles with street view images,” Proceedings of the National Academy of Sciences, vol. 120, no. 27, p. e2220417120, Jul. 2023, doi: 10.1073/pnas.2220417120.
[10]
A. Abadie and M. D. Cattaneo, “Econometric Methods for Program Evaluation,” Annual Review of Economics, vol. 10, no. 1, pp. 465–503, Aug. 2018, doi: 10.1146/annurev-economics-080217-053402.
[11]
T. Yabe, B. García Bulle Bueno, M. R. Frank, A. Pentland, and E. Moro, “Behaviour-based dependency networks between places shape urban economic resilience,” Nature Human Behaviour, pp. 1–11, Dec. 2024, doi: 10.1038/s41562-024-02072-7.
[12]
L. M. Stolerman et al., “Using digital traces to build prospective and real-time county-level early warning systems to anticipate COVID-19 outbreaks in the United States,” Science Advances, vol. 9, no. 3, p. eabq0199, Jan. 2023, doi: 10.1126/sciadv.abq0199.
[13]
B. García Bulle Bueno et al., “Effect of mobile food environments on fast food visits,” Nature Communications, vol. 15, no. 1, p. 2291, Mar. 2024, doi: 10.1038/s41467-024-46425-2.
[14]
S. Larcom, F. Rauch, and T. Willems, “The Benefits of Forced Experimentation: Striking Evidence from the London Underground Network*,” The Quarterly Journal of Economics, vol. 132, no. 4, pp. 2019–2055, May 2017, doi: 10.1093/qje/qjx020.
[15]
M. J. Salganik, Bit by bit: Social research in the digital age. Princeton University Press, 2019.
[16]
A. M. Guess et al., “How do social media feed algorithms affect attitudes and behavior in an election campaign?” Science, vol. 381, no. 6656, pp. 398–404, Jul. 2023, doi: 10.1126/science.abp9364.
[17]
Y.-A. de Montjoye, C. A. Hidalgo, M. Verleysen, and V. D. Blondel, “Unique in the Crowd: The privacy bounds of human mobility,” Scientific Reports, vol. 3, no. 1, Mar. 2013, doi: 10.1038/srep01376.
[18]
D. Lazer, R. Kennedy, G. King, and A. Vespignani, “The parable of google flu: Traps in big data analysis,” science, vol. 343, no. 6176, pp. 1203–1205, 2014.
[19]
X. X. Sala-I-Martin, “I Just Ran Two Million Regressions,” The American Economic Review, vol. 87, no. 2, pp. 178–183, 1997, Accessed: Feb. 17, 2025. [Online]. Available: https://www.jstor.org/stable/2950909
[20]
E. Moro, D. Calacci, X. Dong, and A. Pentland, “Mobility patterns are associated with experienced income segregation in large US cities,” Nature Communications, vol. 12, no. 1, pp. 1–10, Jul. 2021, doi: 10.1038/s41467-021-24899-8.
[21]
C. Alexander, A city is not a tree. Sustasis Press/Off The Common Books, 2017.
[22]
J. Jacobs, The death and life of great American cities, Modern Library ed. New York: Modern Library, 1993.
[23]
M. Batty, “Cities as complex systems: Scaling, interaction, networks, dynamics and urban morphologies.” 2009.
[24]
A.-X. Zhu, “Laws of Geography?” International Encyclopedia of Geography, pp. 1–8, Mar. 2024, doi: 10.1002/9781118786352.wbieg2192.
[25]
M. Leitner, P. Glasner, and O. Kounadi, “Laws of geography,” in Oxford research encyclopedia of criminology and criminal justice, 2018.
[26]
G. K. Zipf, “The p 1 p 2 d hypothesis: On the intercity movement of persons,” American Sociological Review, vol. 11, no. 6, p. 677, Dec. 1946, doi: 10.2307/2087063.
[27]
F. Simini, M. C. González, A. Maritan, and A.-L. Barabási, “A universal model for mobility and migration patterns,” Nature, vol. 484, no. 7392, pp. 96–100, Apr. 2012, doi: 10.1038/nature10856.
[28]
M. Schläpfer et al., “The universal visitation law of human mobility,” Nature, vol. 593, no. 7860, pp. 522–527, May 2021, doi: 10.1038/s41586-021-03480-9.
[29]
L. M. A. Bettencourt, “The Origins of Scaling in Cities,” Science, vol. 340, no. 6139, pp. 1438–1441, Jun. 2013, doi: 10.1126/science.1235823.
[30]
E. Moro et al., “A city is not a static tree: Understanding urban areas through the lens of real-time behavioral data,” ZARCH. Journal of interdisciplinary studies in Architecture and Urbanism, no. 19, pp. 28–39, 2022, doi: 10.26754/ojs_zarch/zarch.2022197407.
[31]
A. Saiz and A. Salazar Miranda, “Understanding Urban Economies, Land Use, and Social Dynamics in the City: Big Data and Measurement.” Rochester, NY, Oct. 2023. doi: 10.2139/ssrn.4594995.