Network Science Institute | Northeastern University
NETS 7983 Computational Urban Science
2025-04-15
This week:
Analysis of Policy Interventions in Computational Urban Science
Data -> Methods -> Models -> Applications.
Cities are the largest laboratories of policy innovation: transport, housing, climate, equity.
Policies need to be:
The critical part is “Evaluation”, i.e., knowing what is the effect of the policy in different groups/areas/times so we can re-design or calibrate our policy for a more efficient implementation.
Thus, we need an accurate policy evaluation framework based on data collection, causal inference, and prediction models.
Ideally, that policy evaluation framework should be a Randomized Control Trial. However, ethical, economic, justice, or legal restrictions prevent that possibility. Most of the times we have only observational data about the impact of policies.
This comes with key challenges:
Because of their complexity, policy evaluation is typically done using very simple techniques:
Because of their complexity, policy evaluation is typically done using very simple techniques:
For those reasons, typical policy evaluation
Overestimates the effect of the intervention, because of selection bias, externalities, confounders, etc.
Does not scale, because evaluations focus only on the direct, local impact without considering systemic feedbacks or spillovers.
Misses unintended consequences, such as gentrification, traffic rerouting, or increased inequities — especially when only immediate or positive effects are measured.
Fails to capture dynamic and spatial effects, assuming impact is contained and static. Example: Retail policies in one neighborhood may shift customer flows rather than grow them.
Leads to misleading narratives, reinforcing decisions based on anecdotal evidence or correlation rather than robust evidence.
Example: using before-after comparisons to evaluate the impact of transportation policies:
The use of large-scale data and computational tools allows for more accurate, scalable, and nuanced policy evaluation by:
Key challenges in using CUS or policy evaluation:
Roadmap:
Data: check the representativity of the data. Areas, people, temporal coverage. Try to
Quasi-experimental design:
Estimation of the impact:
NYC launched a congestion pricing program aimed at reducing traffic congestion in Manhattan below 60th St.
First initial data: https://www.congestion-pricing-tracker.com (from a student in Northeastern!). Using Google Maps, commuting times for different routes in and out of Manhattan.
They found a small effect on the average commuting time on routes affected by the policy
The main effect is on the commuting times at different times of the day
Results from other data companies (INRIX, MTA) show a small impact on the average speed (around 20%) within the Central Business District (CBD), Manhattan. This might signal that the impact was mostly on private vehicles, not Ubers + delivery.
Most of these evaluations:
Use simple counterfactuals:
Do not control for potential spill-overs
Do not implement a systematic causal inference framework
In a recent paper, [1], Cook et al. took a more causal analysis of congestion pricing. They
Average daily speed in the CBD (red) compared with a handful of other cities
Even though only NYC implemented congestion pricing, average speeds in most cities increased in January and February. To get a better counterfactual, they used a synthetic control formed from other cities. They found a significant ATT (15%) on daily and hourly average speed.
The effect is mainly within and to CBD, although small increases in speed of 4% can be seen on trips to other areas that go through the CBD.
Using CATE, they also investigate the effect across different groups. They found similar effects across the income distributions.
Summary:
In the early 2000s, Paris began to promote sustainable transport models. The Paris Pedestrian Initiative implemented a street-sharing program to promote alternative modes of transportation (bikers, pedestrians). A key component of this program was the Zone 30 initiative to create slow zones.
To do that they fixed the speed limit of 30km/h, created more space for bikes, sidewalks, and parking space was converted to terraces in some streets.
There were 141 slow zones introduce in Paris, located within the central area of Paris and implemented across ten years.
One of the key components of this policy is its implementation: it was done at the level of the district, which means that streets outside of those slow zones were not affected by the policy. This was used by Salazar-Miranda et al. [2] to evaluate the impact of policy using causal methods.
They define treated streets as those within slow zones and control as those 100m or less from the boundary of slow zones.
Their choice of treated and control groups was made to maximize their similarity across all other potential confounders like length, proximity to the city center, transit stops, parks, etc., that might affect the outcome of the policy. They only found differences in segment length, thus they control for it later.
Outcome: do slow zones affect human activity on streets?
First: does human activity vary across the boundaries of slow zones? There appears to be a discontinuity in the regressions of activities across the boundaries. This justifies their choice of control and treatment groups.
They used a similar framework as regression discontinuity to measure the ATE by investigating the average effect of segments of a street that are at different sides of the slow zone boundary:
\[ Outcome_{i,t} = \beta\ SlowZone_{i,t} + \gamma\ X_i + \delta_t + \alpha_{n(i)} + \theta_{s(i)} + \epsilon_{i,t} \]
where
They claim that, since control and treated segments of the same street at different sides of the boundary are significantly different in terms of characteristics (apart from the street length), then \(\beta\) can be interpreted as the causal impact. Here are the results for different outcomes (44% increase in the number of tweets!)
However, they found that the effect is only significant for early cohorts (2010) of slow zones. Probably because of under-powered statistical analysis
Summary:
Electric Vehicles (EVs) offer a possibility to alleviate the problem of pollution in our cities. Some policies, like the Infrastructure Investment and Jobs Act (IIJA) in 2021 ($7.5B) promote the creation of public EV charger stations (EVCS) across the nation.
However, the importance of public EVCSs extends beyond their primary function: they can influence the economic outcomes of surrounding businesses. Does the foot traffic around EVCS boost or harm those businesses?
In a recent paper [3], Zheng et al. investigate this through causal methods
Their data includes when a charging station is opened. They considered that all POI locations within a 500m during the study period are part of the treatment group. Control are POI locations outside that 500m boundary.
One limitation of that approach is that ECVS provides might place them strategically to maximize the benefits of them, in areas that could have more growth, for example. To address this endogeneity, the authors use Propensity Score Matching to pair same-category POIs in the treatment and control group.
This is done using logistic regression. Let’s assume that \(D_i\) is the treatment for POI \(i\) and \(X_i\) are observable covariates that might influence the probability that \(i\) is treated
\[ logit P(D_i = 1|X) = \beta_0 + \beta_1 X_1 + \cdots +\beta_k X_k \] \(P(D_i = 1 | X)\) is called the propensity score. The matching is very simple: for each treated unit, we found a control unit with a similar score.
Once this is done, typically, we should observe that the distribution of covariates between treated and control groups should be similar
Using that matched sample of control and treated POIs, they look at different outcomes, like customer count, spending, etc.
To get the ATE, the use a DiD:
\[ \ln Y_{it} = \alpha + \beta D_{it} \times PC_{it} + u_i + \phi_c \times \omega_t + \epsilon_{it} \] where
\(\beta\) is then the marginal effect of adding a new charging port to an EVCS.
They found that adding a new charging port resulted in a 0.21% increase in customer count and a 0.25% in spending in 2019 and 0.14% and 0.16% in 2021-2023.
The effect is very small (!!) and has some demographic, temporal, and spatial heterogeneity. For example, they found that for underprivileged regions, the effect was 0.17%/0.29% in 2019 and that the effect diminishes with distance to EVCS and does not increase with time.
They also found some differences across categories. Only Restaurants and Grocery/clothing stores seem to be affect (and only for fast chargers)
Summary:
CUS 2025, ©SUNLab group socialurban.net/CUS