Optimal Audit Targeting with Machine Learning: Evidence from Pakistan (Job Market Paper)

Dec 10, 2025·

Nicholas Lacoste

Zehra Farooq

· 0 min read

Abstract

This paper bridges welfare economics and machine learning econometrics to develop empirically implementable algorithms for optimal audit targeting. We derive a sufficient statistic-based targeting algorithm that depends on three individualized causal effects – the immediate revenue recovered from an audit, the causal effect of an audit on long-run tax revenue, and the marginal administrative cost of an audit. We estimate these effects with a variety of machine learners comparing causal forests, LASSO, gradient boosted trees, and neural networks using the universe of Pakistani income tax returns, exploiting years in which audits were assigned completely at random. We implement our targeting algorithms in out-of-bag years, comparing them to the real-world policy when audits were partially or entirely targeted. We show that the real-world audit program in Pakistan lost almost 173,000 Rs (about $1,700) in net revenue per-audit, while our optimal policy generates 285,000 Rs (about $2,800) in expected net revenue per-audit. We also find that targeting audits based on immediate recoup is sub-optimal to targeting on long-run deterrence in this setting. Moving forward, our framework offers a general approach to empirical welfare maximization using machine learning in resource-constrained policy settings.

Type

Preprint

Last updated on Dec 10, 2025

Welfare-Optimal Audit Programs Machine Learning Sufficient Statistics

Authors

Nicholas Lacoste

Ph.D. Candidate in Economics

← Estimating the Welfare Cost of Labor Supply Frictions Dec 22, 2025

Let 1,000 Flowers Bloom (or Wilt): Heterogeneity in National Market-Level Charter School Effects Nov 8, 2024 →