KD-trees for best-case linear time implementation of Abadie & Diamond's statistical matching procedure for causal inference.
This project implements an optimized version of the matching procedure described in Abadie & Diamond's work on statistical matching for causal inference. By leveraging KD-tree data structures, the implementation achieves best-case linear time complexity, making it suitable for large-scale causal inference applications.
Statistical matching is a fundamental technique in causal inference used to estimate treatment effects when randomized controlled trials are not feasible. The method matches treated and control units based on their observed covariates to reduce selection bias.
Traditional matching algorithms can be computationally expensive for large datasets. This implementation uses spatial data structures (KD-trees) to dramatically improve performance, enabling matching on larger datasets and with more covariates than previously practical.