Curse of Dimensionality
The curse of dimensionality is a cluster of phenomena that make geometric, statistical, and search algorithms degrade as the number of features grows. Volume grows exponentially, samples become sparse, and pairwise distances concentrate, so nearest-neighbor and density-based methods lose discriminative power.
The curse of dimensionality, a term coined by Richard Bellman in 1961, refers to a family of effects that make geometric, statistical, and indexing methods break down as the dimensionality of the input grows. The unit cube's volume grows exponentially with dimension, so any fixed-size sample becomes vanishingly sparse, and the volume concentrates near the surface rather than the center. For nearest neighbor search the most damaging effect is distance concentration: under common metrics the ratio of the farthest to the nearest pairwise distance tends to 1 as dimensionality grows, so 'closest' and 'farthest' lose their intuitive meaning. This breaks KD-tree, ball tree, and other recursive space-partitioning indexes, which can no longer prune branches and degrade toward a linear scan. Related phenomena include the hubness problem, in which a few points appear in disproportionately many k-NN lists, and the manifold effect, where real data lives on a much lower-dimensional manifold than its ambient space suggests. The practical response is to either reduce dimensionality before searching — via PCA, autoencoders, or learned embeddings — or to use approximate nearest neighbor methods such as HNSW, locality-sensitive hashing, and product quantization that exploit the low intrinsic dimensionality of real datasets while accepting bounded recall loss. The curse also affects statistical learning more broadly: density estimation, clustering, and parametric models all require sample sizes that grow exponentially with dimension to keep error rates fixed.