Share
Economics·44 views·9 min read·Research

Regression to the Mean

Regression to the mean is the statistical principle that extreme values — high or low — tend to move back toward the long-run average over time. In real estate, it describes why unusually strong rent growth, compressed cap rates, and inflated price-to-income ratios are likely to normalize rather than persist indefinitely.

Also known asMean RegressionReversion to Average
Published Jan 19, 2025Updated Mar 28, 2026

Why It Matters

You're using regression to the mean every time you ask whether a market's performance is sustainable or just noise. When rents in a metro climbed 28% in two years, regression to the mean tells you that kind of run is an outlier, not a new normal — and your underwriting should plan for normalization, not extrapolation. The same logic applies in reverse: a market hammered by oversupply or a local employer exit is sitting below its long-run average, which is a signal worth paying attention to.

The analytical frame here is different from mean reversion, which describes the market force. Regression to the mean is the statistical lens — the discipline of measuring how far a current reading sits from its historical distribution and what that gap implies for future performance.

At a Glance

  • What it is: A statistical principle stating that extreme values tend to drift back toward their historical average over time
  • Distinguished from mean reversion: Mean reversion describes the market force; regression to the mean is the analytical measurement framework
  • Key metrics to track: Cap rates, price-to-rent ratios, vacancy rates, rent-to-income ratios, real home price indices
  • Applies to: Any time-series metric with a stable long-run average and a current reading far from that average
  • Investor use: Calibrating underwriting assumptions, avoiding extrapolation of outlier performance, spotting undervalued markets
  • Related concepts: Real estate cycle phases, equilibrium, black swan

How It Works

The statistical foundation is straightforward. Every real estate market has a distribution of outcomes for key metrics across time. Cap rates in a given metro have traded between, say, 4.8% and 7.2% over 20 years, clustering near a median. When a current reading sits two standard deviations from that median — either compressed to 4.1% or expanded to 8.3% — the statistical expectation is that future readings will drift back toward the center of the distribution. That drift is regression to the mean.

The measurement process has three steps. First, establish the historical baseline — ideally 15 to 20 years of data for the specific metro and property type. Shorter windows introduce recency bias; using the 2018–2022 period as a "normal" baseline for rent growth, for example, would badly miscalibrate expectations. Second, calculate the current deviation from that baseline. A price-to-rent ratio 22% above its 20-year median is a significant outlier. A cap rate 80 basis points below its median is a moderate one. Third, incorporate the size of the deviation into your underwriting — not as a guaranteed timing call, but as a directional pressure on future performance.

Regression applies across multiple metrics simultaneously. This is where the analytical value compounds. A market with cap rates below median, price-to-rent ratios above median, vacancy below median, and rent growth well above historical averages is exhibiting regression pressure across four dimensions at once. Each metric independently suggests a drift back toward normal; taken together, they describe a market where multiple favorable tailwinds for sellers have built up — and where regression pressure is strong.

The relationship with hyper-supply and the real estate cycle phases is direct. Hyper-supply phases typically emerge after extended periods of above-mean rent growth and below-mean vacancy. The excess development that defines hyper-supply is itself triggered by the very outperformance that regression predicts will reverse. Tracking regression signals early can give you a read on where a market sits in its cycle before supply data confirms it.

What regression to the mean does not do. It does not identify when normalization will occur. A market can sustain above-mean performance for years if structural supply constraints limit new development — think coastal metros with geographic or regulatory barriers. A black swan event, a major employer arrival, or a demographic shift can permanently reset the mean itself. And regression is a tendency, not a law: individual assets within a market can outperform or underperform their market's regression trajectory based on property-specific factors.

Real-World Example

Aisha is evaluating a 12-unit multifamily purchase in a mid-size Sun Belt metro. She runs a regression-to-the-mean analysis before underwriting her rent growth assumptions.

Her research shows the metro's effective rent growth averaged 2.9% annually over the prior 20 years. The prior three years produced growth of 14.1%, 9.3%, and 6.2% — each year above the long-run mean. The current vacancy rate is 4.1%, 310 basis points below the 20-year median of 7.2%. The current cap rate for comparable properties is 4.7%, against a 20-year median of 5.9%.

She runs two underwriting scenarios. Her seller's pro forma extrapolates 5% annual rent growth. Her regression-adjusted model uses 3.1% — slightly above the long-run mean to account for some structural demand from population growth, but well below recent performance. She also stress-tests with 1.8% growth, reflecting a full reversion that slightly undershoots the mean.

At 5% rent growth, the deal clears her return threshold easily. At 3.1%, it still works but with thinner margins. At 1.8%, it fails her cash-on-cash minimum in year two. Aisha passes on the deal at the asking price — not because she thinks a crash is coming, but because the regression analysis reveals she is being asked to pay for continued outlier performance rather than a return to equilibrium. She offers 7% below ask, reflecting a cap rate closer to the 20-year median. The seller declines. Six months later, vacancy in the metro climbs to 6.4% and asking rents flatten. Aisha's regression framework did not predict the timeline — but it correctly identified the direction.

Pros & Cons

Advantages
  • Grounds underwriting in statistical reality rather than recent momentum — prevents extrapolating outlier performance into long-term projections
  • Works across multiple metrics simultaneously — analyzing cap rates, vacancy, and rent growth together gives a richer picture of regression pressure than any single indicator
  • Applies symmetrically — identifies undervalued markets as clearly as overvalued ones, creating buy signals alongside caution flags
  • Long data series available for most major metros — FRED, CoStar, and local assessor records support robust baseline construction
  • Complements cycle analysis — regression signals often lead cycle phase transitions, providing earlier read on market direction
Drawbacks
  • Offers no timing signal — a market can remain in regression territory for years before normalizing, making patience a prerequisite
  • Historical means shift with structural changes — demographic shifts, remote work trends, and interest rate regime changes can permanently alter the baseline, making old averages misleading
  • Data quality varies by market — smaller metros have thinner historical data, producing less reliable regression baselines
  • Can cause premature exit from strong markets — investors relying heavily on regression signals may exit during extended outperformance cycles and miss meaningful appreciation
  • Does not account for recession-phase overshoot — values can fall well below the long-run mean during severe downturns before recovering, creating false buy signals mid-cycle

Watch Out

Don't use the wrong baseline period. If your historical window includes an anomalous peak — such as 2020–2022 rent spikes in Sun Belt markets — your "mean" will be artificially elevated and your regression signals muted. Use a baseline that spans multiple full cycles, at least 15 years, and ideally includes at least one downturn.

Property type and location matter at every level. Regression to the mean for Class A multifamily in Dallas is not the same distribution as Class B multifamily in Fort Worth. Mixing metro-wide averages with submarket data or different asset classes produces meaningless comparisons. Always match the baseline to the specific segment you are evaluating.

Regression signals are inputs, not decisions. A market exhibiting strong regression pressure toward the mean is a data point — one of many you should weigh alongside employment trends, supply pipeline, financing conditions, and property-specific factors. Investors who treat regression as a mechanical sell signal miss deals in structurally improving markets.

Do not confuse regression to the mean with simple price decline. Regression is about normalization of metrics relative to their historical distribution. A market where cap rates are below median is not necessarily "overpriced" — it may reflect a legitimate change in investor demand or risk perception. The regression signal flags the deviation and asks whether the new level is structural or cyclical.

Ask an Investor

The Takeaway

Regression to the mean is the statistical discipline that keeps your underwriting honest. It forces you to ask whether current metrics are an outlier or a new normal — and it defaults to "outlier" until structural evidence says otherwise. The analytical value is not in predicting when normalization happens, but in refusing to price assets as if the outlier will last forever. When you see a market where multiple metrics are simultaneously stretched above historical norms, regression to the mean is telling you that gravity exists even if the clock is uncertain. Build your models accordingly.

Was this helpful?