Regime assignments use canonical definitions established 2026-03-08 (K-Means k=8, Greenspan Era). View WCSS analysis.
Macro Economic Regime Clustering (1987–2008)GREENSPAN ERA
Published 2026-03-08
This analysis identifies distinct macroeconomic regimes by applying unsupervised K-Means clustering to 17 weekly-aggregated macro indicators spanning 1987 to 2008 (1,148 weeks of data). The algorithm discovered 8 distinct regimes, each characterized by a unique combination of growth, inflation, credit, and risk conditions.
Identified Regimes
Each row summarizes one regime: its auto-generated label, the number of weeks assigned to it, the span of dates it covers, and the three indicator values (with z-scores) that most distinguish it from the others.
| ID | Label | Weeks | Date Span | Top Distinguishing Features (centroid) |
|---|---|---|---|---|
| 0 | Expanding / Inflationary / Low 3m10y Spread | 163 | 2004-12-17 – 2008-01-25 | M2 Money Supply: 6919.18 (z=+1.50); GDP: 16393.45 (z=+1.43); CPI: 201.52 (z=+1.36) |
| 1 | Deteriorating / Low Industrial Production / Low GDP | 73 | 1987-01-02 – 1991-10-25 | Initial Jobless Claims: 424561.64 (z=+1.43); Industrial Production: 60.65 (z=-1.31); Unemployment Rate: 6.59 (z=+1.22) |
| 2 | Low Capacity Utilization / Low 3M Treasury Yield / Low Fed Funds Rate | 193 | 2001-04-06 – 2004-12-10 | Capacity Utilization: 76.30 (z=-1.64); 3M Treasury Yield: 1.64 (z=-1.39); Fed Funds Rate: 1.76 (z=-1.37) |
| 3 | Tightening / Low CPI / Low Industrial Production | 179 | 1987-04-03 – 1990-08-31 | 10Y Treasury Yield: 8.68 (z=+1.59); 2Y Treasury Yield: 8.22 (z=+1.52); Fed Funds Rate: 8.01 (z=+1.50) |
| 4 | Recessionary / Tight Conditions / Credit Stress | 48 | 2008-02-01 – 2008-12-26 | Recession Probability: 84.76 (z=+4.03); NFCI: 1.31 (z=+2.85); BAA Spread: 3.85 (z=+2.70) |
| 5 | Deteriorating / Steepening / Low Industrial Production | 131 | 1991-11-01 – 1994-04-29 | Unemployment Rate: 7.10 (z=+1.78); Short-10Y Spread: 1.96 (z=+1.27); 3m10y Spread: 3.17 (z=+1.25) |
| 6 | Expanding / Low BAA Spread / Low NFCI | 179 | 1994-05-06 – 1997-10-03 | Capacity Utilization: 83.77 (z=+1.01); BAA Spread: 1.56 (z=-0.86); NFCI: -0.69 (z=-0.64) |
| 7 | Low Unemployment Rate / Optimistic / Low 3m10y Spread | 182 | 1997-10-10 – 2001-03-30 | Unemployment Rate: 4.26 (z=-1.35); Consumer Sentiment: 104.84 (z=+1.33); 3m10y Spread: 0.56 (z=-1.00) |
Visualization
Top panel — PCA Feature Space: Each dot represents one week projected into two principal components (capturing 73.4% of total variance). Color indicates the assigned regime. Dots that form tight, well-separated clusters indicate regimes with distinctive macro fingerprints.
Bottom panel — Regime Timeline: The same regime assignments plotted chronologically, so you can see when each macro regime was in force. The legend below the timeline panel identifies each color.
Regime Fingerprints — Feature Z-Scores
The table below shows the standardized z-score of each feature's cluster centroid. A value of +2.0 means that regime has a feature value roughly 2 standard deviations above the historical average; −2.0 means 2 standard deviations below. Dark red = strongly above average Dark blue = strongly below average
| Feature | Regime 0 Expanding / Inflationary / Low 3m10y Spread | Regime 1 Deteriorating / Low Industrial Production / Low GDP | Regime 2 Low Capacity Utilization / Low 3M Treasury Yield / Low Fed Funds Rate | Regime 3 Tightening / Low CPI / Low Industrial Production | Regime 4 Recessionary / Tight Conditions / Credit Stress | Regime 5 Deteriorating / Steepening / Low Industrial Production | Regime 6 Expanding / Low BAA Spread / Low NFCI | Regime 7 Low Unemployment Rate / Optimistic / Low 3m10y Spread |
|---|---|---|---|---|---|---|---|---|
| GDP | +1.43 | -1.19 | +0.79 | -1.26 | +1.58 | -0.90 | -0.45 | +0.28 |
| Unemployment Rate | -0.77 | +1.22 | +0.13 | +0.04 | +0.41 | +1.78 | -0.02 | -1.35 |
| CPI | +1.36 | -1.11 | +0.71 | -1.45 | +1.86 | -0.70 | -0.27 | +0.17 |
| Fed Funds Rate | -0.18 | +0.77 | -1.37 | +1.50 | -1.37 | -0.63 | +0.29 | +0.36 |
| Initial Jobless Claims | -0.63 | +1.43 | +0.76 | -0.51 | +1.36 | +0.50 | -0.16 | -0.87 |
| Consumer Sentiment | -0.28 | -1.04 | +0.02 | +0.15 | -2.50 | -0.83 | +0.43 | +1.33 |
| Recession Probability | -0.26 | +0.78 | -0.07 | -0.27 | +4.03 | -0.33 | -0.34 | -0.22 |
| Industrial Production | +1.31 | -1.31 | +0.76 | -1.27 | +1.22 | -1.05 | -0.39 | +0.61 |
| 3M Treasury Yield | -0.18 | +0.80 | -1.39 | +1.41 | -1.58 | -0.54 | +0.38 | +0.36 |
| 2Y Treasury Yield | -0.45 | +0.85 | -1.34 | +1.52 | -1.60 | -0.32 | +0.47 | +0.18 |
| 10Y Treasury Yield | -0.94 | +1.15 | -1.00 | +1.59 | -1.47 | +0.25 | +0.37 | -0.28 |
| M2 Money Supply | +1.50 | -0.93 | +0.77 | -1.08 | +2.09 | -0.79 | -0.63 | -0.08 |
| Capacity Utilization | -0.13 | -0.27 | -1.64 | +0.82 | -1.22 | +0.10 | +1.01 | +0.41 |
| NFCI | -0.26 | +0.38 | -0.44 | +1.07 | +2.85 | -0.96 | -0.64 | +0.08 |
| BAA Spread | -0.49 | -0.10 | +0.99 | -0.40 | +2.70 | -0.21 | -0.86 | +0.11 |
| 3m10y Spread | -1.03 | +0.28 | +0.94 | -0.13 | +0.59 | +1.25 | -0.11 | -1.00 |
| Short-10Y Spread | -0.77 | +0.21 | +1.28 | -0.54 | +0.96 | +1.27 | -0.40 | -1.00 |
Methodology
Data Sources
Indicators are sourced entirely from the FRED (St. Louis Fed) API and cover 1987 to 2008. The following series are used:
- FRED Macro Series — Real GDP, Unemployment Rate, CPI (All Urban), Federal Funds Rate, Initial Jobless Claims, Consumer Sentiment (UMich), Smoothed US Recession Probability, Industrial Production Index, 3-Month Treasury Bill, 10-Year Treasury, 2-Year Treasury (bridged with 1-Year Treasury prior to June 1976), M2 Money Supply, Total Capacity Utilization, Chicago Fed NFCI, Moody's Baa Corporate Bond Yield
- S&P 500 (FMP API) — Daily prices used solely to compute realized volatility (21-week rolling annualized std of weekly log-returns)
- Derived Features — BAA Spread (Baa − 10Y Treasury), 3m10y Spread (10Y − 3M Treasury), Short-10Y Spread (10Y − spliced 1Y/2Y Treasury)
VIX, HY OAS, IG OAS, 5-Year Breakeven Inflation, Fed Balance Sheet, retail sales, and sector performance data are not available for the full historical period and are excluded.
Weekly Aggregation
All daily series are averaged to week-ending-Friday frequency. Monthly and quarterly series (GDP, CPI, unemployment, etc.) are forward-filled up to 92 days before resampling so that every week carries the most recently released value. Weeks with fewer than 50 % of features populated are dropped. Any remaining individual NaNs are filled with the column median before clustering.
Feature Preprocessing
All 17 features are standardized to zero mean and unit variance
(sklearn.preprocessing.StandardScaler) so that indicators with
vastly different magnitudes (VIX in 10–80, GDP in $ trillions) contribute
equally to the K-Means distance metric.
Clustering Algorithm
K-Means clustering (sklearn.cluster.KMeans,
n_init=20, random_state=42) with
k = 8 clusters. The canonical
k=8 is fixed and stored as canonical regime definitions in S3.
Regime assignments use nearest-centroid (Euclidean distance) in standardized
feature space.
Cluster Labeling
Each cluster is automatically labeled using the three features whose standardized centroid values deviate most from the global mean (largest absolute z-score). A positive deviation uses the feature's descriptive direction tag (e.g. a VIX centroid well above average → "Stressed"); a negative deviation uses "Low <feature name>".
Visualization
Principal Component Analysis (PCA) projects all 17 features into two dimensions for the scatter plot. PC1 and PC2 together capture 73.4% of total variance (47.1% + 26.4%).
Comments