CrashTestYourStrategy Methodology

Verifiability

Falsifiable model claims for the synthetic stress dataset.

Synthetic stress scenarios are evaluated against their intended statistical properties. The snapshot below shows where model behavior aligns with — and where it deviates from — the operational claims published in the methodology.

32
datasets evaluated
50 total
falsifiable claims
11
off-spec (transparent)
License
CC-BY 4.0
Snapshot generated
2026-05-05
Replicas per dataset
50

Aggregated p95 and tail metrics are sample-sensitive at n=50; treat them as indicative rather than precise.

Off-band calibrations & methodology limits

The following profile-asset combinations sit outside their methodology-expected conformance bands as of the most recent snapshot. We list them here rather than quietly retrying calibration loops, because the deviations carry methodological information rather than being pure simulator deficiencies.

Two distinct phenomena, addressed honestly

Our operational gating definitions (SHARP_CRASH: ≥20% rolling-30d DD AND ≥1.5× crash-window vol; VOL_EXPANSION: median ≥1.5× baseline AND ≥2 distinct windows ≥1.5×) are intentionally asset-uniform. The off-band findings reveal two distinct phenomena that we report separately rather than collapsing into a single “asymmetry” narrative:

  1. Identifiability limits. For BTC, the methodology thresholds are not separating in practice — even the 4 historical BTC stress anchors (Crypto Winter 2018, Luna 2022, FTX 2022, ATH 2017) do not satisfy VOL_EXPANSION empirically (see verification table below). The relative-baseline definition, applied to an asset whose baseline is already 0.80 annualised, fails to single out a separable regime: real BTC stress events cluster around 1.0× baseline in 6-month statistics. This is a definition-level limit, not a simulator calibration failure.
  2. Conditional asymmetry. For other off-band combinations (e.g. WTI low_vol_grind), the simulator produces conditional distributions that do meet the gating definition partially, but at lower-than-expected frequency under fixed methodology thresholds. This is a calibration trade-off rather than a definitional limit.

Why we keep the thresholds uniform. Industry risk frameworks vary in approach: Sharpe / Sortino normalise by asset vol; Basel uses asset-class risk weights; RiskMetrics applies asset-specific EWMA decays. Asset-relative thresholds for our failure-mode classifier would give us higher conformance rates per asset — but at the cost of definitional invariance. We prioritise definitional invariance over calibration optimality so that the FM-bucket assignment of any individual replica is independent of the asset on which it was generated. The trade-off is conscious and disclosed; it is not an unavoidable property of the methodology.

A complementary diagnostic on the roadmap: we plan to add a percentile-of-own-vol annotation alongside the uniform classification (e.g. “Crypto Winter realized vol = 80th percentile of historical BTC 6-month windows”), so users can see both the asset-uniform classification and the asset-relative severity. This is not a redefinition of the score — it is a second column of information.

Empirical identifiability verification (BTC)

The strongest test of an FM definition is whether known historical stress events themselves satisfy it. Below: realized vol of BTC's 4 empirical stress anchors, under the same metric pipeline used for synthetic replicas.

BTC empirical caseTotal returnMax DDRealized vol× baselineVOL_EXPANSION-met
BTC ATH 2017+216%−35%0.921.14×no
Crypto Winter 2018−53%−66%0.831.03×no
Luna 2022−51%−55%0.600.74×no
FTX 2022+28%−26%0.450.56×no

Reading. All four BTC stress anchors fall below the 1.5× baseline threshold (1.20 absolute) required for VOL_EXPANSION classification, even though their absolute realized vols (0.45–0.92 annualised) and drawdowns (−26% to −66%) are unambiguously stress-grade. The methodology definition simply does not separate “BTC stress” from “BTC normal” given an 0.80 baseline. Our synthetic VOL_EXPANSION × BTC probe at 0% conformance is therefore a downstream symptom of the definitional limit, not a primary failure of the simulator.

Structural off-band combinations (5)

Conformance falls outside the expected band even when accounting for sampling uncertainty (Wilson 95% CI strictly below band). These are not closed by further calibration loops at fixed methodology thresholds.

Profile × AssetPrimary FMConformance (k/n)Wilson 95% CIExpectedSource of mismatch
vol_expansion_setup_synthetic × BTCVOL_EXPANSION0% (0/50)[0.00, 0.07]60–90%Definitional limit (see verification table); BTC empirical anchors themselves do not satisfy the gating
sharp_crash_setup_synthetic × BTCSHARP_CRASH0% (0/50)[0.00, 0.07]20–50%Same identifiability issue: only 1 of 4 BTC empirical anchors meets gating; threshold is not separating
liquidity_stress_setup_synthetic × BTCLIQUIDITY_STRESS0% (0/50)[0.00, 0.07]60–90%Identifiability — 0 of 4 BTC empirical anchors meets the conjunction of vol+DD+window criteria
low_vol_grind × WTIVOL_COMPRESSION10% (5/50)[0.04, 0.21]60–90%Calibration limit (not identifiability): WTI carries structural regime-shift risk; sustained 12-mo low-vol persistence is rare in any conditioning
low_vol_grind × QQQVOL_COMPRESSION46% (23/50)[0.33, 0.60]60–90%Calibration limit: QQQ tech-vol baseline is volatile; sustained 0.4–0.7×-baseline regime difficult to enforce. Wilson upper bound just touches band

Implication for robustness scores: These BTC and WTI/QQQ synthetic stress probes contribute to the spectrum-coverage of their failure-mode buckets but with reduced weight (per-bucket shrinkage when n_in_bucket < 10). The empirical historical anchors (Luna 2022, FTX 2022, BTC Crypto Winter 2018; WTI Saudi-War 2014, Negative-Pricing 2020) carry the primary information for these failure modes on these assets — the synthetic probes serve as auxiliary spectrum-fillers, not as primary evidence. For BTC specifically, this is forced by definitional limits of the FM gating (above), not by simulator deficiency.

Sampling-marginal off-band combinations (4)

For these combinations the Wilson 95% CI overlaps the expected band — we cannot statistically distinguish them from in-band conformance at n=50. Note that replicas within a profile-asset combination are not strictly i.i.d. (they share antecedent-phase parameters, calibration template, and observer biases), so the effective sample size is somewhat below 50 and the true CI is likely wider than reported. The Wilson bound here is therefore a lower bound on uncertainty.

Profile × AssetPrimary FMConformance (k/n)Wilson 95% CIExpectedCI & band
vol_expansion_setup_synthetic × QQQVOL_EXPANSION58% (29/50)[0.44, 0.71]60–90%CI overlaps band
whipsaw_synthetic × BTCWHIPSAW46% (23/50)[0.33, 0.60]50–80%CI overlaps band
whipsaw_synthetic × ETHWHIPSAW44% (22/50)[0.31, 0.58]50–80%CI overlaps band
demand_destruction × WTITREND_DOWN32% (16/50)[0.21, 0.46]40–80%CI overlaps band

Resolution path

  • BTC × 3 setup-profiles (definitional limit): we do not retry calibration. The empirical-identifiability table above shows the FM gating is not separating for BTC at the uniform threshold. For BTC the trust-layer relies primarily on empirical historical anchors (Crypto Winter, Luna, FTX); synthetic probes provide auxiliary spectrum-coverage with reduced weight via shrinkage.
  • WTI low_vol_grind + QQQ low_vol_grind (calibration limit): no further re-tuning at fixed thresholds. These probes contribute partial spectrum-coverage; empirical low-vol periods (SPY 2017, GOLD 2014) carry the primary VOL_COMPRESSION information.
  • 4 sampling-marginal cases: re-evaluation on a future n=200 generation pass. If the Wilson CI tightens and no longer overlaps the expected band, we will reclassify as structural.
  • Methodology revisions: only after anchor validation against historical stress events. Asset-relative threshold tuning is rejected because it would break definitional invariance, not because it would break comparability per se. The complementary percentile-of-own-vol diagnostic is on the roadmap as an additive disclosure, not a redefinition.

Edge case: unclassified replicas

In our current per-FM-bucket score (V2), replicas that fail every FM gating definition (sub-threshold) are excluded from the composite. This means the composite is conditioned on “identifiable regime” rather than on the full outcome distribution. We disclose this as a known bias toward stress-relevant outcomes; the planned correction is to assign sub-threshold replicas to a BASELINE bucket so they enter the composite at neutral weight. This is tracked alongside the V2 production activation work.

Methodology limitations & open issues

  • Identifiability per asset. The FM gating definitions are not guaranteed to be separating across all assets (BTC is the demonstrated case). Identifiability is verified case-by-case via the empirical-anchor table.
  • Power of test at n=50. With p̂ ≈ 0.5, the Wilson 95% CI half-width is ~14pp. Small effect sizes (≤14pp from band) are not statistically separable from sampling noise. Larger n is needed to resolve marginal cases definitively.
  • Multiple testing. 32 datasets × 1–3 FM tags ≈ 60–80 implicit hypothesis tests. We do not apply multiplicity correction to the conformance bands. At α=0.05 nominal one would expect 3–4 false-positive off-band findings under no-effect; we report 9 off-band, of which 5 are structural and 4 sampling-marginal — roughly consistent with the expected false-positive rate among the marginal subset.
  • Window-mixing in 6-month case slices. Stress events concentrated within a sub-window of the case (e.g. Volmageddon 2018 was a 1-day VIX spike inside a 6-month case window) get diluted in aggregate statistics. We chose 6-month windows for trading-strategy realism, not for FM identifiability — this is a conscious trade-off.
  • CoT calibration drift. Per-asset observer weights and biases are derived from CFTC Commitment-of-Traders reports over 2006–2026. Calibration is point-in-time; we do not currently track whether the simulator's behaviour drifts as we add later CoT data.
  • Anchor selection bias. The 31 empirical historical anchors were selected for stress severity and event-type coverage, not by random sampling of all 6-month windows. Selection favours canonical events (Lehman, Dotcom, COVID) over the full distribution of stress regimes that history actually produced.
  • No out-of-sample walk-forward validation. Robustness scores are computed in-sample relative to the curated case catalog. Strategies are not validated on held-out post-catalog periods. This is on the roadmap but not yet implemented.
  • Replica non-independence. Within a profile-asset combination, the 50 replicas share antecedent-phase parameters and calibration templates. Effective sample size is below 50 in any analysis that assumes i.i.d. replicas. Wilson CIs reported here are lower bounds on true uncertainty.
  • Score definition-dependence. Any robustness score that aggregates over FM-classified replicas inherits a structural dependence on the FM definitions themselves: refining a definition (e.g. splitting SHARP_CRASH into mild and deep variants) can shift a strategy's score even if its trading behaviour is unchanged. This is a closed-loop risk for any FM-bucketed metric. The roadmap addresses it via two changes: (i) a two-step aggregation that conserves run-level mass before projecting onto FM-buckets, and (ii) a definition-independent score component (raw drawdown distribution across all replicas, no FM-bucketing) that anchors the composite to a non-classifier-derived quantity. Currently the production composite uses the v1 path (no FM-bucketing in composite), which sidesteps the issue at the cost of not exposing per-FM-bucket detail in the score itself; the per-FM-bucket detail is reported separately in the regime-performance breakdown.

Per-dataset claim validation

Each card lists the synthetic profile, the failure-mode it claims to represent, and the result of comparing the claimed operational properties against the median across 50 Monte Carlo replicas. Expand a card for full claim text, measured values, and aggregated metric distributions.

Demand DestructionWTITREND_DOWN + VOL_EXPANSION
2 of 2 claims off-spec·50 replicas·126d stress + 250d pre-period

Imposed conditions for sustained downward trajectory with vol expansion (oil-specific).

Claim validation
Total return over case (TREND_DOWN aspect)off-spec
Claim: deep decline, methodology −25% to −60%
Measured median: -0.1071
Measured median (-0.1071) is above the claimed upper bound (-0.2000). The simulator does not produce regime characteristics within the methodology operational band for this profile-asset combination — calibration limit. Resolution path: re-tune the profile (preferred) or revise the regime definition (only after anchor validation).
Realized volatility (VOL_EXPANSION aspect)off-spec
Claim: ≥ 1.5× WTI baseline (~0.45); methodology gating
Measured median: 0.2821
Measured median (0.2821) is below the claimed lower bound (0.4500). The simulator does not produce regime characteristics within the methodology operational band for this profile-asset combination — calibration limit. Resolution path: re-tune the profile (preferred) or revise the regime definition (only after anchor validation).
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.3638-0.2664-0.10710.02710.1907
realized_vol_annualized0.24840.26760.28210.31140.3574
max_drawdown-0.4131-0.3498-0.2299-0.1827-0.1228
autocorrelation_lag1-0.1587-0.06680.02080.06510.1460
kurtosis-0.5368-0.3600-0.17690.13960.5214
skewness-0.2876-0.1341-0.01880.11470.2986
tail_p1-0.0542-0.0463-0.0414-0.0376-0.0332
tail_p990.03100.03330.03860.04350.0510
sign_change_frequency0.42740.46770.50000.52420.5573
vol_of_vol0.02540.03610.04200.05060.0617
avg_run_length1.78351.89391.98412.11862.3148
rolling_30d_max_dd-0.2610-0.2182-0.1821-0.1465-0.1160
crash_window_vol0.21060.25450.28530.32420.3823
retracement_from_trough0.00000.03170.15400.44671.1719
sign_changes_5pct_count4.00006.25009.000010.000012.0000
Source file: demand_destruction_wti.json
Pre-period regime: demand_weakness
Sample seeds (first 5): 7308, 7309, 7310, 7311, 7312
Hyperinflation BoostGOLDTREND_UP + VOL_EXPANSION
1 of 2 claim off-spec·50 replicas·126d stress + 250d pre-period

Imposed conditions for persistent upward advance with elevated volatility (gold-specific).

Claim validation
Total return over case (TREND_UP aspect)in-band
Claim: sustained advance, methodology +8% to +25% (annualized 4-6mo)
Measured median: 0.1376
within claimed range
Realized volatility (VOL_EXPANSION aspect)off-spec
Claim: ≥ 1.5× GOLD baseline (~0.20); methodology gating
Measured median: 0.1449
Measured median (0.1449) is below the claimed lower bound (0.1950). The simulator does not produce regime characteristics within the methodology operational band for this profile-asset combination — calibration limit. Resolution path: re-tune the profile (preferred) or revise the regime definition (only after anchor validation).
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.02810.05490.13760.21960.3741
realized_vol_annualized0.12530.13540.14490.15750.1752
max_drawdown-0.1421-0.0935-0.0726-0.0568-0.0365
autocorrelation_lag1-0.1761-0.0852-0.02940.03070.1078
kurtosis-0.5930-0.3644-0.09400.21220.8336
skewness-0.3112-0.13560.01430.19490.3683
tail_p1-0.0250-0.0215-0.0191-0.0168-0.0140
tail_p990.01740.01950.02100.02300.0254
sign_change_frequency0.41940.47580.52020.54030.5976
vol_of_vol0.01360.01720.02030.02450.0283
avg_run_length1.66471.83821.90852.08332.3585
rolling_30d_max_dd-0.1111-0.0830-0.0670-0.0566-0.0365
crash_window_vol0.11570.13000.14100.16690.1885
retracement_from_trough0.01090.36801.07312.36184.3028
sign_changes_5pct_count1.00002.00002.50003.00005.0000
Source file: hyperinflation_boost_gold.json
Pre-period regime: range
Sample seeds (first 5): 6092, 6093, 6094, 6095, 6096
Liquidity Stress Setup SyntheticBTC
all claims in-band·50 replicas·126d stress + 250d pre-period

Claim validation
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.4457-0.21350.07650.27470.6941
realized_vol_annualized0.34630.36990.41830.45900.5187
max_drawdown-0.5649-0.3470-0.2403-0.1785-0.1267
autocorrelation_lag1-0.1435-0.0663-0.00560.05370.1960
kurtosis-0.6070-0.26160.01750.20460.8475
skewness-0.3902-0.13290.00230.14550.4234
tail_p1-0.0758-0.0620-0.0574-0.0517-0.0443
tail_p990.04570.05090.05830.06550.0746
sign_change_frequency0.41940.45970.49600.52420.5815
vol_of_vol0.03770.04650.05570.06600.0827
avg_run_length1.71031.89392.00012.15522.3585
rolling_30d_max_dd-0.3826-0.2596-0.2131-0.1752-0.1267
crash_window_vol0.31870.36280.39970.46600.5181
retracement_from_trough0.00000.07180.45980.89692.7221
sign_changes_5pct_count8.450011.000013.000015.000019.1000
Source file: liquidity_stress_setup_synthetic_btc.json
Pre-period regime: high_vol_sideways
Sample seeds (first 5): 20800, 20801, 20802, 20803, 20804
Liquidity Stress Setup SyntheticSPY
all claims in-band·50 replicas·126d stress + 250d pre-period

Claim validation
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.4537-0.2169-0.01700.11490.7288
realized_vol_annualized0.33930.37850.42160.45290.4975
max_drawdown-0.5101-0.3765-0.2697-0.1981-0.1468
autocorrelation_lag1-0.1324-0.0690-0.01880.03920.1063
kurtosis-0.4738-0.3443-0.02240.19950.5106
skewness-0.3453-0.1723-0.09010.07400.1730
tail_p1-0.0726-0.0652-0.0588-0.0524-0.0461
tail_p990.04090.04930.05610.06290.0704
sign_change_frequency0.43550.47780.50400.53230.5770
vol_of_vol0.03620.04630.05680.06440.0815
avg_run_length1.72301.86571.96862.07482.2727
rolling_30d_max_dd-0.3223-0.2847-0.2367-0.1851-0.1465
crash_window_vol0.31950.35400.40880.46120.5454
retracement_from_trough0.00060.09010.38390.74812.7293
sign_changes_5pct_count7.000012.000013.000015.000018.5500
Source file: liquidity_stress_setup_synthetic_spy.json
Pre-period regime: bull_trend
Sample seeds (first 5): 20700, 20701, 20702, 20703, 20704
Liquidity Stress Setup SyntheticWTI
all claims in-band·50 replicas·126d stress + 250d pre-period

Claim validation
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.4856-0.3045-0.14960.18410.6619
realized_vol_annualized0.44810.50110.54180.59360.6440
max_drawdown-0.5764-0.4572-0.3729-0.2658-0.1819
autocorrelation_lag1-0.1613-0.1107-0.02870.04770.1291
kurtosis-0.5205-0.2694-0.10250.22440.8505
skewness-0.4631-0.1165-0.00450.11690.3580
tail_p1-0.0979-0.0861-0.0760-0.0679-0.0600
tail_p990.05630.06550.07430.08670.0988
sign_change_frequency0.42740.47580.50810.54030.6048
vol_of_vol0.05090.06070.07460.08970.1112
avg_run_length1.64471.83821.95362.08332.3148
rolling_30d_max_dd-0.4249-0.3471-0.2941-0.2413-0.1819
crash_window_vol0.37350.47430.53550.58490.6485
retracement_from_trough0.00000.08660.23920.78001.4794
sign_changes_5pct_count10.900016.000019.000021.000025.0000
Source file: liquidity_stress_setup_synthetic_wti.json
Pre-period regime: range_high_vol
Sample seeds (first 5): 22200, 22201, 22202, 22203, 22204
Low Vol GrindGOLDVOL_COMPRESSION
all claims in-band·50 replicas·252d stress + 250d pre-period

Sustained suppression of realized volatility over multiple months.

Claim validation
Realized volatility (annualized)in-band
Claim: 0.4–0.7× GOLD baseline (~0.05–0.09)
Measured median: 0.0709
within claimed range
Maximum drawdown (median)in-band
Claim: shallow drawdowns < 8% (non-VIX) per methodology
Measured median: -0.0508
within claimed range
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.05040.01370.09630.12980.1969
realized_vol_annualized0.05830.06470.07090.07630.0846
max_drawdown-0.1040-0.0659-0.0508-0.0408-0.0300
autocorrelation_lag1-0.0898-0.04040.00880.05340.0976
kurtosis-0.3890-0.2088-0.08240.06380.3299
skewness-0.2809-0.1231-0.00030.08130.1564
tail_p1-0.0123-0.0107-0.0100-0.0089-0.0080
tail_p990.00830.00940.01020.01110.0127
sign_change_frequency0.43780.46500.49200.51500.5400
vol_of_vol0.00730.00890.01050.01210.0140
avg_run_length1.84561.93452.02422.14082.2726
rolling_30d_max_dd-0.0684-0.0491-0.0429-0.0359-0.0284
crash_window_vol0.05100.06250.07120.07950.0870
retracement_from_trough0.07590.26291.08172.03603.2660
sign_changes_5pct_count0.00001.00001.00002.00003.0000
Source file: low_vol_grind_gold.json
Pre-period regime: range
Sample seeds (first 5): 1884, 1885, 1886, 1887, 1888
Low Vol GrindQQQVOL_COMPRESSION
all claims in-band·50 replicas·252d stress + 250d pre-period

Sustained suppression of realized volatility over multiple months.

Claim validation
Realized volatility (annualized)in-band
Claim: 0.4–0.7× QQQ baseline (~0.08–0.15)
Measured median: 0.0954
within claimed range
Maximum drawdown (median)in-band
Claim: shallow drawdowns < 8% (non-VIX) per methodology
Measured median: -0.0731
within claimed range
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.1133-0.00530.09570.20440.2793
realized_vol_annualized0.08020.08660.09540.10280.1085
max_drawdown-0.1488-0.1037-0.0731-0.0488-0.0379
autocorrelation_lag1-0.1286-0.0463-0.01280.01350.1088
kurtosis-0.4538-0.2102-0.02540.10770.5295
skewness-0.2475-0.10460.00430.10780.2091
tail_p1-0.0157-0.0149-0.0129-0.0120-0.0103
tail_p990.01130.01280.01380.01510.0172
sign_change_frequency0.45600.47600.50000.52300.5502
vol_of_vol0.00910.01240.01380.01590.0182
avg_run_length1.81161.90511.99212.09172.1826
rolling_30d_max_dd-0.0901-0.0766-0.0575-0.0469-0.0357
crash_window_vol0.06610.08270.09580.10530.1163
retracement_from_trough0.03190.17550.78922.08974.6784
sign_changes_5pct_count1.00001.00002.00003.00005.0000
Source file: low_vol_grind_qqq.json
Pre-period regime: bull_trend
Sample seeds (first 5): 21700, 21701, 21702, 21703, 21704
Low Vol GrindSPYVOL_COMPRESSION
all claims in-band·50 replicas·252d stress + 250d pre-period

Sustained suppression of realized volatility over multiple months.

Claim validation
Realized volatility (annualized)in-band
Claim: 0.4–0.7× SPY baseline (~0.06–0.11)
Measured median: 0.0724
within claimed range
Maximum drawdown (median)in-band
Claim: shallow drawdowns < 8% (non-VIX) per methodology
Measured median: -0.0476
within claimed range
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.05510.06010.14890.21590.3480
realized_vol_annualized0.05890.06670.07240.07980.0851
max_drawdown-0.1120-0.0635-0.0476-0.0360-0.0229
autocorrelation_lag1-0.1053-0.0474-0.01690.01610.0807
kurtosis-0.4547-0.2024-0.03850.14920.7953
skewness-0.2613-0.1237-0.02520.10230.2461
tail_p1-0.0119-0.0109-0.0097-0.0086-0.0078
tail_p990.00800.00970.01120.01210.0130
sign_change_frequency0.46980.47700.50400.52800.5524
vol_of_vol0.00810.00980.01060.01190.0145
avg_run_length1.80461.88721.97642.08732.1191
rolling_30d_max_dd-0.0701-0.0547-0.0421-0.0329-0.0229
crash_window_vol0.05270.06240.07480.08270.0938
retracement_from_trough0.02530.29501.54353.609411.1908
sign_changes_5pct_count1.00001.00001.00002.00003.0000
Source file: low_vol_grind_spy.json
Pre-period regime: bull_trend
Sample seeds (first 5): 1805, 1806, 1807, 1808, 1809
Low Vol GrindWTIVOL_COMPRESSION
1 of 2 claim off-spec·50 replicas·252d stress + 250d pre-period

Sustained suppression of realized volatility over multiple months.

Claim validation
Realized volatility (annualized)in-band
Claim: 0.4–0.7× WTI baseline (~0.12–0.21)
Measured median: 0.1209
within claimed range
Maximum drawdown (median)off-spec
Claim: shallow drawdowns < 8% (non-VIX) per methodology
Measured median: -0.0932
Measured median (-0.0932) is below the claimed lower bound (-0.0800). The simulator does not produce regime characteristics within the methodology operational band for this profile-asset combination — calibration limit. Resolution path: re-tune the profile (preferred) or revise the regime definition (only after anchor validation).
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.15670.03160.14420.22620.4273
realized_vol_annualized0.10150.10900.12090.13310.1457
max_drawdown-0.2070-0.1256-0.0932-0.0753-0.0499
autocorrelation_lag1-0.0906-0.04370.00220.04740.0985
kurtosis-0.3226-0.1610-0.05390.20600.5123
skewness-0.2224-0.08600.01380.10500.2223
tail_p1-0.0208-0.0192-0.0168-0.0153-0.0140
tail_p990.01370.01560.01790.01980.0226
sign_change_frequency0.45380.48000.50000.52000.5480
vol_of_vol0.01280.01580.01800.02030.0242
avg_run_length1.81881.91601.99212.07442.1931
rolling_30d_max_dd-0.1053-0.0883-0.0798-0.0665-0.0471
crash_window_vol0.08510.10270.11850.14190.1529
retracement_from_trough0.02730.26380.96852.00364.8065
sign_changes_5pct_count1.00003.00005.00005.00006.0000
Source file: low_vol_grind_wti.json
Pre-period regime: demand_weakness
Sample seeds (first 5): 21800, 21801, 21802, 21803, 21804
Sharp Crash Setup SyntheticBTCSHARP_CRASH (conditional)
1 of 2 claim off-spec·50 replicas·126d stress + 250d pre-period

Imposed conditions for an institutional risk-off shock — agent-based simulator produces a spectrum of outcomes; per-replica conformance to SHARP_CRASH gating (≥20% rolling-30d DD AND ≥1.5× crash-window vol) varies in the 20-50% band per methodology expectations.

Claim validation
Realized volatility over case (descriptive)off-spec
Claim: elevated case-wide vol; aggregate proxy for crash-window vol-spike
Measured median: 0.2949
Measured median (0.2949) is below the claimed lower bound (0.8000). The simulator does not produce regime characteristics within the methodology operational band for this profile-asset combination — calibration limit. Resolution path: re-tune the profile (preferred) or revise the regime definition (only after anchor validation). (crypto-asset; relative-to-baseline criterion)
Excess kurtosis (tail-modality descriptive)in-band
Claim: fat-tailed return distribution typical of SHARP_CRASH events; ≥ 1.5 indicates tail-intensified sub-classification
Measured median: 5.9362
within claimed range (crypto-asset; relative-to-baseline criterion)
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.6619-0.5041-0.01090.16270.4066
realized_vol_annualized0.22480.26000.29490.32160.3859
max_drawdown-0.6877-0.5595-0.2641-0.1426-0.0921
autocorrelation_lag1-0.2024-0.0936-0.01520.12730.1994
kurtosis2.58544.41525.93627.904110.3334
skewness-1.3412-0.8534-0.22750.90281.6424
tail_p1-0.0906-0.0699-0.0578-0.0488-0.0381
tail_p990.03400.03940.05560.06760.0885
sign_change_frequency0.31810.38100.44350.50000.5403
vol_of_vol0.11770.14550.17880.20190.2643
avg_run_length1.83821.98412.23212.59093.0907
rolling_30d_max_dd-0.4272-0.3282-0.2458-0.1426-0.0921
crash_window_vol0.17530.34620.45130.54580.6103
retracement_from_trough0.00000.00000.24540.79193.2848
sign_changes_5pct_count2.00004.00005.00007.00008.5500
Source file: sharp_crash_setup_synthetic_btc.json
Pre-period regime: parabolic_bull
Sample seeds (first 5): 20200, 20201, 20202, 20203, 20204
Sharp Crash Setup SyntheticQQQSHARP_CRASH (conditional)
all claims in-band·50 replicas·126d stress + 250d pre-period

Imposed conditions for an institutional risk-off shock — agent-based simulator produces a spectrum of outcomes; per-replica conformance to SHARP_CRASH gating (≥20% rolling-30d DD AND ≥1.5× crash-window vol) varies in the 20-50% band per methodology expectations.

Claim validation
Realized volatility over case (descriptive)in-band
Claim: elevated case-wide vol; aggregate proxy for crash-window vol-spike
Measured median: 0.3133
within claimed range
Excess kurtosis (tail-modality descriptive)in-band
Claim: fat-tailed return distribution typical of SHARP_CRASH events; ≥ 1.5 indicates tail-intensified sub-classification
Measured median: 6.2548
within claimed range
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.6899-0.4724-0.17760.20330.5351
realized_vol_annualized0.24180.28000.31330.34920.3967
max_drawdown-0.7075-0.4974-0.2775-0.1414-0.0766
autocorrelation_lag1-0.2526-0.0697-0.01190.09760.2633
kurtosis3.30194.97956.25487.274010.2887
skewness-1.8674-1.0467-0.22100.47571.4983
tail_p1-0.0914-0.0750-0.0647-0.0544-0.0318
tail_p990.02640.04680.05430.06670.0825
sign_change_frequency0.32620.38100.45160.50810.5484
vol_of_vol0.13750.16740.18830.21930.2594
avg_run_length1.81161.95312.19302.59093.0161
rolling_30d_max_dd-0.4632-0.3198-0.2396-0.1386-0.0727
crash_window_vol0.12510.32610.47600.56370.6755
retracement_from_trough0.00000.00000.17230.71102.5357
sign_changes_5pct_count3.00005.00006.00008.750011.0000
Source file: sharp_crash_setup_synthetic_qqq.json
Pre-period regime: sideways_low_vol
Sample seeds (first 5): 20100, 20101, 20102, 20103, 20104
Sharp Crash Setup SyntheticSPYSHARP_CRASH (conditional)
all claims in-band·50 replicas·126d stress + 250d pre-period

Imposed conditions for an institutional risk-off shock — agent-based simulator produces a spectrum of outcomes; per-replica conformance to SHARP_CRASH gating (≥20% rolling-30d DD AND ≥1.5× crash-window vol) varies in the 20-50% band per methodology expectations.

Claim validation
Realized volatility over case (descriptive)in-band
Claim: elevated case-wide vol; aggregate proxy for crash-window vol-spike
Measured median: 0.2878
within claimed range
Excess kurtosis (tail-modality descriptive)in-band
Claim: fat-tailed return distribution typical of SHARP_CRASH events; ≥ 1.5 indicates tail-intensified sub-classification
Measured median: 5.7450
within claimed range
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.5786-0.32130.04870.22600.4576
realized_vol_annualized0.21510.26410.28780.31950.3709
max_drawdown-0.5994-0.4070-0.1733-0.1287-0.0758
autocorrelation_lag1-0.2870-0.09410.04700.15660.2519
kurtosis2.92154.20955.74507.263611.9259
skewness-1.4953-0.5312-0.01150.65291.7288
tail_p1-0.0810-0.0643-0.0532-0.0455-0.0320
tail_p990.03540.04620.05630.06100.0738
sign_change_frequency0.31450.42940.47580.51410.5565
vol_of_vol0.11990.14440.17940.20220.2409
avg_run_length1.78571.93062.08332.30433.1250
rolling_30d_max_dd-0.4193-0.2842-0.1716-0.1278-0.0758
crash_window_vol0.13740.29770.40820.50530.6430
retracement_from_trough0.00000.01820.42041.14173.5336
sign_changes_5pct_count3.00005.00005.00007.00009.0000
Source file: sharp_crash_setup_synthetic_spy.json
Pre-period regime: bull_trend
Sample seeds (first 5): 20000, 20001, 20002, 20003, 20004
Slow Crash No Recovery SyntheticGOLDSLOW_BEAR + TREND_DOWN (adversarial)
2 of 2 claims off-spec·50 replicas·126d stress + 250d pre-period

Adversarial probe: amplified bear drift + sustained dip-buyer suppression vetoes the rebounds present in real historical slow bears. Tests strategies under conditions strict-historically rare. Expected SLOW_BEAR-conformance 40-80%.

Claim validation
Total return over caseoff-spec
Claim: persistent decline with no rebound, methodology −25% to −60%
Measured median: -0.2158
Measured median (-0.2158) is above the claimed upper bound (-0.2500). The simulator does not produce regime characteristics within the methodology operational band for this profile-asset combination — calibration limit. Resolution path: re-tune the profile (preferred) or revise the regime definition (only after anchor validation).
Maximum drawdown (median)off-spec
Claim: drawdowns deepen monotonically; methodology −25% to −70%
Measured median: -0.2231
Measured median (-0.2231) is above the claimed upper bound (-0.2500). The simulator does not produce regime characteristics within the methodology operational band for this profile-asset combination — calibration limit. Resolution path: re-tune the profile (preferred) or revise the regime definition (only after anchor validation).
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.4264-0.3362-0.2158-0.12130.0026
realized_vol_annualized0.13850.14900.15730.16260.1815
max_drawdown-0.4384-0.3493-0.2231-0.1803-0.1002
autocorrelation_lag1-0.1772-0.0737-0.02320.05380.1158
kurtosis-0.6404-0.2935-0.07810.09340.4456
skewness-0.2781-0.11150.01080.14260.2619
tail_p1-0.0287-0.0252-0.0236-0.0220-0.0198
tail_p990.01550.01760.01980.02150.0240
sign_change_frequency0.39880.46770.49190.52020.5565
vol_of_vol0.01260.01910.02250.02660.0306
avg_run_length1.78571.90872.01612.11862.4779
rolling_30d_max_dd-0.2112-0.1752-0.1384-0.1131-0.0779
crash_window_vol0.11840.13960.15270.16470.1920
retracement_from_trough0.00000.00690.02180.09130.3853
sign_changes_5pct_count1.00001.00002.50003.00005.0000
Source file: slow_crash_no_recovery_synthetic_gold.json
Pre-period regime: inflationary_upcycle
Sample seeds (first 5): 21600, 21601, 21602, 21603, 21604
Slow Crash No Recovery SyntheticQQQSLOW_BEAR + TREND_DOWN (adversarial)
all claims in-band·50 replicas·126d stress + 250d pre-period

Adversarial probe: amplified bear drift + sustained dip-buyer suppression vetoes the rebounds present in real historical slow bears. Tests strategies under conditions strict-historically rare. Expected SLOW_BEAR-conformance 40-80%.

Claim validation
Total return over casein-band
Claim: persistent decline with no rebound, methodology −25% to −60%
Measured median: -0.2469
within claimed range
Maximum drawdown (median)in-band
Claim: drawdowns deepen monotonically; methodology −25% to −70%
Measured median: -0.2634
within claimed range
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.4081-0.3323-0.2469-0.1530-0.0052
realized_vol_annualized0.13380.14880.15890.16750.1789
max_drawdown-0.4236-0.3393-0.2634-0.2067-0.1214
autocorrelation_lag1-0.1658-0.06700.00920.06360.1257
kurtosis-0.6537-0.3536-0.09360.15080.6021
skewness-0.3088-0.08320.04940.22170.3738
tail_p1-0.0280-0.0257-0.0237-0.0210-0.0194
tail_p990.01360.01780.01940.02310.0255
sign_change_frequency0.38710.45160.48390.50600.5565
vol_of_vol0.01260.01760.02040.02640.0294
avg_run_length1.78571.96092.04922.19302.5510
rolling_30d_max_dd-0.2044-0.1699-0.1389-0.1209-0.0933
crash_window_vol0.11770.14330.15910.16990.1838
retracement_from_trough0.00000.00350.04470.09230.7401
sign_changes_5pct_count1.00001.00003.00003.00005.0000
Source file: slow_crash_no_recovery_synthetic_qqq.json
Pre-period regime: sideways_low_vol
Sample seeds (first 5): 21500, 21501, 21502, 21503, 21504
Slow Crash No Recovery SyntheticSPYSLOW_BEAR + TREND_DOWN (adversarial)
1 of 2 claim off-spec·50 replicas·126d stress + 250d pre-period

Adversarial probe: amplified bear drift + sustained dip-buyer suppression vetoes the rebounds present in real historical slow bears. Tests strategies under conditions strict-historically rare. Expected SLOW_BEAR-conformance 40-80%.

Claim validation
Total return over caseoff-spec
Claim: persistent decline with no rebound, methodology −25% to −60%
Measured median: -0.2113
Measured median (-0.2113) is above the claimed upper bound (-0.2500). The simulator does not produce regime characteristics within the methodology operational band for this profile-asset combination — calibration limit. Resolution path: re-tune the profile (preferred) or revise the regime definition (only after anchor validation).
Maximum drawdown (median)in-band
Claim: drawdowns deepen monotonically; methodology −25% to −70%
Measured median: -0.2518
within claimed range
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.4951-0.3244-0.2113-0.1576-0.0381
realized_vol_annualized0.13790.14640.15880.17140.1826
max_drawdown-0.5008-0.3601-0.2518-0.1928-0.0934
autocorrelation_lag1-0.2217-0.1024-0.00680.05420.1254
kurtosis-0.5522-0.3238-0.16020.07450.5375
skewness-0.3151-0.1329-0.00500.15190.4396
tail_p1-0.0298-0.0251-0.0234-0.0221-0.0196
tail_p990.01600.01720.01920.02100.0243
sign_change_frequency0.38710.43150.47580.53020.5726
vol_of_vol0.01300.01870.02040.02660.0323
avg_run_length1.73611.87272.08332.29422.5510
rolling_30d_max_dd-0.2215-0.1814-0.1411-0.1198-0.0771
crash_window_vol0.12670.14290.15620.17370.1976
retracement_from_trough0.00000.00000.02650.10430.4142
sign_changes_5pct_count1.00001.00003.00003.00005.0000
Source file: slow_crash_no_recovery_synthetic_spy.json
Pre-period regime: bull_trend
Sample seeds (first 5): 21400, 21401, 21402, 21403, 21404
Slow Decline With Partial RecoveryGOLDSIDEWAYS-with-bearish-bias (mixed FM ex-post)
all claims in-band·50 replicas·126d stress + 250d pre-period

Imposed conditions for moderate bear regime where mid-period rebounds are permitted. Per-replica ex-post FM distribution: primarily SIDEWAYS (42-64%) + WHIPSAW (22-38%) + scattered SLOW_BEAR/TREND_DOWN. The mild bearish drift produces sideways-with-bear-bias rather than full SLOW_BEAR — empirical characterization, not a profile failure.

Claim validation
Total return over casein-band
Claim: moderate decline with mid-period rebounds permitted, typically −5% to −20%
Measured median: -0.1107
within claimed range
Maximum drawdown (median)in-band
Claim: moderate drawdowns, typically −10% to −25%
Measured median: -0.1447
within claimed range
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.2390-0.1663-0.1107-0.05460.0623
realized_vol_annualized0.11770.12520.13940.14940.1648
max_drawdown-0.2564-0.1861-0.1447-0.1060-0.0631
autocorrelation_lag1-0.1353-0.0590-0.02070.02670.1264
kurtosis-0.5161-0.3360-0.09480.24110.7237
skewness-0.3890-0.16960.00370.16970.2720
tail_p1-0.0247-0.0216-0.0205-0.0182-0.0159
tail_p990.01340.01710.01830.01980.0239
sign_change_frequency0.40680.45970.49190.51610.5609
vol_of_vol0.01220.01550.01980.02410.0304
avg_run_length1.77191.92312.01662.15522.4298
rolling_30d_max_dd-0.1502-0.1132-0.0946-0.0820-0.0567
crash_window_vol0.09590.11260.13330.15480.1824
retracement_from_trough0.00000.00000.07330.21251.3921
sign_changes_5pct_count1.00001.00003.00003.00004.5500
Source file: slow_decline_with_partial_recovery_gold.json
Pre-period regime: demand_weakness
Sample seeds (first 5): 2526, 2527, 2528, 2529, 2530
Slow Decline With Partial RecoveryQQQSIDEWAYS-with-bearish-bias (mixed FM ex-post)
all claims in-band·50 replicas·126d stress + 250d pre-period

Imposed conditions for moderate bear regime where mid-period rebounds are permitted. Per-replica ex-post FM distribution: primarily SIDEWAYS (42-64%) + WHIPSAW (22-38%) + scattered SLOW_BEAR/TREND_DOWN. The mild bearish drift produces sideways-with-bear-bias rather than full SLOW_BEAR — empirical characterization, not a profile failure.

Claim validation
Total return over casein-band
Claim: moderate decline with mid-period rebounds permitted, typically −5% to −20%
Measured median: -0.0790
within claimed range
Maximum drawdown (median)in-band
Claim: moderate drawdowns, typically −10% to −25%
Measured median: -0.1326
within claimed range
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.2068-0.1428-0.07900.00550.0961
realized_vol_annualized0.11920.12680.13810.14480.1560
max_drawdown-0.2267-0.1862-0.1326-0.0825-0.0651
autocorrelation_lag1-0.1394-0.0526-0.00150.04000.1528
kurtosis-0.5297-0.2771-0.03000.30430.7323
skewness-0.2455-0.0917-0.00160.15190.2956
tail_p1-0.0238-0.0205-0.0193-0.0176-0.0157
tail_p990.01500.01680.01810.01960.0239
sign_change_frequency0.43910.46770.50000.53230.5887
vol_of_vol0.01110.01500.01900.02240.0260
avg_run_length1.68921.86571.98412.11862.2545
rolling_30d_max_dd-0.1420-0.1128-0.0954-0.0727-0.0617
crash_window_vol0.11090.12240.13810.15240.1701
retracement_from_trough0.00000.04360.15480.40531.3773
sign_changes_5pct_count1.00002.00002.00003.00005.0000
Source file: slow_decline_with_partial_recovery_qqq.json
Pre-period regime: sideways_low_vol
Sample seeds (first 5): 3099, 3100, 3101, 3102, 3103
Slow Decline With Partial RecoverySPYSIDEWAYS-with-bearish-bias (mixed FM ex-post)
all claims in-band·50 replicas·126d stress + 250d pre-period

Imposed conditions for moderate bear regime where mid-period rebounds are permitted. Per-replica ex-post FM distribution: primarily SIDEWAYS (42-64%) + WHIPSAW (22-38%) + scattered SLOW_BEAR/TREND_DOWN. The mild bearish drift produces sideways-with-bear-bias rather than full SLOW_BEAR — empirical characterization, not a profile failure.

Claim validation
Total return over casein-band
Claim: moderate decline with mid-period rebounds permitted, typically −5% to −20%
Measured median: -0.0732
within claimed range
Maximum drawdown (median)in-band
Claim: moderate drawdowns, typically −10% to −25%
Measured median: -0.1266
within claimed range
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.2000-0.1273-0.0732-0.01970.0582
realized_vol_annualized0.12360.13130.13920.15490.1633
max_drawdown-0.2268-0.1644-0.1266-0.0976-0.0695
autocorrelation_lag1-0.1567-0.0916-0.00480.03480.1067
kurtosis-0.6124-0.2530-0.05950.13350.5513
skewness-0.3220-0.12670.08690.22820.3996
tail_p1-0.0254-0.0215-0.0197-0.0179-0.0163
tail_p990.01470.01750.01960.02070.0228
sign_change_frequency0.42740.47580.50000.54640.5806
vol_of_vol0.01260.01490.01810.02090.0281
avg_run_length1.71231.81821.98412.08332.3148
rolling_30d_max_dd-0.1320-0.1154-0.0967-0.0841-0.0565
crash_window_vol0.10590.12400.13690.14850.1776
retracement_from_trough0.00000.07670.23110.41310.9960
sign_changes_5pct_count1.00002.00003.00004.00005.5500
Source file: slow_decline_with_partial_recovery_spy.json
Pre-period regime: weak_bear
Sample seeds (first 5): 2447, 2448, 2449, 2450, 2451
Slow StagflationQQQSLOW_BEAR + VOL_EXPANSION
1 of 2 claim off-spec·50 replicas·126d stress + 250d pre-period

Imposed conditions for persistent decline combined with elevated realized volatility.

Claim validation
Total return over case (SLOW_BEAR aspect)off-spec
Claim: moderate-to-deep decline, methodology −25% to −60%
Measured median: -0.1522
Measured median (-0.1522) is above the claimed upper bound (-0.2000). The simulator does not produce regime characteristics within the methodology operational band for this profile-asset combination — calibration limit. Resolution path: re-tune the profile (preferred) or revise the regime definition (only after anchor validation).
Realized volatility (VOL_EXPANSION aspect)in-band
Claim: ≥ 1.5× QQQ baseline (~0.32); methodology gating
Measured median: 0.3671
within claimed range
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.4128-0.2403-0.15220.04180.2273
realized_vol_annualized0.29400.33750.36710.39140.4126
max_drawdown-0.4523-0.3794-0.2951-0.2113-0.1541
autocorrelation_lag1-0.1473-0.05440.00690.07190.1179
kurtosis-0.5811-0.2699-0.02840.16530.6724
skewness-0.3003-0.13530.09710.29550.4440
tail_p1-0.0627-0.0570-0.0502-0.0456-0.0366
tail_p990.03720.04420.04820.05580.0643
sign_change_frequency0.43020.45970.48390.52420.5734
vol_of_vol0.03180.03780.05130.06150.0773
avg_run_length1.73401.89392.04922.15522.3016
rolling_30d_max_dd-0.3388-0.2564-0.2298-0.1895-0.1485
crash_window_vol0.26740.30970.35600.40040.4372
retracement_from_trough0.00000.05130.15440.43081.8010
sign_changes_5pct_count6.00009.250011.500013.000015.5500
Source file: slow_stagflation_qqq.json
Pre-period regime: sideways_low_vol
Sample seeds (first 5): 9334, 9335, 9336, 9337, 9338
Slow StagflationSPYSLOW_BEAR + VOL_EXPANSION
1 of 2 claim off-spec·50 replicas·126d stress + 250d pre-period

Imposed conditions for persistent decline combined with elevated realized volatility.

Claim validation
Total return over case (SLOW_BEAR aspect)off-spec
Claim: moderate-to-deep decline, methodology −25% to −60%
Measured median: -0.0588
Measured median (-0.0588) is above the claimed upper bound (-0.2000). The simulator does not produce regime characteristics within the methodology operational band for this profile-asset combination — calibration limit. Resolution path: re-tune the profile (preferred) or revise the regime definition (only after anchor validation).
Realized volatility (VOL_EXPANSION aspect)in-band
Claim: ≥ 1.5× SPY baseline (~0.24); methodology gating
Measured median: 0.2776
within claimed range
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.2968-0.2082-0.05880.12260.2725
realized_vol_annualized0.20810.25450.27760.29770.3206
max_drawdown-0.3656-0.2832-0.2119-0.1472-0.1004
autocorrelation_lag1-0.1156-0.0555-0.01260.04820.1211
kurtosis-0.6120-0.4353-0.10540.13220.4153
skewness-0.3063-0.10600.01940.12700.2362
tail_p1-0.0477-0.0425-0.0378-0.0343-0.0265
tail_p990.02740.03360.03640.04150.0459
sign_change_frequency0.42300.45970.50000.53230.5689
vol_of_vol0.02050.02910.03590.04020.0538
avg_run_length1.74711.86571.98412.15522.3388
rolling_30d_max_dd-0.2700-0.2143-0.1713-0.1279-0.1004
crash_window_vol0.19470.24050.26570.30040.3437
retracement_from_trough0.00000.11310.21070.74351.9871
sign_changes_5pct_count4.00006.00007.000010.000011.0000
Source file: slow_stagflation_spy.json
Pre-period regime: sideways_low_vol
Sample seeds (first 5): 22000, 22001, 22002, 22003, 22004
V Recovery Setup SyntheticBTC
all claims in-band·50 replicas·126d stress + 250d pre-period

Claim validation
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.5557-0.37110.18450.37900.5878
realized_vol_annualized0.16990.18880.20680.22930.2600
max_drawdown-0.5596-0.4274-0.1458-0.0961-0.0533
autocorrelation_lag1-0.2268-0.1045-0.00490.08560.2376
kurtosis0.78052.25803.25844.86916.7696
skewness-1.3674-0.7778-0.21140.15441.0544
tail_p1-0.0587-0.0458-0.0399-0.0323-0.0224
tail_p990.01760.02850.03380.04210.0584
sign_change_frequency0.35480.39520.44350.48390.5484
vol_of_vol0.05750.08080.10210.12670.1521
avg_run_length1.81162.04922.23212.50002.7778
rolling_30d_max_dd-0.3192-0.2373-0.1440-0.0950-0.0533
crash_window_vol0.12170.24510.29500.36420.4370
retracement_from_trough0.00000.00301.38363.36467.8972
sign_changes_5pct_count1.00002.00003.00005.00006.5500
Source file: v_recovery_setup_synthetic_btc.json
Pre-period regime: parabolic_bull
Sample seeds (first 5): 21200, 21201, 21202, 21203, 21204
V Recovery Setup SyntheticGOLD
all claims in-band·50 replicas·126d stress + 250d pre-period

Claim validation
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.5619-0.12880.12830.27940.6204
realized_vol_annualized0.16340.20040.22130.24430.2766
max_drawdown-0.5789-0.2796-0.1550-0.1006-0.0605
autocorrelation_lag1-0.2069-0.05410.01900.14100.2051
kurtosis1.63012.89553.70895.48118.5521
skewness-1.6644-0.8615-0.34090.24220.8821
tail_p1-0.0635-0.0485-0.0408-0.0331-0.0265
tail_p990.01900.02920.03520.04540.0542
sign_change_frequency0.35040.41130.46770.50000.5250
vol_of_vol0.06330.08880.11150.13410.1616
avg_run_length1.89151.98412.11862.40382.8125
rolling_30d_max_dd-0.3516-0.2645-0.1429-0.1006-0.0605
crash_window_vol0.14960.26350.30200.36710.4475
retracement_from_trough0.00000.25781.62572.46246.6188
sign_changes_5pct_count1.00003.00004.00005.00007.5500
Source file: v_recovery_setup_synthetic_gold.json
Pre-period regime: inflationary_upcycle
Sample seeds (first 5): 21300, 21301, 21302, 21303, 21304
V Recovery Setup SyntheticQQQ
all claims in-band·50 replicas·126d stress + 250d pre-period

Claim validation
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.5071-0.42920.09170.24100.6022
realized_vol_annualized0.16340.18830.21560.24160.2673
max_drawdown-0.5310-0.4513-0.1638-0.1048-0.0640
autocorrelation_lag1-0.1680-0.03840.01530.09930.2214
kurtosis1.50962.72703.94815.91409.0464
skewness-1.5279-0.6057-0.18820.35811.1029
tail_p1-0.0573-0.0454-0.0376-0.0338-0.0247
tail_p990.02010.02740.03770.04240.0499
sign_change_frequency0.33350.39520.45160.50810.5448
vol_of_vol0.06530.08640.10870.12970.1442
avg_run_length1.82361.95312.19302.50002.9552
rolling_30d_max_dd-0.2969-0.2642-0.1586-0.0993-0.0617
crash_window_vol0.11630.22930.29890.35160.4481
retracement_from_trough0.00000.00410.96322.36854.8157
sign_changes_5pct_count1.00003.00003.00005.00006.0000
Source file: v_recovery_setup_synthetic_qqq.json
Pre-period regime: bull_trend
Sample seeds (first 5): 21100, 21101, 21102, 21103, 21104
V Recovery Setup SyntheticSPY
all claims in-band·50 replicas·126d stress + 250d pre-period

Claim validation
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.5756-0.46960.04090.20600.4113
realized_vol_annualized0.17510.20710.23100.24820.2891
max_drawdown-0.5997-0.4849-0.1776-0.1094-0.0586
autocorrelation_lag1-0.2204-0.06580.02290.09740.2472
kurtosis1.37452.83373.75824.76677.3036
skewness-1.4454-0.7465-0.27140.14800.7863
tail_p1-0.0630-0.0507-0.0442-0.0328-0.0238
tail_p990.02220.03010.03620.04280.0548
sign_change_frequency0.34600.41130.45970.49190.5565
vol_of_vol0.07370.09940.11630.13070.1691
avg_run_length1.78572.01612.15522.40382.8488
rolling_30d_max_dd-0.3622-0.2534-0.1709-0.0935-0.0586
crash_window_vol0.11530.24870.31260.38640.4715
retracement_from_trough0.00000.00000.55602.06225.1363
sign_changes_5pct_count1.45003.00004.00005.75007.0000
Source file: v_recovery_setup_synthetic_spy.json
Pre-period regime: weak_bear
Sample seeds (first 5): 21000, 21001, 21002, 21003, 21004
Vol Expansion Setup SyntheticBTCVOL_EXPANSION (conditional)
1 of 2 claim off-spec·50 replicas·252d stress + 250d pre-period

Imposed conditions for sustained vol elevation via persistent HFT withdrawal and vol-trader amplification — per-replica conformance to VOL_EXPANSION gating (median ≥1.5× baseline AND ≥2 distinct windows ≥1.5×) varies in the 60-90% band per methodology expectations.

Claim validation
Realized volatility (VOL_EXPANSION median gating)off-spec
Claim: ≥ 1.5× BTC baseline (~1.20); methodology gating
Measured median: 0.3176
Measured median (0.3176) is below the claimed lower bound (1.2000). The simulator does not produce regime characteristics within the methodology operational band for this profile-asset combination — calibration limit. Resolution path: re-tune the profile (preferred) or revise the regime definition (only after anchor validation). (crypto-asset; relative-to-baseline criterion)
Total return over case (descriptive)in-band
Claim: direction-neutral by design; ±20% is acceptable spread
Measured median: 0.1198
within claimed range (crypto-asset; relative-to-baseline criterion)
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.3613-0.03840.11980.28920.6930
realized_vol_annualized0.21310.26900.31760.38090.4310
max_drawdown-0.4499-0.3387-0.2644-0.2150-0.1525
autocorrelation_lag1-0.0920-0.04740.00080.03130.1016
kurtosis-0.3665-0.2411-0.09590.10800.4589
skewness-0.1605-0.06220.02610.12110.2114
tail_p1-0.0610-0.0535-0.0437-0.0372-0.0291
tail_p990.03050.03770.04650.05440.0630
sign_change_frequency0.45780.47200.49600.51500.5360
vol_of_vol0.03090.03740.04480.05520.0677
avg_run_length1.85931.93452.00802.10922.1741
rolling_30d_max_dd-0.3185-0.2447-0.1979-0.1726-0.1369
crash_window_vol0.20800.25780.30840.37760.4289
retracement_from_trough0.06180.26580.52510.92122.4241
sign_changes_5pct_count8.450013.250018.000024.000029.1000
Source file: vol_expansion_setup_synthetic_btc.json
Pre-period regime: unstable_decay
Sample seeds (first 5): 20500, 20501, 20502, 20503, 20504
Vol Expansion Setup SyntheticGOLDVOL_EXPANSION (conditional)
all claims in-band·50 replicas·252d stress + 250d pre-period

Imposed conditions for sustained vol elevation via persistent HFT withdrawal and vol-trader amplification — per-replica conformance to VOL_EXPANSION gating (median ≥1.5× baseline AND ≥2 distinct windows ≥1.5×) varies in the 60-90% band per methodology expectations.

Claim validation
Realized volatility (VOL_EXPANSION median gating)in-band
Claim: ≥ 1.5× GOLD baseline (~0.20); methodology gating
Measured median: 0.2450
within claimed range
Total return over case (descriptive)in-band
Claim: direction-neutral by design; ±20% is acceptable spread
Measured median: -0.0019
within claimed range
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.3317-0.1405-0.00190.15650.4506
realized_vol_annualized0.15380.20020.24500.29210.3321
max_drawdown-0.4378-0.3006-0.2395-0.1854-0.1170
autocorrelation_lag1-0.1028-0.0550-0.01190.03100.0995
kurtosis-0.4785-0.2409-0.08260.14320.4571
skewness-0.2258-0.1074-0.00380.11140.1947
tail_p1-0.0527-0.0419-0.0335-0.0283-0.0212
tail_p990.02120.02790.03490.04230.0462
sign_change_frequency0.43160.48400.50000.52400.5600
vol_of_vol0.02110.02790.03310.04390.0580
avg_run_length1.78011.90151.99212.05742.3051
rolling_30d_max_dd-0.2578-0.2040-0.1598-0.1384-0.0917
crash_window_vol0.14120.19270.21930.27250.3356
retracement_from_trough0.00000.15810.35380.69662.0628
sign_changes_5pct_count4.45008.000013.000017.000020.5500
Source file: vol_expansion_setup_synthetic_gold.json
Pre-period regime: range
Sample seeds (first 5): 22100, 22101, 22102, 22103, 22104
Vol Expansion Setup SyntheticQQQVOL_EXPANSION (conditional)
all claims in-band·50 replicas·252d stress + 250d pre-period

Imposed conditions for sustained vol elevation via persistent HFT withdrawal and vol-trader amplification — per-replica conformance to VOL_EXPANSION gating (median ≥1.5× baseline AND ≥2 distinct windows ≥1.5×) varies in the 60-90% band per methodology expectations.

Claim validation
Realized volatility (VOL_EXPANSION median gating)in-band
Claim: ≥ 1.5× QQQ baseline (~0.32); methodology gating
Measured median: 0.3486
within claimed range
Total return over case (descriptive)in-band
Claim: direction-neutral by design; ±20% is acceptable spread
Measured median: 0.0390
within claimed range
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.3287-0.13640.03900.24351.0154
realized_vol_annualized0.23090.27510.34860.37930.4186
max_drawdown-0.5432-0.3749-0.2590-0.2144-0.1526
autocorrelation_lag1-0.1135-0.0475-0.00730.05960.1043
kurtosis-0.4245-0.22190.02100.14490.4823
skewness-0.2105-0.11620.05060.11860.2045
tail_p1-0.0612-0.0530-0.0476-0.0394-0.0325
tail_p990.03220.04090.04850.05500.0642
sign_change_frequency0.44360.47700.50400.52000.5462
vol_of_vol0.03440.04030.04950.05590.0637
avg_run_length1.82481.91601.97642.08732.2433
rolling_30d_max_dd-0.3320-0.2592-0.2036-0.1749-0.1297
crash_window_vol0.20990.27460.31510.36670.4042
retracement_from_trough0.01930.12150.32241.06883.2534
sign_changes_5pct_count9.450015.000019.000024.000029.5500
Source file: vol_expansion_setup_synthetic_qqq.json
Pre-period regime: weak_bear
Sample seeds (first 5): 20400, 20401, 20402, 20403, 20404
Vol Expansion Setup SyntheticSPYVOL_EXPANSION (conditional)
all claims in-band·50 replicas·252d stress + 250d pre-period

Imposed conditions for sustained vol elevation via persistent HFT withdrawal and vol-trader amplification — per-replica conformance to VOL_EXPANSION gating (median ≥1.5× baseline AND ≥2 distinct windows ≥1.5×) varies in the 60-90% band per methodology expectations.

Claim validation
Realized volatility (VOL_EXPANSION median gating)in-band
Claim: ≥ 1.5× SPY baseline (~0.24); methodology gating
Measured median: 0.3166
within claimed range
Total return over case (descriptive)in-band
Claim: direction-neutral by design; ±20% is acceptable spread
Measured median: 0.1018
within claimed range
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.4036-0.13180.10180.34360.7392
realized_vol_annualized0.22000.28990.31660.35460.4131
max_drawdown-0.5270-0.3277-0.2431-0.1888-0.1407
autocorrelation_lag1-0.0915-0.0580-0.01110.02750.0822
kurtosis-0.3909-0.12290.00740.23070.5636
skewness-0.2965-0.13600.02080.10480.2847
tail_p1-0.0619-0.0536-0.0449-0.0405-0.0284
tail_p990.03280.04070.04500.05180.0605
sign_change_frequency0.46180.48000.50200.51900.5560
vol_of_vol0.03210.03950.04780.05630.0654
avg_run_length1.79291.91971.98422.07442.1555
rolling_30d_max_dd-0.2914-0.2391-0.1966-0.1627-0.1134
crash_window_vol0.21570.26170.31000.36820.4411
retracement_from_trough0.00150.21570.49300.98543.7619
sign_changes_5pct_count8.450013.500019.000022.000027.5500
Source file: vol_expansion_setup_synthetic_spy.json
Pre-period regime: bull_trend
Sample seeds (first 5): 20300, 20301, 20302, 20303, 20304
Whipsaw SyntheticBTCWHIPSAW + SIDEWAYS
all claims in-band·50 replicas·126d stress + 250d pre-period

Repeated directional reversals with bounded range and no decisive resolution.

Claim validation
Sign-change frequency (descriptive proxy)in-band
Claim: elevated frequency of return-direction reversals (≥ 40% of bars; true ≥3-5 sign-changes >5% magnitude criterion is per-replica — Phase 17)
Measured median: 0.5081
within claimed range (crypto-asset; relative-to-baseline criterion)
Total return over casein-band
Claim: near-zero net direction; methodology ±10%
Measured median: 0.0183
within claimed range (crypto-asset; relative-to-baseline criterion)
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.1714-0.07300.01830.07910.2134
realized_vol_annualized0.12620.13550.14720.16750.1884
max_drawdown-0.2062-0.1442-0.0993-0.0751-0.0553
autocorrelation_lag1-0.1304-0.08470.00000.05210.0993
kurtosis-0.5816-0.3570-0.11550.11970.5667
skewness-0.3690-0.1387-0.04490.02860.3156
tail_p1-0.0296-0.0236-0.0204-0.0182-0.0164
tail_p990.01540.01810.02050.02350.0265
sign_change_frequency0.43100.48390.50810.52420.5726
vol_of_vol0.01240.01620.02040.02370.0309
avg_run_length1.73611.89391.95312.04922.2959
rolling_30d_max_dd-0.1318-0.1027-0.0852-0.0687-0.0547
crash_window_vol0.11630.13550.15270.17340.1910
retracement_from_trough0.00000.11670.39540.91902.8287
sign_changes_5pct_count1.00002.00003.00004.00006.0000
Source file: whipsaw_synthetic_btc.json
Pre-period regime: unstable_decay
Sample seeds (first 5): 9442, 9443, 9444, 9445, 9446
Whipsaw SyntheticETHWHIPSAW + SIDEWAYS
all claims in-band·50 replicas·126d stress + 250d pre-period

Repeated directional reversals with bounded range and no decisive resolution.

Claim validation
Sign-change frequency (descriptive proxy)in-band
Claim: elevated frequency of return-direction reversals (≥ 40% of bars; true ≥3-5 sign-changes >5% magnitude criterion is per-replica — Phase 17)
Measured median: 0.5000
within claimed range (crypto-asset; relative-to-baseline criterion)
Total return over casein-band
Claim: near-zero net direction; methodology ±10%
Measured median: 0.0680
within claimed range (crypto-asset; relative-to-baseline criterion)
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.0734-0.03810.06800.14300.2737
realized_vol_annualized0.12490.13790.15440.16410.1786
max_drawdown-0.1611-0.1361-0.0969-0.0700-0.0446
autocorrelation_lag1-0.1642-0.0820-0.00860.07020.1486
kurtosis-0.6546-0.2653-0.12350.12190.4866
skewness-0.3891-0.1559-0.02240.15220.2944
tail_p1-0.0259-0.0228-0.0208-0.0189-0.0171
tail_p990.01650.01920.02110.02420.0281
sign_change_frequency0.41490.47580.50000.52420.5609
vol_of_vol0.01290.01740.02020.02400.0310
avg_run_length1.77191.89391.98412.08332.3834
rolling_30d_max_dd-0.1376-0.1081-0.0811-0.0683-0.0423
crash_window_vol0.10910.13320.14120.16770.2056
retracement_from_trough0.00720.18250.47681.23964.3610
sign_changes_5pct_count1.00002.00003.00004.00005.5500
Source file: whipsaw_synthetic_eth.json
Pre-period regime: unstable_decay
Sample seeds (first 5): 7169, 7170, 7171, 7172, 7173
Whipsaw SyntheticQQQWHIPSAW + SIDEWAYS
all claims in-band·50 replicas·126d stress + 250d pre-period

Repeated directional reversals with bounded range and no decisive resolution.

Claim validation
Sign-change frequency (descriptive proxy)in-band
Claim: elevated frequency of return-direction reversals (≥ 40% of bars; true ≥3-5 sign-changes >5% magnitude criterion is per-replica — Phase 17)
Measured median: 0.5081
within claimed range
Total return over casein-band
Claim: near-zero net direction; methodology ±10%
Measured median: -0.0008
within claimed range
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.1468-0.0533-0.00080.06080.2332
realized_vol_annualized0.12820.13970.14820.16520.1768
max_drawdown-0.1926-0.1309-0.1009-0.0830-0.0576
autocorrelation_lag1-0.1464-0.05860.00650.04810.1453
kurtosis-0.5444-0.2918-0.17820.11590.9083
skewness-0.2575-0.08370.00590.11910.2150
tail_p1-0.0257-0.0230-0.0208-0.0192-0.0168
tail_p990.01690.01900.02080.02270.0265
sign_change_frequency0.43830.47780.50810.54030.5895
vol_of_vol0.01290.01630.02100.02580.0299
avg_run_length1.68721.83821.95312.07482.2600
rolling_30d_max_dd-0.1381-0.1088-0.0873-0.0716-0.0576
crash_window_vol0.11460.12910.14600.16390.1832
retracement_from_trough0.00000.17920.41580.76252.6864
sign_changes_5pct_count1.45003.00004.00004.00005.5500
Source file: whipsaw_synthetic_qqq.json
Pre-period regime: sideways_low_vol
Sample seeds (first 5): 7721, 7722, 7723, 7724, 7725
Whipsaw SyntheticSPYWHIPSAW + SIDEWAYS
all claims in-band·50 replicas·126d stress + 250d pre-period

Repeated directional reversals with bounded range and no decisive resolution.

Claim validation
Sign-change frequency (descriptive proxy)in-band
Claim: elevated frequency of return-direction reversals (≥ 40% of bars; true ≥3-5 sign-changes >5% magnitude criterion is per-replica — Phase 17)
Measured median: 0.4919
within claimed range
Total return over casein-band
Claim: near-zero net direction; methodology ±10%
Measured median: 0.0254
within claimed range
Aggregated metrics across 50 replicas
Metricp5p25medianp75p95
total_return-0.1676-0.04990.02540.09600.2642
realized_vol_annualized0.13230.14140.15300.16790.1764
max_drawdown-0.1975-0.1406-0.0998-0.0776-0.0567
autocorrelation_lag1-0.1175-0.0412-0.00010.08220.1491
kurtosis-0.5544-0.3363-0.10510.23820.7455
skewness-0.5647-0.2063-0.02610.07040.2056
tail_p1-0.0265-0.0234-0.0211-0.0199-0.0170
tail_p990.01610.01850.01990.02400.0257
sign_change_frequency0.41490.45970.49190.52220.5492
vol_of_vol0.01320.01780.02160.02520.0302
avg_run_length1.80941.90122.01612.15522.3834
rolling_30d_max_dd-0.1289-0.1130-0.0886-0.0756-0.0567
crash_window_vol0.13290.14340.15520.17550.1895
retracement_from_trough0.05480.22990.61351.23223.5027
sign_changes_5pct_count1.00002.25003.00004.00006.0000
Source file: whipsaw_synthetic_spy.json
Pre-period regime: bull_trend
Sample seeds (first 5): 8941, 8942, 8943, 8944, 8945

Metric definitions

total_return
Cumulative log-return over the stress-period window (warmup excluded).
realized_vol_annualized
Standard deviation of daily log-returns × √252.
max_drawdown
Peak-to-trough drawdown of the cumulative-equity curve (negative value).
autocorrelation_lag1
Pearson correlation of log-returns at lag 1.
kurtosis
Excess kurtosis of log-returns (zero = Gaussian).
skewness
Pearson skewness of log-returns.
tail_p1
1st-percentile single-bar log-return (left-tail proxy).
tail_p99
99th-percentile single-bar log-return (right-tail proxy).
sign_change_frequency
Fraction of bars where the return changes sign vs. the previous bar.
vol_of_vol
Standard deviation of rolling 20-day realized volatility.
avg_run_length
Average length of consecutive same-sign return runs (regime-persistence proxy).

Disclaimer: Results are based on model-driven simulations under simplified assumptions. Does not imply future performance. Does not constitute investment advice, a recommendation, or a forecast.