# DOLPHIN NG β€” Performance Registry > Canonical benchmark tiers. Update this file whenever a new result becomes the production target. > `GOLD` = what the system must beat or match. `SILVER` = previous gold. `BRONZE` = regression floor. --- ## πŸ₯‡ D_LIQ_GOLD β€” Active Production Candidate (2026-03-15) | Metric | Value | vs prev GOLD | vs BRONZE | |--------|-------|-------------|-----------| | **ROI** | **181.81%** | +85.26 pp | +93.26 pp | | **DD** | **17.65%** | +3.33 pp | +2.60 pp | | **Calmar** | **10.30** | vs 6.74 | vs 5.88 | | **Trades** | **2155** | identical | identical | | avg_leverage | 4.09x | β€” | β€” | | liquidation_stops | 1 (0.05%) | β€” | β€” | **Engine:** `LiquidationGuardEngine(soft=8x, hard=9x, mc_ref=5x, margin_buffer=0.95, adaptive_beta=True)` **Factory:** `create_d_liq_engine(**engine_kwargs)` β€” also `create_boost_engine()` default **Module:** `nautilus_dolphin/nautilus/proxy_boost_engine.py` **Config key:** `engine.boost_mode = "d_liq"` (now `DEFAULT_BOOST_MODE`) **Mechanism:** - Inherits `adaptive_beta` scale_boost from AdaptiveBoostEngine (GOLD) - Leverage ceiling raised to 8x soft / 9x hard (from 5x/6x) - MC-Forewarner assessed at 5x reference (decoupled) β†’ 0 RED/ORANGE/halted days - Liquidation floor stop at 10.6% adverse move (= 1/9 Γ— 0.95) β€” prevents exchange force-close - DD plateau: each +1x above 7x costs only +0.12pp DD (vs +2.6pp for 5β†’6x) **Validation (exp9b, 2026-03-15):** - All 4 leverage configs compared vs unguarded (exp9): B/C/D all improved ROI + reduced DD - E (9/10x): 5 liquidation stops β†’ cascade β†’ dead; D (8/9x) is the sweet spot - `pytest -m slow tests/test_proxy_boost_production.py` β†’ **9/9 PASSED (2026-03-15)** - MC completely silent: 0 RED, 0 ORANGE, 0 halted across 56 days at 8/9x - Trade count identical to Silver (2155) β€” no entry/exit timing change **Compounding ($25k, 56-day periods):** | Periods | ~Time | Value | |---------|-------|-------| | 3 | ~5 mo | $559,493 | | 6 | ~1 yr | $12,521,315 | | 12 | ~2 yr | $6,271,333,381 | --- ## πŸ₯ˆ GOLD (prev) β€” Former Production (demoted 2026-03-15) | Metric | Value | |--------|-------| | **ROI** | **96.55%** | | **DD** | **14.32%** | | **Calmar** | **6.74** | | **Trades** | **2155** | | scale_mean | 1.088 | | alpha_eff_mean | 1.429 | **Engine:** `AdaptiveBoostEngine(threshold=0.35, alpha=1.0, adaptive_beta=True)` **Factory:** `create_boost_engine(mode='adaptive_beta')` β€” non-default, opt-in for conservative/quiet-regime use **Validation:** `pytest -m slow tests/test_proxy_boost_production.py` β†’ 7/7 PASSED 2026-03-15 --- ## πŸ₯‰ BRONZE β€” Regression Floor (former silver, 2026-03-15) | Metric | Value | |--------|-------| | **ROI** | **88.55%** | | **PF** | **1.215** | | **DD** | **15.05%** | | **Sharpe** | **4.38** | | **Trades** | **2155** | **Engine:** `NDAlphaEngine` (no proxy_B boost) **Equivalent factory call:** `create_boost_engine(mode='none', ...)` **Validation script:** `test_pf_dynamic_beta_validate.py` > Bronze is the absolute regression floor. Falling below Bronze on both ROI and DD is a failure. --- ## All Boost Modes (exp8 results, 2026-03-14) | mode | ROI% | DD% | Ξ”DD | Ξ”ROI | Notes | |------|------|-----|-----|------|-------| | `none` (Bronze) | 88.55 | 15.05 | β€” | β€” | Baseline | | `fixed` | 93.61 | 14.51 | βˆ’0.54 | +5.06 | thr=0.35, a=1.0 | | `adaptive_alpha` | 93.40 | 14.51 | βˆ’0.54 | +4.86 | alphaΓ—boost | | `adaptive_thr` | 94.13 | 14.51 | βˆ’0.54 | +5.58 | thrΓ·boost | | `adaptive_both` | 94.11 | 14.51 | βˆ’0.54 | +5.57 | both combined | | **`adaptive_beta`** ⭐ | **96.55** | **14.32** | **βˆ’0.72** | **+8.00** | alphaΓ—(1+day_beta) β€” prev GOLD | ## Extended Leverage Configs (exp9b results, 2026-03-15) | Config | ROI% | DD% | Calmar | liq_stops | Notes | |--------|------|-----|--------|-----------|-------| | GOLD (5/6x) | 96.55 | 14.32 | 6.74 | 0 | adaptive_beta baseline | | B_liq (6/7x) | 124.01 | 15.97 | 7.77 | 1 | improved vs unguarded | | C_liq (7/8x) | 155.60 | 17.18 | 9.05 | 1 | improved vs unguarded | | **D_liq (8/9x)** | **181.81** | **17.65** | **10.30** | **1** | **D_LIQ_GOLD** | | E_liq (9/10x) | 155.88 | 31.79 | 4.90 | 5 | cascade β€” dead | --- ## Test Suite ```bash # Fast unit tests only (no data needed, ~5 seconds) pytest tests/test_proxy_boost_production.py -m "not slow" -v # Full e2e regression (55-day backtests, ~60 minutes) pytest tests/test_proxy_boost_production.py -m slow -v ``` Unit tests: ~40 (factory, engine, extended leverage, liquidation guard, actor import) E2E tests: 9 (baseline + 5 boost modes + winner-beats-baseline + D_liq repro + MC silent) Last full run: **2026-03-15 β€” 9/9 PASSED, exit code 0 (50:20)** --- ## Promotion Checklist To promote a new result to D_LIQ_GOLD (production): 1. [x] Beats prev GOLD on ROI (+85pp); DD increased +3.33pp but Calmar +53% β€” acceptable 2. [x] Trade count identical (2155) β€” no re-entry cascade 3. [x] MC completely silent at mc_ref=5.0 β€” 0 RED/ORANGE/halted 4. [x] liquidation_stops=1 (0.05%) β€” negligible, no cascade 5. [x] `pytest -m slow` passes β€” **9/9 PASSED (2026-03-15, 50:20)** 6. [x] Updated Registry.md, memory/benchmarks.md, memory/MEMORY.md 7. [x] `create_d_liq_engine()` and classes added to proxy_boost_engine.py 8. [ ] Wire `create_d_liq_engine` into DolphinActor as configurable option