Includes core prod + GREEN/BLUE subsystems: - prod/ (BLUE harness, configs, scripts, docs) - nautilus_dolphin/ (GREEN Nautilus-native impl + dvae/ preserved) - adaptive_exit/ (AEM engine + models/bucket_assignments.pkl) - Observability/ (EsoF advisor, TUI, dashboards) - external_factors/ (EsoF producer) - mc_forewarning_qlabs_fork/ (MC regime/envelope) Excludes runtime caches, logs, backups, and reproducible artifacts per .gitignore.
133 lines
5.4 KiB
Markdown
Executable File
133 lines
5.4 KiB
Markdown
Executable File
# DOLPHIN NG — Performance Registry
|
||
|
||
> Canonical benchmark tiers. Update this file whenever a new result becomes the production target.
|
||
> `GOLD` = what the system must beat or match. `SILVER` = previous gold. `BRONZE` = regression floor.
|
||
|
||
---
|
||
|
||
## 🥇 D_LIQ_GOLD — Active Production Candidate (2026-03-15)
|
||
|
||
| Metric | Value | vs prev GOLD | vs BRONZE |
|
||
|--------|-------|-------------|-----------|
|
||
| **ROI** | **181.81%** | +85.26 pp | +93.26 pp |
|
||
| **DD** | **17.65%** | +3.33 pp | +2.60 pp |
|
||
| **Calmar** | **10.30** | vs 6.74 | vs 5.88 |
|
||
| **Trades** | **2155** | identical | identical |
|
||
| avg_leverage | 4.09x | — | — |
|
||
| liquidation_stops | 1 (0.05%) | — | — |
|
||
|
||
**Engine:** `LiquidationGuardEngine(soft=8x, hard=9x, mc_ref=5x, margin_buffer=0.95, adaptive_beta=True)`
|
||
**Factory:** `create_d_liq_engine(**engine_kwargs)` — also `create_boost_engine()` default
|
||
**Module:** `nautilus_dolphin/nautilus/proxy_boost_engine.py`
|
||
**Config key:** `engine.boost_mode = "d_liq"` (now `DEFAULT_BOOST_MODE`)
|
||
|
||
**Mechanism:**
|
||
- Inherits `adaptive_beta` scale_boost from AdaptiveBoostEngine (GOLD)
|
||
- Leverage ceiling raised to 8x soft / 9x hard (from 5x/6x)
|
||
- MC-Forewarner assessed at 5x reference (decoupled) → 0 RED/ORANGE/halted days
|
||
- Liquidation floor stop at 10.6% adverse move (= 1/9 × 0.95) — prevents exchange force-close
|
||
- DD plateau: each +1x above 7x costs only +0.12pp DD (vs +2.6pp for 5→6x)
|
||
|
||
**Validation (exp9b, 2026-03-15):**
|
||
- All 4 leverage configs compared vs unguarded (exp9): B/C/D all improved ROI + reduced DD
|
||
- E (9/10x): 5 liquidation stops → cascade → dead; D (8/9x) is the sweet spot
|
||
- `pytest -m slow tests/test_proxy_boost_production.py` → **9/9 PASSED (2026-03-15)**
|
||
- MC completely silent: 0 RED, 0 ORANGE, 0 halted across 56 days at 8/9x
|
||
- Trade count identical to Silver (2155) — no entry/exit timing change
|
||
|
||
**Compounding ($25k, 56-day periods):**
|
||
| Periods | ~Time | Value |
|
||
|---------|-------|-------|
|
||
| 3 | ~5 mo | $559,493 |
|
||
| 6 | ~1 yr | $12,521,315 |
|
||
| 12 | ~2 yr | $6,271,333,381 |
|
||
|
||
---
|
||
|
||
## 🥈 GOLD (prev) — Former Production (demoted 2026-03-15)
|
||
|
||
| Metric | Value |
|
||
|--------|-------|
|
||
| **ROI** | **96.55%** |
|
||
| **DD** | **14.32%** |
|
||
| **Calmar** | **6.74** |
|
||
| **Trades** | **2155** |
|
||
| scale_mean | 1.088 |
|
||
| alpha_eff_mean | 1.429 |
|
||
|
||
**Engine:** `AdaptiveBoostEngine(threshold=0.35, alpha=1.0, adaptive_beta=True)`
|
||
**Factory:** `create_boost_engine(mode='adaptive_beta')` — non-default, opt-in for conservative/quiet-regime use
|
||
**Validation:** `pytest -m slow tests/test_proxy_boost_production.py` → 7/7 PASSED 2026-03-15
|
||
|
||
---
|
||
|
||
## 🥉 BRONZE — Regression Floor (former silver, 2026-03-15)
|
||
|
||
| Metric | Value |
|
||
|--------|-------|
|
||
| **ROI** | **88.55%** |
|
||
| **PF** | **1.215** |
|
||
| **DD** | **15.05%** |
|
||
| **Sharpe** | **4.38** |
|
||
| **Trades** | **2155** |
|
||
|
||
**Engine:** `NDAlphaEngine` (no proxy_B boost)
|
||
**Equivalent factory call:** `create_boost_engine(mode='none', ...)`
|
||
**Validation script:** `test_pf_dynamic_beta_validate.py`
|
||
|
||
> Bronze is the absolute regression floor. Falling below Bronze on both ROI and DD is a failure.
|
||
|
||
---
|
||
|
||
## All Boost Modes (exp8 results, 2026-03-14)
|
||
|
||
| mode | ROI% | DD% | ΔDD | ΔROI | Notes |
|
||
|------|------|-----|-----|------|-------|
|
||
| `none` (Bronze) | 88.55 | 15.05 | — | — | Baseline |
|
||
| `fixed` | 93.61 | 14.51 | −0.54 | +5.06 | thr=0.35, a=1.0 |
|
||
| `adaptive_alpha` | 93.40 | 14.51 | −0.54 | +4.86 | alpha×boost |
|
||
| `adaptive_thr` | 94.13 | 14.51 | −0.54 | +5.58 | thr÷boost |
|
||
| `adaptive_both` | 94.11 | 14.51 | −0.54 | +5.57 | both combined |
|
||
| **`adaptive_beta`** ⭐ | **96.55** | **14.32** | **−0.72** | **+8.00** | alpha×(1+day_beta) — prev GOLD |
|
||
|
||
## Extended Leverage Configs (exp9b results, 2026-03-15)
|
||
|
||
| Config | ROI% | DD% | Calmar | liq_stops | Notes |
|
||
|--------|------|-----|--------|-----------|-------|
|
||
| GOLD (5/6x) | 96.55 | 14.32 | 6.74 | 0 | adaptive_beta baseline |
|
||
| B_liq (6/7x) | 124.01 | 15.97 | 7.77 | 1 | improved vs unguarded |
|
||
| C_liq (7/8x) | 155.60 | 17.18 | 9.05 | 1 | improved vs unguarded |
|
||
| **D_liq (8/9x)** | **181.81** | **17.65** | **10.30** | **1** | **D_LIQ_GOLD** |
|
||
| E_liq (9/10x) | 155.88 | 31.79 | 4.90 | 5 | cascade — dead |
|
||
|
||
---
|
||
|
||
## Test Suite
|
||
|
||
```bash
|
||
# Fast unit tests only (no data needed, ~5 seconds)
|
||
pytest tests/test_proxy_boost_production.py -m "not slow" -v
|
||
|
||
# Full e2e regression (55-day backtests, ~60 minutes)
|
||
pytest tests/test_proxy_boost_production.py -m slow -v
|
||
```
|
||
|
||
Unit tests: ~40 (factory, engine, extended leverage, liquidation guard, actor import)
|
||
E2E tests: 9 (baseline + 5 boost modes + winner-beats-baseline + D_liq repro + MC silent)
|
||
|
||
Last full run: **2026-03-15 — 9/9 PASSED, exit code 0 (50:20)**
|
||
|
||
---
|
||
|
||
## Promotion Checklist
|
||
|
||
To promote a new result to D_LIQ_GOLD (production):
|
||
1. [x] Beats prev GOLD on ROI (+85pp); DD increased +3.33pp but Calmar +53% — acceptable
|
||
2. [x] Trade count identical (2155) — no re-entry cascade
|
||
3. [x] MC completely silent at mc_ref=5.0 — 0 RED/ORANGE/halted
|
||
4. [x] liquidation_stops=1 (0.05%) — negligible, no cascade
|
||
5. [x] `pytest -m slow` passes — **9/9 PASSED (2026-03-15, 50:20)**
|
||
6. [x] Updated Registry.md, memory/benchmarks.md, memory/MEMORY.md
|
||
7. [x] `create_d_liq_engine()` and classes added to proxy_boost_engine.py
|
||
8. [ ] Wire `create_d_liq_engine` into DolphinActor as configurable option
|