Files
DOLPHIN/prod/docs/SYSTEM_BIBLE_v4.md
hjnormey 01c19662cb initial: import DOLPHIN baseline 2026-04-21 from dolphinng5_predict working tree
Includes core prod + GREEN/BLUE subsystems:
- prod/ (BLUE harness, configs, scripts, docs)
- nautilus_dolphin/ (GREEN Nautilus-native impl + dvae/ preserved)
- adaptive_exit/ (AEM engine + models/bucket_assignments.pkl)
- Observability/ (EsoF advisor, TUI, dashboards)
- external_factors/ (EsoF producer)
- mc_forewarning_qlabs_fork/ (MC regime/envelope)

Excludes runtime caches, logs, backups, and reproducible artifacts per .gitignore.
2026-04-21 16:58:38 +02:00

70 KiB
Executable File
Raw Permalink Blame History

DOLPHIN-NAUTILUS SYSTEM BIBLE

Doctrinal Reference — As Running 2026-03-24

Version: v4 — Clean Architecture + Hazelcast DataFeed Previous version: SYSTEM_BIBLE_v3.md (forked 2026-03-24) CI gate (Nautilus): 46/46 tests green Execution: Binance Futures (USDT-M) verified via verify_testnet_creds.py and binance_test.py Status: Paper trading active on Phoenix-01. Clean architecture operational.

What changed since v3 (2026-03-22)

Area Change
Clean Architecture NEW hexagonal architecture in prod/clean_arch/ — Ports, Adapters, Core separation. Adapter-agnostic business logic.
Hazelcast DataFeed NEW HazelcastDataFeed adapter implementing DataFeedPort — reads from DolphinNG6 via Hazelcast (single source of truth).
Scan Bridge Service NEW scan_bridge_service.py — Linux Arrow file watcher that pushes to Hazelcast. Uses file mtime (not scan #) to handle NG6 restarts.
Paper Trading Engine NEW paper_trade.py — Clean architecture trading CLI with 23 round-trip trades executed in testing.
Market Data Live data flowing: 50 assets, BTC @ $71,281.03, velocity divergence signals active.

What changed since v2 (2026-03-22)

Area Change
Binance Futures Switched system focus from Spot to Perpetuals; updated API endpoints (fapi.binance.com); added recvWindow for signature stability.
Friction Management SP Bypass Logic: Alpha engines now support disabling internal fees/slippage to allow Nautilus to handle costs natively. Prevents double-counting.
Paper Trading NEW launch_paper_portfolio.py — uses Sandbox matching with live Binance data; includes realistic Tier 0 friction (0.02/0.05).
Session Logging NEW TradeLoggerActor — independent CSV/JSON audit trails for every session.
Area Change
DolphinActor Refactored to step_bar() API (incremental, not batch); threading.Lock on ACB; _GateSnap stale-state detection; replay vs live mode; bar_idx tracking
OBF Subsystem Sprint 1 hardening complete: circuit breaker, stall watchdog, crossed-book guard, dark streak, first flush 60s, fire-and-forget HZ pushes, dynamic asset discovery
nautilus_prefect_flow.py NEW — Prefect-supervised BacktestEngine daily flow; champion SHA256 hash check; HZ heartbeats; capital continuity; HIBERNATE guard
Test suite +35 DolphinActor tests (test_dolphin_actor.py); total 46 Nautilus + ~120 OBF
prod/docs/ All prod .md files consolidated; SYSTEM_FILE_MAP.md; NAUTILUS_DOLPHIN_SPEC.md added
0.1s resolution Assessed: BLOCKED by 3 hard blockers (see §22)
Capital Sync NEW — DolphinActor now syncs initial_capital with Nautilus Portfolio balance on_start.
Verification NEW — TODO_CHECK_SIGNAL_PATHS.md systematic test spec for local agents.
MC-Forewarner Now wired in DolphinActor.on_start() — both flows run full gold-performance stack; _MC_BASE_CFG + _MC_MODELS_DIR_DEFAULT as frozen module constants; empty-parquet early-return bug fixed in on_bar replay path

TABLE OF CONTENTS

  1. System Philosophy
  2. Physical Architecture 2a. Clean Architecture Layer (NEW v4)
  3. Data Layer
  4. Signal Layer — vel_div & DC
  5. Asset Selection — IRP
  6. Position Sizing — AlphaBetSizer
  7. Exit Management
  8. Fee & Slippage Model
  9. OB Intelligence Layer
  10. ACB v6 — Adaptive Circuit Breaker
  11. Survival Stack — Posture Control
  12. MC-Forewarner Envelope Gate
  13. NDAlphaEngine — Full Bar Loop
  14. DolphinActor — Nautilus Integration
  15. Hazelcast — Full IMap Schema
  16. Production Daemon Topology & HZ Bridge
  17. Prefect Orchestration Layer
  18. CI Test Suite
  19. Parameter Reference
  20. OBF Sprint 1 Hardening
  21. Known Research TODOs
  22. 0.1s Resolution — Readiness Assessment
  23. Signal Path Verification Specification

1. SYSTEM PHILOSOPHY

DOLPHIN-NAUTILUS is a SHORT-only (champion configuration) systematic trading engine targeting crypto perpetual futures on Binance.

Core thesis: When crypto market correlation matrices show accelerating eigenvalue-velocity divergence (vel_div < -0.02), the market is entering an instability regime. Shorting during early instability onset and exiting at fixed take-profit captures the mean-reversion from panic to normalization.

Design constraints:

  • Zero signal re-implementation in the Nautilus layer. All alpha logic lives in NDAlphaEngine.
  • 512-bit arithmetic for correlation matrix processing (separate NG3 pipeline; not in hot path of this engine).
  • Champion parameters are FROZEN. They were validated via exhaustive VBT backtest on dolphin_vbt_real.py.
  • The Nautilus actor is a thin wire, not a strategy. It routes parquet data → NDAlphaEngine → HZ result.

Champion performance (ACBv6 + IRP + DC + OB, full-stack 55-day Dec31Feb25):

  • ROI: +54.67% | PF: 1.141 | Sharpe: 2.84 | Max DD: 15.80% | WR: 49.5% | Trades: 2145
  • Log: run_logs/summary_20260307_163401.json

Data correction note (2026-03-07): An earlier reference showed ROI=+57.18%, PF=1.149, Sharpe=3.00. Those figures came from a stale vbt_cache/2026-02-25.parquet that was built mid-day — missing 435 scans and carrying corrupt vel_div on 492 rows for the final day of the window. ALGO-3 parity testing caught the mismatch (max_diff=1.22 vs tolerance 1e-10). The parquet was rebuilt from live NG3 JSON (build_parquet_cache(dates=['2026-02-25'], force=True)). The stale file is preserved as 2026-02-25.parquet.STALE_20260307 for replicability. The corrected numbers above are the canonical reference. The ~2.5pp ROI drop reflects real late-day trades on Feb 25 that the stale parquet had silently omitted.


2. PHYSICAL ARCHITECTURE

┌──────────────────────────────────────────────────────────────────────┐
│  DATA SOURCES                                                         │
│  NG3 Scanner (Win) → /mnt/ng6_data/eigenvalues/ (SMB DolphinNG6_Data)│
│  Binance WS → 5s OHLCV bars + live order book (48+ USDT perpetuals) │
│  VBT Cache  → vbt_cache_klines/*.parquet (DOLPHIN-local + /mnt/dolphin)│
└────────────────────────┬─────────────────────────────────────────────┘
                         │
┌────────────────────────▼─────────────────────────────────────────────┐
│  HAZELCAST IN-MEMORY GRID  (localhost:5701, cluster: "dolphin")       │
│  *** SYSTEM MEMORY — primary real-time data bus ***                  │
│  DOLPHIN_SAFETY          → posture + Rm (CP AtomicRef / IMap)        │
│  DOLPHIN_FEATURES        → acb_boost {boost,beta}, latest_eigen_scan │
│  DOLPHIN_PNL_BLUE/GREEN  → per-date trade results                    │
│  DOLPHIN_STATE_BLUE      → capital continuity (latest + per-run)     │
│  DOLPHIN_HEARTBEAT       → liveness pulses (nautilus_prefect_flow)   │
│  DOLPHIN_OB              → order book snapshots                       │
│  DOLPHIN_FEATURES_SHARD_00..09 → 400-asset OBF sharded store         │
└────────────────────────┬─────────────────────────────────────────────┘
                         │
┌────────────────────────▼─────────────────────────────────────────────┐
│  PREFECT ORCHESTRATION  (localhost:4200, work-pool: dolphin)          │
│  paper_trade_flow.py        00:05 UTC — NDAlphaEngine direct         │
│  nautilus_prefect_flow.py   00:10 UTC — BacktestEngine + DolphinActor│
│  obf_prefect_flow.py        Continuous ~500ms — OB ingestion         │
│  mc_forewarner_flow.py      Daily — MC gate prediction               │
│  exf_fetcher_flow.py        Periodic — ExF macro data fetch          │
└────────────────────────┬─────────────────────────────────────────────┘
                         │
┌────────────────────────▼─────────────────────────────────────────────┐
│  PRODUCTION DAEMONS (separate processes)                              │
│  acb_processor_service.py   — ACB daily boost + HZ write (CP lock)   │
│  system_watchdog_service.py — Survival Stack Rm → DOLPHIN_SAFETY     │
│  scan_hz_bridge.py          — FS Watchdog → pushes Arrow scans to HZ │
│  exf_fetcher_simple.py      — Live ExF daemon (funding/dvol/fng/taker)│
└────────────────────────┬─────────────────────────────────────────────┘
                         │
┌────────────────────────▼─────────────────────────────────────────────┐
│  NAUTILUS TRADING ENGINE  (siloqy-env, nautilus_trader 1.219.0)      │
│  BacktestEngine + DolphinActor(Strategy) → NDAlphaEngine             │
│  on_bar() fires per date tick; step_bar() iterates parquet rows      │
│  HZ ACB listener → pending-flag → applied at top of next on_bar()   │
│  TradingNode (launcher.py) → future live exchange connectivity        │
└──────────────────────────────────────────────────────────────────────┘

Key invariant v2: DolphinActor.on_bar() receives one synthetic bar per date in paper mode, which triggers engine.begin_day() then iterates through all parquet rows via step_bar(). In live mode, one real bar → one step_bar() call. The _processed_dates guard is replaced by date-boundary detection comparing current_date to the bar's timestamp date.


2a. CLEAN ARCHITECTURE LAYER (NEW v4)

2a.1 Overview

The Clean Architecture layer provides a hexagonal (ports & adapters) implementation for paper trading, ensuring core business logic is independent of infrastructure concerns.

┌─────────────────────────────────────────────────────────────────────────┐
│                     CLEAN ARCHITECTURE (prod/clean_arch/)               │
├─────────────────────────────────────────────────────────────────────────┤
│  PORTS (Interfaces)                                                      │
│  ├── DataFeedPort          → Abstract market data source                │
│  └── TradingPort           → Abstract order execution                   │
├─────────────────────────────────────────────────────────────────────────┤
│  ADAPTERS (Infrastructure)                                               │
│  ├── HazelcastDataFeed     → Reads from DOLPHIN_FEATURES map            │
│  └── PaperTradeExecutor    → Simulated execution (no real orders)       │
├─────────────────────────────────────────────────────────────────────────┤
│  CORE (Business Logic)                                                   │
│  ├── TradingEngine         → Position sizing, signal processing         │
│  ├── SignalProcessor       → Eigenvalue-based signal generation         │
│  └── PortfolioManager      → PnL tracking, position management          │
└─────────────────────────────────────────────────────────────────────────┘

2a.2 Key Design Principles

Dependency Rule: Dependencies only point inward. Core knows nothing about Hazelcast, Arrow files, or Binance.

Single Source of Truth: All data comes from Hazelcast DOLPHIN_FEATURES.latest_eigen_scan, written atomically by DolphinNG6.

File Timestamp vs Scan Number: The Scan Bridge uses file modification time (mtime) instead of scan numbers because DolphinNG6 resets counters on restarts.

2a.3 Components

Component File Purpose
DataFeedPort ports/data_feed.py Abstract interface for market data
HazelcastDataFeed adapters/hazelcast_feed.py Hz implementation of DataFeedPort
TradingEngine core/trading_engine.py Pure business logic
Scan Bridge ../scan_bridge_service.py Arrow → Hazelcast bridge
Paper Trader paper_trade.py CLI trading session

2a.4 Data Flow

DolphinNG6 → Arrow Files (/mnt/ng6_data/arrow_scans/) → Scan Bridge → Hazelcast → HazelcastDataFeed → TradingEngine
     (5s)                                               (watchdog)    (SSOT)      (Adapter)          (Core)

2a.5 MarketSnapshot Structure

MarketSnapshot(
    timestamp=datetime,
    symbol="BTCUSDT",
    price=71281.03,              # From asset_prices[0]
    eigenvalues=[...],           # From asset_loadings (50 values)
    velocity_divergence=-0.0058, # vel_div field
    scan_number=7315
)

2a.6 Current Status

  • Assets Tracked: 50 (BTC, ETH, BNB, etc.)
  • BTC Price: $71,281.03
  • Test Trades: 23 round-trip trades executed
  • Strategy: Mean reversion on velocity divergence
  • Data Latency: ~5 seconds (DolphinNG6 pulse)

3. DATA LAYER

3.1 vbt_cache_klines Parquet Schema

Location: C:\Users\Lenovo\Documents\- DOLPHIN NG HD HCM TSF Predict\vbt_cache_klines\YYYY-MM-DD.parquet

Column Type Description
vel_div float64 Eigenvalue velocity divergence: v50_vel v750_vel (primary signal)
v50_lambda_max_velocity float64 Short-window (50-bar) lambda_max rate of change
v150_lambda_max_velocity float64 150-bar window lambda velocity
v300_lambda_max_velocity float64 300-bar window lambda velocity
v750_lambda_max_velocity float64 Long-window (750-bar) macro eigenvalue velocity
instability_50 float64 General market instability index (50-bar)
instability_150 float64 General market instability index (150-bar)
BTCUSDTSTXUSDT float64 Per-asset close prices (48 assets in current dataset)

Each file: 1,439 rows (1 per 5-second bar over 24h), 57 columns.

3.2 NG3 Eigenvalue Data

Location: C:\Users\Lenovo\Documents\- Dolphin NG HD (NG3)\correlation_arb512\

eigenvalues/
  YYYY-MM-DD/
    scan_NNNNNN__Indicators.npz   ← ACBv6 external factors (funding, dvol, fng, taker)
    scan_NNNNNN__scan_global.npz  ← lambda_vel_w750 for dynamic beta
matrices/
  YYYY-MM-DD/
    scan_NNNNNN_w50_HHMMSS.arb512.pkl.zst  ← 512-bit correlation matrix (unused in hot path)

NPZ files loaded by AdaptiveCircuitBreaker._load_external_factors() (max 10 scans per date, median-aggregated).


4. SIGNAL LAYER

4.1 Primary Signal: vel_div Threshold Gate

Source: alpha_signal_generator.py, AlphaSignalGenerator.generate()

SHORT signal condition:

vel_div < VEL_DIV_THRESHOLD (-0.02)

LONG signal condition (green posture, not champion):

vel_div > LONG_THRESHOLD (0.01)

Confidence calculation (SHORT path):

ratio = clamp((threshold - vel_div) / (threshold - extreme), 0, 1)
      = clamp((-0.02 - vel_div) / (-0.02 - (-0.05)), 0, 1)
      = clamp((-0.02 - vel_div) / 0.03, 0, 1)
confidence = 0.50 + ratio * 0.40   # range: [0.50, 0.90]

is_extreme = (vel_div <= -0.05)

4.2 Direction Confirmation (DC) — Layer 6

Source: alpha_signal_generator.py, check_dc_nb() (numba JIT)

# Looks back dc_lookback_bars (default 7) bars on the selected asset price
p0 = price[n - lookback - 1]
p1 = price[n - 1]
chg_bps = (p1 - p0) / p0 * 10000.0

if chg_bps < -min_magnitude_bps (-0.75):  return CONFIRM   # falling price → SHORT OK
if chg_bps >  min_magnitude_bps (+0.75):  return CONTRADICT
else:                                       return NEUTRAL

dc_skip_contradicts = True (champion): CONTRADICT returns null signal (skip entry).
Effect on leverage: DC has dc_leverage_boost=1.0 (no boost in champion). CONTRADICT kills entry.

4.3 OB Sub-2: Per-Asset Imbalance Confirmation

When ob_engine is wired in (use_ob_edge=True):

eff_imb = -ob_signal.imbalance_ma5  # For SHORT: sell pressure = positive eff_imb

if eff_imb > 0.10:                              # OB confirms → confidence boost ≤+15%
    ob_adj = 1 + min(0.15, eff_imb * persistence * 0.5)
    confidence *= ob_adj
elif eff_imb < -0.15 and persistence > 0.60:   # Strong persistent OB contradiction → HARD SKIP
    return null_signal
elif eff_imb < -0.10:                           # Moderate → soft dampen confidence
    ob_adj = max(0.85, 1 - |eff_imb| * persistence * 0.4)
    confidence *= ob_adj

5. ASSET SELECTION — IRP

5.1 Overview

Source: alpha_asset_selector.py, AlphaAssetSelector.rank_assets() + numba kernels

IRP = Impulse Response Profiling. Ranks all available assets by historical behavior over the last 50 bars in the regime direction. Selects the asset with the highest ARS (Asset Ranking Score) that passes all filters.

Enabled by: use_asset_selection=True (production default).

5.2 Numba Kernel: compute_irp_nb

# Input: price_segment (last 50 prices), direction (-1 or +1)
dir_returns[i] = (price[i+1] - price[i]) * direction   # directional returns

cumulative = cumsum(dir_returns)
mfe = max(cumulative)          # Maximum Favorable Excursion
mae = abs(min(cumulative, 0))  # Maximum Adverse Excursion
efficiency  = mfe / (mae + 1e-6)
alignment   = count(dir_returns > 0) / n_ret
noise       = variance(dir_returns)
latency     = bars_to_reach_10pct_of_mfe   # (default: 50 if mfe==0)

5.3 Numba Kernel: compute_ars_nb

ARS = 0.5 * log1p(efficiency) + 0.35 * alignment - 0.15 * noise * 1000

5.4 Numba Kernel: rank_assets_irp_nb

For each asset:

  1. Compute IRP in DIRECT direction (regime_direction)
  2. Compute IRP in INVERSE direction (-regime_direction)
  3. Take whichever gives higher ARS (allows inverse selection)
  4. Apply filter gates:
    • noise > 500 → skip
    • latency > 20 → skip (must reach 10% MFE within 20 bars)
    • alignment < 0.20 → skip
  5. Bubble-sort by ARS descending (numba nopython)

5.5 AlphaAssetSelector Python Wrapper

# Build 2D array (max_len × n_assets), right-aligned
valid = rank_assets_irp_nb(prices_2d, idx=max_len, regime_direction, ...)
# Walk ranked list:
for r in rankings:
    if min_irp_alignment > 0 and r.metrics.alignment < min_irp_alignment:
        continue          # alignment gate (default 0.45)
    if r.action != expected_action:
        continue          # direction gate
    if ob_engine and ob_placement.depth_quality < 0.20:
        continue          # OB depth gate (try next asset)
    trade_asset = r.asset
    break
# No match → return None (no fallback to BTCUSDT when IRP enabled)

OB Sub-1: ARS adjusted ±5%/10% by per-asset OB depth quality before sorting.


6. POSITION SIZING

6.1 Signal Strength Score

Source: alpha_bet_sizer.py, compute_sizing_nb() (numba JIT)

# SHORT path (vel_div < threshold):
if vel_div <= extreme (-0.05):
    strength_score = 1.0
else:
    strength_score = (threshold - vel_div) / (threshold - extreme)
    # = (-0.02 - vel_div) / 0.03
    strength_score = clamp(strength_score, 0.0, 1.0)

6.2 Dynamic Leverage (Cubic Convex)

scaled_score  = strength_score ** leverage_convexity (3.0)
eff_leverage  = min_leverage + scaled_score * (max_leverage - min_leverage)
              = 0.5 + scaled_score³ * 4.5   # range: [0.5, 5.0] before ACB

6.3 Signal Bucket Classification

# Numba: get_signal_bucket_nb(vel_div, threshold=-0.02, extreme=-0.05)
if vel_div <= extreme * 1.5 (-0.075):  bucket = 0  # "extreme"
elif vel_div <= extreme (-0.05):        bucket = 1  # "strong"
elif vel_div <= (threshold+extreme)/2:  bucket = 2  # "moderate"  (-0.035)
else:                                   bucket = 3  # "weak"

6.4 Alpha Layers (Layer 7)

Bucket Boost — adaptive win-rate feedback:

# get_bucket_boost_nb: per-bucket win rate → multiplier
wr > 0.60  1.3x  |  wr > 0.55  1.1x  |  wr < 0.40  0.7x  |  wr < 0.45  0.85x

Streak Multiplier — recent 5-trade loss streak:

# get_streak_mult_nb
losses_in_last_5 >= 4  0.5x  |  >= 3  0.7x  |  <= 1  1.1x

Trend Multiplier — vel_div acceleration:

# get_trend_mult_nb(vd_trend = vel_div_history[-1] - vel_div_history[-10])
vd_trend < -0.01  1.3x (deepening instability)
vd_trend >  0.01  0.7x (recovering)

Effective Fraction computation:

confidence = 0.70 if is_extreme else 0.55
conf_mult  = confidence / 0.95
extreme_boost = 2.0 if is_extreme else 1.0

base_frac   = 0.02 + strength_score * (base_fraction - 0.02)
eff_fraction = base_frac * conf_mult * extreme_boost * trend_mult * bucket_boost * streak_mult
eff_fraction = clamp(eff_fraction, 0.02, base_fraction=0.20)

Final notional:

notional = capital * eff_fraction * final_leverage

6.5 ACB + MC Size Multiplier

# regime_size_mult is recomputed every bar via _update_regime_size_mult(vel_div)
if day_beta > 0:
    strength_cubic = clamp((threshold - vel_div) / (threshold - extreme), 0, 1) ** convexity
    regime_size_mult = day_base_boost * (1.0 + day_beta * strength_cubic) * day_mc_scale
else:
    regime_size_mult = day_base_boost * day_mc_scale

# Applied to leverage ceiling:
clamped_max_leverage = min(base_max_leverage * regime_size_mult * market_ob_mult, abs_max_leverage=6.0)
raw_leverage = size_result["leverage"] * dc_lev_mult * regime_size_mult * market_ob_mult

# STALKER posture hard cap:
if posture == 'STALKER': clamped_max_leverage = min(clamped_max_leverage, 2.0)

final_leverage = clamp(raw_leverage, min_leverage=0.5, clamped_max_leverage)

7. EXIT MANAGEMENT

7.1 Exit Priority Order (champion)

Source: alpha_exit_manager.py, AlphaExitManager.evaluate()

  1. FIXED_TP: pnl_pct >= 0.0095 (95 basis points)
  2. STOP_LOSS: pnl_pct <= -1.0 (DISABLED in practice — 100% loss never triggers before TP/max_hold)
  3. OB DURESS exits (when ob_engine != None):
    • Cascade Detection: cascade_count > 0 → widen TP ×1.40, halve max_hold
    • Liquidity Withdrawal: regime_signal == 1 → hard SL 10%, TP ×0.60
  4. vel_div adverse-turn exits (vd_enabled=False by default — disabled pending calibration)
  5. MAX_HOLD: bars_held >= 120 (= 600 seconds)

7.2 OB Dynamic Exit Parameter Adjustment

if cascade_count > 0:
    dynamic_tp_pct  *= 1.40
    dynamic_max_hold = int(max_hold_bars * 0.50)    # take profit fast before snap-back

elif regime_signal == 1:   # LIQUIDITY WITHDRAWAL STRESS
    dynamic_sl_pct  = 0.10                          # hard 10% stop (tail protection)
    if pnl_pct > 0.0:
        dynamic_tp_pct *= 0.60                      # take profit sooner under stress
    if eff_imb < -0.10:                             # OB actively opposing
        dynamic_max_hold = int(max_hold_bars * 0.40)

elif regime_signal == -1 and eff_imb > 0.15:        # CALM + FAVORABLE
    dynamic_max_hold = int(max_hold_bars * 1.50)    # let winners run

# Per-asset withdrawal (micro-level):
if withdrawal_velocity < -0.20 and not in cascade/stress:
    dynamic_max_hold = min(dynamic_max_hold, int(max_hold_bars * 0.40))
    if pnl_pct > 0.0: dynamic_tp_pct *= 0.75

7.3 Sub-day ACB Force Exit

When HZ listener fires an ACB update mid-day:

# In update_acb_boost(boost, beta):
if old_boost >= 1.25 and boost < 1.10:
    evaluate_subday_exits()   # → _execute_exit("SUBDAY_ACB_NORMALIZATION", ...)

Threshold is ARBITRARY (not backtested). Marked research TODO. Safe under pending-flag pattern (fires on next bar, not mid-loop).

7.4 Slippage on Exit

# SHORT position exit:
exit_price = current_price * (1.0 + slip)   # slippage against us when covering short
# STOP_LOSS:   slip = 0.0005 (5 bps — market order fill)
# FIXED_TP:   slip = 0.0002 (2 bps — likely limit fill)
# All others: slip = 0.0002

8. FEE & SLIPPAGE MODEL

8.1 SmartPlacer Fee Model (Layer 3)

Source: esf_alpha_orchestrator.py, _execute_exit()

Blended taker/maker fee rates based on historical SP fill statistics. IMPORTANT: In production/paper sessions using Nautilus friction, these MUST be disabled via use_sp_fees=False.

# Entry fee (ONLY applied if use_sp_fees=True):
entry_fee = (0.0002 * sp_maker_entry_rate + 0.0005 * (1 - sp_maker_entry_rate)) * notional
          = (0.0002 * 0.62 + 0.0005 * 0.38) * notional
          = (0.0001240 + 0.0001900) * notional
          = 0.000314 * notional   (31.4 bps)

8.2 SP Slippage Refund (Layer 3)

Also disabled when use_sp_slippage=False is passed to the engine. These were used to "re-approximate" fills in low-fidelity simulations. In paper/live trading, the matching engine provides the fill price directly.

8.3 Production-Grade Native Friction (Nautilus)

In launch_paper_portfolio.py and live production flows:

  1. Engine Bypass: use_sp_fees = False, use_sp_slippage = False.
  2. Nautilus Node Side: Commissions are applied by the kernel via CommissionConfig.
  3. Execution: Slippage is realized via the spread in the Nautilus Sandbox (Paper) or on-chain (Live).

8.4 Independent Session Logging

Every high-fidelity session now deploys a TradeLoggerActor that independently captures:

  • logs/paper_trading/settings_<ts>.json: Full configuration metadata.
  • logs/paper_trading/trades_<ts>.csv: Every execution event.

8.3 OB Edge (Layer 4)

# With real OB engine:
if ob_placement.depth_quality > 0.5:
    pnl_pct_raw += ob_placement.fill_probability * ob_edge_bps * 1e-4

# Without OB engine (legacy Monte Carlo fallback):
if rng.random() < ob_confirm_rate (0.40):
    pnl_pct_raw += ob_edge_bps * 1e-4   # default: +5 bps

Net PnL:

gross_pnl = pnl_pct_raw * notional
net_pnl   = gross_pnl - entry_fee - exit_fee
capital  += net_pnl

9. OB INTELLIGENCE LAYER

Source: ob_features.py, ob_provider.py, hz_ob_provider.py

The OB layer is wired in via engine.set_ob_engine(ob_engine) which propagates to signal_gen, asset_selector, and exit_manager. It is OPTIONAL — the engine degrades gracefully to legacy Monte Carlo when ob_engine=None.

9.1 OB Signals Per Asset

ob_signal = ob_engine.get_signal(asset, timestamp)
# Fields:
#   imbalance_ma5        — 5-bar MA of bid/ask size imbalance ([-1, +1])
#   imbalance_persistence — fraction of last N bars sustaining sign
#   withdrawal_velocity  — rate of depth decay (negative = book thinning)

9.2 OB Macro (Market-Wide)

ob_macro = ob_engine.get_macro()
# Fields:
#   cascade_count    — number of assets in liquidation cascade
#   regime_signal    — (-1=calm/trending, 0=neutral, +1=withdrawal stress)

9.3 OB Placement Quality

ob_placement = ob_engine.get_placement(asset, timestamp)
# Fields:
#   depth_quality    — book depth score ([0, 2+]; >1 = deep book)
#   fill_probability — probability of maker fill at entry price

9.4 OB Sub-Systems Summary

Sub Location Effect
OB-1 AlphaAssetSelector ARS adjusted ±5%/10% by depth quality
OB-2 AlphaSignalGenerator Confidence boost/dampen by imbalance; hard skip if persistent contradiction
OB-3 NDAlphaEngine._try_entry Market-wide imbalance multiplier on final leverage (±20%/15%)
OB-4 AdaptiveCircuitBreaker Macro withdrawal stress modulates ACBv6 dynamic beta (+25% max)
OB-5 AlphaExitManager Dynamic TP/SL/max_hold based on cascade/withdrawal/calm regime

10. ACB v6 — ADAPTIVE CIRCUIT BREAKER

10.1 Architecture (3-Scale Confirmation)

Source: adaptive_circuit_breaker.py, AdaptiveCircuitBreaker

Scale 1 (Daily):  External macro factors → log_0.5 base_boost
Scale 2 (Per-bar): vel_div signal strength → meta-boost multiplier
Scale 3 (Macro):  w750 eigenvalue velocity → dynamic beta switch

10.2 Scale 1 — External Factor Signals

Loaded from NG3 scan_*__Indicators.npz files (median of first 10 scans per date):

Factor Source Bearish Threshold Signal Weight
funding_btc Funding rate < -0.0001 (very) / < 0.0 (mild) 1.0 / 0.5
dvol_btc BTC implied vol > 80 (extreme) / > 55 (elevated) 1.0 / 0.5
fng Fear & Greed Index < 25 (extreme fear) / < 40 (fear) 1.0 / 0.5 (requires confirmation)
taker Taker buy ratio < 0.80 (selling) / < 0.90 (mild) 1.0 / 0.5

Signal counting:

signals = sum(individual_signal_weights)   # float, e.g. 2.5

Base boost formula:

if signals >= 1.0:
    base_boost = 1.0 + 0.5 * log1p(signals)
    # signals=1 → 1.347x | signals=2 → 1.549x | signals=3 → 1.693x
else:
    base_boost = 1.0

10.3 Scale 3 — Dynamic Beta

# Preloaded: w750_threshold = np.percentile(all_w750_vels, 60)
if w750_vel >= w750_threshold:
    beta = BETA_HIGH = 0.8    # aggressive meta-boost during macro acceleration
else:
    beta = BETA_LOW  = 0.2    # conservative during calm macro

10.4 Scale 2 — Per-Bar Meta-Boost

# Computed every bar inside _update_regime_size_mult(vel_div):
strength_cubic = clamp((threshold - vel_div) / (threshold - extreme), 0, 1) ** 3
# leverage_convexity = 3 → cubic

if day_beta > 0:
    regime_size_mult = base_boost * (1.0 + beta * strength_cubic) * mc_scale
else:
    regime_size_mult = base_boost * mc_scale

10.5 Sub-Day ACB Update (HZ Listener)

The acb_processor_service.py re-runs ACB computation mid-day when new NG3 scan data arrives and writes {boost, beta} to DOLPHIN_FEATURES IMap.

_on_acb_event() in DolphinActor stores the payload in self._pending_acb (GIL-safe dict write). Applied at start of next on_bar() iteration:

# In on_bar() — BEFORE processing:
if _pending_acb is not None and engine is not None:
    engine.update_acb_boost(pending_acb['boost'], pending_acb['beta'])
    _pending_acb = None

11. SURVIVAL STACK — POSTURE CONTROL

11.1 Overview

Source: survival_stack.py, SurvivalStack

Computes a continuous Risk Multiplier Rm ∈ [0, 1] from 5 sensor categories. Maps to discrete posture {APEX, STALKER, TURTLE, HIBERNATE}.

11.2 Five Sensor Categories

Cat1 — Binary Invariant (kill switch):

if hz_nodes < 1 or heartbeat_age_s > 30:
    return 0.0   # Total system failure → HIBERNATE immediately
return 1.0

Cat2 — Structural (MC-Forewarner + data staleness):

base = {OK: 1.0, ORANGE: 0.5, RED: 0.1}[mc_status]
decay = exp(-max(0, staleness_hours - 6) / 3)
f_structural = base * decay   # Exponential decay after 6h stale

Cat3 — Microstructure (OB depth/fill quality):

if ob_stale:
    return 0.5
score = min(depth_quality, fill_prob)
return clamp(0.3 + 0.7 * score, 0.3, 1.0)

Cat4 — Environmental (DVOL spike impulse):

if dvol_spike and t_since_spike_min == 0:
    return 0.3   # Instant degradation at spike
return 0.3 + 0.7 * (1 - exp(-t_since_spike_min / 60))  # 60-min recovery tau

Cat5 — Capital (sigmoid drawdown constraint):

# Rm5 ≈ 1.0 at DD<5%, ≈ 0.5 at DD=12%, ≈ 0.1 at DD=20%
return 1 / (1 + exp(30 * (drawdown - 0.12)))

11.3 Hierarchical Combination

f_environment = min(f_structural, f_ext)     # worst of Cat2/Cat4
f_execution   = f_micro                       # Cat3
r_target      = Cat1 * Cat5 * f_environment * f_execution

# Correlated sensor collapse penalty:
degraded = count([f_structural < 0.8, f_micro < 0.8, f_ext < 0.8])
if degraded >= 2:
    r_target *= 0.5

11.4 Bounded Recovery Dynamics

# Fast attack (instant degradation), slow recovery (5%/minute max):
if r_target < last_r_total:
    r_final = r_target          # immediate drop
else:
    alpha = min(1.0, 0.02 * dt_min)
    step  = min(alpha * (r_target - last_r_total), 0.05 * dt_min)
    r_final = last_r_total + step

11.5 Posture Mapping

NOTE: Thresholds are deliberately TIGHTER than mathematical spec (safety buffer).

if Rm >= 0.90: APEX       # Full trading, no constraints
if Rm >= 0.75: STALKER    # Max leverage capped at 2.0x
if Rm >= 0.50: TURTLE     # regime_dd_halt = True (no new entries)
else:          HIBERNATE  # Force-close open positions, no new entries

11.6 Hysteresis

# Down: requires hysteresis_down=2 consecutive bars at lower level
# Up:   requires hysteresis_up=5 consecutive bars at higher level
# Prevents flip-flopping around thresholds

11.7 Posture → Engine Effect

Posture Engine Effect
APEX No constraint (max leverage = abs_max=6.0 × regime_size_mult)
STALKER clamped_max_leverage = min(..., 2.0) in _try_entry
TURTLE regime_dd_halt = Trueprocess_bar skips entry block
HIBERNATE _manage_position forces EXIT("HIBERNATE_HALT"), regime_dd_halt = True

12. MC-FOREWARNER ENVELOPE GATE

Source: Called via engine.set_mc_forewarner(forewarner, mc_base_cfg)

Runs daily at start of process_day():

mc_cfg = {**mc_base_cfg, 'max_leverage': base_max_leverage * day_base_boost}
mc_report = forewarner.assess_config_dict(mc_cfg)

mc_red    = mc_report.catastrophic_probability > 0.25 or mc_report.envelope_score < -1.0
mc_orange = (not mc_red) and (envelope_score < 0 or catastrophic_probability > 0.10)

day_mc_status = 'RED' | 'ORANGE' | 'OK'
day_mc_scale  = 0.5 if ORANGE else 1.0

if mc_red or posture in ['TURTLE', 'HIBERNATE']:
    regime_dd_halt = True   # No new entries for the entire day

13. NDAlphaEngine — FULL BAR LOOP

13.1 process_day() Flow

def process_day(date_str, df, asset_columns, vol_regime_ok, direction, posture):
    # --- Per-day initialization ---
    regime_direction = direction or -1     # -1=SHORT champion
    regime_dd_halt   = False
    day_posture      = posture
    
    # --- ACB: Scale 1 + Scale 3 (once per day) ---
    if acb:
        info = acb.get_dynamic_boost_for_date(date_str, ob_engine)
        day_base_boost = info['boost']    # log_0.5 curve
        day_beta       = info['beta']     # dynamic beta (w750 percentile switch)
    
    # --- MC-Forewarner gate (once per day) ---
    if forewarner:
        mc_report = forewarner.assess_config_dict(...)
        if mc_red or posture in [TURTLE, HIBERNATE]:
            regime_dd_halt = True   # kill new entries for day
    
    # === BAR LOOP ===
    for ri, row in df.iterrows():
        vd       = row['vel_div']            # primary signal
        v50_vel  = row['v50_lambda_max_velocity']
        v750_vel = row['v750_lambda_max_velocity']
        prices   = {col: row[col] for col in asset_columns if row[col] > 0}
        
        # Append to price_histories (capped at 200; sliding window)
        for ac, p in prices.items():
            price_histories[ac].append(p)
            if len(price_histories[ac]) > 500:
                price_histories[ac] = price_histories[ac][-200:]
        
        vol_ok = vol_regime_ok[ri] if vol_regime_ok else (bid >= 100)
        
        # ACB Scale 2: regime_size_mult updated every bar
        _update_regime_size_mult(vd)
        
        process_bar(global_bar_idx, vd, prices, vol_ok, price_histories, v50_vel, v750_vel)
        global_bar_idx += 1
    
    return {date, pnl, capital, boost, beta, mc_status, trades}

13.2 process_bar() Flow

def process_bar(bar_idx, vel_div, prices, vol_regime_ok, price_histories, v50_vel, v750_vel):
    bar_count += 1
    vel_div_history.append(vel_div)   # trimmed to 200
    
    # === EXIT MANAGEMENT (always first) ===
    if position is not None:
        exit_info = _manage_position(bar_idx, prices, vel_div, v50_vel, v750_vel)
        # → AlphaExitManager.evaluate() → if EXIT: _execute_exit()
    
    # === ENTRY (only when no position) ===
    if position is None AND bar_idx > last_exit_bar AND NOT regime_dd_halt:
        if bar_count >= lookback (100) AND vol_regime_ok:
            entry_info = _try_entry(bar_idx, vel_div, prices, price_histories, v50_vel, v750_vel)

13.3 _try_entry() Flow

def _try_entry(bar_idx, vel_div, prices, price_histories, v50_vel, v750_vel):
    if capital <= 0: return None
    
    # 1. IRP Asset Selection (Layer 2)
    if use_asset_selection:
        market_data = {a: history[-50:] for a, history in price_histories if len >= 50}
        rankings = asset_selector.rank_assets(market_data, regime_direction)
        trade_asset = first_asset_passing_all_gates(rankings)
        if trade_asset is None: return None   # strict: no fallback
    else:
        trade_asset = "BTCUSDT"   # fallback when IRP disabled
    
    # 2. Signal Generation + DC (Layer 6)
    signal = signal_gen.generate(vel_div, vel_div_history,
                                  price_histories[trade_asset],
                                  regime_direction, trade_asset)
    if not signal.is_valid: return None   # vel_div or DC killed it
    
    # 3. Position Sizing (Layers 7-8)
    size = bet_sizer.calculate_size(capital, vel_div, signal.vel_div_trend, regime_direction)
    
    # 4. OB Sub-3: Cross-asset market multiplier
    market_ob_mult = ob_engine.get_market_multiplier(...)  # ±20%
    
    # 5. ACB leverage ceiling enforcement
    clamped_max = min(base_max_leverage * regime_size_mult * market_ob_mult, abs_max_leverage=6.0)
    if posture == STALKER: clamped_max = min(clamped_max, 2.0)
    final_leverage = clamp(size.leverage * regime_size_mult * market_ob_mult, min_lev, clamped_max)
    
    # 6. Notional and entry
    notional = capital * size.fraction * final_leverage
    entry_price = prices[trade_asset]
    
    # 7. Create position
    position = NDPosition(trade_asset, regime_direction, entry_price,
                          notional, final_leverage, ...)
    exit_manager.setup_position(trade_id, entry_price, direction, bar_idx, v50_vel, v750_vel)

14. DOLPHIN ACTOR — NAUTILUS INTEGRATION

Source: nautilus_dolphin/nautilus_dolphin/nautilus/dolphin_actor.py Base: nautilus_trader.trading.strategy.Strategy (Rust/Cython core) Lines: 338

14.1 Lifecycle (v2 — step_bar API)

__init__:
    dolphin_config, engine=None, hz_client=None
    current_date=None, posture='APEX', _processed_dates=set()
    _pending_acb: dict|None = None
    _acb_lock = threading.Lock()           ← v2: explicit lock (not GIL reliance)
    _stale_state_events = 0
    _day_data = None, _bar_idx_today = 0

on_start():
    1. _connect_hz() → HazelcastClient(cluster="dolphin", members=["localhost:5701"])
    2. _read_posture() → DOLPHIN_SAFETY (CP AtomicRef, map fallback)
    3. _setup_acb_listener() → add_entry_listener(DOLPHIN_FEATURES["acb_boost"])
    4. create_boost_engine(mode=boost_mode, **engine_kwargs) → NDAlphaEngine
    5. MC-Forewarner injection (gold-performance stack — always active):
           mc_models_dir = config.get('mc_models_dir', _MC_MODELS_DIR_DEFAULT)
           if Path(mc_models_dir).exists():
               forewarner = DolphinForewarner(models_dir=mc_models_dir)
               engine.set_mc_forewarner(forewarner, _MC_BASE_CFG)
           ← graceful degradation: logs warning + continues if models missing
           ← disable explicitly: set mc_models_dir=None/'' in config

on_bar(bar):
    ① Drain ACB under _acb_lock:
        pending = _pending_acb; _pending_acb = None  ← atomic swap
        if pending: engine.update_acb_boost(boost, beta)

    ② Date boundary:
        date_str = datetime.fromtimestamp(bar.ts_event/1e9, UTC).strftime('%Y-%m-%d')
        if current_date != date_str:
            if current_date: engine.end_day()
            current_date = date_str
            posture = _read_posture()
            _bar_idx_today = 0
            engine.begin_day(date_str, posture=posture, direction=±1)
            if not live_mode: _load_parquet_data(date_str) → _day_data

    ③ HIBERNATE guard: if posture=='HIBERNATE': return  ← hard skip, no step_bar

    ④ Feature extraction:
        live_mode=False → if _day_data empty: return  ← early exit, no step_bar with zeros
                          elif _bar_idx_today >= len(df): return  ← end-of-day
                          else: row = df.iloc[_bar_idx_today], vol_regime_ok = (idx>=100)
        live_mode=True  → _get_latest_hz_scan(), staleness check (>10s → warning),
                          dedup on scan_number

    ⑤ _GateSnap BEFORE: (acb_boost, acb_beta, posture, mc_gate_open)

    ⑥ engine.pre_bar_proxy_update(inst50, v750_vel)  ← if ProxyBoostEngine

    ⑦ result = engine.step_bar(bar_idx, vel_div, prices, v50_vel, v750_vel, vol_regime_ok)
       _bar_idx_today += 1

    ⑧ _GateSnap AFTER: compare → if changed: stale_state_events++, result['stale_state']=True

    ⑨ _write_result_to_hz(date_str, result)

on_stop():
    _processed_dates.clear()
    _stale_state_events = 0
    if hz_client: hz_client.shutdown()

14.2 Thread Safety: ACB Pending-Flag Pattern (v2)

CRITICAL: HZ entry listeners run on HZ client pool threads, NOT the Nautilus event loop.

# HZ listener thread — parse outside lock, assign inside lock:
def _on_acb_event(event):
    try:
        val = event.value
        if val:
            parsed = json.loads(val)          # CPU work OUTSIDE lock
            with self._acb_lock:
                self._pending_acb = parsed    # atomic write under lock
    except Exception as e:
        self.log.error(f"ACB event parse error: {e}")

# Nautilus event loop — drain under lock, apply outside lock:
def on_bar(bar):
    with self._acb_lock:
        pending = self._pending_acb
        self._pending_acb = None              # atomic consume under lock
    if pending is not None and self.engine is not None:
        boost = float(pending.get('boost', 1.0))
        beta  = float(pending.get('beta',  0.0))
        self.engine.update_acb_boost(boost, beta)

v2 vs v1: v1 relied on GIL for safety (bare dict assignment). v2 uses explicit threading.Lock — correct even if GIL is removed in future Python versions. Lock hold time is minimized to a single pointer swap.

14.3 _GateSnap — Stale-State Detection

New in v2. Detects when ACB boost, posture, or MC gate changes between the pre-step and post-step snapshot:

_GateSnap = namedtuple('_GateSnap', ['acb_boost', 'acb_beta', 'posture', 'mc_gate_open'])

before = _GateSnap(engine._day_base_boost, engine._day_beta, posture, engine._mc_gate_open)
result = engine.step_bar(...)
after  = _GateSnap(engine._day_base_boost, engine._day_beta, _read_posture(), engine._mc_gate_open)

if before != after:
    self._stale_state_events += 1
    self.log.warning(f"[STALE_STATE] gate changed mid-eval: {changed_fields}")
    result['stale_state'] = True   # flagged in HZ write — DO NOT use for live orders

14.4 Replay vs Live Mode

Replay Mode (live_mode=False) Live Mode (live_mode=True)
Data source vbt_cache_klines/YYYY-MM-DD.parquet DOLPHIN_FEATURES["latest_eigen_scan"] (HZ)
Per-bar iteration df.iloc[_bar_idx_today] One bar = one HZ scan fetch
vol_regime_ok bar_idx >= 100 (warmup) From scan dict
Stale guard abs(now_ns - scan_ts_ns) > 10s → warning
Dedup scan_num == last_scan_number → skip

14.5 Data Loading (Replay)

def _load_parquet_data(date_str):
    path = HCM_DIR / "vbt_cache_klines" / f"{date_str}.parquet"
    df = pd.read_parquet(path)
    meta_cols = {vel_div, scan_number, v50_..., v750_..., instability_50, instability_150}
    asset_columns = [c for c in df.columns if c not in meta_cols]
    return df, asset_columns, None   # vol_regime_ok deferred to on_bar warmup check

14.6 Posture Reading

Primary: HZ CP Subsystem AtomicReference('DOLPHIN_SAFETY') — linearizable. Fallback: HZ IMap('DOLPHIN_SAFETY').get('latest') — eventually consistent. Default when HZ unavailable: 'APEX' (non-fatal degradation).

14.7 Result Writing

def _write_result_to_hz(date_str, result):
    if not self.hz_client: return   # silent noop
    imap_pnl = hz_client.get_map('DOLPHIN_PNL_BLUE').blocking()
    imap_pnl.put(date_str, json.dumps(result))
    if result.get('stale_state'):
        self.log.error("[STALE_STATE] DO NOT use for live order submission")
    # result: {date, pnl, capital, boost, beta, mc_status, trades, stale_state?}

14.8 Important Notes for Callers

  • actor.log is read-only (Rust-backed Cython property). Never try to assign actor.log = MagicMock() in tests — use the real Nautilus logger instead.
  • actor.posture is a regular Python attribute (writable in tests).
  • actor.engine is set in on_start(). Tests can set directly after __init__.

15. HAZELCAST — FULL IMAP SCHEMA

Hazelcast is the system memory. All subsystem state flows through it. Every consumer must treat HZ maps as authoritative real-time sources.

Infrastructure: Hazelcast 5.3, Docker (prod/docker-compose.yml), localhost:5701, cluster "dolphin". CP Subsystem: Enabled — required for ACB atomic operations. Management Center: http://localhost:8080. Python client: hazelcast-python-client 5.6.0 (siloqy-env).

15.1 Complete IMap Reference

Map Key Value Writer Reader(s) Notes
DOLPHIN_SAFETY "latest" JSON {posture, Rm, sensors, ...} system_watchdog_service.py DolphinActor, paper_trade_flow, nautilus_prefect_flow CP AtomicRef preferred; IMap fallback
DOLPHIN_FEATURES "acb_boost" JSON {boost, beta} acb_processor_service.py DolphinActor (HZ entry listener) Triggers _on_acb_event
DOLPHIN_FEATURES "latest_eigen_scan" JSON {vel_div, scan_number, asset_prices, timestamp_ns, w50_velocity, w750_velocity, instability_50} Eigenvalue scanner bridge DolphinActor (live mode) Dedup on scan_number
DOLPHIN_PNL_BLUE "YYYY-MM-DD" JSON daily result {pnl, capital, trades, boost, beta, mc_status, posture, stale_state?} paper_trade_flow, DolphinActor._write_result_to_hz, nautilus_prefect_flow Analytics stale_state=True means DO NOT use for live orders
DOLPHIN_PNL_GREEN "YYYY-MM-DD" JSON daily result paper_trade_flow (green) Analytics GREEN config only
DOLPHIN_STATE_BLUE "latest" JSON {strategy, capital, date, pnl, trades, peak_capital, drawdown, engine_state, updated_at} paper_trade_flow paper_trade_flow (capital restore) Full engine_state for position continuity
DOLPHIN_STATE_BLUE "latest_nautilus" JSON {strategy, capital, date, pnl, trades, posture, param_hash, engine, updated_at} nautilus_prefect_flow nautilus_prefect_flow (capital restore) param_hash = champion SHA256[:16]
DOLPHIN_STATE_BLUE "state_{strategy}_{date}" JSON per-run snapshot paper_trade_flow Recovery Full historical per-run snapshots
DOLPHIN_HEARTBEAT "nautilus_flow_heartbeat" JSON {ts, iso, run_date, phase, flow} nautilus_prefect_flow (heartbeat_task) External monitoring Written at flow_start, engine_start, flow_end
DOLPHIN_HEARTBEAT "probe_ts" Timestamp string nautilus_prefect_flow (hz_probe_task) Liveness check Written at HZ probe time
DOLPHIN_OB per-asset key JSON OB snapshot obf_prefect_flow HZOBProvider Raw OB map
DOLPHIN_FEATURES_SHARD_00 symbol JSON OB feature dict {imbalance, fill_probability, depth_quality, regime_signal, ...} obf_prefect_flow HZOBProvider shard routing (see §15.2)
DOLPHIN_FEATURES_SHARD_01..09 symbol Same schema obf_prefect_flow HZOBProvider
DOLPHIN_SIGNALS signal key Signal distribution signal_bridge.py Strategy consumers

15.2 OBF Shard Routing

SHARD_COUNT = 10
shard_idx = sum(ord(c) for c in symbol) % SHARD_COUNT
imap_name = f"DOLPHIN_FEATURES_SHARD_{shard_idx:02d}"   # ..._00 through ..._09

Routing is stable (sum-of-ord, not hash()) — deterministic across Python versions and process restarts. 400+ assets distribute evenly across 10 shards.

15.3 ShardedFeatureStore API

Source: hz_sharded_feature_store.py, ShardedFeatureStore

store = ShardedFeatureStore(hz_client)
store.put('BTCUSDT', 'vel_div', -0.03)   # routes to shard based on symbol hash
val  = store.get('BTCUSDT', 'vel_div')
store.delete('BTCUSDT', 'vel_div')
# Internal key format: "vel_div_BTCUSDT"

Near cache config: TTL=300s, invalidate_on_change=True, LRU eviction, max_size=5000 per shard.

15.4 HZOBProvider — Dynamic Asset Discovery

# On connect (lazy), discovers which assets are present in any shard:
for shard_idx in range(SHARD_COUNT):
    key_set = client.get_map(f"DOLPHIN_FEATURES_SHARD_{shard_idx:02d}").blocking().key_set()
    discovered_assets.update(key_set)

No static asset list required — adapts automatically as OBF flow adds/removes assets.

15.5 CP Subsystem (ACB Processor)

acb_processor_service.py uses HZ CP FencedLock to prevent simultaneous ACB writes from multiple instances. CP Subsystem must be enabled in docker-compose.yml. All writers must use the same CP lock name to get protection.

15.6 OBF Circuit Breaker (HZ Push)

After 5 consecutive HZ push failures, OBF flow opens a circuit breaker and switches to file-only mode (ob_cache/latest_ob_features.json). Consumers should prefer the JSON file during HZ outages.


16. PRODUCTION DAEMON TOPOLOGY

16.1 ACB Processor Service (acb_processor_service.py)

Purpose: Computes ACBv6 daily boost + dynamic beta from NG3 NPZ files → writes to HZ DOLPHIN_FEATURES["acb_boost"]. Schedule: Once at market open (08:00 UTC), re-runs on new NG3 scan batch arrival. HZ: Uses CP FencedLock to prevent simultaneous writes. Output: DOLPHIN_FEATURES.put('acb_boost', json.dumps({boost, beta}))

16.2 OBF Hot Loop (obf_prefect_flow.py)

Purpose: Binance WS order book ingestion → 4-subsystem feature computation → HZ push + JSON file cache. Cadence: ~500ms per cycle (hardened Sprint 1). Resilience: Circuit breaker (5 failures → file-only mode), stall watchdog, crossed-book guard, dark streak detection, first flush at 60s. HZ: Fire-and-forget per-asset async push to DOLPHIN_FEATURES_SHARD_*. File fallback: /mnt/dolphinng5_predict/ob_cache/latest_ob_features.json (atomic write).

16.3 System Watchdog Service (system_watchdog_service.py)

Purpose: 5-sensor Rm computation → writes posture to DOLPHIN_SAFETY. Cadence: ~60s per cycle. Output: DOLPHIN_SAFETY AtomicReference (primary) or IMap (fallback).

16.4 ExF Daemon (exf_fetcher_simple.py)

Purpose: Live external factors daemon — fetches funding rate, DVOL, Fear&Greed, taker ratio → pushes to HZ for ACBv6 Scale 1.

16.5 MC-Forewarner Flow (mc_forewarner_flow.py)

Purpose: Prefect-orchestrated daily ML assessment. Outcome: OK / ORANGE / RED → HZ. Effect: ORANGE → day_mc_scale=0.5. RED → regime_dd_halt=True.

16.6 paper_trade_flow.py (Primary — 00:05 UTC)

Purpose: Daily NDAlphaEngine run. Loads klines, wires ACB+OB+MC, runs begin_day/step_bar/end_day, persists PnL + state to HZ. Direction: direction = -1 (SHORT, blue). GREEN (LONG) is separate config.

16.7 Daemon Start Sequence

1. docker-compose up -d          ← Hazelcast 5701, ManCenter 8080, Prefect 4200
2. acb_processor_service.py      ← writes initial ACB boost before market open
3. obf_prefect_flow.py           ← start OBF WS ingestion (Prefect worker)
4. exf_fetcher_simple.py         ← ExF live daemon
5. system_watchdog_service.py    ← begin posture computation
6. mc_forewarner_flow.py         ← Prefect deployment (daily)
7. paper_trade_flow.py           ← 00:05 UTC (Prefect deployment)
8. nautilus_prefect_flow.py      ← 00:10 UTC (Prefect deployment)
   ↑ Start actor flows LAST — they read everything above from HZ

16.8 Monitoring Endpoints

Service URL / Command
Hazelcast Management Center http://localhost:8080
Prefect UI http://localhost:4200
Prefect UI (Tailscale external) http://100.105.170.6:4200
Daily PnL HZ IMap DOLPHIN_PNL_BLUE[YYYY-MM-DD]
Posture HZ AtomicRef DOLPHIN_SAFETY
ACB State HZ IMap DOLPHIN_FEATURES["acb_boost"]
Nautilus heartbeat HZ IMap DOLPHIN_HEARTBEAT["nautilus_flow_heartbeat"]

17. PREFECT ORCHESTRATION LAYER

Version: Prefect 3.6.22 (siloqy-env) Server: http://localhost:4200/api Work pool: dolphin (process type) Worker command: prefect worker start --pool dolphin --type process

17.1 Registered Deployments

Deployment Flow Schedule Config
dolphin-paper-blue paper_trade_flow.py 0 0 * * * (00:05 UTC) configs/blue.yml
dolphin-paper-green paper_trade_flow.py 0 0 * * * (00:05 UTC) configs/green.yml
dolphin-nautilus-blue nautilus_prefect_flow.py 10 0 * * * (00:10 UTC) configs/blue.yml

17.2 nautilus_prefect_flow.py — Nautilus BacktestEngine Supervisor

New in v2. Tasks in execution order:

hz_probe_task             retries=3  timeout=30s   — verify HZ reachable; abort on failure
validate_champion_params  retries=0  timeout=10s   — SHA256 hash vs FROZEN params; ValueError on drift
load_bar_data_task        retries=2  timeout=120s  — load vbt_cache_klines parquet; validate vel_div col
read_posture_task         retries=2  timeout=20s   — read DOLPHIN_SAFETY
restore_capital_task      retries=2  timeout=20s   — restore capital from DOLPHIN_STATE_BLUE
  → HIBERNATE? skip engine, write result, heartbeat, return
run_nautilus_backtest_task retries=0 timeout=600s  — BacktestEngine + DolphinActor full cycle
write_hz_result_task      retries=3  timeout=30s   — DOLPHIN_PNL_BLUE + DOLPHIN_STATE_BLUE write
heartbeat_task            retries=0  timeout=15s   — phase=flow_end

Champion integrity: _CHAMPION_HASH = sha256(json.dumps(_CHAMPION_PARAMS, sort_keys=True))[:16]. Computed at import time. Any config drift triggers ValueError before engine starts.

Capital continuity: Restores from DOLPHIN_STATE_BLUE["latest_nautilus"]. Falls back to initial_capital (25,000 USDT) if absent.

17.3 paper_trade_flow.py — Task Reference

Task Retries Purpose
load_config 0 YAML config load
load_day_scans 2 Parquet (preferred) or JSON fallback; vel_div validation
run_engine_day 0 begin_day/step_bar×N/end_day; returns daily stats
write_hz_state 3 DOLPHIN_STATE_BLUE + DOLPHIN_PNL_BLUE persist
log_pnl 0 Disk JSONL append (paper_logs/{color}/)

17.4 Registration Commands

source /home/dolphin/siloqy_env/bin/activate
PREFECT_API_URL=http://localhost:4200/api

python prod/paper_trade_flow.py --register        # blue + green paper deployments
python prod/nautilus_prefect_flow.py --register   # nautilus blue deployment

17.5 Manual Run

# Paper trade:
python prod/paper_trade_flow.py --config prod/configs/blue.yml --date 2026-03-21

# Nautilus supervisor:
python prod/nautilus_prefect_flow.py --date 2026-03-21

# Dry-run (data + param validation, no engine):
python prod/nautilus_prefect_flow.py --date 2026-03-21 --dry-run

18. CI TEST SUITE

18.1 Test Suites Overview

Suite Location Runner Gate
Nautilus bootstrap nautilus_dolphin/tests/test_0_nautilus_bootstrap.py pytest nautilus_dolphin/tests/test_0_nautilus_bootstrap.py -v 11/11
DolphinActor nautilus_dolphin/tests/test_dolphin_actor.py pytest nautilus_dolphin/tests/test_dolphin_actor.py -v 35/35
OBF unit tests tests/test_obf_unit.py pytest tests/test_obf_unit.py -v ~120/~120
Legacy CI ci/ directory pytest ci/ -v 14/14

Total: 46 Nautilus tests + ~120 OBF unit tests + 14 legacy CI = ~180 tests green.

18.2 Nautilus Bootstrap Tests (11 tests)

test_0_nautilus_bootstrap.py — foundation sanity checks:

  • Nautilus import, catalog construction, Bar/BarType creation
  • DolphinActor instantiation without full kernel (uses __new__ + __init__ pattern)
  • Champion config loading from blue.yml
  • HZ connectivity probe (skip if HZ unavailable)
  • BacktestEngine construction with DolphinActor registered

18.3 DolphinActor Tests (35 tests, 8 classes)

test_dolphin_actor.py — full behavioral coverage:

Class Tests What It Covers
TestChampionParamInvariants 6 Config loading, SHA256 hash stability, frozen param values, blue.yml parity
TestACBPendingFlagThreadSafety 5 Lock acquisition, JSON parse outside lock, dict assign inside lock, concurrent event safety
TestHibernatePostureGuard 3 HIBERNATE skips engine entirely, APEX/STALKER/TURTLE pass through, posture gate logic
TestDateChangeHandling 5 Date rollover triggers end_day/begin_day, once-per-date guard, bar_idx reset
TestHZUnavailableDegradation 4 HZ down → engine continues with stale OB features; heartbeat errors silenced; file fallback
TestReplayModeBarTracking 3 bar_idx increments per step_bar call; total_bars_processed correct; replay vs live mode flag
TestOnStopCleanup 4 on_stop writes final HZ result; HZ down on stop is non-fatal; engine state serialized
TestStaleStateGuard 5 _GateSnap detects mid-eval posture/acb changes; snap mismatch triggers abort; re-eval on next bar

Critical implementation note: actor.log is a Cython/Rust-backed read-only property on Actor. Do NOT attempt actor.log = MagicMock() — raises AttributeError: attribute 'log' of ... objects is not writable. The real Nautilus logger is initialized by super().__init__() and works in test context.

18.4 Legacy CI Tests (14 tests)

Location: ci/ directory. Runner: pytest ci/ -v

File Tests What It Covers
test_13_nautilus_integration.py 6 Actor import, instantiation, on_bar, HIBERNATE posture, once-per-day guard, ACB thread safety
test_14_long_system.py 3 Multi-day run, capital persistence, trade count
test_15_acb_reactive.py 1 ACB boost update applied correctly mid-day
test_16_scaling.py 4 Memory footprint <4GB (50 assets), shard routing (400 symbols), 400-asset no-crash, 400-asset with IRP

18.5 Key Test Patterns

ACB pending-flag pattern (ThreadSafety test):

# JSON parse OUTSIDE lock, dict assign INSIDE lock
with patch.object(actor.engine, 'update_acb_boost') as mock_update:
    actor._on_acb_event(event)
    assert actor._pending_acb['boost'] == 1.35
    mock_update.assert_not_called()  # engine NOT called from listener thread

Date rollover pattern (DateChange test):

# Fires 3 bars on same date → assert begin_day.call_count == 1
# Fires 1 bar on next date  → assert begin_day.call_count == 2, end_day.call_count == 1

_GateSnap stale-state detection:

# Snap taken at start of step_bar; posture changes mid-eval → abort, retry next bar
snap = actor._gate_snap  # namedtuple(acb_boost, acb_beta, posture, mc_gate_open)

19. PARAMETER REFERENCE

19.1 Champion Parameters (Frozen)

Parameter Value Layer Notes
vel_div_threshold -0.02 Signal gate PRIMARY entry threshold
vel_div_extreme -0.05 Signal/sizing Extreme regime: full size
min_leverage 0.5 Sizing Floor leverage
max_leverage 5.0 Sizing Base ceiling (before ACB)
abs_max_leverage 6.0 Sizing Hard geometric ceiling
leverage_convexity 3.0 Sizing Cubic convex curve
fraction 0.20 Sizing Max capital fraction per trade
fixed_tp_pct 0.0095 Exit 95 bps take-profit
stop_pct 1.0 Exit Effectively disabled
max_hold_bars 120 Exit 600 seconds
dc_lookback_bars 7 DC 35 seconds price momentum
dc_min_magnitude_bps 0.75 DC Minimum BTC momentum
dc_skip_contradicts True DC Hard skip on contradiction
min_irp_alignment 0.45 IRP Alignment gate
sp_maker_entry_rate 0.62 Fees 62% maker fill at entry
sp_maker_exit_rate 0.50 Fees 50% maker fill at exit
ob_edge_bps 5.0 OB Legacy MC OB edge
ob_confirm_rate 0.40 OB Legacy MC confirmation rate
lookback 100 Warmup Bars before first entry allowed
seed 42 RNG Deterministic numpy RandomState

19.2 ACBv6 Parameters (Frozen — Validated)

Parameter Value Notes
BETA_HIGH 0.8 w750 above p60 threshold
BETA_LOW 0.2 w750 below p60 threshold
W750_THRESHOLD_PCT 60 Percentile switch point
FUNDING_VERY_BEARISH -0.0001 1.0 signal
DVOL_EXTREME 80 1.0 signal
FNG_EXTREME_FEAR 25 1.0 signal (needs confirmation)
TAKER_SELLING 0.8 1.0 signal

19.3 Survival Stack Thresholds (Deliberately Tight)

Posture Rm Threshold vs. Math Spec
APEX ≥ 0.90 Tighter — spec was 0.85
STALKER ≥ 0.75 Tighter — spec was 0.70
TURTLE ≥ 0.50 Tighter — spec was 0.45
HIBERNATE < 0.50

Do NOT loosen these without quantitative justification.


20. OBF SPRINT 1 HARDENING

Completed: 2026-03-22. All 25 items in AGENT_TODO_PRIORITY_FIXES_AND_TODOS.md addressed.

20.1 P0/P1/P2 Hardening (Production Safety)

Item Change Severity
Circuit breaker 5 consecutive HZ push failures → exponential backoff + file-only fallback P0
Crossed-book guard Ask ≤ bid on incoming feed → discard snapshot, log warning, continue P0
Dark streak detector N consecutive zero-volume bars → emit STALE_DATA warning P1
First flush delay No OB features published until 60s after startup (warmup) P1
Stall watchdog No new bar for STALL_TIMEOUT seconds → alert + optional restart P1
Fire-and-forget HZ push HZ write moved to background thread; hot loop never blocks on HZ P2
Dynamic asset discovery hzobprovider discovers active symbols from HZ at runtime; no hardcoded list P2
Per-timestamp macro map latest_macro_at_ts keyed by bar timestamp; resolves stale-read race on fast replays P2

20.2 P3 Infrastructure Items

Item Status
scripts/verify_parquet_archive.py — validates all daily parquet files for schema and row count DONE
ob_cache/SCHEMA.md — authoritative JSON schema for latest_ob_features.json DONE
P3-1 / P3-5 / P3-6 — out of scope for sprint 1, deferred SKIPPED

20.3 OBF Architecture Post-Sprint

Binance WS feed
    ↓
obf_prefect_flow.py (hot loop, ~100ms cadence)
    ├── Crossed-book guard → discard if ask ≤ bid
    ├── Dark streak detector → N zero-vol bars
    ├── First flush delay → 60s warmup
    ├── Feature compute (depth imbalance, spread, vwap, pressure ratio)
    ├── Per-timestamp macro map update
    ├── Fire-and-forget HZ push (background thread)
    │       └── Circuit breaker (5 failures → file-only)
    └── ob_cache/latest_ob_features.json (local fallback)

20.4 Test Coverage

tests/test_obf_unit.py — ~120 unit tests covering all hardening items:

  • Circuit breaker state machine (CLOSED → OPEN → HALF-OPEN)
  • Crossed-book guard triggers on malformed data
  • Dark streak threshold detection
  • Warmup period gating
  • Background thread non-blocking behavior
  • Asset discovery via HZ key scan

21. KNOWN RESEARCH TODOs

ID Description Priority
TODO-1 Calibrate vd_enabled adverse-turn exits (currently disabled). Requires analysis of trade vel_div distribution at entry vs. subsequent bars. True invalidation threshold likely ~+0.02 sustained for N=3 bars. MEDIUM
TODO-2 Validate SUBDAY_ACB force-exit threshold (old_boost >= 1.25 and boost < 1.10). Currently ARBITRARY — agent-chosen, not backtest-derived. MEDIUM
TODO-3 MIG8: Binance live adapter (real order execution). OUT OF SCOPE until after 30-day paper trading validation. LOW
TODO-4 48-hour chaos test with all daemons running simultaneously. Watch for: KeyError, stale-read anomalies, concurrent HZ writer collisions. HIGH (before live capital)
TODO-5 Memory profiler with IRP enabled at 400 assets (current 71 MB measurement was without IRP). Projected ~600 MB — verify. LOW
TODO-6 TF-spread recovery exits (tf_enabled=False). Requires sweep of tf_exhaust_ratio and tf_flip_ratio vs. champion backtest. LOW
TODO-7 GREEN (LONG) posture paper validation. LONG thresholds (long_threshold=0.01, long_extreme=0.04) not yet production-validated. MEDIUM
TODO-8 ML-MC Forewarner injection into nautilus_prefect_flow.py. DONE 2026-03-22 — wired in DolphinActor.on_start() for both flows. CLOSED
TODO-9 Live TradingNode integration (launcher.py exists; Binance adapter config incomplete). Requires 30-day clean paper run first. LOW

22. 0.1S RESOLUTION — READINESS ASSESSMENT

Assessment date: 2026-03-22. Status: BLOCKED — 3 hard blockers.

The current system processes 5s OHLCV bars. Upgrading to 0.1s tick resolution requires resolving all three blockers below before any code changes.

22.1 Blocker 1 — Async HZ Push

Problem: The OBF hot loop fires at ~100ms cadence. At 0.1s resolution, the per-bar HZ write latency (currently synchronous in feature compute path, despite fire-and-forget for the push itself) would exceed bar cadence, causing HZ write queue growth and eventual OOM.

Required: Full async HZ client (hazelcast-python-client async API or aiohazelcast). Currently all HZ operations are synchronous blocking calls. Estimated effort: 23 days of refactor + regression testing.

22.2 Blocker 2 — get_depth Timeout

Problem: get_depth() in HZOBProvider issues a synchronous HZ IMap.get() call with a 500ms timeout. At 0.1s resolution, each bar would wait up to 500ms for OB depth data — 5× the bar cadence. This makes 0.1s resolution impossible without an in-process depth cache.

Required: Pre-fetched depth cache (e.g., local dict refreshed by a background subscriber), making get_depth() a pure in-process read with <1µs latency. Estimated effort: 12 days.

22.3 Blocker 3 — Lookback Recalibration

Problem: All champion parameters that reference "bars" were validated against 5s bars:

  • lookback=100 (100 × 5s = 500s warmup)
  • max_hold_bars=120 (120 × 5s = 600s max hold)
  • dc_lookback_bars=7 (7 × 5s = 35s DC window)

At 0.1s resolution, the same bar counts would mean 10s warmup, 12s max hold, 0.7s DC window — completely invalidating champion params. All params must be re-validated from scratch via VBT backtest at 0.1s resolution.

Required: Full backtest sweep at 0.1s. Estimated effort: 12 weeks of compute + validation time. This is a research milestone, not an engineering task.

22.4 Assessment Summary

Blocker Effort Dependency
Async HZ push 23 days engineering None — can start now
get_depth cache 12 days engineering None — can start now
Lookback recalibration 12 weeks research Requires blockers 1+2 resolved first

Recommendation: Do NOT attempt 0.1s resolution until after 30-day paper trading validation at 5s. The engineering blockers can be prototyped in parallel, but champion params cannot be certified until post-paper-run stability is confirmed.

23. SIGNAL PATH VERIFICATION SPECIFICATION

Testing the asynchronous, multi-scale signal path requires systematic validation of the data bridge and cross-layer trigger logic.

23.1 Verification Flow

A local agent (Prefect or standalone) should verify:

  1. Micro Ingestion: 100ms OB features sharded across 10 HZ maps.
  2. Regime Bridge: NG5 Arrow scan detection by scan_hz_bridge.py and push to latest_eigen_scan.
  3. Strategy Reactivity: DolphinActor.on_bar (5s) pulling HZ data and verifying scan_number idempotency.
  4. Macro Safety: Survival Stack Rm-computation pushing APEX/STALKER/HIBERNATE posture to DOLPHIN_SAFETY.

23.2 Reference Document

Full test instructions, triggers, and expected values are defined in: TODO_CHECK_SIGNAL_PATHS.md (Project Root)


End of DOLPHIN-NAUTILUS System Bible v3.0 — 2026-03-23 Champion: SHORT only (APEX posture, blue configuration) Automation: Prefect-supervised paper trading active. Status: Capital Sync enabled; Friction SP-bypass active; TradeLogger running. Do NOT deploy real capital until 30-day paper run is clean.