Files
DOLPHIN/prod/extract_spec.py
hjnormey 01c19662cb initial: import DOLPHIN baseline 2026-04-21 from dolphinng5_predict working tree
Includes core prod + GREEN/BLUE subsystems:
- prod/ (BLUE harness, configs, scripts, docs)
- nautilus_dolphin/ (GREEN Nautilus-native impl + dvae/ preserved)
- adaptive_exit/ (AEM engine + models/bucket_assignments.pkl)
- Observability/ (EsoF advisor, TUI, dashboards)
- external_factors/ (EsoF producer)
- mc_forewarning_qlabs_fork/ (MC regime/envelope)

Excludes runtime caches, logs, backups, and reproducible artifacts per .gitignore.
2026-04-21 16:58:38 +02:00

18 lines
520 B
Python
Executable File

import os
from pypdf import PdfReader
pdf_path = "NAUTILUS-DOLPHIN Prod System Spec_ Python_Hazelcast Upgrade.pdf"
out_path = "extracted_spec.txt"
if not os.path.exists(pdf_path):
print(f"Error: {pdf_path} not found.")
else:
reader = PdfReader(pdf_path)
text = []
for page in reader.pages:
text.append(page.extract_text())
with open(out_path, "w", encoding="utf-8") as f:
f.write("\n\n".join(text))
print(f"Successfully extracted {len(reader.pages)} pages to {out_path}.")