LevelUp isn't pre-authored content. REACTOR — our 9-agent pipeline — designs each challenge, a deterministic validator vets it, you work it in a real Docker sandbox, and the whole platform recalibrates nightly.
The Designer agent drafts a challenge brief — a vulnerable web app, an intrusion scenario, a malicious binary, a smart contract with a subtle bug. Static Analysis catches obvious generator slop before build. Validator and Calibrator iterate until the brief hits the target ELO band.
Or paste a breach report URL — REACTOR reconstructs it as a multi-stage scenario, not a single flag.
LEVELUP{...}Every draft runs through the real state machine in orchestrator.py. We don't ship a broken sandbox. We don't ship an unsolvable one. And we definitely don't ship one an LLM can write up from public data.
Not a multiple-choice quiz. Not a text-adventure sim. A live Docker container with a full analyst toolkit, a terminal, and a flag that requires real tradecraft to capture.
Win, lose, or time out — your 12-axis ELO updates. A matchmaker queues the next challenge in your growth zone: hard enough to stretch, not so hard you bounce off.
The evolution worker runs the nightly cron. Challenges that everyone solves too fast get mutated. Prompts that produce boring briefs get retired. Gaps in the catalogue get filled from real solve-rate data.