About Fournex
A production-grade runtime for reinforcement learning agents
Fournex is the layer where trained RL policies are loaded, executed, and managed as living systems, not just called like static models.
Fournex is not a dashboard or an experiment tool.
It is an execution engine.
What Problem It Solves
Most RL teams can train agents, but once training is done they have no solid way to:
- Run agents continuously in real systems
- Maintain state across steps and episodes
- Serve policies reliably under load
- Debug what actually happened in production
- Integrate agents cleanly into robotics, simulation, or service backends
They end up building fragile custom runtimes.
Fournex replaces those one-off runtimes with a standard, high-performance agent execution layer.
What "Runtime" Actually Means
A runtime is responsible for the hard stuff around execution:
- Policy execution: load policy artifact (SB3, TorchScript, ONNX), run inference
- Stateful sessions: maintain per-agent state (RNN hidden state, filters, normalization stats, last obs/action, episode context)
- Environment loop support: optionally run step loops server-side (agent ↔ env) with consistent timing
- Determinism & replay: capture what happened and reproduce it (seed, version, config, obs/actions/rewards)
- Performance: low overhead per step, batching, efficient memory layout, pinned cores, etc.
- Isolation: crashes or bad policies shouldn't take down the node; resource caps and timeouts
- Observability: metrics, traces, logs; minimal but essential signals
- Integration surface: stable API/SDK that robotics / sims / games can embed against
Two Modes of Runtime
Stateless Inference
action = policy(obs)For low-latency calls from external systems.
Stateful Agent Sessions
session.step(obs) → actionMaintains continuity and supports control loops, RNNs, etc.
Our Product Vision
"The best session engine + policy runner you can embed or deploy."
Why Teams Adopt This
Teams don't adopt runtimes because they love infra—they adopt because it immediately solves:
"We can't run this policy reliably at 50–200Hz."
"We need deterministic replay for debugging."
"We need RNN state / episode continuity."
"We need to deploy 5 versions and compare."
"Our inference loop overhead is killing us."
Fournex nails stability, determinism, and ease of integration more than dashboards.
What Fournex Does
✓Package a policy
✓Run it locally and in Docker
✓Call it via API or SDK
✓Use stateful sessions
✓Capture traces for replay/debugging
✓Run it fast and reliably under sustained load
That's the smallest "runtime" that feels real.