Train Agents
That Trade

A metaclass-driven, registry-first RL framework. Hash-locked RLExperimentSpec, RLRuntime, and Iceberg trajectory store for deterministic, replayable training and evaluation.

Request Demo Read RL-Ops Guide

Sanitized reinforcement learning lab with experiment hashes, trajectory review, and PRUDEX-style scorecard.

RL-Ops

RL Lab

RL experiments become reviewable operating evidence, not opaque model artifacts.

RLRuntime

Hash-locked RLExperimentSpec ensures reproducibility. Every experiment is versioned in PostgreSQL. Trajectories are stored in Iceberg for forensic replay.

RLRuntime Architecture Diagram

Pre-Built RL Agents

Choose from 12+ industry-standard RL architectures including EIIE, DeepTrader, Investor-Imitator, ETEO, and OPD. Optimized for low-latency financial environments.

Agent Architecture Grid

Market Dynamics Modeling

Slice-and-merge regime labeling. Train agents that are aware of market conditions and can adapt their strategy to changing regimes.

Market Regime Detection Visualization

PRUDEX Evaluation

17 independent measures across profitability, risk, diversity, execution, and explainability. Moving beyond Sharpe ratio for professional trading.

PRUDEX Compass Visualization

Weight-Centric Pipeline

FinRL-X inspired 4-stage pipeline: Feature selection -> Alpha generation -> Target weights -> Risk overlay. Robust portfolio-level decision making.

Weight-Centric Pipeline Diagram

The PRUDEX Evaluation Framework

Moving beyond Sharpe ratio. Our Phase 9 PRUDEX-Compass framework provides 17 independent measures and 5 advanced visualizations to truly understand agent behavior before deploying capital.

Iceberg Trajectory Store

Every step, observation, and reward is persisted to an Iceberg warehouse for forensic analysis and replay.

Weight-Centric Pipeline

FinRL-X inspired pipeline (f_S → f_A → f_T → f_R) for robust portfolio-level decision making.

Pre-built Agents

EIIE (Ensemble of Identical Independent Experts)
DeepTrader (Asset scoring + risk control)
Investor Imitator (Inverse RL)
DeepScalper (HFT execution)
PPO In-house (Optimized for low-latency)
FinAgent (LLM-hybrid adapter)

Train AgentsThat Trade

RL Lab

RLRuntime

Pre-Built RL Agents

Market Dynamics Modeling

PRUDEX Evaluation

Weight-Centric Pipeline

The PRUDEX Evaluation Framework

Iceberg Trajectory Store

Weight-Centric Pipeline

Pre-built Agents

Train Agents
That Trade