Quantum Signals

1. Executive Summary

Quantum Signals provides pre-trained intraday AI signals for US equity index futures (ES, NQ). Our models predict short-horizon mid-price trends (up / stable / down) using finance-native neural networks trained on Level II limit order book (LOB) and microstructure features.‍

Objective. Review the performance of pre-trained AI signals on ES and NQ futures and understand how the technology performs and generalizes.

‍Bigger picture. Pre-trained signals are a baseline that can be used for validation. We do not expect all our customers to trade using these signals. The goal is to help you design custom signals tailored to your symbols, horizons, entry cadence, and risk rules.

2. Review the Basics

2.1 Data

‍We use Level II LOB (10 levels, prices & volumes) from CME Globex with ~3 years’ worth of history for training, plus the real-time LOB feed for near-real-time (sub-second) predictions.2.2 PredictionsFor the two pre-trained signals we are reviewing in this document we are predicting the following:

Target variable: mid-price trend (Up / Down / Stable) between two averaging windows:
- Start window: [now, now + 5 min] (average over next 5-minutes)
- End window: end of the day (average over 15:55–16:00 ET)‍
Stable: ±2 bps band around 0‍
Prediction cadence: every 1 minute on CME Trading Days for Equities, 09:30–16:00 ET

‍2.3 AI Model

‍We utilize a Temporal Fusion Transformer (TFT) architecture optimized for market microstructure. This is not an LLM, we use transformer components tailored for finance. Here are some of the key considerations the model takes into account:

multi-scale time features (LOB events, seconds, minutes, days)
probability calibration (post-processing of the output using class probabilities)
cost-sensitive loss function aligned to PnL

Technical paper on TFT: “Temporal Fusion Transformers for Interpretable Multi-horizon Time Series Forecasting”, B. Lim et al (available at https://arxiv.org/abs/1912.09363)

3. Benchmarks

3.1 Training, Validation, and Test periods

‍We are retraining our models every 3 months, and evaluating performance out-of-sample.

Training: Several months of historical data that we use to train the model. Every 3 months we append another 3 months of data to the end of this data set.
Validation: A 3-month period (rolling forward every 3 months), not seen during training, that we use to pick the best performing model.
Test: A 3-month period (rolling forward every 3 months), not seen during training or validation, that we use to report performance.

	Reported Performance for Q4 2024	Reported Performance for Q1 2025	Reported Performance for Q2 2025
Training	Start: Q1 2022 End: Q2 2024	Start: Q1 2021 End: Q3 2024	Start: Q1 2021 End: Q4 2024
Validation	Q3 2024	Q4 2024	Q1 2025
Test	Q4 2024	Q1 2025	Q2 2025

‍

3.2 Trading Strategy Used for Benchmarking

‍We are retraining our models every 3 months, and evaluating performance out-of-sample.

Every 15 minutes (between 09:45 and 15:30 ET) we decide to take a long, short or no position using 1/24 of our starting portfolio for the day.
Each long/short position is then split into 15 parts and executed on each minute for the next 15 minutes following the decision. There is no sizing adjustment.
We exit all positions at the end of the day (15:55–16:00 ET).
Costs: 1 bp round-turn assumption (ES: 1bp ~ 2 ticks = $25.0, NQ: 1bp ~ 8 ticks = $40.0).
Exchange/clearing fees not included.
Contract series & roll: Front-month continuous. Switch at the open T–2 trading days before expiration-stop trading the expiring contract and start trading the next.

3.3 Performance Metrics

Quarter	Symbol	Sharpe	Calmar	Win %	Ann. Return %	Ann. Vol %	MDD %
Q4 2024	NQ	1.5067	5.5566	52.0251	12.2237	8.1128	-2.1999
Q4 2024	ES	1.1914	4.5486	49.8931	8.0632	6.7677	-1.7727
Q1 2025	NQ	1.6054	3.3688	59.6708	13.5947	8.4683	-4.0355
Q1 2025	ES	1.8636	5.4055	57.4246	13.9768	7.5000	-2.5856
Q2 2025	NQ	2.1431	4.3566	54.7915	12.2438	5.7132	-2.8104
Q2 2025	ES	1.7423	4.2824	55.5556	11.1363	6.3918	-2.6005
Overall	NQ	1.5553	2.6820	55.4478	15.2756	9.8214	-5.6957
Overall	ES	1.6115	3.0239	54.3098	15.5737	9.6641	-5.1502

Notes:‍

Annualization: daily conversion to annual using √252.
Sharpe uses daily returns.
Ann. Vol: Annualized Volatility.
MDD: Max Drawdown.
Rebasing: curves rebased to 100 at period start.

4. Prediction Data

The predictions generated by the model are available in the following files for download to anyone interested in using them in their own test harness. There is one file per model.

ES Backtest Predictions Q4 2024 - Q2 2025

NQ Backtest Predictions Q4 2024 - Q2 2025

Here are the column names and meaning in those files:

date_time: Timestamp of each observation (mostly 1-minute cadence) | (datetime string)
predictions: Model-predicted class (0 for Down, 1 for Stable, 2 for Up) | (int)
actual_labels: Realized (actual) class label for the outcome (0/1/2) | (int

5. Next steps

Get on a short call to ask more questions and/or ask for a detailed table/CSV of past predictions for your test harness.
Review out-of-sample performance by consuming the predictions in real-time using our API. This requires signing-up for our “Professional” tier which includes a 1-month free trial.
Go beyond pre-trained signals by tailoring a signal to your specific needs. Customize by symbol, target variable, time horizon, neutral band, entry cadence and risk rules.

6. Contact

Yianni Gamvros
Co-founder & CEO
info[at]quantumsignals.ai

Disclaimers:

Futures trading involves substantial risk of loss and is not suitable for all investors.
Hypothetical/simulated results do not represent actual trading, may under- or over-state market impacts (e.g., liquidity), and reflect hindsight.
No representation that any account will achieve similar results.
Past performance is not necessarily indicative of future results.
Quantum Signals is not an NFA member.

Share with your network.

In this article

Executive Summary

US Futures (ES and NQ): Performance Benchmark Q2 2025