US Futures (ES and NQ): Performance Benchmark Q2 2025

October 8, 2025

.

10 min read

1. Executive Summary

Quantum Signals provides pre-trained intraday AI signals for US equity index futures (ES, NQ). Our models predict short-horizon mid-price trends (up / stable / down) using finance-native neural networks trained on Level II limit order book (LOB) and microstructure features.

Objective. Review the performance of pre-trained AI signals on ES and NQ futures and understand how the technology performs and generalizes.

Bigger picture. Pre-trained signals are a baseline that can be used for validation. We do not expect all our customers to trade using these signals. The goal is to help you design custom signals tailored to your symbols, horizons, entry cadence, and risk rules.

2. Review the Basics

2.1 Data

We use Level II LOB (10 levels, prices & volumes) from CME Globex with ~3 years’ worth of history for training, plus the real-time LOB feed for near-real-time (sub-second) predictions.2.2 PredictionsFor the two pre-trained signals we are reviewing in this document we are predicting the following:

  • Target variable: mid-price trend (Up / Down / Stable) between two averaging windows:
    • Start window: [now, now + 5 min] (average over next 5-minutes)
    • End window: end of the day (average over 15:55–16:00 ET)
  • Stable: ±2 bps band around 0
  • Prediction cadence: every 1 minute on CME Trading Days for Equities, 09:30–16:00 ET

2.3 AI Model

We utilize a Temporal Fusion Transformer (TFT) architecture optimized for market microstructure. This is not an LLM, we use transformer components tailored for finance. Here are some of the key considerations the model takes into account:

  • multi-scale time features (LOB events, seconds, minutes, days)
  • probability calibration (post-processing of the output using class probabilities)
  • cost-sensitive loss function aligned to PnL

Technical paper on TFT: “Temporal Fusion Transformers for Interpretable Multi-horizon Time Series Forecasting”, B. Lim et al (available at https://arxiv.org/abs/1912.09363)

3. Benchmarks

3.1 Training, Validation, and Test periods

We are retraining our models every 3 months, and evaluating performance out-of-sample.

  • Training: Several months of historical data that we use to train the model. Every 3 months we append another 3 months of data to the end of this data set.
  • Validation: A 3-month period (rolling forward every 3 months), not seen during training, that we use to pick the best performing model.
  • Test: A 3-month period (rolling forward every 3 months), not seen during training or validation, that we use to report performance.
Reported Performance for Q4 2024 Reported Performance for Q1 2025 Reported Performance for Q2 2025
Training Start: Q1 2022
End: Q2 2024
Start: Q1 2021
End: Q3 2024
Start: Q1 2021
End: Q4 2024
Validation Q3 2024 Q4 2024 Q1 2025
Test Q4 2024 Q1 2025 Q2 2025

3.2 Trading Strategy Used for Benchmarking

We are retraining our models every 3 months, and evaluating performance out-of-sample.

  • Every 15 minutes (between 09:45 and 15:30 ET) we decide to take a long, short or no position using 1/24 of our starting portfolio for the day.
  • Each long/short position is then split into 15 parts and executed on each minute for the next 15 minutes following the decision. There is no sizing adjustment.
  • We exit all positions at the end of the day (15:55–16:00 ET).
  • Costs: 1 bp round-turn assumption (ES: 1bp ~ 2 ticks = $25.0, NQ: 1bp ~ 8 ticks = $40.0).
  • Exchange/clearing fees not included.
  • Contract series & roll: Front-month continuous. Switch at the open T–2 trading days before expiration-stop trading the expiring contract and start trading the next.

3.3 Performance Metrics

Quarter Symbol Sharpe Calmar Win % Ann. Return % Ann. Vol % MDD %
Q4 2024 NQ 1.5067 5.5566 52.0251 12.2237 8.1128 -2.1999
Q4 2024 ES 1.1914 4.5486 49.8931 8.0632 6.7677 -1.7727
Q1 2025 NQ 1.6054 3.3688 59.6708 13.5947 8.4683 -4.0355
Q1 2025 ES 1.8636 5.4055 57.4246 13.9768 7.5000 -2.5856
Q2 2025 NQ 2.1431 4.3566 54.7915 12.2438 5.7132 -2.8104
Q2 2025 ES 1.7423 4.2824 55.5556 11.1363 6.3918 -2.6005
Overall NQ 1.5553 2.6820 55.4478 15.2756 9.8214 -5.6957
Overall ES 1.6115 3.0239 54.3098 15.5737 9.6641 -5.1502
Figure 2: E-mini Nasdaq-100 (NQ)

Notes:

  • Annualization: daily conversion to annual using √252.
  • Sharpe uses daily returns.
  • Ann. Vol: Annualized Volatility.
  • MDD: Max Drawdown.
  • Rebasing: curves rebased to 100 at period start.

4. Prediction Data

The predictions generated by the model are available in the following files for download to anyone interested in using them in their own test harness. There is one file per model.

Here are the column names and meaning in those files:

  • date_time: Timestamp of each observation (mostly 1-minute cadence) | (datetime string)
  • predictions: Model-predicted class (0 for Down, 1 for Stable, 2 for Up) | (int)
  • actual_labels: Realized (actual) class label for the outcome (0/1/2) | (int

5. Next steps

  1. Get on a short call to ask more questions and/or ask for a detailed table/CSV of past predictions for your test harness.
  2. Review out-of-sample performance by consuming the predictions in real-time using our API. This requires signing-up for our “Professional” tier which includes a 1-month free trial.
  3. Go beyond pre-trained signals by tailoring a signal to your specific needs. Customize by symbol, target variable, time horizon, neutral band, entry cadence and risk rules.

6. Contact

Yianni Gamvros
Co-founder & CEO
info[at]quantumsignals.ai

Disclaimers:

  • Futures trading involves substantial risk of loss and is not suitable for all investors.
  • Hypothetical/simulated results do not represent actual trading, may under- or over-state market impacts (e.g., liquidity), and reflect hindsight.
  • No representation that any account will achieve similar results.
  • Past performance is not necessarily indicative of future results.
  • Quantum Signals is not an NFA member.
Share with your network.
In this article

Executive Summary