Home Methods Data Performance Philosophy Research Live Dashboard

1. Headline numbers

0.826F2_dom AUC
±0.015Fold Std Dev
70.6%Stack Hit Rate
1.35Profit Factor

Past results do not guarantee future performance. The figures above are backtest and walk-forward validation results on the v131 stack. Nothing on this page is a live account statement.

2. F2_dom walk-forward AUC

The microstructure head is the most heavily validated component of the stack. Across five purged folds with a 10-minute embargo, over 1.45M labelled samples drawn from the full MBP-10 history:

FoldSamplesAUCNotes
1 (earliest)~290k0.841Highest spread vol in window
2~290k0.819Quiet regime, feature importance shifts
3~290k0.832Mixed regime
4~290k0.821Event-heavy (CPI, NFP)
5 (latest)~290k0.817Most recent, closest to live
Mean ± SD~1.45M0.826 ± 0.015Purged K=5, 10-min embargo

Reading this honestly: fold 5 is the most recent and the closest to live. It is also the lowest AUC in the set. We treat that as the realistic upper bound for deployed performance, not the mean.

3. Full-stack backtest

The full believe stack (tick ML + XGB 5m + F2_dom) is run against the 78-day live tick capture from 2026-01-28 onward, with commission, marketable-limit entry slippage rules, and random 1-2 tick stop slippage applied. All three heads run at qty=1 with independent OCA brackets.

ConfigurationHit RateProfit FactorResult
Stack, F2_dom disabled67.7%1.02Near-breakeven pre-commission
Stack, F2_dom enabled70.6%1.35Walk-forward consistent

The lift from the F2_dom head is real, small, and consistent across folds. This is what a microstructure signal is supposed to do: tilt the edge, not replace it.

4. Triple-barrier label distribution

Before trusting any classification metric, you should see the label distribution. Below is the distribution of the three barrier outcomes across the 1.45M F2_dom training samples:

OutcomeShareInterpretation
Upper barrier touched (+12 ticks)~41%Take-profit realised
Lower barrier touched (-8 ticks)~44%Stop-loss realised
Vertical barrier (time-out)~15%Neither touched; exit at expiry

The label set is close to balanced and not dominated by time-outs, which is a precondition for the AUC number above to be meaningful.

5. Feature importance (F2_dom v131)

Importance is gain-based, averaged over the five purged folds. We track the top 16 each retrain and alert on large rank shifts. The table below is the v131 snapshot; absolute gain values are withheld because they are retrain-specific and not decision-useful off the training host.

RankFeatureFamily
1book_imbAggregate imbalance
2tob_ratioTop of book
3top3_imbNear-touch imbalance
4mid_momMicroprice drift
5imb_stdRolling imbalance vol
6bid_grad_2Bid gradient L2
7ask_grad_2Ask gradient L2
8spread_ticksSpread
9depth_ratioDepth skew
10queue_agePrice-level staleness

6. What we deliberately do not publish

7. What we monitor day to day

Nothing on this page is an offer, solicitation, or investment advice. Past walk-forward and backtest results do not guarantee future live performance. Commission, exchange fees, slippage, and regime changes can materially affect results. BHF Capital is an informational brand of Rare Bird Holdings LLC.