Site & Recruitment Intelligence

Site Performance Prediction Models

Pick sites on what they will do, not what they say.

How AI-based site performance prediction is reshaping clinical trial site selection — historical recruitment, eligible-population mapping and operational signals.

Decision angle

"Which sites will actually deliver — and which look good on paper but underperform?"

TL;DR

Site selection used to rely on PI relationships. AI prediction models combine historical recruitment, EHR-derived eligible population and operational signals to select sites that recruit on time.

Site selection is moving from relationship-driven to data-driven. Models that combine historical recruitment, EHR eligible counts and operational signals consistently pick faster-recruiting sites.

Key insights

What we’re seeing in the data.

Historical recruitment is the strongest signal

Past performance predicts ~60% of variance.

EHR-derived eligible counts beat self-reports

Sites overestimate eligible pools by 30–60%.

Operational variables matter

IRB speed, contract turnaround, staff turnover.

~60%

Variance from history

Predictive

30–60%

Self-report inflation

Sites

10–30%

Performance lift

AI selection

EHR

Eligible-pool source

Best

Decision framework

How to think about it.

01
Pull historical site performance

Recruitment, retention, query rate.
02
Map eligible population

EHR-based, indication-specific.
03
Score operational variables

IRB speed, contract turnaround.
04
Build composite prediction

Weighted model.

Considerations

What separates a good answer from a defensible one.

✓

Data privacy

EHR-based estimates require compliance.

✓

Site self-report bias

Cross-validate.

✓

Indication specificity

Onc≠cardio.

Sources & tools

Where the signal comes from.

Site selection AI platforms EHR aggregation services CTMS historical data Cortellis site intelligence

FAQ

Common questions.

Are PI relationships still relevant?

Yes — for engagement, not for selection. Use prediction for selection, relationship for execution.

What ROI is realistic?

10–30% timeline improvement when used disciplined.

Keep going.

Read next

Want this answered on your data?

We build decision systems on top of analyses like this — so the next question takes minutes, not weeks.

Talk to a strategist

Site Performance Prediction Models

What we’re seeing in the data.

Historical recruitment is the strongest signal

EHR-derived eligible counts beat self-reports

Operational variables matter

How to think about it.

Pull historical site performance

Map eligible population

Score operational variables

Build composite prediction

What separates a good answer from a defensible one.

Where the signal comes from.

Common questions.

Are PI relationships still relevant?

What ROI is realistic?

Keep going.

Adaptive Trial Design Playbook

AI for Protocol Design and Feasibility

AI Reducing Clinical Trial Timelines

Best Countries for Clinical Trials, by Indication

Want this answered on your data?