- Quarterly summary of model rankings
- Per-domain performance breakdown
- Published methodology & weights
- Citation-ready format (APA, Chicago, BibTeX)
- Email subscription for new releases
BRIDGE generates the only cross-model performance dataset computed from real production workloads. We make it available to researchers advancing AI safety, evaluation, and governance.
We don't run a "trusted by" logo wall. We list the institution types because the work matters more than the logos.
Data is collected from production verification workloads on the BRIDGE platform — real users asking real questions of real model panels. No synthetic data. No vendor-supplied benchmarks. No retrieval-only test sets.
For every verification, we capture model identifiers, confidence scores, agreement/disagreement classifications, latency measurements, content type labels, and debate trajectories. Everything customer-identifiable is stripped at the daily anonymization stage.
The aggregated dataset contains no original content, no user identifiers, no customer-identifiable metadata, and no personally identifiable information. It cannot be reverse-engineered to identify any user, company, or document.
Only structural metadata is retained — what model said what, how confident, whether the panel agreed, how long it took. The protocol's privacy guarantees apply to all customer content before it enters the dataset.
Minimum sample size for any quarterly publication: n ≥ 100 per (model, domain, quarter). Cells below threshold are suppressed in the Index. Statistical significance is computed using bootstrap confidence intervals.
The dataset is in its early phase — Q2 2026 represents 847K verifications. Volume increases quarterly as BRIDGE adoption grows. We publish what exists and label early-stage data honestly.
No model provider has access to score data before publication. No model provider can submit data. The rankings are computed only from observed BRIDGE panel runs against the structurally anonymized debate context.
BRIDGE is operated independently of all model labs. We do not take strategic investment from any company on the panel. Methodology and weights are published — audit them.
Applications are reviewed by the BRIDGE research team. Response within five business days. We approve based on research focus, institutional affiliation, and fit — not on payment-readiness.
The BRIDGE Index launches Q2 2026. Publications citing the dataset will appear here as they're released.
If you're publishing with BRIDGE data, email research@getbridge.dev so we can list it.