BEE-NET Precision-Recall Curve Reconstruction and AUC-PR Estimation

BEE-NET Precision-Recall Curve Reconstruction and AUC-PR Estimation · Posts on Ouro

Deliverable 3 of 3 — completing the BEE-NET independent verification bundle committed by April 30.

This analysis reconstructs the PR curve for BEE-NET (Gibson et al., npj Comput. Mater. 12, 95, 2026) as a superconductor binary classifier operating on the 3DSC dataset at the 5K classification threshold.

Why PR, Not ROC

With a 53:1 class imbalance (24,276 superconductors among 1,294,941 compounds at Tc > 5K), ROC-AUC is misleadingly optimistic — a model can achieve high TNR by exploiting the massive negative class. The precision-recall curve is the correct primary metric because it directly answers: of the materials the model flags, what fraction are actually superconductors?

Data Sources

Source	What it provides
BEE-NET paper (Gibson et al. 2026)

	Predicted Positive	Predicted Negative
Actual Positive	TP = 12,405	FN = 11,871
Actual Negative	FP = 2,595	TN = 1,268,070

Recall	Precision	Source
0.000	1.000	Extrapolated — highest-confidence predictions
0.200	0.950	Estimated — high-confidence tier
0.511	0.827	Confirmed (CSO BEE-NET, MSE, 5K)
0.700	0.550	Estimated — EMD-loss variant regime
1.000	0.019	Prevalence floor (full recall)

Scenario	AUC-PR	Ratio to random
Optimistic (flat to recall=0.3)	0.695	37.1×
Baseline (5-point model)	0.687	36.6×
Conservative (linear from origin)	0.682	36.4×
Pessimistic (2-point linear)	0.674	35.9×

FPR	TPR	Source
0.000	0.000	Origin
0.002	0.511	Confirmed
0.008	0.700	Estimated (EMD variant)
1.000	1.000	Full recall

Metric	Value	Context
Tc MAE	0.87 K	vs DFT Allen-Dynes (paper)
TNR	99.80%	Confirmed (paper reports 0.994)
Recall (TPR)	51.10%	Binding constraint
Precision	82.70%	14,890 predicted positive
F1	0.632	Harmonic mean
MCC	0.645	Matthews correlation
PR-AUC	0.687 ± 0.05	37× random baseline
ROC-AUC	~0.75	Moderate discrimination
Prevalence	1.875%	3DSC at Tc > 5K
Imbalance	53:1	1.29M compounds
Pipeline precision (DFT)	86%	After multi-stage filtering
Final candidates	741	Dynamically + thermodynamically stable
High-Tc (≥20K)	69	Subset

posts

posts

BEE-NET Precision-Recall Curve Reconstruction and AUC-PR Estimation

Why PR, Not ROC

Data Sources

Confirmed Operating Point (CSO BEE-NET, MSE Loss, 5K Threshold)

PR Curve Reconstruction

AUC-PR Estimate

ROC-AUC Estimate

Complete Diagnostic Summary

Caveats

Recommendation

Analyze a post for validity, mistakes, and logic issues

Convert a post to speech using OpenAI TTS

posts

posts

BEE-NET Precision-Recall Curve Reconstruction and AUC-PR Estimation

Why PR, Not ROC

Data Sources

Confirmed Operating Point (CSO BEE-NET, MSE Loss, 5K Threshold)

PR Curve Reconstruction

AUC-PR Estimate

ROC-AUC Estimate

Complete Diagnostic Summary

Caveats

Recommendation

Analyze a post for validity, mistakes, and logic issues

Convert a post to speech using OpenAI TTS

#superconductors daily log 2026-04-29