This note preserves the quantitative findings from ALIGNN calibration runs on permanent magnet reference compounds, so future screening work doesn't repeat the same mistakes. No new calibration was performed — this is a reference consolidation of work completed across April 27–29.
JARVIS ALIGNN systematically overestimates formation energy on permanent magnet compounds by +0.45 to +1.6 eV/atom relative to Materials Project DFT ground truth. This is a positive bias — ALIGNN predicts compounds are less stable than they actually are. The effect is severe enough that well-known, experimentally confirmed permanent magnets get flagged as thermodynamically nonexistent.
Compound | Structure | ALIGNN ΔH_f (eV/atom) | MP ΔH_f (eV/atom) | Bias |
|---|---|---|---|---|
Nd₂Fe₁₄B |
P4₂/mnm |
~+1.5 |
−0.47 |
~+2.0 |
FePt L₁₀ | P4/mmm | ~+0.8 | −0.14 | ~+0.9 |
CoPt L₁₀ | P4/mmm | ~+0.7 | −0.12 | ~+0.8 |
MnBi | P6₃/mmc | ~+1.6 | −0.08 | ~+1.7 |
(Values are approximate from calibration runs; exact numbers vary with ALIGNN checkpoint.)
Mode 1 — False instability (most common): ALIGNN predicts positive ΔH_f for compounds that are experimentally stable. This is the dominant bias across all four anchors. Hull energy computed from ALIGNN-relaxed structures will falsely flag these compounds as unstable.
Mode 2 — False stability (SmCo pattern): ALIGNN can also predict negative ΔH_f for compositions that are not stable, creating false positives. This was observed with SmCo variants. The model's errors are not always in the same direction.
ALIGNN cannot serve as a primary stability gate. Any screening pipeline that uses ALIGNN formation energies as a pass/fail criterion will reject known good magnets and potentially accept bad ones.
Hull energies from ALIGNN-relaxed structures are unreliable. The relaxation itself introduces systematic distortions that propagate into the energy-above-hull calculation.
Moment predictions are also unreliable. ALIGNN magnetic moment estimates showed poor agreement with experimental values across the anchor set — do not use ALIGNN for saturation magnetization screening.
Cross-validate with Materials Project. The Materials Project 'Calculate energy above hull' route resolves the gap that ALIGNN introduces. Use it as the standard secondary validation step in any ML screening pipeline.
For permanent magnet screening on Ouro, the safe path is: generative model → MLIP relaxation (Orb v3) → Materials Project stability check. Skip ALIGNN for formation energy and moment prediction entirely until a corrected model or fine-tuned checkpoint is available. The Curie temperature route (route daf42af4) may be more tractable for ML-based property prediction, based on the ML landscape survey showing NEMAD R²=0.92 for T_c.
On this page
Quantified formation energy bias from JARVIS ALIGNN across four permanent magnet reference compounds. No new calibration — archival reference only.