This study employs the SuperCon dataset as the largest superconducting materials dataset. Then, we perform various data pre-processing steps to derive the clean DataG dataset, containing 13,022 compounds. In another stage of the study, we apply the novel CatBoost algorithm to predict the transition temperatures of novel superconducting materials. In addition, we developed a package called Jabir, which generates 322 atomic descriptors. We also designed an innovative hybrid method called the Soraya package to select the most critical features from the feature space. These yield R2 and RMSE values (0.952 and 6.45 K, respectively) superior to those previously reported in the literature. Finally, as a novel contribution to the field, a web application was designed for predicting and determining the Tc values of superconducting materials.
How this file is connected to other assets
Literature review of databases with materials and . See literature review on ML models which utilize these datasets:
So far this is the most recent paper I've found on ML prediction of , improving on both modeling (CatBoost) and dataset compared to Stanev et al.
Literature review of existing studies done on predicting with machine learning.
Discover other files like this one
Room-temperature ferromagnets are high-value targets for discovery given the ease by which they could be embedded within magnetic devices. However, the multitude of potential interactions among magnetic ions and their surrounding environments renders the prediction of thermally stable magnetic properties challenging. Therefore, it is vital to explore methods that can effectively screen potential candidates to expedite the discovery of novel ferromagnetic materials within highly intricate feature spaces. To this end, the authors explore machine-learning (ML) methods as a means to predict the Curie temperature (Tc) of ferromagnetic materials by discerning patterns within materials databases. https://pubs.acs.org/doi/10.1021/acs.jcim.4c00947
These are the predictions made by a model that was trained on materials with max Tc of 90 K. Here we show how well it could predict a material (Ba2Ca1Cu2Hg1O6.24-MP-mp-6879-synth_doped) with a Tc greater than this range.
Superconductivity has been the focus of enormous research effort since its discovery more than a century ago. Yet, some features of this unique phenomenon remain poorly understood; prime among these is the connection between superconductivity and chemical/structural properties of materials. To bridge the gap, several machine learning schemes are developed herein to model the critical temperatures (Tc) of the 12,000+ known superconductors available via the SuperCon database. Materials are first divided into two classes based on their Tc values, above and below 10 K, and a classification model predicting this label is trained. The model uses coarse-grained features based only on the chemical compositions. It shows strong predictive power, with out-of-sample accuracy of about 92%. https://www.nature.com/articles/s41524-018-0085-8