001     1028902
005     20240725202008.0
024 7 _ |a 10.26165/JUELICH-DATA/TOBXWP
|2 doi
037 _ _ |a FZJ-2024-04862
041 _ _ |a English
100 1 _ |a Daniel, Davis Thomas
|0 P:(DE-Juel1)185897
|b 0
|e Corresponding author
|u fzj
245 _ _ |a Replication Data and Code for: 'Machine learning isotropic g values of radical polymers'
260 _ _ |c 2024
|b Jülich DATA
336 7 _ |a MISC
|2 BibTeX
336 7 _ |a Dataset
|b dataset
|m dataset
|0 PUB:(DE-HGF)32
|s 1721888742_4783
|2 PUB:(DE-HGF)
336 7 _ |a Chart or Table
|0 26
|2 EndNote
336 7 _ |a Dataset
|2 DataCite
336 7 _ |a DATA_SET
|2 ORCID
336 7 _ |a ResearchData
|2 DINI
520 _ _ |a This data repository contains the data sets and python scripts associated with the manuscript 'Machine learning isotropic g values of radical polymers '. Electron paramagnetic resonance measurements allow for obtaining experimental g values of radical polymers. Analogous to chemical shifts, g values give insight into the identity and environment of the paramagnetic center. In this work, Machine learning based prediction of g values is explored as a viable alternative to computationally expensive density functional theory (DFT) methods. Description of folder contents (switch to tree view): Datasets : Contains PTMA polymer structures from TR, TE-1, and TE-2 data sets transformed using a molecular descriptor (SOAP, MBTR or DAD) and corresponding DFT-calculated g values. Filenames contain 'PTMA_X' where X denotes the number of monomers which are radicals. Structure data sets have 'structure_data' in the title, DFT calculated g values have 'giso_DFT_data' in the title. The files are in .npy (NumPy) format. Models : ERT models trained on SOAP, MBTR and DAD feature vectors. Scripts : Contains scripts which can be used to predict g values from XYZ files of PTMA structures with 6 monomer units and varying radical density. The script 'prediction_functions.py' contains the functions which transform the XYZ coordinates into an appropriate feature vector which the trained model uses to predict. Description of individual functions are also given as docstrings (python documentation strings) in the code. The folder also contains additional files needed for the ERT-DAD model in .pkl format. XYZ_files : Contains atomic coordinates of PTMA structures in XYZ format. Two subfolders : WSD and TE-2 correspond to structures present in the whole structure data set and TE-2 test data set (see main text in the manuscript for details). Filenames in the folder 'XYZ_files/TE-2/PTMA-X/' are of the type 'chainlength_6ptma_Y'_Y''.xyz' where 'chainlength_6ptma' denotes the length of polymer chain (6 monomers), Y' denotes the proportion of monomers which are radicals (for instance, Y' = 50 means 3 out of 6 monomers are radicals) and Y'' denotes the order of the MD time frame. Actual time frame values of Y'' in ps is given in the manuscript. PTMA-ML.ipynb : Jupyter notebook detailing the workflow of generating the trained model. The file includes steps to load data sets, transform xyz files using molecular descriptors, optimise hyperparameters , train the model, cross validate using the training data set and evaluate the model. PTMA-ML.pdf : PTMA-ML.ipynb in PDF format. List of abbreviations : PTMA : poly(2,2,6,6-tetramethyl-1-piperidinyloxy-4-yl methacrylate) TR : Training data set TE-1 : Test data set 1 TE-2 : Test data set 2 ERT : Extremely randomized trees WSD : Whole structure data set SOAP : Smooth overlap of atomic orbitals MBTR : Many-body tensor representation DAD : Distances-Angles-Dihedrals
536 _ _ |a 1223 - Batteries in Application (POF4-122)
|0 G:(DE-HGF)POF4-1223
|c POF4-122
|f POF IV
|x 0
536 _ _ |a DFG project 422726248 - SPP 2248: Polymer-basierte Batterien (422726248)
|0 G:(GEPRIS)422726248
|c 422726248
|x 1
536 _ _ |a HITEC - Helmholtz Interdisciplinary Doctoral Training in Energy and Climate Research (HITEC) (HITEC-20170406)
|0 G:(DE-Juel1)HITEC-20170406
|c HITEC-20170406
|x 2
588 _ _ |a Dataset connected to DataCite
700 1 _ |a Mitra, Souvik
|0 0009-0005-9476-980X
|b 1
700 1 _ |a Eichel, Rüdiger-A.
|0 P:(DE-Juel1)156123
|b 2
|u fzj
700 1 _ |a Diddens, Diddo
|0 P:(DE-Juel1)169877
|b 3
700 1 _ |a Granwehr, Josef
|0 P:(DE-Juel1)162401
|b 4
|u fzj
773 _ _ |a 10.26165/JUELICH-DATA/TOBXWP
856 4 _ |u https://doi.org/10.26165/JUELICH-DATA/TOBXWP
909 C O |o oai:juser.fz-juelich.de:1028902
|p VDB
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 0
|6 P:(DE-Juel1)185897
910 1 _ |a RWTH Aachen
|0 I:(DE-588b)36225-6
|k RWTH
|b 0
|6 P:(DE-Juel1)185897
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 2
|6 P:(DE-Juel1)156123
910 1 _ |a RWTH Aachen
|0 I:(DE-588b)36225-6
|k RWTH
|b 2
|6 P:(DE-Juel1)156123
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 3
|6 P:(DE-Juel1)169877
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 4
|6 P:(DE-Juel1)162401
910 1 _ |a RWTH Aachen
|0 I:(DE-588b)36225-6
|k RWTH
|b 4
|6 P:(DE-Juel1)162401
913 1 _ |a DE-HGF
|b Forschungsbereich Energie
|l Materialien und Technologien für die Energiewende (MTET)
|1 G:(DE-HGF)POF4-120
|0 G:(DE-HGF)POF4-122
|3 G:(DE-HGF)POF4
|2 G:(DE-HGF)POF4-100
|4 G:(DE-HGF)POF
|v Elektrochemische Energiespeicherung
|9 G:(DE-HGF)POF4-1223
|x 0
914 1 _ |y 2024
920 _ _ |l yes
920 1 _ |0 I:(DE-Juel1)IET-1-20110218
|k IET-1
|l Grundlagen der Elektrochemie
|x 0
980 _ _ |a dataset
980 _ _ |a VDB
980 _ _ |a I:(DE-Juel1)IET-1-20110218
980 _ _ |a UNRESTRICTED


LibraryCollectionCLSMajorCLSMinorLanguageAuthor
Marc 21