[
    {
        "id": "data:g1tyb-nje27",
        "collection": "data",
        "collection_id": "g1tyb-nje27",
        "cite_using_url": "https://data.caltech.edu/records/g1tyb-nje27",
        "type": "dataset",
        "title": "Discovery of complex oxides via automated experiments and data science",
        "author": [
            {
                "family_name": "Yang",
                "given_name": "Lusann",
                "orcid": "0000-0001-5071-2936"
            },
            {
                "family_name": "Haber",
                "given_name": "Joel",
                "orcid": "0000-0001-7847-5506"
            },
            {
                "family_name": "Armstrong",
                "given_name": "Zan"
            },
            {
                "family_name": "Yang",
                "given_name": "Samuel J.",
                "orcid": "0000-0003-2460-6456"
            },
            {
                "family_name": "Kan",
                "given_name": "Kevin"
            },
            {
                "family_name": "Zhou",
                "given_name": "Lan"
            },
            {
                "family_name": "Richter",
                "given_name": "Matthias H.",
                "orcid": "0000-0003-0091-2045"
            },
            {
                "family_name": "Roat",
                "given_name": "Christopher"
            },
            {
                "family_name": "Wagner",
                "given_name": "Nicholas"
            },
            {
                "family_name": "Coram",
                "given_name": "Marc"
            },
            {
                "family_name": "Berndl",
                "given_name": "Marc",
                "orcid": "0000-0001-6750-5861"
            },
            {
                "family_name": "Riley",
                "given_name": "Patrick"
            },
            {
                "family_name": "Gregoire",
                "given_name": "John M.",
                "orcid": "0000-0002-2863-5265"
            }
        ],
        "abstract": "<p>This dataset is licensed under the Creative Commons Attribution 4.0 license(CC-BY-4.0). See https://creativecommons.org/licenses/by/4.0/for more information.</p><p>&nbsp;</p><p>If using this dataset, please cite https://doi.org/10.1073/pnas.2106042118</p><p>&nbsp;</p><p>We've released data from 6 print sessions, comprising 173 plates, 131 quaternary oxide systems, 6,918,024 &nbsp;individual composition samples, and 376,752 distinct compositions. While the tenfold reproductions within each plate are well controlled, uncontrolled variables (printhead age, etc) may lead to poorer consistency between print sessions.</p><p>&nbsp;</p><p>The data exists in four directories and one metadata file. Each directory contains one type of data, with one *.csv file per printed plate.</p><p>&nbsp;</p><p>i. The data in ten_replicas/ consists of optical transmission data, with one row per printed patch on a plate. The column headers are:</p><p>ExpID: an integer experiment ID for the printed patch on the plate.</p><p>row, col: The row and the column coordinates of the printed patch in the microscope image</p><p>signal_#: The measurement of \u0251, the optical transmission spectrum of the printed patch, at a given wavelength. # ranges from 0 to 8, inclusive, indicating transmission spectra at the following wavelengths: 375, 395, 455, 530, 590, 617, 660, 735, &amp; 850 nm.</p><p>plate: The integer plate identifier.</p><p>line: An integer identifier of the composition gradient that was printed.</p><p>line_experiment_id: An integer identifier of the composition sample along the composition gradient.</p><p>replica: An integer identifier of the replica # of the printed line.</p><p>metal: Each plate will have up to six metal column headers, where the possible metals include: ['Ce', 'Co', 'Cu', 'Fe', 'In', 'Mg', 'Ni', 'Sn', 'Ta', 'Y']. The metal columns sum to 1, indicating the ratios of metals printed.</p><p>&nbsp;</p><p>ii. The data in aggregated_replicas/ consists of optical transmission data, with one row per tenfold aggregated patch on a plate. The column headers are:</p><p>signal_#: The measurement of \u0251, the optical transmission spectrum of the printed patch, at a given wavelength. # ranges from 0 to 8, inclusive, indicating transmission spectra at the following wavelengths: 375, 395, 455, 530, 590, 617, 660, 735, &amp; 850 nm.</p><p>plate: The integer plate identifier.</p><p>line: An integer identifier of the composition gradient that was printed.</p><p>line_experiment_id: An integer identifier of the composition sample along the composition gradient.</p><p>metal: Each plate will have up to six metal column headers, where the possible metals include: ['Ce', 'Co', 'Cu', 'Fe', 'In', 'Mg', 'Ni', 'Sn', 'Ta', 'Y']. The metal columns sum to 1, indicating the ratios of metals printed.</p><p>&nbsp;</p><p>iii. The data in mixture/ represents the outcome of a probabilistic model that a given composition can be explained by a mixture of at most 3 binary signals. There is one row per composition. The column headers are:</p><p>log_prob: The log of the probability that this composition is explainable by at most 3 binary signals.</p><p>metal: Each plate will have up to six metal column headers, where the possible metals include: ['Ce', 'Co', 'Cu', 'Fe', 'In', 'Mg', 'Ni', 'Sn', 'Ta', 'Y']. The metal columns sum to 1, indicating the ratios of metals in the composition.</p><p>&nbsp;</p><p>iv. The data in phase_fits/ represents the outcome of a phase fitting model. There is one row per phase diagram. This data is meant to be read using the example colab. The column headers are:</p><p>residual: Float, the residual of the phase fit.</p><p>signal_type: This is either 'signal' or 'sigma', indicating the type of the phase fit (see paper).</p><p>discretization: The integer number of intervals we discretized the phase space into.</p><p>n_points: The number of internal points in the phase diagram. This is an integer between 1 and 5, inclusive.</p><p>metal_0, metal_1, metal_2: Three strings identifying the constituent metals of the phase diagram.</p><p>point_#_pos_0, point_#_pos_y: The coordinates of a phase point. # ranges between 0 and 7, inclusive. point_#_pos_0 gives the float amount of metal_0, and point_#_pos_1 gives the float amount of metal_1. The float amount of metal_2 can be inferred via 1 - (point_#_pos_0 + point_#_pos_1).</p><p>point_#_fitted_channel_X: The fitted optical absorption spectra of point_#. # is an integer between 0 and 7, inclusive. X is an integer between 0 and 8, inclusive, indicating the wavelength of the light absorbed.</p><p>&nbsp;</p><p>The files are publicly available for access via:</p><p>- the gsutil CLI tool at https://cloud.google.com/storage/docs/gsutil</p><p>- the tf.io.gfile APIs at https://www.tensorflow.org/api_docs/python/tf/io/gfile/GFile</p><p>- HTTP API: http://storage.googleapis.com/gresearch/metal-oxide-spectroscopy/path/to/file</p><p>&nbsp;</p><p>This file, the README, is available at:</p><p>http://storage.googleapis.com/gresearch/metal-oxide-spectroscopy/README.txt</p><p>&nbsp;</p><p>The metadata file is available at:</p><p>http://storage.googleapis.com/gresearch/metal-oxide-spectroscopy/metadata.csv, which lists all the plates available for download.</p><p>&nbsp;</p><p>The plate data for each of the four data types listed above can be found at:</p><p>http://storage.googleapis.com/gresearch/metal-oxide-spectroscopy/data_type_subdir/plate.csv</p>",
        "doi": "10.22002/g1tyb-nje27",
        "publisher": "CaltechDATA",
        "publication_date": "2021-09-10"
    },
    {
        "id": "data:p2bsg-7tb62",
        "collection": "data",
        "collection_id": "p2bsg-7tb62",
        "cite_using_url": "https://data.caltech.edu/records/p2bsg-7tb62",
        "type": "dataset",
        "title": "Raman spectroscopy dataset on Mn-Fe-Ni-Cu-Co-Zn oxides",
        "author": [
            {
                "family_name": "Gregoire",
                "given_name": "John",
                "orcid": "0000-0002-2863-5265"
            },
            {
                "family_name": "Boyd",
                "given_name": "David"
            },
            {
                "family_name": "Haber",
                "given_name": "Joel",
                "orcid": "0000-0001-7847-5506"
            }
        ],
        "abstract": "Raman spectra (Renishaw Inline) for a series of compositions synthesized by inkjet printing with intended compositions including all 1,2,3 and 4 cation combinations of Mn-Fe-Ni-Cu-Co-Zn with 10%  intervals in each composition dimension. File format consistent with that described at udiscover.it",
        "doi": "10.22002/D1.1179",
        "publisher": "CaltechDATA",
        "publication_date": "2019-02-21"
    }
]