Related Datasets

DEEPEN Global Standardized Categorical Exploration Datasets for Magmatic Plays

DOI 10.15121/1995526

Publicly accessible License

DEEPEN stands for DE-risking Exploration of geothermal Plays in magmatic ENvironments.

As part of the development of the DEEPEN 3D play fairway analysis (PFA) methodology for magmatic plays (conventional hydrothermal, superhot EGS, and supercritical), weights needed to be developed for use in the weighted sum of the different favorability index models produced from geoscientific exploration datasets. This was done using two different approaches: one based on expert opinions, and one based on statistical learning. This GDR submission includes the datasets used to produce the statistical learning-based weights.

While expert opinions allow us to include more nuanced information in the weights, expert opinions are subject to human bias. Data-centric or statistical approaches help to overcome these potential human biases by focusing on and drawing conclusions from the data alone. The drawback is that, to apply these types of approaches, a dataset is needed. Therefore, we attempted to build comprehensive standardized datasets mapping anomalies in each exploration dataset to each component of each play. This data was gathered through a literature review focused on magmatic hydrothermal plays along with well-characterized areas where superhot or supercritical conditions are thought to exist. Datasets were assembled for all three play types, but the hydrothermal dataset is the least complete due to its relatively low priority.

For each known or assumed resource, the dataset states what anomaly in each exploration dataset is associated with each component of the system. The data is only a semi-quantitative, where values are either high, medium, or low, relative to background levels. In addition, the dataset has significant gaps, as not every possible exploration dataset has been collected and analyzed at every known or suspected geothermal resource area, in the context of all possible play types. The following training sites were used to assemble this dataset:
- Conventional magmatic hydrothermal: Akutan (from AK PFA), Oregon Cascades PFA, Glass Buttes OR, Mauna Kea (from HI PFA), Lanai (from HI PFA), Mt St Helens Shear Zone (from WA PFA), Wind River Valley (From WA PFA), Mount Baker (from WA PFA).
- Superhot EGS: Newberry (EGS demonstration project), Coso (EGS demonstration project), Geysers (EGS demonstration project), Eastern Snake River Plain (EGS demonstration project), Utah FORGE, Larderello, Kakkonda, Taupo Volcanic Zone, Acoculco, Krafla.
- Supercritical: Coso, Geysers, Salton Sea, Larderello, Los Humeros, Taupo Volcanic Zone, Krafla, Reyjanes, Hengill.
**Disclaimer: Treat the supercritical fluid anomalies with skepticism. They are based on assumptions due to the general lack of confirmed supercritical fluid encounters and samples at the sites included in this dataset, at the time of assembling the dataset. The main assumption was that the supercritical fluid in a given geothermal system has shared properties with the hydrothermal fluid, which may not be the case in reality.

Once the datasets were assembled, principal component analysis (PCA) was applied to each. PCA is an unsupervised statistical learning technique, meaning that labels are not required on the data, that summarized the directions of variance in the data. This approach was chosen because our labels are not certain, i.e., we do not know with 100% confidence that superhot resources exist at all the assumed positive areas. We also do not have data for any known non-geothermal areas, meaning that it would be challenging to apply a supervised learning technique. In order to generate weights from the PCA, an analysis of the PCA loading values was conducted. PCA loading values represent how much a feature is contributing to each principal component, and therefore the overall variance in the data.

Download All Resources 1.03 MB — 3 files

DEEPEN 3D PFA Weights for Exploration Datasets in Magmatic Environments

Weights produced using these datasets, along with an explanation of the weighting methodology

View Linked Dataset

EGS Standardized Exploration Dataset.xlsx

Standardized dataset mapping anomalies in each exploration dataset to each component of a superhot EGS play. This data was gathered through a literature review focused ... more

112

Download 106.2 kB

Hydrothermal Standardized Exploration Dataset.xlsx

Standardized dataset mapping anomalies in each exploration dataset to each component of a magmatic conventional hydrothermal play. This data was gathered through a lite... more

146

Download 862.41 kB

Supercritical Standardized Exploration Dataset.xlsx

Standardized dataset mapping anomalies in each exploration dataset to each component of a supercritical play. This data was gathered through a literature review focused... more

115

Download 89.41 kB

Citation Formats

TY  - DATA
AB  - DEEPEN stands for DE-risking Exploration of geothermal Plays in magmatic ENvironments.

As part of the development of the DEEPEN 3D play fairway analysis (PFA) methodology for magmatic plays (conventional hydrothermal, superhot EGS, and supercritical), weights needed to be developed for use in the weighted sum of the different favorability index models produced from geoscientific exploration datasets. This was done using two different approaches: one based on expert opinions, and one based on statistical learning. This GDR submission includes the datasets used to produce the statistical learning-based weights.

While expert opinions allow us to include more nuanced information in the weights, expert opinions are subject to human bias. Data-centric or statistical approaches help to overcome these potential human biases by focusing on and drawing conclusions from the data alone. The drawback is that, to apply these types of approaches, a dataset is needed. Therefore, we attempted to build comprehensive standardized datasets mapping anomalies in each exploration dataset to each component of each play. This data was gathered through a literature review focused on magmatic hydrothermal plays along with well-characterized areas where superhot or supercritical conditions are thought to exist. Datasets were assembled for all three play types, but the hydrothermal dataset is the least complete due to its relatively low priority. 

For each known or assumed resource, the dataset states what anomaly in each exploration dataset is associated with each component of the system. The data is only a semi-quantitative, where values are either high, medium, or low, relative to background levels. In addition, the dataset has significant gaps, as not every possible exploration dataset has been collected and analyzed at every known or suspected geothermal resource area, in the context of all possible play types. The following training sites were used to assemble this dataset:
 - Conventional magmatic hydrothermal: Akutan (from AK PFA), Oregon Cascades PFA, Glass Buttes OR, Mauna Kea (from HI PFA), Lanai (from HI PFA), Mt St Helens Shear Zone (from WA PFA), Wind River Valley (From WA PFA), Mount Baker (from WA PFA).
 - Superhot EGS: Newberry (EGS demonstration project), Coso (EGS demonstration project), Geysers (EGS demonstration project), Eastern Snake River Plain (EGS demonstration project), Utah FORGE, Larderello, Kakkonda, Taupo Volcanic Zone, Acoculco, Krafla.
 - Supercritical: Coso, Geysers, Salton Sea, Larderello, Los Humeros, Taupo Volcanic Zone, Krafla, Reyjanes, Hengill.
**Disclaimer: Treat the supercritical fluid anomalies with skepticism. They are based on assumptions due to the general lack of confirmed supercritical fluid encounters and samples at the sites included in this dataset, at the time of assembling the dataset. The main assumption was that the supercritical fluid in a given geothermal system has shared properties with the hydrothermal fluid, which may not be the case in reality.

Once the datasets were assembled, principal component analysis (PCA) was applied to each. PCA is an unsupervised statistical learning technique, meaning that labels are not required on the data, that summarized the directions of variance in the data. This approach was chosen because our labels are not certain, i.e., we do not know with 100% confidence that superhot resources exist at all the assumed positive areas. We also do not have data for any known non-geothermal areas, meaning that it would be challenging to apply a supervised learning technique. In order to generate weights from the PCA, an analysis of the PCA loading values was conducted. PCA loading values represent how much a feature is contributing to each principal component, and therefore the overall variance in the data.

AU  - Taverna, Nicole
A2  - Caliandro, Nils
A3  - King, Rachel
DB  - Geothermal Data Repository
DP  - Open EI | National Renewable Energy Laboratory
DO  - 10.15121/1995526
KW  - geothermal
KW  - energy
KW  - DEEPEN
KW  - superhot
KW  - supercritical
KW  - superhot EGS
KW  - magmatic
KW  - hydrothermal
KW  - weights
KW  - pfa
KW  - pca
KW  - unsupervised
KW  - standardized
KW  - exploration
KW  - characterization
KW  - global
LA  - English
DA  - 2023/06/30
PY  - 2023
PB  - National Renewable Energy Laboratory
T1  - DEEPEN Global Standardized Categorical Exploration Datasets for Magmatic Plays
UR  - https://doi.org/10.15121/1995526
ER  -

Export Citation to RIS

Taverna, Nicole, et al. DEEPEN Global Standardized Categorical Exploration Datasets for Magmatic Plays. National Renewable Energy Laboratory, 30 June, 2023, Geothermal Data Repository. https://doi.org/10.15121/1995526.

Taverna, N., Caliandro, N., & King, R. (2023). DEEPEN Global Standardized Categorical Exploration Datasets for Magmatic Plays. [Data set]. Geothermal Data Repository. National Renewable Energy Laboratory. https://doi.org/10.15121/1995526

Taverna, Nicole, Nils Caliandro, and Rachel King. DEEPEN Global Standardized Categorical Exploration Datasets for Magmatic Plays. National Renewable Energy Laboratory, June, 30, 2023.  Distributed by Geothermal Data Repository. https://doi.org/10.15121/1995526

@misc{GDR_Dataset_1509,
title = {DEEPEN Global Standardized Categorical Exploration Datasets for Magmatic Plays},
author = {Taverna, Nicole and Caliandro, Nils and King, Rachel},
abstractNote = {DEEPEN stands for DE-risking Exploration of geothermal Plays in magmatic ENvironments.



As part of the development of the DEEPEN 3D play fairway analysis (PFA) methodology for magmatic plays (conventional hydrothermal, superhot EGS, and supercritical), weights needed to be developed for use in the weighted sum of the different favorability index models produced from geoscientific exploration datasets. This was done using two different approaches: one based on expert opinions, and one based on statistical learning. This GDR submission includes the datasets used to produce the statistical learning-based weights.



While expert opinions allow us to include more nuanced information in the weights, expert opinions are subject to human bias. Data-centric or statistical approaches help to overcome these potential human biases by focusing on and drawing conclusions from the data alone. The drawback is that, to apply these types of approaches, a dataset is needed. Therefore, we attempted to build comprehensive standardized datasets mapping anomalies in each exploration dataset to each component of each play. This data was gathered through a literature review focused on magmatic hydrothermal plays along with well-characterized areas where superhot or supercritical conditions are thought to exist. Datasets were assembled for all three play types, but the hydrothermal dataset is the least complete due to its relatively low priority. 



For each known or assumed resource, the dataset states what anomaly in each exploration dataset is associated with each component of the system. The data is only a semi-quantitative, where values are either high, medium, or low, relative to background levels. In addition, the dataset has significant gaps, as not every possible exploration dataset has been collected and analyzed at every known or suspected geothermal resource area, in the context of all possible play types. The following training sites were used to assemble this dataset:

 - Conventional magmatic hydrothermal: Akutan (from AK PFA), Oregon Cascades PFA, Glass Buttes OR, Mauna Kea (from HI PFA), Lanai (from HI PFA), Mt St Helens Shear Zone (from WA PFA), Wind River Valley (From WA PFA), Mount Baker (from WA PFA).

 - Superhot EGS: Newberry (EGS demonstration project), Coso (EGS demonstration project), Geysers (EGS demonstration project), Eastern Snake River Plain (EGS demonstration project), Utah FORGE, Larderello, Kakkonda, Taupo Volcanic Zone, Acoculco, Krafla.

 - Supercritical: Coso, Geysers, Salton Sea, Larderello, Los Humeros, Taupo Volcanic Zone, Krafla, Reyjanes, Hengill.

**Disclaimer: Treat the supercritical fluid anomalies with skepticism. They are based on assumptions due to the general lack of confirmed supercritical fluid encounters and samples at the sites included in this dataset, at the time of assembling the dataset. The main assumption was that the supercritical fluid in a given geothermal system has shared properties with the hydrothermal fluid, which may not be the case in reality.



Once the datasets were assembled, principal component analysis (PCA) was applied to each. PCA is an unsupervised statistical learning technique, meaning that labels are not required on the data, that summarized the directions of variance in the data. This approach was chosen because our labels are not certain, i.e., we do not know with 100\% confidence that superhot resources exist at all the assumed positive areas. We also do not have data for any known non-geothermal areas, meaning that it would be challenging to apply a supervised learning technique. In order to generate weights from the PCA, an analysis of the PCA loading values was conducted. PCA loading values represent how much a feature is contributing to each principal component, and therefore the overall variance in the data.

},
url = {https://gdr.openei.org/submissions/1509},
year = {2023},
howpublished = {Geothermal Data Repository, National Renewable Energy Laboratory, https://doi.org/10.15121/1995526},
note = {Accessed: 2025-10-06},
doi = {10.15121/1995526}
}

https://dx.doi.org/10.15121/1995526

Details

Data from Jun 30, 2023

Last updated Sep 15, 2023

Submitted Jul 5, 2023

Organization

National Renewable Energy Laboratory

Contact

Nicole Taverna

Authors

Nicole Taverna

National Renewable Energy Laboratory

Nils Caliandro

National Renewable Energy Laboratory

Rachel King

National Renewable Energy Laboratory

Keywords

geothermal, energy, DEEPEN, superhot, supercritical, superhot EGS, magmatic, hydrothermal, weights, pfa, pca, unsupervised, standardized, exploration, characterization, global

DOE Project Details

Project Name DE-risking Exploration of geothermal Plays in magmatic ENvironments (DEEPEN)

Project Lead Lauren Boyd

Project Number 37178

DEEPEN Global Standardized Categorical Exploration Datasets for Magmatic Plays

Citation Formats

Details

Organization

Contact

Authors

Keywords

DOE Project Details

Share