TY - JOUR
T1 - Development and Application of the Branched and Isoprenoid GDGT Machine Learning Classification Algorithm (BIGMaC) for Paleoenvironmental Reconstruction
AU - Martínez-Sosa, Pablo
AU - Tierney, Jessica E.
AU - Pérez-Angel, Lina C.
AU - Stefanescu, Ioana C.
AU - Guo, Jingjing
AU - Kirkels, Frédérique
AU - Sepúlveda, Julio
AU - Peterse, Francien
AU - Shuman, Bryan N.
AU - Reyes, Alberto V.
N1 - Publisher Copyright:
© 2023. American Geophysical Union. All Rights Reserved.
PY - 2023/7
Y1 - 2023/7
N2 - Glycerol dialkyl glycerol tetraethers (GDGTs), both archaeal isoprenoid GDGTs (isoGDGTs) and bacterial branched GDGTs (brGDGTs), have been used in paleoclimate studies to reconstruct environmental conditions. Since GDGTs are produced in many types of environments, their relative abundances also depend on the depositional setting. This suggests that the distribution of GDGTs also preserves useful information that can be used more broadly to infer these depositional environments in the geological past. Here, we combined existing iso- and brGDGT relative abundance data with newly analyzed samples to generate a database of 1,153 samples from several modern sedimentary settings. We observed a robust relationship between the depositional environment and the relative abundances of GDGTs in our samples. This data set was used to train and test the Branched and isoGDGT Machine learning Classification (BIGMaC) algorithm, which identifies the environment a sample comes from based on the distribution of GDGTs with high precision and recall (F1 = 0.95). We tested the model on the sedimentary record from the Giraffe kimberlite pipe, an Eocene maar in subantarctic Canada, and found that the BIGMaC reconstruction agrees with independent stratigraphic and palynological information, provides new information about the paleoenvironment of this site, and helps improve its paleotemperature reconstruction. In contrast, we also include an example from the PETM-aged Cobham lignite as a cautionary example that illustrates the limitations of the algorithm. We propose that in cases where paleoenvironments are unknown or are changing, BIGMaC can be applied in concert with other proxies to generate more refined paleoclimate records.
AB - Glycerol dialkyl glycerol tetraethers (GDGTs), both archaeal isoprenoid GDGTs (isoGDGTs) and bacterial branched GDGTs (brGDGTs), have been used in paleoclimate studies to reconstruct environmental conditions. Since GDGTs are produced in many types of environments, their relative abundances also depend on the depositional setting. This suggests that the distribution of GDGTs also preserves useful information that can be used more broadly to infer these depositional environments in the geological past. Here, we combined existing iso- and brGDGT relative abundance data with newly analyzed samples to generate a database of 1,153 samples from several modern sedimentary settings. We observed a robust relationship between the depositional environment and the relative abundances of GDGTs in our samples. This data set was used to train and test the Branched and isoGDGT Machine learning Classification (BIGMaC) algorithm, which identifies the environment a sample comes from based on the distribution of GDGTs with high precision and recall (F1 = 0.95). We tested the model on the sedimentary record from the Giraffe kimberlite pipe, an Eocene maar in subantarctic Canada, and found that the BIGMaC reconstruction agrees with independent stratigraphic and palynological information, provides new information about the paleoenvironment of this site, and helps improve its paleotemperature reconstruction. In contrast, we also include an example from the PETM-aged Cobham lignite as a cautionary example that illustrates the limitations of the algorithm. We propose that in cases where paleoenvironments are unknown or are changing, BIGMaC can be applied in concert with other proxies to generate more refined paleoclimate records.
KW - GDGTs
KW - machine learning
KW - paleoenvironment
UR - http://www.scopus.com/inward/record.url?scp=85165441315&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85165441315&partnerID=8YFLogxK
U2 - 10.1029/2023PA004611
DO - 10.1029/2023PA004611
M3 - Article
AN - SCOPUS:85165441315
SN - 2572-4517
VL - 38
JO - Paleoceanography and Paleoclimatology
JF - Paleoceanography and Paleoclimatology
IS - 7
M1 - e2023PA004611
ER -