Dataset

Universal Amplicon Sequences (mixed 16S/18S) from GEOTRACES Cruises GA03 and GP13

OBIS Secretariat Open in mapper Explore occurrences

Dedicated sampling campaigns such as JGOFS, CLIVAR, and GEOTRACES have quantified critical oceanic biogeochemical processes on a global scale. Integrating these measurements with microbial community composition data is highly desirable because it would allow hypotheses about biogeographic distributions to be tested or perhaps lead to the discovery of organisms responsible for a particular biogeochemical process. A promising strategy to generate this microbial community composition data comes from high-throughput sequencing of PCR amplicons generated with the 515Y/926R universal 16S/18S primer set. The two key advantages of the 515Y/926R primers are 1) their comprehensiveness - recovering amplicons from the entire cellular microbial community - and 2) their quantitative nature - recovering gene copy abundances as shown previously with microbial community standards. Compared to metagenomes, amplicons additionally have the advantage of more easily detecting rare community members that may be biogeochemically significant (e.g. diazotrophs). In this study, we applied the 515Y/926R primers to DNA from the recently published bioGEOTRACES metagenomic dataset, and use these results to describe microbial community composition across a longitudinal transect of the southern Pacific Ocean from Australia to Tahiti (GEOTRACES section GP13) and a longitudinal transect of the northern Atlantic from Massachusetts to the Canary Islands (GEOTRACES section GA03). In addition, we conducted intercomparisons with metagenomic taxa abundances and show the two techniques correspond strongly to one another for most samples (average R^2=0.97).

Published: September 22, 2022 at 08:44

License: This work is licensed under a Creative Commons Attribution Non Commercial (CC-BY) 4.0 License

URL: https://hosted-datasets.gbif.org/mgnify/MGYS00005710.zip

Contacts:

University of Southern California
University of Southern California

42,338
occurrence records
593
taxa
8
species

Taxa

Missing and invalid fields

Field Missing Invalid
coordinateUncertaintyInMeters 42,338
100.0%
decimalLatitude 295
0.7%
decimalLongitude 295
0.7%
eventDate 295
0.7%
maximumDepthInMeters 42,338
100.0%
minimumDepthInMeters 42,338
100.0%
occurrenceStatus 42,338
100.0%
scientificNameID 42,338
100.0%

Quality flags

The OBIS data quality flags are documented at https://github.com/iobis/obis-qc.

Flag Dropped Records
NO_DEPTH 42,338
100.0%
NO_MATCH 15,068
35.6%
MARINE_UNSURE 3,569
8.4%
NOT_MARINE 295
0.7%
NO_COORD 295
0.7%
WORMS_ANNOTATION_UNRESOLVABLE 218
0.5%
NO_ACCEPTED_NAME 214
0.5%

Measurement types

DNA derived data