Dataset

EMOSE (2017) Inter-Comparison of Marine Plankton Metagenome Analysis Methods

OBIS Secretariat Open in mapper Explore occurrences

This study includes experiments from the following five sequencing strategies: 1) Shotgun DNA sequencing; 2) Amplicon sequencing after 18S amplification by PCR using 1391F/EukB primer set. Library were constructed according to Illumina Library protocol without any sizing; 3) Amplicon sequencing after 16S amplification by PCR using 515F/926R primer set. Library were constructed according to Illumina Library protocol without any sizing; 4) Amplicon sequencing after 16S amplification by PCR using 515F/926R primer set. Library were constructed according to Illumina Library protocol with a sizing step selecting the 450-650 bp fragments; and 5) Amplicon sequencing after 16S amplification by PCR using 515F/926R primer set. Library were constructed according to Illumina Library protocol with a sizing step selecting the 650-850 bp fragments.

Published: September 22, 2022 at 08:44

License: This work is licensed under a Creative Commons Attribution Non Commercial (CC-BY) 4.0 License

URL: https://hosted-datasets.gbif.org/mgnify/MGYS00001935.zip

Contacts:

The French National Sequencing Center (Genoscope)
The French National Sequencing Center (Genoscope)

160,521
occurrence records
2,385
taxa
345
species

Taxa

Missing and invalid fields

Field Missing Invalid
coordinateUncertaintyInMeters 160,521
100.0%
decimalLatitude 3,330
2.1%
decimalLongitude 3,330
2.1%
eventDate 3,330
2.1%
maximumDepthInMeters 3,330
2.1%
minimumDepthInMeters 3,330
2.1%
occurrenceStatus 160,521
100.0%
scientificNameID 160,521
100.0%

Quality flags

The OBIS data quality flags are documented at https://github.com/iobis/obis-qc.

Flag Dropped Records
NO_MATCH 50,915
31.7%
MARINE_UNSURE 15,358
9.6%
NO_COORD 3,330
2.1%
NO_DEPTH 3,330
2.1%
NO_ACCEPTED_NAME 1,064
0.7%
NOT_MARINE 520
0.3%
WORMS_ANNOTATION_UNRESOLVABLE 218
0.1%
WORMS_ANNOTATION_REJECT_AMBIGUOUS 92
0.1%
WORMS_ANNOTATION_RESOLVABLE 86
0.1%

Measurement types

DNA derived data