Dataset

Comparative study in validity of three regions of 18S-rRNA for eukaryote amplicon sequence analyses

OBIS Secretariat Open in mapper Explore occurrences

In the present study, we performed a detailed investigation of the 18S-rDNA, namely, numbers of registered sequences, frequencies of amplification success, the amplicon sequence variability among three regions containing V1-V3, V4-V5 and V7-V9 regions using in silico PCRs based on public databases, and the identification power by NGS-based environmental surveys of planktonic eukaryote community. Although the number of registered sequences in V4-V5 regions was remarkably higher than other regions, the identification power in NGS-based environmental surveys was lowest in V4-V5 regions due to the lowest sequence variability. The number of registered sequences in V1-V3 region was ca. two times smaller than V7-V9 region, and the sequence variability in V1-V3 region was significantly higher than that in V7-V9 region. Then, the identification power was not significant between these two regions, implying identification power is affected by combination of numbers of registered sequences and the sequence variability. We therefore believe V1-V3 region will be the best one for applying to NGS-based monitoring of planktonic eukaryote community in the near future as the number of sequences deposited increases in public databases.

Published: September 20, 2022 at 12:42

License: This work is licensed under a Creative Commons Attribution Non Commercial (CC-BY) 4.0 License

URL: https://hosted-datasets.gbif.org/mgnify/MGYS00002476.zip

Contacts:

Fisheries Research Agency
Fisheries Research Agency

4,739
occurrence records
806
taxa
109
species

Taxa

Missing and invalid fields

Field Missing Invalid
coordinateUncertaintyInMeters 4,739
100.0%
eventDate 4,739
100.0%
occurrenceStatus 4,739
100.0%
scientificNameID 4,739
100.0%

Quality flags

The OBIS data quality flags are documented at https://github.com/iobis/obis-qc.

Flag Dropped Records
NO_MATCH 820
17.3%
MARINE_UNSURE 186
3.9%
NO_ACCEPTED_NAME 57
1.2%
WORMS_ANNOTATION_UNRESOLVABLE 18
0.4%
NOT_MARINE 16
0.3%
WORMS_ANNOTATION_REJECT_AMBIGUOUS 8
0.2%
WORMS_ANNOTATION_RESOLVABLE 6
0.1%

Measurement types

DNA derived data