RNA metabarcoding to assess zooplankton community response to environmental influence in boreal lake experimental enclosures

Related items loading ...

Overview Research Site Status and Provenance Access and Downloads

Section 1: Overview

Name of Research Project

Related Project	Part
GWF-NGS: Next Generation Solutions for Healthy Water Resources

Program Affiliations

No entries

Related Research Project(s)

Related Project	Part
GWF-NGS: Next Generation Solutions for Healthy Water Resources

Dataset Title

RNA metabarcoding to assess zooplankton community response to environmental influence in boreal lake experimental enclosures

Additional Information

GeoNetwork record: www.gwfnet.net/geonetwork/srv/eng/catalog.search#/metadata/f6342595-4173-458f-b2cd-8016448ec6c1 Tracking ID under eDNA project: UofS-eDNA-dataset-metadata-8

Creators and Contributors

Name	Role	Email	Institution
John Giesy	Point of Contact, Author, Principal Investigator	john.giesy@usask.ca	University of Saskatchewan
Phillip Ankley	Author, Originator		University of Saskatchewan
Yuwei Xie	Author, Originator		University of Saskatchewan
Sonya Havens	Author, Collaborator		International Institute for Sustainable Development
Lisa Peters	Author, Collaborator		University of Manitoba
Lauren Timlick	Author, Collaborator		International Institute for Sustainable Development
Jose Luis Rodriguez-Gil	Author, Collaborator		International Institute for Sustainable Development
Vince Palace	Author, Principal Investigator		International Institute for Sustainable Development

Abstract

Zooplankton metabarcoding can provide a high-throughput method for characterizing community response to environmental stressors. There are issues, however, with inferring actual response to stressors due to PCR biases, DNA persistence, and the inability to capture live organisms. Here, we used normalized activity, namely RNA metabarcoding normalized by DNA metabarcoding, to characterize the zooplankton community response to environmental influence. DNA and RNA metabarcoding was also applied in the context of assessing response of the zooplankton community exposed to simulated spills of diluted bitumen (dilbit), with experimental remediation practices of enhanced monitored natural recovery and shoreline cleaner application. Zooplankton samples were collected via pump on day -3 and 11 and 38 days after the simulated dilbit spill. The zooplankton samples were co-extracted for DNA and RNA and were PCR amplified targeting the mitochondrial Cytochrome c Oxidase subunit I gene (CO1) region, with amplicon sequencing following. The dataset includes the demultiplexed sequencing output, the feature table with species-level taxonomic annotation, and the sample metadata used for hypothesis testing. This dataset contains data from rock habitat enclosures.

Purpose

The project provides metabarcoding data of multiple communities during a period of diluted bitumen exposure and subsequent remediation treatments in lake mesocosms. This project will use next generation techniques to understand the changes of lower trophic organisms (eg. phytoplankton, zooplankton) under variable remediation efforts following a simulated spill of diluted bitumen.

Plain Language Summary

Following a simulated spill of diluted bitumen in a lake mesocosm, clean-up methods were implemented to assess the efficiency of different approaches. Zooplankton tissue and water samples were taken pre-exposure and two time points after exposure to assess the effects the various clean-up methods may have on lower trophic communities. Overall, this study will be used to help aid in ecological risk assessment of a crude petroleum spill and what the best approach is to clean-up the contaminated site.

Keywords

Keyword
metabarcoding
zooplankton
ecogenomics
diluted bitumen
oil-spill
remediation
freshwater
boreal
Cytochrome c Oxidase subunit 1 (CO1)

Citations

Ankley, P., Xie, Y., Havens, S., Peters, L., Tomy, G., Timlick, L., Rodriguez-Gil, J., Giesy, J., Palace, V. (2021). RNA metabarcoding to assess zooplankton community response to environmental influence in boreal lake experimental enclosures [Dataset]. Federated Research Data Repository. https://doi.org/10.20383/102.0332 Ankley, P., Xie, Y., Havens, S., Peters, L., Timlick, L., Rodriguez-Gil, J., Giesy, J., Palace, V. (2021). RNA metabarcoding helps reveal zooplankton community response to environmental stressors. Environmental Pollution, 118446. https://doi.org/10.1016/j.envpol.2021.118446

Description of Specific Location	Latitude	Longitude
Lake 260, IISD - Experimental Lakes Area	49.6964	-93.76752

Section 3: Status and Provenance

Dataset Version

Dataset Creation Date

2021-03-18

Status of data collection/production

○ Planned

○ In Progress

○ Abandoned

◉ Complete

Dataset Completion or Abandonment Date

2021-03-18

Data Update Frequency

○ Continually

○ Daily

○ Weekly

○ Biweekly

○ Monthly

○ Anually

○ As needed

○ Irregular

◉ None planned

○ Unknown

Creation Software

Software	Version	File Formats
fastq-multx		https://github.com/brwnj/fastq-multx
usearch	11
unoise3
R

Primary Source of Data

◻ Unknown/Unspecified

◻ Census

▣ Field collected samples

▣ Field experiment

◻ Field observation

◻ Field survey

◻ Human biological samples

◻ Lab experiment

◻ Model simulation

◻ Previously collected

◻ Qualitative (from observations or interviews)

◻ Social survey

◻ Traditional knowledge

◻ Other Source of Data (Please specify in field below)

Other Source of Data (if applicable)

Data Lineage (if applicable). Please include versions (e.g., input and forcing data, models, and coupling modules; instrument measurements; surveys; sample collections; etc.)

1. Description of methods used for collection/generation of data: Zooplankton samples were collected from boreal lake mesocosms exposed to a simulated diluted bitumen spill and additional remediation practices. Remediation practices consisted of enhanced monitored natural recovery and shoreline cleaner application, with samples being collected day -3, day 11 and day 38 post-spill. All samples were collected following strict quality control methods, including decontaminated equipment, one-use items, and tools specific to treatment. Samples were immediately placed in LifeGuard Soil Preservation Solution Qiagen Inc., Mississauga, ON) on ice prior to being transferred to a -80C freezer for long-term storage. 16s amplicon sequencing Total genomic DNA and RNA was extracted from zooplankton samples using the AllPrep DNA/RNA Mini Kit (Qiagen Inc., Mississauga, ON). Concentrations were measured and checked for quality using Qubit 4 Fluorometer and NanoDrop Spectrophotometers, respectively (Thermo Fisher Scientific, USA). Complementary DNA (cDNA) was synthesized using SuperScript IV Reverse Transcriptase (Invitrogen, CA, USA) along with ezDNase to remove residual DNA. The COI gene was amplified using primers specified by Leray et al. 2013, with the forward primer (mICOIintF) (GGWACWGGWTGAACWGTWTAYCCYCC) and the reverse primer (jgHCO2198R) (TAAACTTCAGGGTGACCAAAAAATCA). Samples were dual indexed to increase throughput of sequencing (Fadrosh et al., 2014). Samples were amplified with a 50 μL PCR reaction including Platinum Taq Hot Start II High-Fidelity DNA Polymerase (Invitrogen, USA) using a SimpliAmp thermal cycler (ThermoFisher Scientific) under the following conditions: initial denaturation at 98°C for 30s, followed by 25 cycles of 98°C for 30s, 58°C for 30s, and 72°C for 30s, with a final extension at 72°C for 10 min. PCR products were assessed for size and specificity using electrophoresis on a 1.2% w/v agarose gel and purified using the Qiagen QIAquick PCR Purification Kit (Qiagen Inc.). All purified products were quantified with the Qubit dsDNA HS assay kit and concentrations were adjusted to 1 ng/ μL with molecular-grade water. Purified products were pooled, and libraries were constructed using the NEBNext® DNA Library Prep Master Mix Set for Illumina® (New England BioLabs Inc., Whitby, ON). Libraries were quantified prior to sequencing using the NEBNext® Library Quant Kit for Illumina®. Sequencing was performed on an Illumina® MiSeq instrument (Illumina, San Diego, CA) using a 2x300 base pair kit. 2. Methods for processing the data: Sequences were trimmed, cleaned, and demultiplexed using a combination of USEARCH v11 (Edgar 2016) and fastq-multx ( https://github.com/brwnj/fastq-multx ). Paired-end sequences were merged with USEARCH v11 (Edgar 2016). Merged reads were then filtered for high quality (ee > 1.0) and greater length (300 bp). Primer binding regions were then removed from the merged sequence, with 17 nucleotides stripped left and 20 nucleotides stripped right. Chimeric sequences were subsequently removed using unoise3, and ZOTUs (zero-radius OTUs) were compared to the BOLD database, an in-house curated database, and the NCBI database for taxonomic annotation. Statistical analyses were performed in R (R Core Team, 2013). All samples specified were collapsed down to unique Time, Nucleic Acid, and Enclosure for further data analysis (See Data Specific Information For: sample-annotation.xlsx and variable list: TimeNucleicAcidEnclosure). 3. Instrument- or software-specific information needed to interpret the data: User defined. These are fastq files. They can be analyzed in QIIME2 or R. 4. Environmental/experimental conditions: Zooplankton samples were collected from the boreal lake shoreline enclosures in the rock habitat (Lake 260) located at the IISD-ELA. The samples were collected on June 19th, July 3rd, and July 30th using a pump and 53 micrometer filter mesh, with 20 Liters of water being pumped per sample. 5. Describe any quality-assurance procedures performed on the data: Quality was visualized with fastqc. USEARCH was used to trim low-quality bases out of the dataset prior to demultiplexing and merge reads after demultiplexing. unoise3 then filters and denoises the merged sequences. unoise3 also removes chimeras from the data.

Section 4: Access and Downloads

Access to the Dataset

Creative Commons Attribution 4.0 International (CC BY 4.0) https://creativecommons.org/licenses/by/4.0/

Does the data have access restrictions?

▣ No restriction (data is currently open to public)

◻ Limited (data is currently under embargo until publication)

◻ Limited (data involves intellectual property issues related to local or traditional knowledge)

◻ Limited (release of data may cause harm to the environment or to the public)

◻ Limited (pre-existing data has been used and is subject to access restrictions)

◻ Limited (data involves human subjects)

◻ Limited (data is supported by industry partnerships)

◻ Limited (data is supported by government partnerships)

Downloading and Characteristics of the Dataset

Download Links and Instructions

The data can be downloaded from https://doi.org/10.20383/102.0332

Total Size of all Dataset Files (GB)

0.4

File formats and online databases

◻ Link to online database or web services (e.g., WISKI, ECCC)

▣ Archive files (.zip, .rar, .7z, .tar, .tgz, .tar.gz, etc.)

◻ CSV files (.csv - comma or tab separated value files)

▣ Excel document files (.xlsx, .xls)

◻ Image files (e.g., .tiff, .jpeg, .png, .gif, etc.)

◻ NetCDF files (.netcdf, .nc)

◻ Text files (.txt)

◻ Word document files (.docx, .doc)

◻ Other (Please specify in field below)

Other Data Formats (if applicable)

List of Parameters and Variables

Parameter/Variable	Unit	Frequency	Source

T-2021-06-08-i1WJPA4Ag9k2MufVq2L9i3lA Dataset 1.2

Begin Date	End Date
2019-06-19	2019-07-30

West Boundary Longitude
East Boundary Longitude
North Boundary Latitude
South Boundary Latitude