Cloud-SPAN logo. Cloud-SPAN logo.
  • Home
  • Precourse Instructions
  • About
  1. Extras
  2. Data
  • Files and Directories
    • Understanding your file system
    • Logging onto the Cloud
    • Introducing the Shell
  • Using the Command Line
    • Navigating Files and Directories
    • Working with Files and Directories
    • Redirection
  • QC & RNA pre-processing
    • Introduction to Metatranscriptomics
    • Quality of Raw Reads
    • Ribosomal RNA Filtering
  • Taxonomic Annotation
    • Extracting a Community Profile
    • Visualising Community Structure
  • Functional Annotation
    • Extracting Functional Information
  • Extras
    • Data
    • Glossary
    • Workflow Reference

On this page

  • The Data
  • Edit this page
  • Report an issue
  1. Extras
  2. Data

Data

The Data

This course uses data from the Galaxy Training Network tutorial on metatranscriptomics, which utilizes a dataset hosted on Zenodo (Kunath et al., 2018, ISME J). The dataset comes from a time-series analysis of a microbial community inside a bioreactor. For the purposes of this course, we focus on a single time point (the first) and one biological replicate (replicate A). The data consist of paired-end RNA-Seq sequences in FastQ format, representing the expressed functional potential of the microbial community.

The samples originate from a cellulose-degrading microbial consortium (SEM1b) enriched from a thermophilic biogas reactor in Norway. The consortium is co-dominated by Clostridium thermocellum, a cellulolytic bacterium, and multiple strains of Coprothermobacter proteolyticus, which together degrade plant biomass under anaerobic, high-temperature conditions. This enrichment was performed to study the active expression of carbohydrate-degrading genes within the community.

Back to top
Extracting Functional Information
Glossary

Licensed under CC-BY 4.0 2021–24 by Cloud-SPAN

  • Edit this page
  • Report an issue
Cookie Preferences