Cloud-SPAN logo. Cloud-SPAN logo.
  • Home
  • Precourse Instructions
  • About
  1. Metatranscriptomics
  • Files and Directories
    • Understanding your file system
    • Logging onto the Cloud
    • Introducing the Shell
  • Using the Command Line
    • Navigating Files and Directories
    • Working with Files and Directories
    • Redirection
  • QC & RNA pre-processing
    • Introduction to Metatranscriptomics
    • Quality of Raw Reads
    • Ribosomal RNA Filtering
  • Taxonomic Annotation
    • Extracting a Community Profile
    • Visualising Community Structure
  • Functional Annotation
    • Extracting Functional Information
  • Extras
    • Data
    • Glossary
    • Workflow Reference

On this page

  • Prerequisites
  • Data
  • Course format
  • Course Overview
  • Edit this page
  • Report an issue

Metatranscriptomics

This hands-on, online course teaches data analysis for metatranscriptomics projects. It is aimed at those with little or no experience of using high performance computing (HPC) for data analysis. In the course we will cover:

  • navigating file directories and using the command line
  • logging into a remote cloud instance
  • using common commands and running analysis programs in the command line
  • what is metatranscriptomics?
  • following a metatranscriptomics analysis workflow including:
    • quality control on RNA-seq reads

    • adapter trimming and quality filtering

    • Removing rRNA sequences from RNA-seq data

    • preparing reads for downstream analyses

    • taxonomic profiling of metatranscriptomic data

    • functional interpretation of expressed genes

    • exploring results using interactive tools such as Krona

The course is taught as a mixture of live coding, online lectures, self-study and drop-in sessions.

Prerequisites

ImportantBiological concepts and software setup

This course assumes no prior experience with the tools covered in the workshop, but learners are expected to have a basic understanding of biological concepts, including gene expression, mRNA, and microbiomes. Participants should bring their own laptops and be prepared to engage actively in hands-on metatranscriptomics exercises.

To get started, follow the directions in the “Precourse Instructions” tab to get access to the required software and data for this workshop. Windows users need to install Git Bash in their laptop. Mac users may need to configure the terminal program in their laptop to use the Bash shell.

Data

TipAbout the data used in the course

This course uses data from the Galaxy Training Network tutorial on metatranscriptomics, which utilizes a dataset hosted on Zenodo (Kunath et al., 2018, ISME J). The dataset comes from a time-series analysis of a microbial community inside a bioreactor. For the purposes of this course, we focus on a single time point (the first) and one biological replicate (replicate A). The data consist of paired-end RNA-Seq sequences in FastQ format, representing the expressed functional potential of the microbial community.

The samples originate from a cellulose-degrading microbial consortium (SEM1b) enriched from a thermophilic biogas reactor in Norway. The consortium is co-dominated by Clostridium thermocellum, a cellulolytic bacterium, and multiple strains of Coprothermobacter proteolyticus, which together degrade plant biomass under anaerobic, high-temperature conditions. This enrichment was performed to study the active expression of carbohydrate-degrading genes within the community.

Course format

This workshop is designed to be run on pre-imaged Amazon Web Services (AWS) instances. All the software and data used in the workshop are hosted on an Amazon Machine Image (AMI). We will give you details as to how to access these instances after registration.

The course will take place over three weeks and will combine live coding and teaching with offline work. We will guide you through the analysis each session but some steps may need to completed offline due to the amount of time they take to complete. There will also be drop-in sessions to offer support and troubleshooting help, and a Slack workspace for questions.

Course Overview

Lesson Overview
Files and Directories Learn about files and directories, log onto a cloud instance and get a basic introduction to the shell.
Using the Command Line Learn more about using the shell to navigate directories, manipulate and search files, and combine existing commands to do new things.
QC & Sorting How to quality control and sort transcript reads.
Taxonomic Annotation How to use Metaphlan to identify taxonomy of transcript reads
Functional Annotation How to use Humann to identify functional genes from your sorted transcript reads.
Back to top

Licensed under CC-BY 4.0 2021–26 by Cloud-SPAN

  • Edit this page
  • Report an issue
Cookie Preferences