DEAGO

Introduction

DEAGO generates user-friendly quality control (QC) and analysis reports for RNA-Seq datasets. These interactive reports can be opened simply in a web browser and provide a simple way of sharing information about your analysis with collaborators.

DEAGO uses a Perl wrapper module (Bio-Deago) to generate a HTML report from markdown templates by calling functions from an R package (deago).

There are three main steps to each analysis:

  • Generating QC plots (ggplot2)
  • Identifying differentially expressed (DE) genes (DESeq2)
  • Performing gene ontology (GO) term enrichment analyses (topGO)

For more information, please see the user guide.

Learning outcomes

By the end of this tutorial you can expect to be able to:

  • Understand what input files are required for DEAGO and their formats
  • Generate a QC report for an RNA-seq dataset
  • Generate a DE report that can be used to identify differentially expressed genes
  • Generate a GO term enrichment analysis report

Tutorial sections

This tutorial comprises the following sections:

  1. Introducing the tutorial dataset
  2. Preparing your input data
  3. Running a quality control (QC) analysis
  4. Running a differential expression (DE) analysis
  5. Running a GO term enrichment analysis
  6. Troubleshooting

Authors

This tutorial was created by Victoria Offord.

Running the commands from this tutorial

You can run the commands in this tutorial either directly from the Jupyter notebook (if using Jupyter), or by typing the commands in your terminal window.

Running commands on Jupyter

If you are using Jupyter, command cells (like the one below) can be run by selecting the cell and clicking Cell -> Run from the menu above or using ctrl Enter to run the command. Let's give this a try by printing our working directory using the pwd command and listing the files within it. Run the commands in the two cells below.


In [ ]:
pwd

In [ ]:
ls -l

Running commands in the terminal

You can also follow this course by typing all the commands you see into a terminal window. This is similar to the "Command Prompt" window on MS Windows systems, which allows the user to type DOS commands to manage files.

To get started, select the cell below with the mouse and then either press control and enter or choose Cell -> Run in the menu at the top of the page.


In [ ]:
echo cd $PWD

Now open a new terminal on your computer and type the command that was output by the previous cell followed by the enter key. The command will look similar to this:

cd /home/manager/pathogen-informatics-training/Notebooks/DEAGO/

Now you can follow the instructions in the tutorial from here.

Prerequisites

This tutorial assumes that you have deago and Bio-Deago installed on your computer. For download and installation instructions, please see:

The following software versions were used when preparing this tutorial:

  • Bio-Deago 1.2.0
  • deago 1.1.2

Note: For Sanger pathogens users, these are already available on pcs5 and farm3. We also have a separate Sanger pathogen users page which will walk you through preparing your input data from the Sanger pathogen pipelines.

To check that you have installed the software correctly, you can run the following command:


In [ ]:
deago -h

Let's get started!

To get started with the tutorial, head to the first section: Introducing the tutorial dataset.