Prelab 5: Phylogenetics

First, read the lecture notes on phylogenetics and the documentation on PHYLIP and answer a few questions, then prepare the files you will need for next class.

Introduction to Multiple Sequence Alignment and Phylogenetics (GCB535_Phylogenetics.pdf on Canvas)

1) What do the branch lengths on trees indicate?

2) An evolutionary concept we will use in this module is that of gene homologs, or genes that are related by descent from a common ancestral gene. Homologous genes within a species that arose from a duplication event are called paralogs, and homologous genes that arose through speciation are called orthologs. Based on what you learned about phylogenetic trees in the lecture and these definitions, what are the relationships (paralogs or orthologs) between the the following gene pairs from the tree on the right?

a) Mouse geneA1 and mouse geneA2?

b) Mouse geneA2 and human geneA?

Get familiar with the PHYLIP package

The main PHYLIP web page, with extended documentation, is: http://evolution.genetics.washington.edu/phylip/phylip.html Read about the following programs in the PHYLIP package that we will use in the lab: dnapars, drawtree, retree, seqboot and consense.

3) What is the purpose of bootstrapping? Describe how it works.

Download and format your data file.

Task: Retrieve the mRNA sequences in FASTA format for the following genes: Hint: This may a useful place to start searching for your sequences: http://www.ncbi.nlm.nih.gov/nuccore

  • Twist [Fly] (NM_079092, splice form A)
  • Twist1 [Human] (NM_000474)
  • Twist1 [Mouse] (NM_011658)
  • Twist2 [Human] (NM_057179)
  • Twist2 [Mouse] (NM_007855)

Task: Create a single file called twist.fasta on your computer with all of the sequences using the following steps:

  • Copy and paste these sequences into one text file using any text editor (like gedit or nano). Please keep the same order as the above list (have fly listed first, then Human Twist1, etc.)
  • Change the header portion of each sequence entry to make your results more readable later on. Both ClustalW and PHYLIP will truncate the header information in their output after 9 characters. Something very short, like “HumanTw1” is a good idea here.
  • Save it as twist.fasta. If you made this file on your own computer, upload it into the the phylogenetics assignment directory.