<?xml version="1.0" encoding="utf-8" standalone="no"?>
<!DOCTYPE document PUBLIC "-//CNX//DTD CNXML 0.5//EN" "http://cnx.rice.edu/cnxml/0.5/DTD/cnxml_plain.dtd">
<document xmlns="http://cnx.rice.edu/cnxml" xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:bib="http://bibtexml.sf.net/" id="new0">
  <name>More Transcription - A Closer Look</name>
  <metadata>
  <md:version>1.2</md:version>
  <md:created>2003/06/29</md:created>
  <md:revised>2003/07/27 17:34:41.270 GMT-5</md:revised>
  <md:authorlist>
    <md:author id="ahughes">
      <md:firstname>Andrew</md:firstname>
      
      <md:surname>Hughes</md:surname>
      <md:email>ahughes@rice.edu</md:email>
    </md:author>
  </md:authorlist>

  <md:maintainerlist>
    <md:maintainer id="ahughes">
      <md:firstname>Andrew</md:firstname>
      
      <md:surname>Hughes</md:surname>
      <md:email>ahughes@rice.edu</md:email>
    </md:maintainer>
  </md:maintainerlist>
  
  

  <md:abstract/>
</metadata>

  <content>

<section id="props">
  <name>Properties of Eukaryotic Transcription</name>
  <list id="ti">
  <item>Complex!</item>
  <item>RNA polymerase is the main protein that creates the RNA transcript</item>
  <item>Proteins called transcription factors help RNA polymerase and regulate it's function.</item>
  <item>2 types of transcription factors: basal tfs and modulatory tfs.</item>
  <item>Basal transcription factors - required for all transcription events.</item>
  <item>Modulatory transcription factors - regulate expression of genes.  Both positive and negative regulation.</item>
  <item>Transcription factors can act from relatively far away.  Up to 10kb!</item>
  <item>pre-mRNA transcripts are spliced to remove exons.</item>
  <item>Alternative splicing allows the cell to combine exons differently in the final mRNA transcript from the same pre-mRNA transcript.</item> 
  </list>
  </section>


    
  <section id="steps">
  <name>Transcription In Five Easy Steps</name>

  <list id="steps5" type="enumerated">
  <item>Transcription of DNA to pre-mRNA. DNA, RNA polymerase and transcription factors.</item>
  <item>Addition of 5' methyl guanosine cap to pre-mRNA transcript.</item>
  <item>Splicing of pre-mRNA transcript (yields mRNA proper)</item>
  <item>Addition of 3' poly(A) tail to mRNA transcript.</item>
  <item>mRNA is transported to the cytoplasm.</item>
  </list>

  <figure id="transcriptionoutline">
  <name>5 Steps of mRNA preparation</name>
  <media type="image/pjpg" src="transcription_outline.jpg"/>
  </figure>

  </section>

  <section id="polys">
  <name>RNA Polymerase - main protein that creates RNA transcript.</name>
  <para id="p">RNA polynmerase is the protein whose job it is to 'read' the genetic code and create a complimentary RNA transcript from that code.  Eukaryotes have three different types of RNA polymerases: RNA polymerase I, II, and III.  RNA polymerase II is the from of polymerase that transcribes most genes and is the form of polymerase with which we need to concern ourselves.
  </para>
  </section>

  <section id="transcriptioninitiation">
  <name>Initiation of Transcription</name>

  <para id="ti1">Transcription initiation in eukaryotes is complicated, and the details are not entirely understood (although we do have a good grasp on the basic mechanism).  The fully assembled eukaryotic transcription inititation complex contains more than 50 polypeptides.  RNA polymerase II has more than 10 polypeptide subunits by itself.  Keep in mind that the process outlined below is a generalization, and any given, specific transcription event will probably vary some in the details.  For example, although the TATA box is the most strongly conserved promoter sequence, it is by no means present in every eukaryotic promoter.  Also, there is some debate as the importance of the order of the binding of the proteins in the initiation complex: it was thought that a specific order was vital, now, however, new evidence suggests that reaching the end binding state may be what is really important, whatever the order.
  </para>

  <para id="ti2">
  <name>Transcription Factors - help/regulate RNA polymerase's function</name>
Transcription factors (tfs) are proteins involved with transcription - except for RNA polymerase.  Transcription factors can be broken down into two groups: basal tfs and modulatory tfs.  Eukaryotic RNA polymerases are not capable of initiating transcription alone, they require the assistance of a set of basal transcription factors.  Basal tfs assist RNA polymerase in the recognition of promoter sequences and unwiding the DNA double helix, among other functions.  Basal tfs are necessary for every transcription event.  Modulatory transcription factors regulate the expression of a gene, or a set of gene.  These tfs are important because they allow the body to differentially express genes at different times and different places in the body.  Modulatory tfs are vital for multicellular life.  They allow the body to create different cells, tissues, and organs.
  </para>

  <list id="mylist">
  <item>Learning the specific order of tf binding is not vital</item>
  <item>Understand that transcription initiation involves MANY polypeptides,
        each with a specific function.</item>
  <item>Functions include: recognizing promoter sequences, modulating amount of transcript production, modulating time/space of transcript production, and unwinding DNA</item>
  <item>Each tf/protein represents a functional unit</item>
  </list>

  <para id="fd">Transcription factors are denoted TFIIX, for Transcription Factor for polymerase II.  X is an identifying letter.
  </para>

  <figure id="initiation">
  <name>Transcription Initiation</name>
  <media type="image/pjpg" src="transcription_initiation.jpg"/>
  </figure>

  <para id="ti3">
  <name>Basal Transcription Factors - necessary for transcription</name>
  The first step in transcription initiation is TFIIA binds to the TATA box in the promoter region.  The TATA box is the most strongly conserved consensus sequence in the eukaryotic promoter; it has the consensus sequence TATAAAA and is usually located around position -30 (counting backwards from the transcriptional start site).  After TFIID binds to the TATA box, TFIIA joins the initiation complex, followed by TFIIB.  Separate from the initiation complex, TFIIF binds to RNA polymerase II.  One subunit of TFIIF has DNA-unwinding properties that presumably help RNA polymerase II unwind the DNA during transcription.  Once the TFIIF-RNA polymerase complex has been formed, the two proteins can then join the initiation complex.  After the addition of several more proteins to the initiation complex, transcripton is finally ready to begin.
  </para>

  <list id="ti3list" type="enumerated">
  <name>Steps of Transcription Initiation - Basal Transcription Factors</name>
  <item>TFIID binds to the TATA box in the promoter region.</item>
  <item>TFIIA joins initiation complex.</item>
  <item>TFIIB joins initiation complex.</item>
  <item>TFIIF binds RNA polymerase (separate from initiation complex).</item>
  <item>RNA poly - THIIF complex binds initiation complex on DNA.</item>
  <item>Transcription begins.</item>
  </list>

  <para id="regtfs">
  <name>Modulatory Transcription Factors - regulate transcription</name>
  Regulatory transcription factors are non-intrinsic transcription factors that modulate the expression of a particular transcript.  These are the transcription factors that regulate time/space differrential expression in the organism.  These factors can be enhancers or silencers, meaning that they can both increase and decrease the expression of a given gene.  Also, these transcription factors can act from several thousand base-pairs in either direction from the promoter.  Here it is important to remember that the eukaryotic genome has significant secondary structure; DNA wrapping around nucleosomes and  30nm fiber looping can bring distant sequence motifs proximal to promoter sites where they can have significant action.  It is important to remember that an enhancer/silencer sequence motif may be present without having an effect, because the specific transcription factor protein responsible for binding to that domain may not have been expressed, ie. may not be present.  The sequence on its own has no effect, the presence of the modulatory transcription factor is required.  This can complicate sequence analysis w/r/t expression because in some cases we will only have half of the contextual information (ie we are missing the protein context of the cell).
  </para>
  
  </section>

  <section id="mods">
  <name>Transcript Modifications</name>

  <section id="mods2">
  <name>5' Methyl Guanosine Cap</name>

  <list id="jjf">
  <item>Protects transcript from degredation.</item>
  <item>Helps with initiation of translation.</item>
  </list>

  <para id="para1">After the initial pre-mRNA transcript has been created the first major modification is the addition of a 5' methyl guanosine cap.  The addition of the 5' 7-MG cap is important for two reasons: the 5' caps are recognized by protein factors that initiate translation, and it also helps protect the transcript from nucleases.  Nucleases are very common in the cell and because of this unprotected RNA has a very short half-life inside the cell.  Nucleases are actually so common that working with RNA in the laboratory can be quite difficult because the samples have a tendency to disintegrate into useless bits.  
  </para>

  </section>

  <section id="mods3">
  <name>Splicing</name>

  <list id="kdf">
  <item>Splicing removes non-coding exons from transcript.</item>
  <item>Alternative splicing allows for different combinations of exons from same pre-mRNA transcript (gene).</item>
  <item>Some RNAs can self-splice.</item>
  </list>

  

  <para id="spl">Eukaryotic genes contain two types of transcribed regions: introns and exons.  Exons are the regions of the genome that contain actual coding information.  Introns are non-coding, meaning that intronic sequences are never translated to protein.  Introns are never included in the final processed mRNA transcript.   Splicing is the process of removing introns from the pre-mRNA transcript to produce an exon-only mRNA molecule, which is then shipped off for translation.  Generally, eukaryotic mRNAs are considered to monogenic.  Monogenic means that an RNA transcript contains exons from only one gene.  However, up to one fourth of the transcripts in C. elegans have been show to be multi-genic (i.e. they contain exons from multiple genes).</para>

  <para id="spl2">A further complication of the splicing process is that mRNA can undergo alternative splicing.  To illustrate this let's imagine a gene that has 3 exons and two introns.  From this gene, three different final transcripts are possible.  In all transcripts the two introns are going to be removed, but the cell can combine the exons however it wants as long as the original order is maintained.  This means that for this example the possible mRNA transcripts include: Exon1-Exon2, Exon1-Exon3, and Exon1-Exon2-Exon3; however, Exon3-Exon1 is not possible because the exons are out of order.</para> 

  <para id="spl3">An interesting side note is that some introns are capable of self-splicing, that is they can politely remove themselves without the intervention of any proteins.  This is significant mainly because it is a significant counter example to the idea that RNA is an inert transcript and action is soley the domain of proteins.  RNAs should really be viewed as having both enzymatic properties and abstract information-carrying ability.  Because of this many people believe that RNA was the original genetic molecule and that DNA and proteins evolved later in the game.</para>  

<figure id="altsplice">
  <media type="image/pjpg" src="alternative_splicing.jpg"/>
  </figure>

  <para id="spl4">Alternative splicing is a very important and powerful tool.  To understand the benefit alternative splicing gives the cell we need to understand something about proteins.  Proteins can be understood as containing modularized functional units.  These functional units can be active sites on enzymes, large structural motifs such as beta-sheets or alpha-helices, or motifs that direct the eventual destination of expressed proteins.  A good example of an alternatively spliced pre-mRNA transcript is the mouse IgM immuoglobulin transcript.  IgM exists in two forms: excreted and membrane bound.  These two forms of the protein differ in the only in the C-terminus: the secreted protein has a secreted terminus motif while the membrane-bound protein has a C-terminal membrane anchor region.  Both products come from the same pre-mRNA, but alternative splicing includes either the terminal exon that creates the excreted form of IgM or the membrane-bound form of IgM.</para>

  </section>

  <section id="polya">
  <name>3' poly-Adenylation</name>

  <list id="ffd">
  <item>Important for cellular transport.</item>
  <item>Helps stabilize the transcript</item>
  </list>

  <para id="fff">The poly(A) tails are formed in a two step process: an endonulcease cleaves around 1000-2000 non-coding bases from the 3' end of the pre-mRNA transcript and then poly(A) polymerase adds 20-200 AMP molecules to the 3' end of the transcript.  The poly(A) tail is important in the cellular transport of the mRNA transcript and, like the 5' cap, also helps to stabilize the mRNA transcript.  
  </para>
  </section>

  </section>

  

  
  </content>
  
</document>
