# Connexions

You are here: Home » Content » Bios 533 Bioinformatics » RNA Secondary Structure Prediction

### Lenses

What is a lens?

#### Definition of a lens

##### Lenses

A lens is a custom view of the content in the repository. You can think of it as a fancy kind of list that will let you see content through the eyes of organizations and people you trust.

##### What is in a lens?

Lens makers point to materials (modules and collections), creating a guide that includes their own comments and descriptive tags about the content.

##### Who can create a lens?

Any individual member, a community, or a respected organization.

##### What are tags?

Tags are descriptors added by lens makers to help label content, attaching a vocabulary that is meaningful in the context of the lens.

#### Affiliated with (What does "Affiliated with" mean?)

This content is either by members of the organizations listed or about topics related to the organizations listed. Click each link to see a list of all content affiliated with the organization.
• Rice Digital Scholarship

This collection is included in aLens by: Digital Scholarship at Rice University

Click the "Rice Digital Scholarship" link to see all content affiliated with them.

### Recently Viewed

This feature requires Javascript to be enabled.

Inside Collection (Course):

Course by: Susan Cates. E-mail the author

# RNA Secondary Structure Prediction

Module by: Susan Cates. E-mail the author

Summary: This module discusses the basic theory behind RNA Secondary Structure Prediction methodologies. In addition, several RNA tools and databases are introduced.

The study of RNA structure calls for a distinct set of computational tools designed expressly for RNA applications. Recall there are three major categories of RNA, messenger RNA (mRNA), transfer RNA (tRNA) and ribosomal RNA (rRNA). Ribosomal RNA and ribozymes have catalytic functions, like proteins, while messenger RNA has an information storage function, like DNA. A good resource for reviewing the major types of RNA and their functions can be found in the online RNA Structure Primer , available at the RNABase RNA Structure Database website RNABase The RNA Structure Database,

RNA is usually thought of as a single stranded linear molecule, however, in a biological system this is not the case. Frequently, different regions of the same RNA strand will fold together via base pair interactions to make intricate secondary and tertiary structures that are essential for correct biological function. Common secondary structure motifs include hairpin loops, stems, and bulges. Diagrams of these motifs can be viewed on the IMB JENA Nucleic Acid Nomenclature and Structure web page, under section 8 of the Table of Contents, secondary structural elements. To observe tertiary structure in an RNA molecule, view the structure of phenylalanine tRNA from the yeast crystal structure , available from the Nucleic Acid Database (NDB) Project, Rutgers.

Though RNA is usually single stranded, in some RNA virus genomes it will form a double stranded helix. However, unlike DNA, RNA forms an A-form double helix. The RNA double helix differs from that of the DNA double helix because of the presence of ribose, rather than deoxyribose, in the sugar phosphate backbone of the molecule. The addition of a hydroxyl group at the C2 postion in the ribose sugar is responsible for the A-form geometry in double stranded RNA. The A-form makes a right-handed helix, like the B-form double helix, but is a shorter, wider helix than the B-form, and the major groove is deep, but narrow, making it virtually inaccessible to proteins. It is in the major groove that the chemical groups are sequence-specific and dependent on base identity, and therefore, this is where proteins tend to bind a DNA double helix. Because the RNA A-form double helix contains a major groove that is too narrow and deep for proteins to access, the minor groove becomes more important for protein interactions with RNA helices. Also, proteins that interact with specific RNA sequences commonly bind single-stranded RNA segments. For an example of an RNA/protein complex, view the NDB entry for the Protein/Hepatitis Delta Virus Ribozyme Complex. When a strand of DNA forms a double helix with a strand of RNA, this will also result in an A-form helix. An example of a DNA/RNA complex can be found by viewing the NDB entry for the RNA-DNA complex formed by the 10-23 DNA enzyme.

RNA molecules sometimes contain the unusual nucleotides ribothymidine, dihydrouridine, pseudouridine and inosine. In addition, RNA somewhat commonly forms a G-U wobble base pair, and makes other non-canonical base pairs, a listing of which can be found on the George Fox group's Non-Canonical Base Pair Database web page, University of Houston.

## Exercise 1

List three non-canonical base pairs identified on the web page, besides a G-U wobble base pair.

It is clear to see that RNA molecules have many unique characteristics, distinct from the properties of DNA and protein structures. Many of these characteristics can be exploited to predict RNA structure from a nucleotide sequence.

One of the methodologies that is commonly used for RNA structure prediction is based on calculating free energy estimates for each possible fold, then choosing the fold that yields the lowest free energy. These free energy values are a combination of energy values calculated for each pair of adjacent base pairs, plus loop or bulge energies. The energy values are derived from melting studies of synthetically constructed oligoribonucleotides. For more information on the development of RNA free energy parameters, see the Development and References Page of the Zuker-Turner RNA folding package.

Compute the free energy of an RNA structure using the efn server, an RNA free energy web site authored by Michael Zuker, Rensselaer Polytechnic Institute. Copy and paste the following RNA sequence into the sequence query box.


G G C G C G G C A C C G U C C G C G G A A C A A A C G G


Just below the sequence query box, there is a box where the secondary structure can be defined by specifying base pairs. This is done using a triplet of numbers to define each stem region of consecutive base pairs. The first number in the triplet defines the sequence number of the first base in the pair from the 5' end of the sequence. The second number defines the sequence number of the opposing base in the pair. The third number defines how many consecutive bases are involved in the stem. In this example, use the following triplet:


10, 19, 3


The triplet in the above example means that the bases "C C G", number 10, 11 and 12, base pair with the bases "C G G", bases number 17, 18 and 19. Paste the above triplet into the secondary structure box and click on the box that says, "Send data for processing".

## Exercise 2

What is the computed free energy for this RNA structure?

## Exercise 3

Click on the link that says "png" to get a better picture of the structure. How many bases (non-paired) are in the loop in this structure?

## Exercise 4

Save the png file to disk and send a copy to the course instructor.

Now, use the same sequence, but specify a different secondary structure. This time, paste the following triplet into the secondary structure box and send for data processing:


3, 18, 5


## Exercise 5

What is the computed free energy for this RNA structure?

## Exercise 6

Click on the link that says "png" to get a better picture of the structure. How many bases (non-paired) are in the loop in this structure?

## Exercise 7

Save the png file to disk and send a copy to the course instructor.

## Exercise 8

Which of these two structures is more likely to exist under physiological circumstances, given no additional constraints?

A second approach to RNA secondary structure prediction is to look for conserved stem regions in related sequences. This method involves looking for regions within sequences where stems have been conserved, even when the bases have mutated. For this to happen, it would require that if a G mutated to an A, then the opposing C in the base pair would mutate to a U. These regions are found by aligning related RNA sequences, and applying an algorithm that looks for these sorts of paired mutations in predicted stem regions. Align the RNA sequences of the following tRNAs using ClustalW.


>1ASY:S ASPARTYL TRNA SYNTHETASE (ASPRS)
UCCGUGAUAGUUXAAXGGXCAGAAUGGGCGCXUGUCXCGUGCCAGAUXGGGGTXCAAUUC
CCCGUCGCGGAGCCA

>1EIY:C TRNA(PHE)
GCCGAGGUAGCUCAGUUGGUAGAGCAUGCGACUGAAAAUCGCAGUGUCCGCGGUUCGAUU
CCGCGCCUCGGCACCA

>1EFW:C ASPARTYL-TRNA
GGAGCGGXAGUUCAGXCGGXXAGAAUACCUGCCUXUCXCGCAGGGGXUCGCGGGXXCGAG
UCCCGXCCGUUCC

>1EHZ:A TRANSFER RNA (PHE)
GCGGAUUUAXCUCAGXXGGGAGAGCXCCAGAXUXAAXAXXUGGAGXUCXUGUGXXCGXUC
CACAGAAUUCGCACCA



IMPORTANT: After ClustalW alignment, the program puts asterisk below conserved residues. These must be removed before submitting the alignment to the RNA secondary structures prediction server.

Copy the multiple alignment and paste it into the query box at the RNA secondary structure prediction server, Moscow State University. Click "submit query", and the results should appear within about 3 minutes. Scroll down the page and view the section where the stem regions were identified, and their free energies were computed.

## Exercise 9

How many stems are predicted?

## Exercise 10

List each of their computed free energy values.

## Exercise 11

Continue to scroll down the page and look at the predicted structure diagram. What is the total free energy of the structure?

## Exercise 12

Does this structure that has been predicted from sequences agree well with the known structure of tRNAs?

RNA structure has some distinct differences from DNA structure that can be exploited to yield secondary structure predictions that are usually reasonably accurate. In addition, there are many on-line tools and databases that are specific to RNA. Here, the use of a few of these tools has been illustrated, but take some time to view more of the links that are available on the RNA World Website, Institut fur Molekulare Biotechnologie, Jena, Germany.

## References

1. V. L. Murthy. (Copyright 2000-2002). RNABase, The RNA Structure Database. http://www.rnabase.org.
2. J.L. Sussman, S.R. Holbrook, R.W. Warrant, G.M. Church, S.-H. Kim. (1978). Crystal Structure of Yeast Phenylalanine T-RNA. J. Mol. Biol., 123, 607-630.
3. A.R. Ferre-D'Amare, K. Zhou, J.A. Doudna. (1998). Crystal Structure of a Hepatitis Delta Virus Ribozyme. Nature, 395, 567-574.
4. J. Nowakowski, P.S. Shim, G.S. Prasad, C.D. Stout, G.F. Joyce. (1999). Crystal Structure of an 82-Nucleotide RNA-DNA Complex Formed by the 10-23 DNA Enzyme. Nat. Struct. Biol., 6, 151-156.

## Content actions

PDF | EPUB (?)

### What is an EPUB file?

EPUB is an electronic book format that can be read on a variety of mobile devices.

PDF | EPUB (?)

### What is an EPUB file?

EPUB is an electronic book format that can be read on a variety of mobile devices.

#### Collection to:

My Favorites (?)

'My Favorites' is a special kind of lens which you can use to bookmark modules and collections. 'My Favorites' can only be seen by you, and collections saved in 'My Favorites' can remember the last module you were on. You need an account to use 'My Favorites'.

| A lens I own (?)

#### Definition of a lens

##### Lenses

A lens is a custom view of the content in the repository. You can think of it as a fancy kind of list that will let you see content through the eyes of organizations and people you trust.

##### What is in a lens?

Lens makers point to materials (modules and collections), creating a guide that includes their own comments and descriptive tags about the content.

##### Who can create a lens?

Any individual member, a community, or a respected organization.

##### What are tags?

Tags are descriptors added by lens makers to help label content, attaching a vocabulary that is meaningful in the context of the lens.

| External bookmarks

#### Module to:

My Favorites (?)

'My Favorites' is a special kind of lens which you can use to bookmark modules and collections. 'My Favorites' can only be seen by you, and collections saved in 'My Favorites' can remember the last module you were on. You need an account to use 'My Favorites'.

| A lens I own (?)

#### Definition of a lens

##### Lenses

A lens is a custom view of the content in the repository. You can think of it as a fancy kind of list that will let you see content through the eyes of organizations and people you trust.

##### What is in a lens?

Lens makers point to materials (modules and collections), creating a guide that includes their own comments and descriptive tags about the content.

##### Who can create a lens?

Any individual member, a community, or a respected organization.

##### What are tags?

Tags are descriptors added by lens makers to help label content, attaching a vocabulary that is meaningful in the context of the lens.

| External bookmarks