5' -- pppNpNpN.....--3'
Note that two of the phosphates come from the mRNA (blue) and one comes from the GTP (red).
A large complex carries out cleavage and poly-A addition. The protein components required are:
1) endonucleases &endash; these carry out the actual mRNA cleavage.; there are two of them, CF1 and CF2. They cleave the mRNA to prepare a 3' end for poly-A addition.
2) stimulatory factor (CTSF) &endash; this binds to a G/U rich sequence downstream (3') to the cleavage site. It is involved in assembly of the complex and activates other components.
3) specificity factor (CPSF) &endash; this recognizes the 5'--AAUAAA --3' sequence in the mRNA. It requires CTSF to function.
4) poly-A polymerase (PAP) &endash; This synthesizes the poly-A tail in a two-step process. The first ten adenosines added are dependent on the 5'--AAUAAA --3' sequence and the specificity factor. In the second stage, after adenosine number ten, these requirements are no longer present.
5) poly-A binding protein (PABP) &endash; This is a non-sequence specific RNA binding protein. It binds stoichiometrically to runs of A's and coats the poly-A tail.
There are three kinds of sequences that are important for splicing:
1) 5' and 3' splice sites. There are conserved sequences at the 5' (GU) and 3' (AG) ends of the intron. The 5' GU is also called the splice donor site while the 3' AG is also called the splice acceptor site. The strength of the 5' splice site is also be determined by the last nucleotides in the exon.
2) Branchpoint sequences. These are located approximately between 20 and 40 nucleotides from the 3' end of the intron. The branchpoint sequences are not as conserved as the 5' and 3' splice site sequences. Only the second to last nucleotide in the seven nucleotide consensus sequences is conserved - it is always an A. This A is important in the creation of unusual 5' - 2' bond which results in the formation of a lariat structure.
3) A pryrimidine rich sequence between the branch site and the 3' AG site. (see steps in splicing section)
In genes with multiple introns, the question arose as to whether they are spliced out in an ordered manner. splicing could theoretically proceed 5' --> 3', 3'--> 5', or randomly.
Experimental data indicates that none of these possibilities are actually true. introns are removed in an ordered manner that is dependent on the secondary and tertiary structure of the RNA. Prediction of the order is very difficult. An experiment involving a Northern blot of steady-state levels of RNA in an ovomucoid gene demonstrates these findings (see Fig. 22.5).
After the 5' end of the intron is cut, the 2' hydroxyl of the conserved A in the branchpoint sequence makes a 2' --> 5' bond with the 5' end of the intron. This breaking of this 5' phosphodiester bond is somewhat unusual because it does not require ATP. It makes use of transesterifcation reaction instead. In a transesterification reaction, the breaking of one bond is coupled with the creation of another &endash; the energy is recycled in a way. The 2' hydroxyl of the A attacks the phosphodiester bond between exon and intron.. Although the transesterification reactions do not require the hydrolysis of ATP, but the process of splicing as a whole does (i.e. other steps of splicing are ATP requiring).
The U1 snRNA is very important in splicing. Its 5' end is especially critical because the 5' end of U1 RNA has a 3-'CAUUCAU-5' sequence which recognizes the complementary 5' splice sequence in an intron. If you mutate the intron sequence, you destroy splicing. If you make a compensatory mutation in the 5' end of the U1 snRNA, splicing is restored. This indicates that pre-mRNA/ U1 snRNA base pairing is necessary for splicing.
1) U1 snRNP binds to the 5' splice set.
U2AF binds to the pyrimidine rich tract.
ASF / SF2 bind. These are general splicing factors and SR proteins. (SR = serine and arginine rich.)
2) U2 binds the branch site. Note that U1 and U2AF are required for U2 binding.
U2 base-pairs with a branchpoint consensus in yeast.
The binding of U2 requires ATP hydrolysis.
3) U5-U4/U6 trimer binds
U5 binds the exon at the 5' splice site while U6 binds U2.
All of this requires bending of the RNA which is important to get the RNA in the correct orientation to carry out the transesterification reactions.
4) U1 is released and leaves the complex
U5 moves from the exon portion of the 5' splice site and replaces U1 at the intron portion of the 5' splice site.
U6 also binds the 5' splice site.
5) U4 is released (an energy dependent process)
U6 base pairs with U2. This forms the catalytic center.
The first transesterification reaction takes place &endash; the 5' splice site is cleaved and the lariat forms.
6) U5 contacts both exons and the second transesterification reaction occurs.
The 3' splice site is cleaved and the exons are ligated as the intron is released in lariat form. This step requires ATP. The lariat is later degraded.
U6 is important for activation of the catalytic center. It arrives with the U6 snRNA bound to the U4 snRNA. When bound in this form it is inactive.
When ATP is hydrolyzed to release U4, U6 then base pairs with U2 and becomes active.
Recall that U2 was initially bound to the branchpoint. At this stage it is bound to the branchpoint and U6. So U6 can base pair with either U4 (inactive) or U2 (active).
Overall, 3 ATP's are hydrolyzed for one intron removal.
1) intron definition - In this method, the machinery scans through the pre-mRNA to find 3' and 5' splice sites of a single intron. This scanning process bridges the intron, hence the name intron definition.
2) exon definition - This involves scanning and recognizing the 5' end of one intron and the branchpoint of the next intron upstream. This scan therefore spans an exon and is called exon definition. This is used more often in mammals because the exons are so much smaller than the introns that it improves scanning efficiency. Exon definition probably does not occur in yeast since more than one intron is required for the process and most yeast genes that have introns have only one.
Recall that plants are the intermediate case &endash; some experiments addressing this issue in plants make use of the AP3 gene from Arabidopsis (in the end no time to talk about this!).
1) an endonuclease cuts the RNA
2) terminal uridytyl transferase (TUTase) adds the U's.
3) RNA ligase reseals the DNA.