|
You have been given an unknown genomic sequence to analyze during this term. Your job will be to tell me everything that you can about this piece of DNA - what gene(s) does it contain? what is the intron/exon organization? can you say anything about a promoter? If you have more than one gene, are they related? etc. You should start out by trying the various "gene finding" programs (see links on our home page). You should also explore the web for any kind of analysis you might find useful. Try some of the basic analyses in Gene Inspector as well. You probably want to start by running GeneMachine and examining the results in Sequin. During the term you should stop in regularly to talk with me about your progress and to discuss what to do next -- this should be a collaborative effort between you and me! Since these sequences have never been characterized before, they might contain anywhere from 0 to 10 genes... we will not know until you have done the analyses. You might discover something entirely new that might make an interesting publication. At the end of the term, you will hand in a report detailing what you have learned about your sequence. It would be very advisable to write down notes as you do the research. It is very easy to forget what you have done if you look back several weeks. The report should include specifics on the methods/algorithms you used, figures and tables detailing what the results of each analysis were and what conclusions you were able to draw about your sequence from each analysis. Since we are dealing with uncharacterized sequences, you may not be able to draw any conclusions about function -- but that is how research works. It's OK not to be able to define a function for your DNA/gene(s) as long as you document the analyses you have done and why you conclude that it is not possible to define a function at this time. What to Hand In You are to hand in a report that details your unknown DNA and how you went about doing the analysis. As you work with your sequence you will essentially be solving a mystery. Of course, each of you will be solving a different mystery. The questions below are meant as a guide to how you should be thinking. There will be many different twists and turns in each project, so please keep in touch with me as you uncover new information about your DNA. The report should identify (graphically) the organization of the genes in your DNA and should number them. This numbering scheme should be used when describing the products those genes. The report should include a description of the steps you take along the way and the reasoning behind the steps. You might start out describing the "gene finding" that you did. You should use a Microsoft Word document called YourName_termProject to contain the bulk of your analysis results - both the text and the graphics. If you had conflicts in predictions from various programs, what did you do to resolve those conflicts? State the number of genes you identified and then go on to analyze each of the genes you found. For each gene, show the intron-exon structure (using a graphic generated in Gene Inspector, Gene Construction Kit, GenScan or any other source -- and then pasted into the word document) and the protein sequence(s) that is encoded. Hand in the protein sequences as a single Gene Inspector file named YourName_TP_proteins. Then, for each gene, describe the analyses you did on the protein and the DNA. What were the results of database searches and the various protein analyses you did? Can you predict a likely 3-D structure for the proteins or segments of the proteins? Can you state what the gene (or gene product) does? If not, does it resemble any other known genes? Are adjacent genes in any way similar? Is there any kind of correlation between the repeat structures or base distributions and your gene structure? Look at the regions on your DNA between genes... is there any sequence that is recognizable by database searches? Can you identify a cluster of genes similar to yours in another species? If so, are there any conserved DNA sequences that are non-coding? Could they be control regions? Antisense RNAs? tRNAs? Suggested Format: For writing up the term project, a good format would be the following. Your main Word document should describe what you have done during your analysis. It should include important information and figures supporting your conclusions and the logic that you used in carrying out the analyses. You might organize the body of the report into sections corresponding to each predicted gene. You should put all the detailed data into appendices so it does not disrupt the flow of your report. I'd suggest using one appendix for each gene which would include things like BLAST results or other lengthy data/results. You should write the report at a level such that others in the class can understand it. The term project is due as defined on the syllabus page. Place all the documents you generate into a single folder called lastname_firstname.tp, stuff/zip that folder and hand it in using BLACKBOARD as for all previous homework. In 2009W we are working with the genome of an organism called a lancelet (Branchiostoma floridae), sometimes called amphioxus. The unknown sequences are approximately 85 KB is size and were obtained from UC Santa Cruz. This genome provides important information about the evolution of vertebrates through two rounds of gene duplication (news release, comment). The genome paper was published in Nature in 2008. Further analyses were published in Genome Research (1, 2, 3), also in 2008. This species is at the root of the vertebrate tree and is an ancient ancestor of ours. It has a genome quadruplication compared to earlier species and it is believed that this quadrupling of the genome allowed more advanced species to evolve. We can still detect much of the quadruplication in our genomes today, although new functions have emerged for many of the genes while other genes have been lost. |
| Student | Unknown Sequence |
|
Barrett Elizabeth A |
unknown01 |
|
Bogan Katrina L |
unknown02 |
|
Gray Elizabeth C |
unknown03 |
|
Gruber Joann F |
unknown04 |
|
Guo Lan |
unknown05 |
|
Sulovari Arvis |
unknown06 |
|
Tse Julia |
unknown07 |
|
Zhou Pei |
unknown08 |
|
Chang Li-Ju |
unknown09 |
|
J. Doe |
unknown10 |
|
J. Doe |
unknown11 |
|
J. Doe |
unknown12 |
This page was last modified on