DNA sequence-specific recognition by transcriptional factors

Thumbnail Image
Date
2003-01-01
Authors
Xu, Wu
Major Professor
Advisor
Committee Member
Journal Title
Journal ISSN
Volume Title
Publisher
Altmetrics
Abstract

The whole genome sequences from a wide variety of species including 599 viruses, and viroids, 205 naturally occurring plasmids, 185 organelles, 31 eubacteria, seven archaea, one fungus, two animals and one plant are available. 2000-3000 transcription factors out of approximate 30,000-40,000 genes in human genome, which play the central role in controlling cell development, cell growth and differentiation. Abnormal activity of transcriptional factors often leads to diseases. Elucidating the transcriptional regulatory network will be the next challenge of the post-genomic era. Gene regulation initiates from the selective binding of transcription factors to a particular DNA site out of a vast number of potential sites in the genome. It is unclear how transcription factors could specifically recognize the correct sites out of hundreds or thousands potential sites in the genome. We investigated the DNA recognition sites functionally mapped by biochemical and biophysically approaches and also transcription factor-DNA complexes solved by X-ray or NMR from Protein Data Bank. The purpose of this study is to find whether there is a simple code for transcription factor-DNA recognition. Our analyses show that (i) the length for DNA recognition sequences is typically from 4-10 bases; (ii) there is no GC or AT preference for our studied sequences; (iii) positively charged amino acids-Arg and Lys are found to be the majority of contacts with base and phosphate; (iv) some favored interaction pairs, Arg-G, Lys-G and Glu-C, are observed from our studies. However, no simple code for transcription factor-DNA recognition is obtained from our study. A relational database for storing and retrieving collected data is generated as an example to demonstrate the importance of database in computational biology.

Series Number
Journal Issue
Is Version Of
Versions
Series
Academic or Administrative Unit
Theses & dissertations (Interdisciplinary)
Type
thesis
Comments
Rights Statement
Copyright
Wed Jan 01 00:00:00 UTC 2003
Funding
Supplemental Resources
Source