Abstract:
:Pseudogenes are copies of genes that cannot produce a protein. They can be detected from disruptions to their apparent coding sequence, caused by frameshifts and premature stop codons. They are classed as either processed pseudogenes (made by reverse transcription from an mRNA) or duplicated pseudogenes, arising from duplication in the genomic DNA and subsequent disablement. Historically, there is anecdotal evidence that the fruit fly (Drosophila melanogaster) has few pseudogenes. Investigators have linked this to a high deletion rate of genomic DNA, for which there is evidence from genetic experiments on genome size. Here, we apply a homology-based pipeline that was developed previously to identify pseudogenes in other eukaryotic genomes, to the fruit fly, so as to derive the first complete survey of its pseudogene population. We find approximately 100 pseudogenes, with at least a sixth of these as candidate processed pseudogenes. This gives a much lower proportion of pseudogenes (compared with the size of the proteome) than in the genomes of other eukaryotes for which data are available (human, nematode and budding yeast). Closest matching proteins to Drosophila pseudogenes are significantly longer than the average protein in its proteome (up to approximately 60% more than the average protein's length), in contrast to the situation in the three other eukaryotic genomes. This may be due to the persistence of fragments of longer genes. In the fly pseudogene population, we found most pseudogenes for serine proteases (which are more abundant in the Drosophila lineage compared with the other eukaryotes), immunoglobulin-motif-containing proteins and cytochromes P450. Data on the sequences and positions of the putative pseudogenes are available at: http://www.pseudogene.org/fly. The detection of a small number of pseudogenes in the Drosophila genome and the higher mean length for the closest matching proteins to pseudogenes (possibly because remnants of genes encoding longer proteins are more likely to persist) are further evidence for a high deletion rate of genomic DNA in the fruit fly. The data are useful for molecular evolution study in Drosophila.
journal_name
Nucleic Acids Resjournal_title
Nucleic acids researchauthors
Harrison PM,Milburn D,Zhang Z,Bertone P,Gerstein Mdoi
10.1093/nar/gkg169keywords:
subject
Has Abstractpub_date
2003-02-01 00:00:00pages
1033-7issue
3eissn
0305-1048issn
1362-4962journal_volume
31pub_type
杂志文章abstract::The expression of Herpes Simplex Virus 1 (HSV-1) glycoprotein C (gC), a well defined herpesvirus late gene, was studied by linking the promoter-regulatory region of this gene to the coding sequences for the bacterial enzyme, beta-galactosidase (beta-gal). A chimeric gene, containing the beta-gal gene under the control...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/16.21.10267
更新日期:1988-11-11 00:00:00
abstract::The rat alpha- and bovine alpha s1-casein genes have been isolated and their 5' sequences determined. The rat alpha-, beta-, gamma- and bovine alpha s1-casein genes contain similar 5' exon arrangements in which the 5' noncoding, signal peptide and casein kinase phosphorylation sequences are each encoded by separate ex...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/14.4.1883
更新日期:1986-02-25 00:00:00
abstract::ModBase (http://salilab.org/modbase) is a database of annotated comparative protein structure models. The models are calculated by ModPipe, an automated modeling pipeline that relies primarily on Modeller for fold assignment, sequence-structure alignment, model building and model assessment (http://salilab.org/modelle...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkq1091
更新日期:2011-01-01 00:00:00
abstract::E2A is a member of the E-protein family of transcription factors. Previous studies have reported context-dependent regulation of E2A-dependent transcription. For example, whereas the E2A portion of the E2A-Pbx1 leukemia fusion protein mediates robust transcriptional activation in t(1;19) acute lymphoblastic leukemia, ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkt855
更新日期:2014-01-01 00:00:00
abstract::A terminator of transcription with bidirectional activity has been located between the translation termination codons of the genes tetA and orfL on Tn10. These genes are transcribed towards each other. Each orientation of the intervening sequence is shown to reduce the expression of the lacZ and galK genes when cloned...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/13.12.4227
更新日期:1985-06-25 00:00:00
abstract::Antisense oligodeoxynucleotides (ODNs) have biological activity in treating various forms of cancer. The antisense effects of two types of 20mer ODNs, phosphorothioate-modified ODNs (S-ODNs) and S-ODNs with 12 2'-O-methyl groups (Me-S-ODNs), targeted to sites 109 and 277 of bcl-2 mRNA, were compared. Both types were a...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkh516
更新日期:2004-04-02 00:00:00
abstract::Besides the nicking-closing (topoisomerase I) activity, an ATP-dependent DNA topoisomerase is present in rat liver nuclei. The enzyme, partially purified, is able to catenate in vitro closed DNA circles in a magnesium-dependent, ATP-dependent, histone H1-dependent reaction, and to decatenate in vitro kinetoplast DNA n...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/11.4.1059
更新日期:1983-02-25 00:00:00
abstract::Kinetically monitored, reverse transcriptase-initiated PCR (kinetic RT-PCR, kRT-PCR) is a novel application of kinetic PCR for high throughput transcript quantitation in total cellular RNA. The assay offers the simplicity and flexibility of an enzyme assay with distinct advantages over DNA microarray hybridization and...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/28.2.e2
更新日期:2000-01-15 00:00:00
abstract::The three dimensional structures for representatives of nearly half of all protein families are now available in public databases. Thus, no matter which protein one investigates, it is increasingly likely that the 3D structure of a homolog will be known and may reveal unsuspected structure-function relationships. The ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/27.1.240
更新日期:1999-01-01 00:00:00
abstract::As protein-protein interactions are crucial in most biological processes, it is valuable to understand how and where protein pairs interact. We developed a web server HOMCOS (Homology Modeling of Complex Structure, http://biunit.naist.jp/homcos) to predict interacting protein pairs and interacting sites by homology mo...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkn218
更新日期:2008-07-01 00:00:00
abstract::Controller (C) proteins regulate the timing of the expression of restriction and modification (R-M) genes through a combination of positive and negative feedback circuits. A single dimer bound to the operator switches on transcription of the C-gene and the endonuclease gene; at higher concentrations, a second dimer bo...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkn448
更新日期:2008-08-01 00:00:00
abstract::We analyzed the effects of site-directed mutations in the SUC2 promoter of Saccharomyces cerevisiae. Analyses were performed in wild-type as well as mig1 and tup1 mutant strains after the promoter mutants were reintroduced into the native SUC2 locus on the left arm of chromosome IX. Mutation of the two GC boxes reveal...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/26.4.1002
更新日期:1998-02-15 00:00:00
abstract::DNA mismatches that occur between vector homology arms and chromosomal target sequences reduce gene targeting frequencies in several species; however, this has not been reported in human cells. Here we demonstrate that even a single mismatched base pair can significantly decrease human gene targeting frequencies. In a...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkt1303
更新日期:2014-03-01 00:00:00
abstract::We have identified and sequenced two members of a chicken middle repetitive DNA sequence family. By reassociation kinetics, members of this family (termed CRl) are estimated to be present in 1500-7000 copies per chicken haploid genome. The first family member sequenced (CRlUla) is located approximately 2 kb upstream f...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/9.20.5383
更新日期:1981-10-24 00:00:00
abstract::About 40% of the genes in the nematode Caenorhabditis elegans have homologs in humans. Based on the history of this model system, it is clear that the application of genetic methods to the study of this set of genes would provide important clues to their function in humans. To facilitate such genetic studies, we are e...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gnf051
更新日期:2002-06-15 00:00:00
abstract::Gene trapping is a method of generating murine embryonic stem (ES) cell lines containing insertional mutations in known and novel genes. A number of international groups have used this approach to create sizeable public cell line repositories available to the scientific community for the generation of mutant mouse str...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkj097
更新日期:2006-01-01 00:00:00
abstract::EcoGene (http://ecogene.org) is a database and website devoted to continuously improving the structural and functional annotation of Escherichia coli K-12, one of the most well understood model organisms, represented by the MG1655(Seq) genome sequence and annotations. Major improvements to EcoGene in the past decade i...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gks1235
更新日期:2013-01-01 00:00:00
abstract::Human 2'-5' oligoadenylate synthetase-1 (OAS1) is central in innate immune system detection of cytoplasmic double-stranded RNA (dsRNA) and promotion of host antiviral responses. However, the molecular signatures that promote OAS1 activation are currently poorly defined. We show that the 3'-end polyuridine sequence of ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gku1289
更新日期:2015-01-01 00:00:00
abstract::The Genomes On Line Database (GOLD) is a web resource for comprehensive access to information regarding complete and ongoing genome sequencing projects worldwide. The database currently incorporates information on over 1500 sequencing projects, of which 294 have been completed and the data deposited in the public data...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkj145
更新日期:2006-01-01 00:00:00
abstract::Lens epithelium-derived growth factor/p75 (LEDGF/p75) is a transcriptional coactivator involved in stress response, autoimmune disease, cancer and HIV replication. A fusion between the nuclear pore protein NUP98 and LEDGF/p75 has been found in human acute and chronic myeloid leukemia and association of LEDGF/p75 with ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkq410
更新日期:2010-10-01 00:00:00
abstract::In silico prediction of transcription factor binding sites (TFBSs) is central to the task of gene regulatory network elucidation. Genomic DNA sequence information provides a basis for these predictions, due to the sequence specificity of TF-binding events. However, DNA sequence alone is an impoverished source of infor...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkn866
更新日期:2009-01-01 00:00:00
abstract::A newly discovered group of spherical plant viruses contains a bipartite genome consisting of a single-strand linear RNA molecule (RNA 1, Mr 1.5 x 10(6) ), and a single-strand, covalently closed circular viroid-like RNA molecule (RNA 2, Mr approximately 125,000). The nucleotide sequences of the RNA 2 of two of these, ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/10.12.3681
更新日期:1982-06-25 00:00:00
abstract::A cDNA clone covering the whole coding region for a glycinin subunit precursor containing the A1a acidic subunit, one of the A2 family, has been identified from a library of soybean cotyledonary cDNA clones using a mixed oligonucleotide probe. Analysis of the cDNA insert revealed that it contained 1746 nucleotides of ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/13.18.6719
更新日期:1985-09-25 00:00:00
abstract::Patterns in biological sequences frequently signify interesting features in the underlying molecule. Many tools exist to search for well-known patterns. Less support is available for exploratory analysis, where no well-defined patterns are known yet. PatScanUI (https://patscan.secondarymetabolites.org/) provides a hig...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gky321
更新日期:2018-07-02 00:00:00
abstract::Cultured mammalian cells transduced with the Escherichia coli gene, Ecogpt, synthesize the bacterial enzyme xanthine-guanine phosphoribosyl transferase (XGPT) (1). This paper describes a method for measuring XGPT activity in crude cell extracts by following the conversion of 14C-xanthine (X) to 14C-xanthine monophosph...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/13.8.2921
更新日期:1985-04-25 00:00:00
abstract::We have analyzed the ATPase activity exhibited by the UvrABC DNA repair complex. The UvrA protein is an ATPase whose lack of DNA dependence may be related to the ATP induced monomer-dimer transitions. ATP induced dimerization may be responsible for the enhanced DNA binding activity observed in the presence of ATP. Alt...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/17.11.4145
更新日期:1989-06-12 00:00:00
abstract::In order to construct fish specific expression vectors for studies on gene regulation in vitro and in vivo a variety of heterologous enhancers and promoters from mammals and from viruses of higher vertebrate cells were tested for expression of the bacterial chloramphenicol acetyl transferase reporter gene in three tel...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/18.11.3299
更新日期:1990-06-11 00:00:00
abstract::In order to systematically analyze the effects of nucleoside modification of sugar moieties in DNA polymerase reactions, we synthesized 16 modified templates containing 2',4'-bridged nucleotides and three types of 2',4'-bridged nucleoside-5'-triphospates with different bridging structures. Among the five types of ther...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkn404
更新日期:2008-08-01 00:00:00
abstract::Multiplexed high-throughput pyrosequencing is currently limited in complexity (number of samples sequenced in parallel), and in capacity (number of sequences obtained per sample). Physical-space segregation of the sequencing platform into a fixed number of channels allows limited multiplexing, but obscures available s...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkm760
更新日期:2007-01-01 00:00:00
abstract::We have used a polymerase chain reaction (PCR) procedure to analyse low abundance complementary sense RNAs of Digitaria streak virus (DSV) from infected leaves of Digitaria setigera. This study has confirmed that both spliced and unspliced RNAs are synthesised by the same transcription unit. The position of the intron...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/18.24.7259
更新日期:1990-12-25 00:00:00