GeneAlign: a coding exon prediction tool based on phylogenetical comparisons.

Abstract:

:GeneAlign is a coding exon prediction tool for predicting protein coding genes by measuring the homologies between a sequence of a genome and related sequences, which have been annotated, of other genomes. Identifying protein coding genes is one of most important tasks in newly sequenced genomes. With increasing numbers of gene annotations verified by experiments, it is feasible to identify genes in the newly sequenced genomes by comparing to annotated genes of phylogenetically close organisms. GeneAlign applies CORAL, a heuristic linear time alignment tool, to determine if regions flanked by the candidate signals (initiation codon-GT, AG-GT and AG-STOP codon) are similar to annotated coding exons. Employing the conservation of gene structures and sequence homologies between protein coding regions increases the prediction accuracy. GeneAlign was tested on Projector dataset of 491 human-mouse homologous sequence pairs. At the gene level, both the average sensitivity and the average specificity of GeneAlign are 81%, and they are larger than 96% at the exon level. The rates of missing exons and wrong exons are smaller than 1%. GeneAlign is a free tool available at http://genealign.hccvs.hc.edu.tw.

journal_name

Nucleic Acids Res

journal_title

Nucleic acids research

authors

Hsieh SJ,Lin CY,Liu NH,Chow WY,Tang CY

doi

10.1093/nar/gkl307

subject

Has Abstract

pub_date

2006-07-01 00:00:00

pages

W280-4

issue

Web Server issue

eissn

0305-1048

issn

1362-4962

pii

34/suppl_2/W280

journal_volume

34

pub_type

杂志文章
  • A multiplex platform for digital measurement of circular DNA reaction products.

    abstract::Digital PCR provides high sensitivity and unprecedented accuracy in DNA quantification, but current approaches require dedicated instrumentation and have limited opportunities for multiplexing. Here, we present an isothermal platform for digital enumeration of DNA reaction products in multiplex via standard fluorescen...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkaa419

    authors: Björkesten J,Patil S,Fredolini C,Lönn P,Landegren U

    更新日期:2020-07-27 00:00:00

  • SEWAL: an open-source platform for next-generation sequence analysis and visualization.

    abstract::Next-generation DNA sequencing platforms provide exciting new possibilities for in vitro genetic analysis of functional nucleic acids. However, the size of the resulting data sets presents computational and analytical challenges. We present an open-source software package that employs a locality-sensitive hashing algo...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkq661

    authors: Pitt JN,Rajapakse I,Ferré-D'Amaré AR

    更新日期:2010-12-01 00:00:00

  • Identifying eIF4E-binding protein translationally-controlled transcripts reveals links to mRNAs bound by specific PUF proteins.

    abstract::eIF4E-binding proteins (4E-BPs) regulate translation of mRNAs in eukaryotes. However the extent to which specific mRNA targets are regulated by 4E-BPs remains unknown. We performed translational profiling by microarray analysis of polysome and monosome associated mRNAs in wild-type and mutant cells to identify mRNAs i...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkq686

    authors: Cridge AG,Castelli LM,Smirnova JB,Selley JN,Rowe W,Hubbard SJ,McCarthy JE,Ashe MP,Grant CM,Pavitt GD

    更新日期:2010-12-01 00:00:00

  • Mammalian enzymes for preventing transcriptional errors caused by oxidative damage.

    abstract::8-Oxo-7,8-dihydroguanine (8-oxoGua) is produced in cells by reactive oxygen species normally formed during cellular metabolic processes. This oxidized base can pair with both adenine and cytosine, and thus the existence of this base in messenger RNA would cause translational errors. The MutT protein of Escherichia col...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gki682

    authors: Ishibashi T,Hayakawa H,Ito R,Miyazawa M,Yamagata Y,Sekiguchi M

    更新日期:2005-07-07 00:00:00

  • Primary structure, developmentally regulated expression and potential duplication of the zebrafish homeobox gene ZF-21.

    abstract::We report the molecular cloning and characterization of a cDNA derived from a zebrafish gene (ZF-21) related to the mouse homeobox containing gene Hox2.1. Interesting information about the differential conservation of various domains was gained from comparisons between the putative protein sequences from ZF-21 (275 am...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/16.19.9097

    authors: Njølstad PR,Molven A,Hordvik I,Apold J,Fjose A

    更新日期:1988-10-11 00:00:00

  • Two-photon fluorescence cross-correlation spectroscopy as a potential tool for high-throughput screening of DNA repair activity.

    abstract::Several lines of evidence indicate that differences in DNA repair capacity are an important source of variability in cancer risk. However, traditional assays for measurement of DNA repair activity in human samples are laborious and time-consuming. DNA glycosylases are the first step in base excision repair of a variet...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gni166

    authors: Collini M,Caccia M,Chirico G,Barone F,Dogliotti E,Mazzei F

    更新日期:2005-10-21 00:00:00

  • Identification and mapping of N6-methyladenosine containing sequences in simian virus 40 RNA.

    abstract::Late SV40 16S and 19S mRNAs were found to contain an average of three m6A residues per mRNA molecule. The methylated residues of both the viral and cellular mRNAs occur in two sequences; Gpm6ApC and (Ap)nm6ApC, where n = 1-4. More than 60% of the m6A residues in SV40 16S and 19S mRNAs occur in Gpm6ApC even though ther...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/6.8.2879

    authors: Canaani D,Kahana C,Lavi S,Groner Y

    更新日期:1979-06-25 00:00:00

  • BCNTB bioinformatics: the next evolutionary step in the bioinformatics of breast cancer tissue banking.

    abstract::Here, we present an update of Breast Cancer Now Tissue Bank bioinformatics, a rich platform for the sharing, mining, integration and analysis of breast cancer data. Its modalities provide researchers with access to a centralised information gateway from which they can access a network of bioinformatic resources to que...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkx913

    authors: Gadaleta E,Pirrò S,Dayem Ullah AZ,Marzec J,Chelala C

    更新日期:2018-01-04 00:00:00

  • Destabilization of tetranucleotide repeats in Haemophilus influenzae mutants lacking RnaseHI or the Klenow domain of PolI.

    abstract::A feature of Haemophilus influenzae genomes is the presence of several loci containing tracts of six or more identical tetranucleotide repeat units. These repeat tracts are unstable and mediate high frequency, reversible alterations in the expression of surface antigens. This process, termed phase variation (PV), enab...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gki180

    authors: Bayliss CD,Sweetman WA,Moxon ER

    更新日期:2005-01-14 00:00:00

  • Identification and characterization of transcription factor IIIA from Schizosaccharomyces pombe.

    abstract::Transcription factor IIIA (TFIIIA) is specifically required for transcription of 5S rRNA genes and is the archetypal C2H2 zinc finger protein. All known vertebrate TFIIIAs have a similar organization: nine zinc fingers, followed by a C-terminal domain of unknown structure. The zinc fingers of Saccharomyces cerevisiae ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkf385

    authors: Schulman DB,Setzer DR

    更新日期:2002-07-01 00:00:00

  • Assembly, nuclear import and function of U7 snRNPs studied by microinjection of synthetic U7 RNA into Xenopus oocytes.

    abstract::In Xenopus oocytes in vitro transcribed mouse U7 RNA is assembled into small nuclear ribonucleoproteins (snRNPs) that are functional in histone RNA 3' processing. If the special Sm binding site of U7 (AAUUUGUCUAG, U7 Sm WT) is converted into the canonical Sm sequence derived from the major snRNAs (AAUUUUUGGAG, U7 Sm O...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/23.16.3141

    authors: Stefanovic B,Hackl W,Lührmann R,Schümperli D

    更新日期:1995-08-25 00:00:00

  • Binding of oligonucleotides to a viral hairpin forming RNA triplexes with parallel G*G*C triplets.

    abstract::Infrared and UV spectroscopies have been used to study the assembly of a hairpin nucleotide sequence (nucleotides 3-30) of the 5' non-coding region of the hepatitis C virus RNA (5'-GGCGGGGAUUAUCCCCGCUGUGAGGCGG-3') with a RNA 20mer ligand (5'-CCGCCUCACAAAGGUGGGGU-3') in the presence of magnesium ion and spermidine. The...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/30.6.1333

    authors: Carmona P,Molina M

    更新日期:2002-03-15 00:00:00

  • Nuclear poly(A) binding protein 1 (PABPN1) and Matrin3 interact in muscle cells and regulate RNA processing.

    abstract::The polyadenylate binding protein 1 (PABPN1) is a ubiquitously expressed RNA binding protein vital for multiple steps in RNA metabolism. Although PABPN1 plays a critical role in the regulation of RNA processing, mutation of the gene encoding this ubiquitously expressed RNA binding protein causes a specific form of mus...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkx786

    authors: Banerjee A,Vest KE,Pavlath GK,Corbett AH

    更新日期:2017-10-13 00:00:00

  • Complex formation between ribosomal protein S1, oligo-and polynucleotides: chain length dependence and base specificity.

    abstract::In order to examine the nature of the complex formation between the ribosomal protein S1 and nucleic acids three methods were used: Inhibition of the reaction of n-ethyl[2.3 14C]-maleimide with S1 by the addition of oligonucleotides; adsorption of the complexes to nitrocellulose filters; and equilibrium dialysis. The ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/4.10.3627

    authors: Lipecky R,Kohlschein J,Gassen HG

    更新日期:1977-10-01 00:00:00

  • Identification and functional characterization of a cis-acting positive DNA element regulating CYP 2B1/B2 gene transcription in rat liver.

    abstract::A positive cis-acting DNA element in the near 5'-upstream region of the CYP2B1/B2 genes in rat liver was found to play an important role in the transcription of these genes. An oligonucleotide covering -69 to -98 nt mimicked the gel mobility shift pattern given by the fragment -179 to +29 nt, which was earlier found a...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/20.3.557

    authors: Upadhya P,Rao MV,Venkateswar V,Rangarajan PN,Padmanaban G

    更新日期:1992-02-11 00:00:00

  • ZCURVE: a new system for recognizing protein-coding genes in bacterial and archaeal genomes.

    abstract::A new system, ZCURVE 1.0, for finding protein- coding genes in bacterial and archaeal genomes has been proposed. The current algorithm, which is based on the Z curve representation of the DNA sequences, lays stress on the global statistical features of protein-coding genes by taking the frequencies of bases at three c...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkg254

    authors: Guo FB,Ou HY,Zhang CT

    更新日期:2003-03-15 00:00:00

  • Tumour suppressor ING1b maintains genomic stability upon replication stress.

    abstract::The lesion bypass pathway, which is regulated by monoubiquitination of proliferating cell nuclear antigen (PCNA), is essential for resolving replication stalling due to DNA lesions. This process is important for preventing genomic instability and cancer development. Previously, it was shown that cells deficient in tum...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkq1337

    authors: Wong RP,Lin H,Khosravi S,Piche B,Jafarnejad SM,Chen DW,Li G

    更新日期:2011-05-01 00:00:00

  • Direct micro-haplotyping by multiple double PCR amplifications of specific alleles (MD-PASA).

    abstract::Analysis of haplotypes is an important tool in population genetics, familial heredity and gene mapping. Determination of haplotypes of multiple single nucleotide polymorphisms (SNPs) or other simple mutations is time consuming and expensive when analyzing large populations, and often requires the help of computational...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gnf062

    authors: Eitan Y,Kashi Y

    更新日期:2002-06-15 00:00:00

  • Comparative calorimetric studies on the dynamic conformation of plant 5S rRNA. I. Thermal unfolding pattern of lupin seeds and wheat germ 5S rRNAs, also in the presence of magnesium and sperminium cations.

    abstract::An attempt has been made to correlate differential scanning calorimetry melting profiles of 5S rRNAs from lupin seeds (L.s.) and wheat germ (W.g.) with their structure. It is suggested that the observed differences in thermal unfolding are due to differences in RNA nucleotide sequence and as a consequence in higher or...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/16.2.685

    authors: Barciszewski J,Bratek-Wiewiórowska MD,Górnicki P,Naskret-Barciszewska M,Wiewiórowski M,Zielenkiewicz A,Zielenkiewicz W

    更新日期:1988-01-25 00:00:00

  • Mouse genome database 2016.

    abstract::The Mouse Genome Database (MGD; http://www.informatics.jax.org) is the primary community model organism database for the laboratory mouse and serves as the source for key biological reference data related to mouse genes, gene functions, phenotypes and disease models with a strong emphasis on the relationship of these ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkv1211

    authors: Bult CJ,Eppig JT,Blake JA,Kadin JA,Richardson JE,Mouse Genome Database Group.

    更新日期:2016-01-04 00:00:00

  • Molecular structure of a U•A-U-rich RNA triple helix with 11 consecutive base triples.

    abstract::Three-dimensional structures have been solved for several naturally occurring RNA triple helices, although all are limited to six or fewer consecutive base triples, hindering accurate estimation of global and local structural parameters. We present an X-ray crystal structure of a right-handed, U•A-U-rich RNA triple he...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkz1222

    authors: Ruszkowska A,Ruszkowski M,Hulewicz JP,Dauter Z,Brown JA

    更新日期:2020-04-06 00:00:00

  • Real-time measurement of in vitro transcription.

    abstract::We have developed a simple method to measure RNA synthesis in real time. In this technique, transcription reactions are performed in the presence of molecular beacons that possess a 2'-O-methylribonucleotide backbone. These probes become fluorescent as they hybridize to nascent RNA during the course of synthesis. We f...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gnh068

    authors: Marras SA,Gold B,Kramer FR,Smith I,Tyagi S

    更新日期:2004-05-20 00:00:00

  • Substrate specificity of the Ogg1 protein of Saccharomyces cerevisiae: excision of guanine lesions produced in DNA by ionizing radiation- or hydrogen peroxide/metal ion-generated free radicals.

    abstract::We have investigated the substrate specificity of the Ogg1 protein of Saccharomyces cerevisiae (yOgg1 protein) for excision of modified DNA bases from oxidatively damaged DNA substrates using gas chromatography/isotope dilution mass spectrometry. Four DNA substrates prepared by treatment with H2O2/Fe(III)-EDTA/ascorbi...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/26.5.1228

    authors: Karahalil B,Girard PM,Boiteux S,Dizdaroglu M

    更新日期:1998-03-01 00:00:00

  • 3' Alu PCR: a simple and rapid method to isolate human polymorphic markers.

    abstract::Microsatellites, such as (TG)n found at random throughout the genome, or as 3' extensions of Alu sequences are being increasingly used as genetic markers because of their pluriallelic character. The search for polymorphic microsatellites is time consuming, however, as it is necessary to sequence clones containing the ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/20.6.1333

    authors: Charlieu JP,Laurent AM,Carter DA,Bellis M,Roizès G

    更新日期:1992-03-25 00:00:00

  • Predicted structure and phyletic distribution of the RNA-binding protein Hfq.

    abstract::Hfq, a bacterial RNA-binding protein, was recently shown to contain the Sm1 motif, a characteristic of Sm and LSm proteins that function in RNA processing events in archaea and eukaryotes. In this report, comparative structural modeling was used to predict a three-dimensional structure of the Hfq core sequence. The pr...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkf508

    authors: Sun X,Zhulin I,Wartell RM

    更新日期:2002-09-01 00:00:00

  • Identification of family-determining residues in PHD fingers.

    abstract::Histone modifications are fundamental to chromatin structure and transcriptional regulation, and are recognized by a limited number of protein folds. Among these folds are PHD fingers, which are present in most chromatin modification complexes. To date, about 15 PHD finger domains have been structurally characterized,...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkq947

    authors: Slama P,Geman D

    更新日期:2011-03-01 00:00:00

  • The OMA orthology database in 2018: retrieving evolutionary relationships among all domains of life through richer web and programmatic interfaces.

    abstract::The Orthologous Matrix (OMA) is a leading resource to relate genes across many species from all of life. In this update paper, we review the recent algorithmic improvements in the OMA pipeline, describe increases in species coverage (particularly in plants and early-branching eukaryotes) and introduce several new feat...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkx1019

    authors: Altenhoff AM,Glover NM,Train CM,Kaleb K,Warwick Vesztrocy A,Dylus D,de Farias TM,Zile K,Stevenson C,Long J,Redestig H,Gonnet GH,Dessimoz C

    更新日期:2018-01-04 00:00:00

  • Oligomeric properties and DNA binding specificities of repressor isoforms from the Streptomyces bacteriophage phiC31.

    abstract::Three protein isoforms (74, 54 and 42 kDa) are expressed from repressor gene c in the Streptomyces temperate bacteriophage phiC31. Because expression of the two smaller isoforms, 54 and 42 kDa, is sufficient for superinfection immunity, the interaction between these isoforms was studied. The native 42 kDa repressor (N...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/26.10.2457

    authors: Wilson SE,Smith MC

    更新日期:1998-05-15 00:00:00

  • Telomeric circles are abundant in the stn1-M1 mutant that maintains its telomeres through recombination.

    abstract::Some human cancers maintain their telomeres using the alternative lengthening of telomeres (ALT) mechanism; a process thought to involve recombination. Different types of recombinational telomere elongation pathways have been identified in yeasts. In senescing yeast telomerase deletion (ter1-Delta) mutants with very s...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkp814

    authors: Basenko EY,Cesare AJ,Iyer S,Griffith JD,McEachern MJ

    更新日期:2010-01-01 00:00:00

  • DBSI: DNA-binding site identifier.

    abstract::In this study, we present the DNA-Binding Site Identifier (DBSI), a new structure-based method for predicting protein interaction sites for DNA binding. DBSI was trained and validated on a data set of 263 proteins (TRAIN-263), tested on an independent set of protein-DNA complexes (TEST-206) and data sets of 29 unbound...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkt617

    authors: Zhu X,Ericksen SS,Mitchell JC

    更新日期:2013-09-01 00:00:00