Automatic extraction of mutations from Medline and cross-validation with OMIM.

Abstract:

:Mutations help us to understand the molecular origins of diseases. Researchers, therefore, both publish and seek disease-relevant mutations in public databases and in scientific literature, e.g. Medline. The retrieval tends to be time-consuming and incomplete. Automated screening of the literature is more efficient. We developed extraction methods (called MEMA) that scan Medline abstracts for mutations. MEMA identified 24,351 singleton mutations in conjunction with a HUGO gene name out of 16,728 abstracts. From a sample of 100 abstracts we estimated the recall for the identification of mutation-gene pairs to 35% at a precision of 93%. Recall for the mutation detection alone was >67% with a precision rate of >96%. This shows that our system produces reliable data. The subset consisting of protein sequence mutations (PSMs) from MEMA was compared to the entries in OMIM (20,503 entries versus 6699, respectively). We found 1826 PSM-gene pairs to be in common to both datasets (cross-validated). This is 27% of all PSM-gene pairs in OMIM and 91% of those pairs from OMIM which co-occur in at least one Medline abstract. We conclude that Medline covers a large portion of the mutations known to OMIM. Another large portion could be artificially produced mutations from mutagenesis experiments. Access to the database of extracted mutation-gene pairs is available through the web pages of the EBI (refer to http://www.ebi. ac.uk/rebholz/index.html).

journal_name

Nucleic Acids Res

journal_title

Nucleic acids research

authors

Rebholz-Schuhmann D,Marcel S,Albert S,Tolle R,Casari G,Kirsch H

doi

10.1093/nar/gkh162

keywords:

subject

Has Abstract

pub_date

2004-01-02 00:00:00

pages

135-42

issue

1

eissn

0305-1048

issn

1362-4962

pii

32/1/135

journal_volume

32

pub_type

杂志文章
  • Activation of a dual adenovirus promoter containing nonconsensus TATA motifs in Schizosaccharomyces pombe: role of TATA sequences in the efficiency of transcription.

    abstract::The role of TATA elements in the expression of a mammalian promoter was investigated in the fission yeast Schizosaccharomyces pombe, by studying the human adenovirus E2-early promoter. This is a unique dual promoter with two nonconsensus TATA elements directing transcription from two cap sites, +1 and -26. A sequence ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/21.11.2737

    authors: Swaminathan S,Malhotra P,Manohar CF,Dhar R,Thimmapaya B

    更新日期:1993-06-11 00:00:00

  • Conformation of the 3'-end of beet necrotic yellow vein benevirus RNA 3 analysed by chemical and enzymatic probing and mutagenesis.

    abstract::Secondary structure-sensitive chemical and enzymatic probes have been used to produce a model for the folding of the last 68 residues of the 3'-non-coding region of beet necrotic yellow vein benevirus RNA 3. The structure consists of two stem-loops separated by a single-stranded region. RNA 3-derived transcripts were ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/25.23.4723

    authors: Lauber E,Guilley H,Richards K,Jonard G,Gilmer D

    更新日期:1997-12-01 00:00:00

  • Nucleotide sequence analysis of the linked left and right hand terminal regions of adenovirus type 5 DNA present in the transformed rat cell line 5RK20.

    abstract::A peculiar phenomenon is observed in several adenovirus type 2 or 5 (Ad2 or Ad5) transformed cell lines: the right hand and left hand terminal regions of the viral genome present in the viral DNA insertions of these cell lines are found to be linked together. A large part of the viral DNA insertion present in the Ad5 ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/10.7.2189

    authors: Visser L,Reemst AC,van Mansfeld AD,Rozijn TH

    更新日期:1982-04-10 00:00:00

  • Survey of chimeric IStron elements in bacterial genomes: multiple molecular symbioses between group I intron ribozymes and DNA transposons.

    abstract::IStrons are chimeric genetic elements composed of a group I intron associated with an insertion sequence (IS). The group I intron is a catalytic RNA providing the IStron with self-splicing ability, which renders IStron insertions harmless to the host genome. The IS element is a DNA transposon conferring mobility, and ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gku939

    authors: Tourasse NJ,Stabell FB,Kolstø AB

    更新日期:2014-11-10 00:00:00

  • Magnesium-dependent alternative foldings of active and inactive Escherichia coli tRNA(Glu) revealed by chemical probing.

    abstract::A stable conformer of Escherichia coli tRNA(Glu), obtained in the absence of Mg(2+), is inactive in the aminoacylation reaction. Probing it with diethylpyrocarbonate, dimethyl sulfate and ribonuclease V1 revealed that it has a hairpin structure with two internal loops; the helical segments at both extremities have the...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/27.17.3583

    authors: Madore E,Florentz C,Giegé R,Lapointe J

    更新日期:1999-09-01 00:00:00

  • Chromosome XII context is important for rDNA function in yeast.

    abstract::The rDNA cluster in Saccharomyces cerevisiae is located 450 kb from the left end and 610 kb from the right end of chromosome XII and consists of approximately 150 tandemly repeated copies of a 9.1 kb rDNA unit. To explore the biological significance of this specific chromosomal context, chromosome XII was split at bot...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkl293

    authors: Kim YH,Ishikawa D,Ha HP,Sugiyama M,Kaneko Y,Harashima S

    更新日期:2006-05-31 00:00:00

  • A simplified method of generating transgenic Xenopus.

    abstract::Currently transgenic frog embryos are generated using restriction-enzyme-mediated integration (REMI) on decondensed sperm nuclei followed by nuclear transplantation into unfertilized eggs. We have developed a simplified version of this protocol that has the potential to increase the numbers of normally developing tran...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/28.4.e12

    authors: Sparrow DB,Latinkic B,Mohun TJ

    更新日期:2000-02-15 00:00:00

  • An hnRNP-like RNA-binding protein affects alternative splicing by in vivo interaction with transcripts in Arabidopsis thaliana.

    abstract::Alternative splicing (AS) of pre-mRNAs is an important regulatory mechanism shaping the transcriptome. In plants, only few RNA-binding proteins are known to affect AS. Here, we show that the glycine-rich RNA-binding protein AtGRP7 influences AS in Arabidopsis thaliana. Using a high-resolution RT-PCR-based AS panel, we...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gks873

    authors: Streitner C,Köster T,Simpson CG,Shaw P,Danisman S,Brown JW,Staiger D

    更新日期:2012-12-01 00:00:00

  • Specific interactions of distamycin with G-quadruplex DNA.

    abstract::Distamycin binds the minor groove of duplex DNA at AT-rich regions and has been a valuable probe of protein interactions with double-stranded DNA. We find that distamycin can also inhibit protein interactions with G-quadruplex (G4) DNA, a stable four-stranded structure in which the repeating unit is a G-quartet. Using...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkg392

    authors: Cocco MJ,Hanakahi LA,Huber MD,Maizels N

    更新日期:2003-06-01 00:00:00

  • Distinct roles of Pcf11 zinc-binding domains in pre-mRNA 3'-end processing.

    abstract::New transcripts generated by RNA polymerase II (RNAPII) are generally processed in order to form mature mRNAs. Two key processing steps include a precise cleavage within the 3' end of the pre-mRNA, and the subsequent polymerization of adenosines to produce the poly(A) tail. In yeast, these two functions are performed ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkx674

    authors: Guéguéniat J,Dupin AF,Stojko J,Beaurepaire L,Cianférani S,Mackereth CD,Minvielle-Sébastia L,Fribourg S

    更新日期:2017-09-29 00:00:00

  • Raman spectral studies of nucleic acids. XVII. Conformational structures of polyinosinic acid.

    abstract::Laser-Raman spectra of poly(rI) show the formation of an ordered complex in aqueous solutions of high ionic strength. This structure exhibits the A-helix geometry, contains stacked bases and is apparently stabilized by specific hydrogen bonding involving hypoxanthine C6=0 groups. Thermal dissociation of the poly(rI) c...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/4.7.2407

    authors: Chou CH,Thomas GJ Jr,Arnott S,Smith PJ

    更新日期:1977-07-01 00:00:00

  • The European Bioinformatics Institute in 2017: data coordination and integration.

    abstract::The European Bioinformatics Institute (EMBL-EBI) supports life-science research throughout the world by providing open data, open-source software and analytical tools, and technical infrastructure (https://www.ebi.ac.uk). We accommodate an increasingly diverse range of data types and integrate them, so that biologists...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkx1154

    authors: Cook CE,Bergman MT,Cochrane G,Apweiler R,Birney E

    更新日期:2018-01-04 00:00:00

  • Improved nucleotide selectivity and termination of 3'-OH unblocked reversible terminators by molecular tuning of 2-nitrobenzyl alkylated HOMedU triphosphates.

    abstract::We describe a novel 3'-OH unblocked reversible terminator with the potential to improve accuracy and read-lengths in next-generation sequencing (NGS) technologies. This terminator is based on 5-hydroxymethyl-2'-deoxyuridine triphosphate (HOMedUTP), a hypermodified nucleotide found naturally in the genomes of numerous ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkq1293

    authors: Litosh VA,Wu W,Stupi BP,Wang J,Morris SE,Hersh MN,Metzker ML

    更新日期:2011-03-01 00:00:00

  • AlleleHMM: a data-driven method to identify allele specific differences in distributed functional genomic marks.

    abstract::How DNA sequence variation influences gene expression remains poorly understood. Diploid organisms have two homologous copies of their DNA sequence in the same nucleus, providing a rich source of information about how genetic variation affects a wealth of biochemical processes. However, few computational methods have ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkz176

    authors: Chou SP,Danko CG

    更新日期:2019-06-20 00:00:00

  • Effect of UV-irradiation on DNA replication of the parvovirus minute-virus-of-mice in mouse fibroblasts.

    abstract::The effect of UV-irradiation on the conversion of the single-stranded DNA of the parvovirus Minute-Virus-of-Mice (MVM) to duplex Replicative Forms (RF) was studied after infection of mouse A9 fibroblasts. UV-irradiation of the virus prior to infection of unirradiated cells resulted in a dose-dependent, single-hit, inh...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/10.8.2577

    authors: Rommelaere J,Ward DC

    更新日期:1982-04-24 00:00:00

  • Mitome: dynamic and interactive database for comparative mitochondrial genomics in metazoan animals.

    abstract::Mitome is a specialized mitochondrial genome database designed for easy comparative analysis of various features of metazoan mitochondrial genomes such as base frequency, A+T skew, codon usage and gene arrangement pattern. A particular function of the database is the automatic reconstruction of phylogenetic relationsh...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkm763

    authors: Lee YS,Oh J,Kim YU,Kim N,Yang S,Hwang UW

    更新日期:2008-01-01 00:00:00

  • Initiation of strand incision at G:T and O(6)-methylguanine:T base mismatches in DNA by human cell extracts.

    abstract::Extracts of the human glioma cell line A1235 (lacking O(6)-methylguanine-DNA methyltransferase) are known to restore a G:T mismatch to a normal G:C pair in a G:T-containing model (45 bp) DNA substrate. Herein we demonstrate that substitution of G:T with O(6)-methylguanine:T (m6G:T) results in extract-induced intra-str...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/29.11.2409

    authors: Lari SU,Day RS,Dobler K,Paterson MC

    更新日期:2001-06-01 00:00:00

  • Tumor-associated mutations of rat mitochondrial transfer RNA genes.

    abstract::Mitochondrial DNA is a sensitive target of chemical carcinogens (Backer and Weinstein (1980) Science 209, 297-299), suggesting that mutations of the mitochondrial genome occur in tumor cells. We examined this point by comparing mitochondrial DNA sequences in four rat tumors with those of normal rat liver. Some novel m...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/11.6.1635

    authors: Taira M,Yoshida E,Kobayashi M,Yaginuma K,Koike K

    更新日期:1983-03-25 00:00:00

  • Identifying functional gene sets from hierarchically clustered expression data: map of abiotic stress regulated genes in Arabidopsis thaliana.

    abstract::We present MultiGO, a web-enabled tool for the identification of biologically relevant gene sets from hierarchically clustered gene expression trees (http://ekhidna.biocenter.helsinki.fi/poxo/multigo). High-throughput gene expression measuring techniques, such as microarrays, are nowadays often used to monitor the exp...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkl694

    authors: Kankainen M,Brader G,Törönen P,Palva ET,Holm L

    更新日期:2006-01-01 00:00:00

  • Relation of cell type and cell density in tissue culture to the isoaccepting spectra of the nucleoside Q containing tRNAs: tRNATyr, tRNAHis, tRNAAsn and tRNAAsp.

    abstract::An examination, using reversed-phase chromatography and cyanogen bromide treatment, of tRNATyr, tRNAHis, tRNAAsn, and tRNAAsp from SV40-transformed mouse fibroblasts grown to different cell densities, untransformed cells grown to confluence, and mouse liver indicates that: (1) The tissue cultured mouse fibroblasts exa...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/5.7.2513

    authors: Katze JR

    更新日期:1978-07-01 00:00:00

  • Uracil-DNA glycosylase affects mismatch repair efficiency in transformation and bisulfite-induced mutagenesis in Streptococcus pneumoniae.

    abstract::The generalized mismatch repair system of Streptococcus pneumoniae (the Hex system) can eliminate base pair mismatches arising in heteroduplex DNA during transformation or by DNA polymerase errors during replication. Mismatch repair is most likely initiated at nicks or gaps. The present work was started to examine the...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/19.20.5525

    authors: Méjean V,Devedjian JC,Rives I,Alloing G,Claverys JP

    更新日期:1991-10-25 00:00:00

  • An alternative protein factor which binds the internal promoter of Xenopus 5S ribosomal RNA genes.

    abstract::In small oocytes of Xenopus species, two sets of 5S RNA genes, oocyte-type and somatic-type, are fully activated. The 5S RNA transcripts are temporarily stored, half in association with TFIIIA to form a 7S particle, the other half in association with tRNA and two proteins (p48 and p43) to form a 42S particle. It has b...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/15.21.8679

    authors: Barrett P,Sommerville J

    更新日期:1987-11-11 00:00:00

  • The DEAH-box RNA helicase RHAU binds an intramolecular RNA G-quadruplex in TERC and associates with telomerase holoenzyme.

    abstract::Guanine-quadruplexes (G4) consist of non-canonical four-stranded helical arrangements of guanine-rich nucleic acid sequences. The bulky and thermodynamically stable features of G4 structures have been shown in many respects to affect normal nucleic acid metabolism. In vivo conversion of G4 structures to single-strande...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkr630

    authors: Lattmann S,Stadler MB,Vaughn JP,Akman SA,Nagamine Y

    更新日期:2011-11-01 00:00:00

  • Analysis of the proximal transcriptional element of the myelin basic protein gene.

    abstract::The gene encoding myelin basic protein (MBP) contains multiple activator sequences spanning upstream of its transcriptional initiation site which differentially promote transcription in glial cells. The proximal activator sequence, designated MB1, activates transcription in a glial cell type specific manner. This sequ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/20.3.545

    authors: Devine-Beach K,Haas S,Khalili K

    更新日期:1992-02-11 00:00:00

  • Transcription through the yeast origin of replication ARS1 ends at the ABFI binding site and affects extrachromosomal maintenance of minichromosomes.

    abstract::When the function of origins of replication in yeast was compromised by placing ARS sequences downstream of strong promoters, ARS activity might have been affected either by transcription or by an altered chromatin configuration induced by the construct. To distinguish between these possibilities, derivatives of the y...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/22.19.3904

    authors: Tanaka S,Halter D,Livingstone-Zatchej M,Reszel B,Thoma F

    更新日期:1994-09-25 00:00:00

  • The nucleosome repeat length increases during erythropoiesis in the chick.

    abstract::During erythropoiesis in the chick, the nucleosome repeat length increases from 190 base pairs to 212 base pairs. This increase is correlated with a dramatic increase in the concentration of the red cell specific histone H5 (from 0.2 molecules per nucleosome to 1 molecule per nucleosome) and with no change in the conc...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/5.4.1179

    authors: Weintraub H

    更新日期:1978-04-01 00:00:00

  • The Borrelia burgdorferi telomere resolvase, ResT, anneals ssDNA complexed with its cognate ssDNA-binding protein.

    abstract::Spirochetes of the genus Borrelia possess unusual genomes that consist in a linear chromosome and multiple linear and circular plasmids. The linear replicons are terminated by covalently closed hairpin ends, referred to as hairpin telomeres. The hairpin telomeres represent a simple solution to the end-replication prob...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkw344

    authors: Huang SH,Kobryn K

    更新日期:2016-06-20 00:00:00

  • DNA polymerase I and a protein complex bind specifically to E. coli palindromic unit highly repetitive DNA: implications for bacterial chromosome organization.

    abstract::Starting from a crude E. coli extract, two activities which specifically protect highly repetitive bacterial DNA sequences (called PU for Palindromic Unit or REP for Repetitive Extragenic Palindromic sequence) against a digestion with Exonuclease III have been purified. We show that one of these activities is due to t...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/18.13.3941

    authors: Gilson E,Perrin D,Hofnung M

    更新日期:1990-07-11 00:00:00

  • Cloning and expression of the hypoxanthine-guanine phosphoribosyltransferase gene from Trypanosoma brucei.

    abstract::The hypoxanthine-guanine phosphoribosyltransferase (HGPRT) enzyme of Trypanosoma brucei and related parasites provides a rational target for the treatment of African sleeping sickness and several other parasitic diseases. To characterize the T. brucei HGPRT enzyme in detail, the T. brucei hgprt was isolated within a 4...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/21.23.5431

    authors: Allen TE,Ullman B

    更新日期:1993-11-25 00:00:00

  • SBSPKS: structure based sequence analysis of polyketide synthases.

    abstract::Polyketide synthases (PKSs) catalyze biosynthesis of a diverse family of pharmaceutically important secondary metabolites. Bioinformatics analysis of sequence and structural features of PKS proteins plays a crucial role in discovery of new natural products by genome mining, as well as in design of novel secondary meta...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkq340

    authors: Anand S,Prasad MV,Yadav G,Kumar N,Shehara J,Ansari MZ,Mohanty D

    更新日期:2010-07-01 00:00:00