Abstract:
:How DNA sequence variation influences gene expression remains poorly understood. Diploid organisms have two homologous copies of their DNA sequence in the same nucleus, providing a rich source of information about how genetic variation affects a wealth of biochemical processes. However, few computational methods have been developed to discover allele specific differences in functional genomic data. Existing methods either treat each SNP independently, limiting statistical power, or combine SNPs across gene annotations, preventing the discovery of allele specific differences in unexpected genomic regions. Here we introduce AlleleHMM, a new computational method to identify blocks of neighboring SNPs that share similar allele specific differences in mark abundance. AlleleHMM uses a hidden Markov model to divide the genome into three hidden states based on allele frequencies in genomic data: a symmetric state (state S) which shows no difference between alleles, and regions with a higher signal on the maternal (state M) or paternal (state P) allele. AlleleHMM substantially outperformed naive methods using both simulated and real genomic data, particularly when input data had realistic levels of overdispersion. Using global run-on sequencing (GRO-seq) data, AlleleHMM identified thousands of allele specific blocks of transcription in both coding and non-coding genomic regions. AlleleHMM is a powerful tool for discovering allele specific regions in functional genomic datasets.
journal_name
Nucleic Acids Resjournal_title
Nucleic acids researchauthors
Chou SP,Danko CGdoi
10.1093/nar/gkz176subject
Has Abstractpub_date
2019-06-20 00:00:00pages
e64issue
11eissn
0305-1048issn
1362-4962pii
5421125journal_volume
47pub_type
杂志文章abstract::A computer program has been developed which aids in the determination of restriction enzyme recognition sequences. This is achieved by cleaving DNAs of known sequence with a restriction endonuclease and comparing the fragmentation pattern with a computer-generated set of patterns. The feasibility of this approach has ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/5.11.4105
更新日期:1978-11-01 00:00:00
abstract::We describe a new exonuclease-based method for joining and/or constructing two or more DNA molecules. DNA fragments containing ends complementary to those of a vector or another independent molecules were generated by the polymerase chain reaction. The 3' ends of these molecules as well as the vector DNA were then rec...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/21.8.1889
更新日期:1993-04-25 00:00:00
abstract::UNITE (https://unite.ut.ee/) is a web-based database and sequence management environment for the molecular identification of fungi. It targets the formal fungal barcode-the nuclear ribosomal internal transcribed spacer (ITS) region-and offers all ∼1 000 000 public fungal ITS sequences for reference. These are clustere...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gky1022
更新日期:2019-01-08 00:00:00
abstract::The Bovine Genome Database (BGD; http://BovineGenome.org) strives to improve annotation of the bovine genome and to integrate the genome sequence with other genomics data. BGD includes GBrowse genome browsers, the Apollo Annotation Editor, a quantitative trait loci (QTL) viewer, BLAST databases and gene pages. Genome ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkq1235
更新日期:2011-01-01 00:00:00
abstract::Exome sequencing strategy is promising for finding novel mutations of human monogenic disorders. However, pinpointing the casual mutation in a small number of samples is still a big challenge. Here, we propose a three-level filtration and prioritization framework to identify the casual mutation(s) in exome sequencing ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkr1257
更新日期:2012-04-01 00:00:00
abstract::The high similarity of tunicates and vertebrates during their development coupled with the transparency of tunicate larvae, their well-studied cell lineages and the availability of simple and efficient transgenesis methods makes of this subphylum an ideal system for the investigation of vertebrate physiological and de...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkj064
更新日期:2006-01-01 00:00:00
abstract::Expression of a proof-reading deficient form of mitochondrial DNA (mtDNA) polymerase gamma, POLG, causes early death accompanied by features of premature ageing in mouse. However, the mechanism of cellular senescence remains unresolved. In addition to high levels of point mutations of mtDNA, the POLG mutator mouse har...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkp091
更新日期:2009-04-01 00:00:00
abstract::This review presents detailed information about the structure of triplet repeat RNA and addresses the simple sequence repeats of normal and expanded lengths in the context of the physiological and pathogenic roles played in human cells. First, we discuss the occurrence and frequency of various trinucleotide repeats in...
journal_title:Nucleic acids research
pub_type: 杂志文章,评审
doi:10.1093/nar/gkr729
更新日期:2012-01-01 00:00:00
abstract::Interferon regulatory factors IRF-3 and IRF-7 are transcription factors essential in the activation of interferon-β (IFN-β) gene in response to viral infections. Although, both proteins recognize the same consensus IRF binding site AANNGAAA, they have distinct DNA binding preferences for sites in vivo. The X-ray struc...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkr325
更新日期:2011-09-01 00:00:00
abstract::Analysis of large-scale gene expression studies usually begins with gene clustering. A ubiquitous problem is that different algorithms applied to the same data inevitably give different results, and the differences are often substantial, involving a quarter or more of the genes analyzed. This raises a series of import...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gki536
更新日期:2005-05-10 00:00:00
abstract::The MPI Bioinformatics Toolkit (http://toolkit.tuebingen.mpg.de) is an open, interactive web service for comprehensive and collaborative protein bioinformatic analysis. It offers a wide array of interconnected, state-of-the-art bioinformatics tools to experts and non-experts alike, developed both externally (e.g. BLAS...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkw348
更新日期:2016-07-08 00:00:00
abstract::DNA synthesis of broad host-range plasmid R1162 is initiated from two positions, flanking a large (40 bp stem, 40 bp loop) inverted repeat. Each start-point is located within a highly conserved, but oppositely oriented, 10 base-pair sequence. Synthesis from the two positions converges within the intervening inverted r...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/15.20.8319
更新日期:1987-10-26 00:00:00
abstract::Here we report that the poor binding of methylphosphonate oligodeoxynucleosides (MP-ODNs) to their nucleic acid targets can be improved by additional inversion of the anomeric configuration (from beta to alpha) in the sugar moieties to give a new class of analogs, MP alpha-oligonucleosides. MP alpha-dT12and MP 5' alph...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/26.20.4551
更新日期:1998-10-15 00:00:00
abstract::We have isolated an eight kilobase fragment of Bacillus subtilis DNA by specific integration and excision of a plasmid containing a sequence adjacent to ribosomal operon rrn O. The genetic locus of the cloned fragment was verified by linkage of the integrated vector to nearby genetic markers using both transduction an...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/12.15.6307
更新日期:1984-08-10 00:00:00
abstract::Minichromosome maintenance (MCM) proteins facilitate replication by licensing origins and unwinding the DNA double strand. Interestingly, the number of MCM hexamers greatly exceeds the number of firing origins suggesting additional roles of MCMs. Here we show a hitherto unanticipated function of MCM2 in cilia formatio...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gky945
更新日期:2019-01-10 00:00:00
abstract::Upstream open reading frame (uORF)-mediated translational inhibition is important in controlling key regulatory genes expression. However, understanding the underlying molecular mechanism of such uORF-mediated control system in vivo is challenging in the absence of an animal model. Therefore, we generated a zebrafish ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkr645
更新日期:2011-11-01 00:00:00
abstract::The TcSNP database (http://snps.tcruzi.org) integrates information on genetic variation (polymorphisms and mutations) for different stocks, strains and isolates of Trypanosoma cruzi, the causative agent of Chagas disease. The database incorporates sequences (genes from the T. cruzi reference genome, mRNAs, ESTs and ge...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkn874
更新日期:2009-01-01 00:00:00
abstract::The EMBL Data Library was the first internationally supported central resource for nucleic acid sequence data. Working in close collaboration with its American counterpart, GenBank (1), the library prepares and makes available to the scientific community a comprehensive collection of the published nucleic acid sequenc...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/14.1.5
更新日期:1986-01-10 00:00:00
abstract::A new thermodynamic database for normal and modified nucleic acids has been developed. This Thermodynamic Database for Nucleic Acids (NTDB) includes sequence, structure and thermodynamic information as well as experimental methods and conditions. In this release, there are 1851 sequences containing both normal and mod...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/29.1.230
更新日期:2001-01-01 00:00:00
abstract::Eucaryotic transcription initiation by RNA polymerase II involves protein:DNA interactions during the formation of a transcription complex. In addition to RNA polymerase II there are at least five other general transcription factors necessary for initiation with the adenovirus major late promoter. One of these, TFIIA,...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/18.12.3611
更新日期:1990-06-25 00:00:00
abstract::The purine rich oligodeoxyribonucleotides 1C, d(ATGACGGAATA) and 2C, d(ATGAGCGAATA) alone exhibit highly cooperative melting transitions. Analysis of the concentration dependence of melting, and electrophoretic studies indicate that these oligomers can form an unusual purine rich offset double helix. The unusual duple...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/16.11.5137
更新日期:1988-06-10 00:00:00
abstract::Helicase loading at a DNA replication origin often requires the dynamic interactions between the DNA helicase and an accessory protein. In E. coli, the DNA helicase is DnaB and DnaC is its loading partner. We used the method of hydrogen/deuterium exchange mass spectrometry to address the importance of DnaB-DnaC comple...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkv961
更新日期:2016-01-08 00:00:00
abstract::Mitome is a specialized mitochondrial genome database designed for easy comparative analysis of various features of metazoan mitochondrial genomes such as base frequency, A+T skew, codon usage and gene arrangement pattern. A particular function of the database is the automatic reconstruction of phylogenetic relationsh...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkm763
更新日期:2008-01-01 00:00:00
abstract::The ASTRAL compendium provides several databases and tools to aid in the analysis of protein structures, particularly through the use of their sequences. It is partially derived from the SCOP database of protein domains, and it includes sequences for each domain as well as other resources useful for studying these seq...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/30.1.260
更新日期:2002-01-01 00:00:00
abstract::RNA editing in protozoan parasites is a mitochondrial RNA processing reaction in which exclusively uridylate residues are inserted into, and less frequently deleted from, pre-mRNAs. Molecules central to the process are so-called guide RNAs (gRNAs) which function as templates in the reaction. For a detailed molecular u...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/25.12.2311
更新日期:1997-06-15 00:00:00
abstract::In Eukaryotes, DNA is wound around the histone octamer forming the basic chromatin unit, the nucleosome. Atomic structures have been obtained from crystallography and single particle cryo-electron microscopy (cryoEM) of identical engineered particles. But native nucleosomes are dynamical entities with diverse DNA sequ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gky670
更新日期:2018-09-28 00:00:00
abstract::The contacts of phosphate groups in mRNAs with ribosomes were studied. Two mRNAs were used: one mRNA contained in the middle two defined codons to construct the pre- and the post-translocational states, the other was a sequence around the initiation site of the natural cro-mRNA. Phosphorothioate nucleotides were rando...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/24.12.2228
更新日期:1996-06-15 00:00:00
abstract::Electrophoretic mobility shift assays (EMSA) were used to define the regions of the INO1 promoter that interact with factors present in extracts prepared from the yeast, Saccharomyces cerevisae. These experiments identified three different types of protein:DNA complexes that assemble with the INO1 promoter. Formation ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/19.14.3987
更新日期:1991-07-25 00:00:00
abstract::The side chains of the 20 types of amino acids, owing to a large extent to their different physical properties, have characteristic distributions in interior/surface regions of individual proteins and in interface/non-interface portions of protein surfaces that bind proteins or nucleic acids. These distributions have ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkm231
更新日期:2007-07-01 00:00:00
abstract::As large-scale re-sequencing of genomes reveals many protein mutations, especially in human cancer tissues, prediction of their likely functional impact becomes important practical goal. Here, we introduce a new functional impact score (FIS) for amino acid residue changes using evolutionary conservation patterns. The ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkr407
更新日期:2011-09-01 00:00:00