Estimating population genetic parameters and comparing model goodness-of-fit using DNA sequences with error.

Abstract:

:It is known that sequencing error can bias estimation of evolutionary or population genetic parameters. This problem is more prominent in deep resequencing studies because of their large sample size n, and a higher probability of error at each nucleotide site. We propose a new method based on the composite likelihood of the observed SNP configurations to infer population mutation rate theta = 4N(e)micro, population exponential growth rate R, and error rate epsilon, simultaneously. Using simulation, we show the combined effects of the parameters, theta, n, epsilon, and R on the accuracy of parameter estimation. We compared our maximum composite likelihood estimator (MCLE) of theta with other theta estimators that take into account the error. The results show the MCLE performs well when the sample size is large or the error rate is high. Using parametric bootstrap, composite likelihood can also be used as a statistic for testing the model goodness-of-fit of the observed DNA sequences. The MCLE method is applied to sequence data on the ANGPTL4 gene in 1832 African American and 1045 European American individuals.

journal_name

Genome Res

journal_title

Genome research

authors

Liu X,Fu YX,Maxwell TJ,Boerwinkle E

doi

10.1101/gr.097543.109

subject

Has Abstract

pub_date

2010-01-01 00:00:00

pages

101-9

issue

1

eissn

1088-9051

issn

1549-5469

pii

gr.097543.109

journal_volume

20

pub_type

杂志文章
  • Multiparameter functional diversity of human C2H2 zinc finger proteins.

    abstract::C2H2 zinc finger proteins represent the largest and most enigmatic class of human transcription factors. Their C2H2-ZF arrays are highly variable, indicating that most will have unique DNA binding motifs. However, most of the binding motifs have not been directly determined. In addition, little is known about whether ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.209643.116

    authors: Schmitges FW,Radovani E,Najafabadi HS,Barazandeh M,Campitelli LF,Yin Y,Jolma A,Zhong G,Guo H,Kanagalingam T,Dai WF,Taipale J,Emili A,Greenblatt JF,Hughes TR

    更新日期:2016-12-01 00:00:00

  • A cross-platform analysis of 14,177 expression quantitative trait loci derived from lymphoblastoid cell lines.

    abstract::Gene expression levels can be an important link DNA between variation and phenotypic manifestations. Our previous map of global gene expression, based on ~400K single nucleotide polymorphisms (SNPs) and 50K transcripts in 400 sib pairs from the MRCA family panel, has been widely used to interpret the results of genome...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.142521.112

    authors: Liang L,Morar N,Dixon AL,Lathrop GM,Abecasis GR,Moffatt MF,Cookson WO

    更新日期:2013-04-01 00:00:00

  • The Arabidopsis genome: a foundation for plant research.

    abstract::The sequence of the first plant genome was completed and published at the end of 2000. This spawned a series of large-scale projects aimed at discovering the functions of the 25,000+ genes identified in Arabidopsis thaliana (Arabidopsis). This review summarizes progress made in the past five years and speculates about...

    journal_title:Genome research

    pub_type: 杂志文章,评审

    doi:10.1101/gr.3723405

    authors: Bevan M,Walsh S

    更新日期:2005-12-01 00:00:00

  • The mouse Aire gene: comparative genomic sequencing, gene organization, and expression.

    abstract::Mutations in the human AIRE gene (hAIRE) result in the development of an autoimmune disease named APECED (autoimmune polyendocrinopathy candidiasis ectodermal dystrophy; OMIM 240300). Previously, we have cloned hAIRE and shown that it codes for a putative transcription-associated factor. Here we report the cloning and...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:

    authors: Blechschmidt K,Schweiger M,Wertz K,Poulson R,Christensen HM,Rosenthal A,Lehrach H,Yaspo ML

    更新日期:1999-02-01 00:00:00

  • Detecting copy number variation with mated short reads.

    abstract::The development of high-throughput sequencing (HTS) technologies has opened the door to novel methods for detecting copy number variants (CNVs) in the human genome. While in the past CNVs have been detected based on array CGH data, recent studies have shown that depth-of-coverage information from HTS technologies can ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.106344.110

    authors: Medvedev P,Fiume M,Dzamba M,Smith T,Brudno M

    更新日期:2010-11-01 00:00:00

  • Adenoviral vectors expressing siRNAs for discovery and validation of gene function.

    abstract::RNA interference is a powerful tool for studying gene function and for drug target discovery in diverse organisms and cell types. In mammalian systems, small interfering RNAs (siRNAs), or DNA plasmids expressing these siRNAs, have been used to down-modulate gene expression. However, inefficient transfection protocols,...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.1332603

    authors: Arts GJ,Langemeijer E,Tissingh R,Ma L,Pavliska H,Dokic K,Dooijes R,Mesić E,Clasen R,Michiels F,van der Schueren J,Lambrecht M,Herman S,Brys R,Thys K,Hoffmann M,Tomme P,van Es H

    更新日期:2003-10-01 00:00:00

  • Detecting ancient positive selection in humans using extended lineage sorting.

    abstract::Natural selection that affected modern humans early in their evolution has likely shaped some of the traits that set present-day humans apart from their closest extinct and living relatives. The ability to detect ancient natural selection in the human genome could provide insights into the molecular basis for these hu...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.219493.116

    authors: Peyrégne S,Boyle MJ,Dannemann M,Prüfer K

    更新日期:2017-09-01 00:00:00

  • CG dinucleotides enhance promoter activity independent of DNA methylation.

    abstract::Most mammalian RNA polymerase II initiation events occur at CpG islands, which are rich in CpGs and devoid of DNA methylation. Despite their relevance for gene regulation, it is unknown to what extent the CpG dinucleotide itself actually contributes to promoter activity. To address this question, we determined the tra...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.241653.118

    authors: Hartl D,Krebs AR,Grand RS,Baubec T,Isbel L,Wirbelauer C,Burger L,Schübeler D

    更新日期:2019-04-01 00:00:00

  • Background-suppressed live visualization of genomic loci with an improved CRISPR system based on a split fluorophore.

    abstract::The higher-order structural organization and dynamics of the chromosomes play a central role in gene regulation. To explore this structure-function relationship, it is necessary to directly visualize genomic elements in living cells. Genome imaging based on the CRISPR system is a powerful approach but has limited appl...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.260018.119

    authors: Chaudhary N,Nho SH,Cho H,Gantumur N,Ra JS,Myung K,Kim H

    更新日期:2020-09-01 00:00:00

  • A GC-rich sequence feature in the 3' UTR directs UPF1-dependent mRNA decay in mammalian cells.

    abstract::Up-frameshift protein 1 (UPF1) is an ATP-dependent RNA helicase that has essential roles in RNA surveillance and in post-transcriptional gene regulation by promoting the degradation of mRNAs. Previous studies revealed that UPF1 is associated with the 3' untranslated region (UTR) of target mRNAs via as-yet-unknown sequ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.206060.116

    authors: Imamachi N,Salam KA,Suzuki Y,Akimitsu N

    更新日期:2017-03-01 00:00:00

  • A periodic pattern of SNPs in the human genome.

    abstract::By surveying a filtered, high-quality set of SNPs in the human genome, we have found that SNPs positioned 1, 2, 4, 6, or 8 bp apart are more frequent than SNPs positioned 3, 5, 7, or 9 bp apart. The observed pattern is not restricted to genomic regions that are known to cause sequencing or alignment errors, for exampl...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6223207

    authors: Madsen BE,Villesen P,Wiuf C

    更新日期:2007-10-01 00:00:00

  • Genome-wide mapping of human DNA-replication origins: levels of transcription at ORC1 sites regulate origin selection and replication timing.

    abstract::We report the genome-wide mapping of ORC1 binding sites in mammals, by chromatin immunoprecipitation and parallel sequencing (ChIP-seq). ORC1 binding sites in HeLa cells were validated as active DNA replication origins (ORIs) using Repli-seq, a method that allows identification of ORI-containing regions by parallel se...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.142331.112

    authors: Dellino GI,Cittaro D,Piccioni R,Luzi L,Banfi S,Segalla S,Cesaroni M,Mendoza-Maldonado R,Giacca M,Pelicci PG

    更新日期:2013-01-01 00:00:00

  • YY1 and CTCF orchestrate a 3D chromatin looping switch during early neural lineage commitment.

    abstract::CTCF is an architectural protein with a critical role in connecting higher-order chromatin folding in pluripotent stem cells. Recent reports have suggested that CTCF binding is more dynamic during development than previously appreciated. Here, we set out to understand the extent to which shifts in genome-wide CTCF occ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.215160.116

    authors: Beagan JA,Duong MT,Titus KR,Zhou L,Cao Z,Ma J,Lachanski CV,Gillis DR,Phillips-Cremins JE

    更新日期:2017-07-01 00:00:00

  • Translation initiation downstream from annotated start codons in human mRNAs coevolves with the Kozak context.

    abstract::Eukaryotic translation initiation involves preinitiation ribosomal complex 5'-to-3' directional probing of mRNA for codons suitable for starting protein synthesis. The recognition of codons as starts depends on the codon identity and on its immediate nucleotide context known as Kozak context. When the context is weak ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.257352.119

    authors: Benitez-Cantos MS,Yordanova MM,O'Connor PBF,Zhdanov AV,Kovalchuk SI,Papkovsky DB,Andreev DE,Baranov PV

    更新日期:2020-07-01 00:00:00

  • The human protein coevolution network.

    abstract::Coevolution maintains interactions between phenotypic traits through the process of reciprocal natural selection. Detecting molecular coevolution can expose functional interactions between molecules in the cell, generating insights into biological processes, pathways, and the networks of interactions important for cel...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.092452.109

    authors: Tillier ER,Charlebois RL

    更新日期:2009-10-01 00:00:00

  • MicroRNAs reinforce repression of PRC2 transcriptional targets independently and through a feed-forward regulatory network.

    abstract::Gene expression can be regulated at multiple levels, but it is not known if and how there is broad coordination between regulation at the transcriptional and post-transcriptional levels. Transcription factors and chromatin regulate gene expression transcriptionally, whereas microRNAs (miRNAs) are small regulatory RNAs...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.238311.118

    authors: Shivram H,Le SV,Iyer VR

    更新日期:2019-02-01 00:00:00

  • A transposon-based strategy for sequencing repetitive DNA in eukaryotic genomes.

    abstract::Repetitive DNA is a significant component of eukaryotic genomes. We have developed a strategy to efficiently and accurately sequence repetitive DNA in the nematode Caenorhabditis elegans using integrated artificial transposons and automated fluorescent sequencing. Mapping and assembly tools represent important compone...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.7.5.551

    authors: Devine SE,Chissoe SL,Eby Y,Wilson RK,Boeke JD

    更新日期:1997-05-01 00:00:00

  • Schizosaccharomyces pombe essential genes: a pilot study.

    abstract::After completion of the Schizosaccharomyces pombe genome sequence, we have carried out a pilot gene deletion project to assess the feasibility of a genome-wide deletion project and to estimate the percentage of essential genes. Using a PCR-based gene deletion procedure, we investigated 100 genes within a 253-kb region...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.636103

    authors: Decottignies A,Sanchez-Perez I,Nurse P

    更新日期:2003-03-01 00:00:00

  • Modeling of epigenome dynamics identifies transcription factors that mediate Polycomb targeting.

    abstract::Although changes in chromatin are integral to transcriptional reprogramming during cellular differentiation, it is currently unclear how chromatin modifications are targeted to specific loci. To systematically identify transcription factors (TFs) that can direct chromatin changes during cell fate decisions, we model t...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.142661.112

    authors: Arnold P,Schöler A,Pachkov M,Balwierz PJ,Jørgensen H,Stadler MB,van Nimwegen E,Schübeler D

    更新日期:2013-01-01 00:00:00

  • CADLIVE dynamic simulator: direct link of biochemical networks to dynamic models.

    abstract::We have developed the CADLIVE (Computer-Aided Design of LIVing systEms) Simulator that provided a rule-based automatic way to convert biochemical network maps into dynamic models, which enables simulating their dynamics without going through all of the reactions down to the details of exact kinetic parameters. The sim...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.3463705

    authors: Kurata H,Masaki K,Sumida Y,Iwasaki R

    更新日期:2005-04-01 00:00:00

  • Next-generation tag sequencing for cancer gene expression profiling.

    abstract::We describe a new method, Tag-seq, which employs ultra high-throughput sequencing of 21 base pair cDNA tags for sensitive and cost-effective gene expression profiling. We compared Tag-seq data to LongSAGE data and observed improved representation of several classes of rare transcripts, including transcription factors,...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.094482.109

    authors: Morrissy AS,Morin RD,Delaney A,Zeng T,McDonald H,Jones S,Zhao Y,Hirst M,Marra MA

    更新日期:2009-10-01 00:00:00

  • Transcript assembly improves expression quantification of transposable elements in single-cell RNA-seq data.

    abstract::Transposable elements (TEs) are an integral part of the host transcriptome. TE-containing noncoding RNAs (ncRNAs) show considerable tissue specificity and play important roles during development, including stem cell maintenance and cell differentiation. Recent advances in single-cell RNA-seq (scRNA-seq) revolutionized...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.265173.120

    authors: Shao W,Wang T

    更新日期:2021-01-01 00:00:00

  • Genomic analysis identifies association of Fusobacterium with colorectal carcinoma.

    abstract::The tumor microenvironment of colorectal carcinoma is a complex community of genomically altered cancer cells, nonneoplastic cells, and a diverse collection of microorganisms. Each of these components may contribute to carcinogenesis; however, the role of the microbiota is the least well understood. We have characteri...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.126573.111

    authors: Kostic AD,Gevers D,Pedamallu CS,Michaud M,Duke F,Earl AM,Ojesina AI,Jung J,Bass AJ,Tabernero J,Baselga J,Liu C,Shivdasani RA,Ogino S,Birren BW,Huttenhower C,Garrett WS,Meyerson M

    更新日期:2012-02-01 00:00:00

  • A pooling-based approach to mapping genetic variants associated with DNA methylation.

    abstract::DNA methylation is an epigenetic modification that plays a key role in gene regulation. Previous studies have investigated its genetic basis by mapping genetic variants that are associated with DNA methylation at specific sites, but these have been limited to microarrays that cover <2% of the genome and cannot account...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.183749.114

    authors: Kaplow IM,MacIsaac JL,Mah SM,McEwen LM,Kobor MS,Fraser HB

    更新日期:2015-06-01 00:00:00

  • Species-specific class I gene expansions formed the telomeric 1 mb of the mouse major histocompatibility complex.

    abstract::We have determined the complete sequence of 951,695 bp from the class I region of H2, the mouse major histocompatibility complex (Mhc) from strain 129/Sv (haplotype bc). The sequence contains 26 genes. The sequence spans from the last 50 kb of the H2-T region, including 2 class I genes and 3 class I pseudogenes, and i...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.975303

    authors: Takada T,Kumánovics A,Amadou C,Yoshino M,Jones EP,Athanasiou M,Evans GA,Fischer Lindahl K

    更新日期:2003-04-01 00:00:00

  • Judging the quality of gene expression-based clustering methods using gene annotation.

    abstract::We compare several commonly used expression-based gene clustering algorithms using a figure of merit based on the mutual information between cluster membership and known gene attributes. By studying various publicly available expression data sets we conclude that enrichment of clusters for biological function is, in g...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.397002

    authors: Gibbons FD,Roth FP

    更新日期:2002-10-01 00:00:00

  • Asymmetric nucleosomes flank promoters in the budding yeast genome.

    abstract::Nucleosomes in active chromatin are dynamic, but whether they have distinct structural conformations is unknown. To identify nucleosomes with alternative structures genome-wide, we used H4S47C-anchored cleavage mapping, which revealed that 5% of budding yeast (Saccharomyces cerevisiae) nucleosome positions have asymme...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.182618.114

    authors: Ramachandran S,Zentner GE,Henikoff S

    更新日期:2015-03-01 00:00:00

  • Transcriptional enhancement by GATA1-occupied DNA segments is strongly associated with evolutionary constraint on the binding site motif.

    abstract::Tissue development and function are exquisitely dependent on proper regulation of gene expression, but it remains controversial whether the genomic signals controlling this process are subject to strong selective constraint. While some studies show that highly constrained noncoding regions act to enhance transcription...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.083089.108

    authors: Cheng Y,King DC,Dore LC,Zhang X,Zhou Y,Zhang Y,Dorman C,Abebe D,Kumar SA,Chiaromonte F,Miller W,Green RD,Weiss MJ,Hardison RC

    更新日期:2008-12-01 00:00:00

  • Unamplified cap analysis of gene expression on a single-molecule sequencer.

    abstract::We report the development of a simplified cap analysis of gene expression (CAGE) protocol adapted for single-molecule sequencers that avoids second strand synthesis, ligation, digestion, and PCR. HeliScopeCAGE directly sequences the 3' end of cap trapped first-strand cDNAs. As with previous versions of CAGE, we better...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.115469.110

    authors: Kanamori-Katayama M,Itoh M,Kawaji H,Lassmann T,Katayama S,Kojima M,Bertin N,Kaiho A,Ninomiya N,Daub CO,Carninci P,Forrest AR,Hayashizaki Y

    更新日期:2011-07-01 00:00:00

  • Reconstructing complex regions of genomes using long-read sequencing technology.

    abstract::Obtaining high-quality sequence continuity of complex regions of recent segmental duplication remains one of the major challenges of finishing genome assemblies. In the human and mouse genomes, this was achieved by targeting large-insert clones using costly and laborious capillary-based sequencing approaches. Sanger s...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.168450.113

    authors: Huddleston J,Ranade S,Malig M,Antonacci F,Chaisson M,Hon L,Sudmant PH,Graves TA,Alkan C,Dennis MY,Wilson RK,Turner SW,Korlach J,Eichler EE

    更新日期:2014-04-01 00:00:00