Systematic recovery and analysis of full-ORF human cDNA clones.

Abstract:

:The Mammalian Gene Collection (MGC) consortium (http://mgc.nci.nih.gov) seeks to establish publicly available collections of full-ORF cDNAs for several organisms of significance to biomedical research, including human. To date over 15,200 human cDNA clones containing full-length open reading frames (ORFs) have been identified via systematic expressed sequence tag (EST) analysis of a diverse set of cDNA libraries; however, further systematic EST analysis is no longer an efficient method for identifying new cDNAs. As part of our involvement in the MGC program, we have developed a scalable method for targeted recovery of cDNA clones to facilitate recovery of genes absent from the MGC collection. First, cDNA is synthesized from various RNAs, followed by polymerase chain reaction (PCR) amplification of transcripts in 96-well plates using gene-specific primer pairs flanking the ORFs. Amplicons are cloned into a sequencing vector, and full-length sequences are obtained. Sequences are processed and assembled using Phred and Phrap, and analyzed using Consed and a number of bioinformatics methods we have developed. Sequences are compared with the Reference Sequence (RefSeq) database, and validation of sequence discrepancies is attempted using other sequence databases including dbEST and dbSNP. Clones with identical sequence to RefSeq or containing only validated changes will become part of the MGC human gene collection. Clones containing novel splice variants or polymorphisms have also been identified. Our approach to clone recovery, applied at large scale, has the potential to recover many and possibly most of the genes absent from the MGC collection.

journal_name

Genome Res

journal_title

Genome research

authors

Baross A,Butterfield YS,Coughlin SM,Zeng T,Griffith M,Griffith OL,Petrescu AS,Smailus DE,Khattra J,McDonald HL,McKay SJ,Moksa M,Holt RA,Marra MA

doi

10.1101/gr.2473704

subject

Has Abstract

pub_date

2004-10-01 00:00:00

pages

2083-92

issue

10B

eissn

1088-9051

issn

1549-5469

pii

14/10b/2083

journal_volume

14

pub_type

杂志文章
  • Immune signatures correlate with L1 retrotransposition in gastrointestinal cancers.

    abstract::Long interspersed nuclear element-1 (LINE-1 or L1) retrotransposons are normally suppressed in somatic tissues mainly due to DNA methylation and antiviral defense. However, the mechanism to suppress L1s may be disrupted in cancers, thus allowing L1s to act as insertional mutagens and cause genomic rearrangement and in...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.231837.117

    authors: Jung H,Choi JK,Lee EA

    更新日期:2018-08-01 00:00:00

  • Integrated single-cell genetic and transcriptional analysis suggests novel drivers of chronic lymphocytic leukemia.

    abstract::Intra-tumoral genetic heterogeneity has been characterized across cancers by genome sequencing of bulk tumors, including chronic lymphocytic leukemia (CLL). In order to more accurately identify subclones, define phylogenetic relationships, and probe genotype-phenotype relationships, we developed methods for targeted m...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.217331.116

    authors: Wang L,Fan J,Francis JM,Georghiou G,Hergert S,Li S,Gambe R,Zhou CW,Yang C,Xiao S,Cin PD,Bowden M,Kotliar D,Shukla SA,Brown JR,Neuberg D,Alessi DR,Zhang CZ,Kharchenko PV,Livak KJ,Wu CJ

    更新日期:2017-08-01 00:00:00

  • Cytosine modifications modulate the chromatin architecture of transcriptional enhancers.

    abstract::Epigenetic mechanisms are believed to play key roles in the establishment of cell-specific transcription programs. Accordingly, the modified bases 5-methylcytosine (5mC) and 5-hydroxymethylcytosine (5hmC) have been observed in DNA of genomic regulatory regions such as enhancers, and oxidation of 5mC into 5hmC by Ten-e...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.211466.116

    authors: Mahé EA,Madigou T,Sérandour AA,Bizot M,Avner S,Chalmel F,Palierne G,Métivier R,Salbert G

    更新日期:2017-06-01 00:00:00

  • A linkage map of the rat genome derived from three F2 crosses.

    abstract::We report the construction of a dense linkage map of the rat genome integrating 767 simple sequence length polymorphism markers, combined over three crosses with high rates of polymorphism. F2 populations from WKY x S (n = 159), BN x S (n = 91), and BN x GK (n = 139) were selected and genotyped for combinations of mic...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.7.5.434

    authors: Bihoreau MT,Gauguier D,Kato N,Hyne G,Lindpaintner K,Rapp JP,James MR,Lathrop GM

    更新日期:1997-05-01 00:00:00

  • A combinatorial partitioning method to identify multilocus genotypic partitions that predict quantitative trait variation.

    abstract::Recent advances in genome research have accelerated the process of locating candidate genes and the variable sites within them and have simplified the task of genotype measurement. The development of statistical and computational strategies to utilize information on hundreds -- soon thousands -- of variable loci to in...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.172901

    authors: Nelson MR,Kardia SL,Ferrell RE,Sing CF

    更新日期:2001-03-01 00:00:00

  • Gene regulation and speciation in house mice.

    abstract::One approach to understanding the process of speciation is to characterize the genetic architecture of post-zygotic isolation. As gene regulation requires interactions between loci, negative epistatic interactions between divergent regulatory elements might underlie hybrid incompatibilities and contribute to reproduct...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.195743.115

    authors: Mack KL,Campbell P,Nachman MW

    更新日期:2016-04-01 00:00:00

  • Evolution of gene order in the genomes of two related yeast species.

    abstract::Changes in gene order between the genomes of two related yeast species, Saccharomyces cerevisiae and Saccharomyces bayanus var. uvarum were studied. From the dataset of a previous low coverage sequencing of the S. bayanus var. uvarum genome, 35 different synteny breakpoints between neighboring genes and two cases of l...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.212701

    authors: Fischer G,Neuvéglise C,Durrens P,Gaillardin C,Dujon B

    更新日期:2001-12-01 00:00:00

  • Characterization and dynamics of pericentromere-associated domains in mice.

    abstract::Despite recent progress in genome topology knowledge, the role of repeats, which make up the majority of mammalian genomes, remains elusive. Satellite repeats are highly abundant sequences that cluster around centromeres, attract pericentromeric heterochromatin, and aggregate into nuclear chromocenters. These nuclear ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.186643.114

    authors: Wijchers PJ,Geeven G,Eyres M,Bergsma AJ,Janssen M,Verstegen M,Zhu Y,Schell Y,Vermeulen C,de Wit E,de Laat W

    更新日期:2015-07-01 00:00:00

  • Conservation, regulation, synteny, and introns in a large-scale C. briggsae-C. elegans genomic alignment.

    abstract::A new algorithm, WABA, was developed for doing large-scale alignments between genomic DNA of different species. WABA was used to align 8 million bases of Caenorhabditis briggsae genomic DNA against the entire 97-million-base Caenorhabditis elegans genome. The alignment, including C. briggsae homologs of 154 geneticall...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.10.8.1115

    authors: Kent WJ,Zahler AM

    更新日期:2000-08-01 00:00:00

  • A fine scale phenotype-genotype virulence map of a bacterial pathogen.

    abstract::A large fraction of the genes from sequenced organisms are of unknown function. This limits biological insight, and for pathogenic microorganisms hampers the development of new approaches to battle infections. There is thus a great need for novel strategies that link genotypes to phenotypes for microorganisms. We desc...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.137430.112

    authors: van Opijnen T,Camilli A

    更新日期:2012-12-01 00:00:00

  • Software for automated analysis of DNA fingerprinting gels.

    abstract::Here we describe software tools for the automated detection of DNA restriction fragments resolved on agarose fingerprinting gels. We present a mathematical model for the location and shape of the restriction fragments as a function of fragment size, with model parameters determined empirically from "marker" lanes cont...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.904303

    authors: Fuhrmann DR,Krzywinski MI,Chiu R,Saeedi P,Schein JE,Bosdet IE,Chinwalla A,Hillier LW,Waterston RH,McPherson JD,Jones SJ,Marra MA

    更新日期:2003-05-01 00:00:00

  • A systematic model to predict transcriptional regulatory mechanisms based on overrepresentation of transcription factor binding profiles.

    abstract::An important aspect of understanding a biological pathway is to delineate the transcriptional regulatory mechanisms of the genes involved. Two important tasks are often encountered when studying transcription regulation, i.e., (1) the identification of common transcriptional regulators of a set of coexpressed genes; (...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.4303406

    authors: Chang LW,Nagarajan R,Magee JA,Milbrandt J,Stormo GD

    更新日期:2006-03-01 00:00:00

  • Profiling patterned transcripts in Drosophila embryos.

    abstract::Here we describe a high-throughput screen to isolate transcripts with spatially restricted patterns of expression in early embryos. Our approach utilizes robotic automation for rapid analysis of sequence-selected cDNAs in a whole-mount in situ hybridization assay. We determined the spatial distribution of a random col...

    journal_title:Genome research

    pub_type: 信件

    doi:10.1101/gr.84402

    authors: Simin K,Scuderi A,Reamey J,Dunn D,Weiss R,Metherall JE,Letsou A

    更新日期:2002-07-01 00:00:00

  • Whole-genome sequence assembly for mammalian genomes: Arachne 2.

    abstract::We previously described the whole-genome assembly program Arachne, presenting assemblies of simulated data for small to mid-sized genomes. Here we describe algorithmic adaptations to the program, allowing for assembly of mammalian-size genomes, and also improving the assembly of smaller genomes. Three principal change...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.828403

    authors: Jaffe DB,Butler J,Gnerre S,Mauceli E,Lindblad-Toh K,Mesirov JP,Zody MC,Lander ES

    更新日期:2003-01-01 00:00:00

  • Sensitive mapping of recombination hotspots using sequencing-based detection of ssDNA.

    abstract::Meiotic DNA double-stranded breaks (DSBs) initiate genetic recombination in discrete areas of the genome called recombination hotspots. DSBs can be directly mapped using chromatin immunoprecipitation followed by sequencing (ChIP-seq). Nevertheless, the genome-wide mapping of recombination hotspots in mammals is still ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.130583.111

    authors: Khil PP,Smagulova F,Brick KM,Camerini-Otero RD,Petukhova GV

    更新日期:2012-05-01 00:00:00

  • LSH and G9a/GLP complex are required for developmentally programmed DNA methylation.

    abstract::LSH, a member of the SNF2 family of chromatin remodeling ATPases encoded by the Hells gene, is essential for normal levels of DNA methylation in the mammalian genome. While the role of LSH in the methylation of repetitive DNA sequences is well characterized, its contribution to the regulation of DNA methylation and th...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.108498.110

    authors: Myant K,Termanis A,Sundaram AY,Boe T,Li C,Merusi C,Burrage J,de Las Heras JI,Stancheva I

    更新日期:2011-01-01 00:00:00

  • Connecting sequence and biology in the laboratory mouse.

    abstract::The Mouse Genome Sequencing Consortium and the RIKEN Genome Exploration Research grouphave generated large sets of sequence data representing the mouse genome and transcriptome, respectively. These data provide a valuable foundation for genomic research. The challenges for the informatics community are how to integrat...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.991003

    authors: Baldarelli RM,Hill DP,Blake JA,Adachi J,Furuno M,Bradt D,Corbani LE,Cousins S,Frazer KS,Qi D,Yang L,Ramachandran S,Reed D,Zhu Y,Kasukawa T,Ringwald M,King BL,Maltais LJ,McKenzie LM,Schriml LM,Maglott D,Church DM

    更新日期:2003-06-01 00:00:00

  • The repetitive landscape of the chicken genome.

    abstract::Cot-based cloning and sequencing (CBCS) is a powerful tool for isolating and characterizing the various repetitive components of any genome, combining the established principles of DNA reassociation kinetics with high-throughput sequencing. CBCS was used to generate sequence libraries representing the high, middle, an...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.2438004

    authors: Wicker T,Robertson JS,Schulze SR,Feltus FA,Magrini V,Morrison JA,Mardis ER,Wilson RK,Peterson DG,Paterson AH,Ivarie R

    更新日期:2005-01-01 00:00:00

  • Multiple waves of recent DNA transposon activity in the bat, Myotis lucifugus.

    abstract::DNA transposons, or class 2 transposable elements, have successfully propagated in a wide variety of genomes. However, it is widely believed that DNA transposon activity has ceased in mammalian genomes for at least the last 40 million years. We recently reported evidence for the relatively recent activity of hAT and H...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.071886.107

    authors: Ray DA,Feschotte C,Pagan HJ,Smith JD,Pritham EJ,Arensburger P,Atkinson PW,Craig NL

    更新日期:2008-05-01 00:00:00

  • Genomic organization of TEL: the human ETS-variant gene 6.

    abstract::We have constructed a detailed map of the genomic region containing the ETS-variant gene 6 (ETV6), involved in translocations and deletions associated with hematologic malignancies. Thirty-eight cosmids were characterized belonging to two contigs spanning 340 kb, and an EcoRl restriction map was developed. The gap bet...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6.5.404

    authors: Baens M,Peeters P,Guo C,Aerssens J,Marynen P

    更新日期:1996-05-01 00:00:00

  • Fusion of the human gene for the polyubiquitination coeffector UEV1 with Kua, a newly identified gene.

    abstract::UEV proteins are enzymatically inactive variants of the E2 ubiquitin-conjugating enzymes that regulate noncanonical elongation of ubiquitin chains. In Saccharomyces cerevisiae, UEV is part of the RAD6-mediated error-free DNA repair pathway. In mammalian cells, UEV proteins can modulate c-FOS transcription and the G2-M...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.gr-1405r

    authors: Thomson TM,Lozano JJ,Loukili N,Carrió R,Serras F,Cormand B,Valeri M,Díaz VM,Abril J,Burset M,Merino J,Macaya A,Corominas M,Guigó R

    更新日期:2000-11-01 00:00:00

  • Toward the development of a gene index to the human genome: an assessment of the nature of high-throughput EST sequence data.

    abstract::A rigorous analysis of the Merck-sponsored EST data with respect to known gene sequences increases the utility of the data set and helps refine methods for building a gene index. A highly curated human transcript data base was used as a reference data set of known genes. A detailed analysis of EST sequences derived fr...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6.9.829

    authors: Aaronson JS,Eckman B,Blevins RA,Borkowski JA,Myerson J,Imran S,Elliston KO

    更新日期:1996-09-01 00:00:00

  • DNA enrichment by allele-specific hybridization (DEASH): a novel method for haplotyping and for detecting low-frequency base substitutional variants and recombinant DNA molecules.

    abstract::Detecting rare sequence variants in genomic DNA is central to the analysis of de novo mutation and recombination events and the detection of rare pathological mutations in mixed cell populations. Current PCR techniques suffer from noise that limits detection to variants present at a frequency of at least 10(-4)-10(-5)...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.1214603

    authors: Jeffreys AJ,May CA

    更新日期:2003-10-01 00:00:00

  • Theories and applications for sequencing randomly selected clones.

    abstract::Theory is developed for the process of sequencing randomly selected large-insert clones. Genome size, library depth, clone size, and clone distribution are considered relevant properties and perfect overlap detection for contig assembly is assumed. Genome-specific and nonrandom effects are neglected. Order of magnitud...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.gr-1339r

    authors: Wendl MC,Marra MA,Hillier LW,Chinwalla AT,Wilson RK,Waterston RH

    更新日期:2001-02-01 00:00:00

  • Topologically associating domains and their long-range contacts are established during early G1 coincident with the establishment of the replication-timing program.

    abstract::Mammalian genomes are partitioned into domains that replicate in a defined temporal order. These domains can replicate at similar times in all cell types (constitutive) or at cell type-specific times (developmental). Genome-wide chromatin conformation capture (Hi-C) has revealed sub-megabase topologically associating ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.183699.114

    authors: Dileep V,Ay F,Sima J,Vera DL,Noble WS,Gilbert DM

    更新日期:2015-08-01 00:00:00

  • CBX3 regulates efficient RNA processing genome-wide.

    abstract::CBX5, CBX1, and CBX3 (HP1α, β, and γ, respectively) play an evolutionarily conserved role in the formation and maintenance of heterochromatin. In addition, CBX5, CBX1, and CBX3 may also participate in transcriptional regulation of genes. Recently, CBX3 binding to the bodies of a subset of genes has been observed in hu...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.124818.111

    authors: Smallwood A,Hon GC,Jin F,Henry RE,Espinosa JM,Ren B

    更新日期:2012-08-01 00:00:00

  • Comparative methylome analysis of benign and malignant peripheral nerve sheath tumors.

    abstract::Aberrant DNA methylation (DNAm) was first linked to cancer over 25 yr ago. Since then, many studies have associated hypermethylation of tumor suppressor genes and hypomethylation of oncogenes to the tumorigenic process. However, most of these studies have been limited to the analysis of promoters and CpG islands (CGIs...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.109678.110

    authors: Feber A,Wilson GA,Zhang L,Presneau N,Idowu B,Down TA,Rakyan VK,Noon LA,Lloyd AC,Stupka E,Schiza V,Teschendorff AE,Schroth GP,Flanagan A,Beck S

    更新日期:2011-04-01 00:00:00

  • Centromere repositioning.

    abstract::Primate pericentromeric regions recently have been shown to exhibit extraordinary evolutionary plasticity. In this paper we report an additional peculiar feature of these regions that we discovered while analyzing, by FISH, the evolutionary conservation of primate phylogenetic chromosome IX. If the position of the cen...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.9.12.1184

    authors: Montefalcone G,Tempesta S,Rocchi M,Archidiacono N

    更新日期:1999-12-01 00:00:00

  • A genome-wide study of dual coding regions in human alternatively spliced genes.

    abstract::Alternative splicing is a major mechanism for gene product regulation in many multicellular organisms. By using different exon combinations, some coding regions can encode amino acids in multiple reading frames in different transcripts. Here we performed a systematic search through a set of high-quality human transcri...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.4246506

    authors: Liang H,Landweber LF

    更新日期:2006-02-01 00:00:00

  • Comparative gene mapping: a fine-scale survey of chromosome rearrangements between ruminants and humans.

    abstract::A total of 202 genes were cytogenetically mapped to goat chromosomes, multiplying by five the total number of regional gene localizations in domestic ruminants (255). This map encompasses 249 and 173 common anchor loci regularly spaced along human and murine chromosomes, respectively, which makes it possible to perfor...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.8.9.901

    authors: Schibler L,Vaiman D,Oustry A,Giraud-Delville C,Cribiu EP

    更新日期:1998-09-01 00:00:00