Abstract:
:Analyzing vertebrate genomes requires rapid mRNA/DNA and cross-species protein alignments. A new tool, BLAT, is more accurate and 500 times faster than popular existing tools for mRNA/DNA alignments and 50 times faster for protein alignments at sensitivity settings typically used when comparing vertebrate sequences. BLAT's speed stems from an index of all nonoverlapping K-mers in the genome. This index fits inside the RAM of inexpensive computers, and need only be computed once for each genome assembly. BLAT has several major stages. It uses the index to find regions in the genome likely to be homologous to the query sequence. It performs an alignment between homologous regions. It stitches together these aligned regions (often exons) into larger alignments (typically genes). Finally, BLAT revisits small internal exons possibly missed at the first stage and adjusts large gap boundaries that have canonical splice sites where feasible. This paper describes how BLAT was optimized. Effects on speed and sensitivity are explored for various K-mer sizes, mismatch schemes, and number of required index matches. BLAT is compared with other alignment programs on various test sets and then used in several genome-wide applications. http://genome.ucsc.edu hosts a web-based BLAT server for the human genome.
journal_name
Genome Resjournal_title
Genome researchauthors
Kent WJdoi
10.1101/gr.229202subject
Has Abstractpub_date
2002-04-01 00:00:00pages
656-64issue
4eissn
1088-9051issn
1549-5469journal_volume
12pub_type
杂志文章相关文献
GENOME RESEARCH文献大全abstract::Changes in gene expression drive novel phenotypes, raising interest in how gene expression evolves. In contrast to the static genome, cells modulate gene expression in response to changing environments. Previous comparative studies focused on specific conditions, describing interspecies variation in expression levels,...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.261537.120
更新日期:2020-07-01 00:00:00
abstract::Identifying transcriptional regulatory elements represents a significant challenge in annotating the genomes of higher vertebrates. We have developed a computational tool, rVista, for high-throughput discovery of cis-regulatory elements that combines clustering of predicted transcription factor binding sites (TFBSs) a...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.225502
更新日期:2002-05-01 00:00:00
abstract::Here we describe a high-throughput screen to isolate transcripts with spatially restricted patterns of expression in early embryos. Our approach utilizes robotic automation for rapid analysis of sequence-selected cDNAs in a whole-mount in situ hybridization assay. We determined the spatial distribution of a random col...
journal_title:Genome research
pub_type: 信件
doi:10.1101/gr.84402
更新日期:2002-07-01 00:00:00
abstract::We find that the degree of impairment of protein function by missense variants is predictable by comparative sequence analysis alone. The applicable range of impairment is not confined to binary predictions that distinguish normal from deleterious variants, but extends continuously from mild to severe effects. The acc...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.3804205
更新日期:2005-07-01 00:00:00
abstract::MicroRNAs (miRNAs) are known to post-transcriptionally regulate target mRNAs through the 3'-UTR, which interacts mainly with the 5'-end of miRNA in animals. Here we identify many endogenous motifs within human 5'-UTRs specific to the 3'-ends of miRNAs. The 3'-end of conserved miRNAs in particular has significant inter...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.089367.108
更新日期:2009-07-01 00:00:00
abstract::Paternal X chromosome inactivation occurs in rodent extraembryonic membranes and in all tissues of marsupials. Methylation of CpG islands occurs on the inactive X in eutherians and is considered to be a stabilizing mechanism. The only previous study of a marsupial X-linked CpG island was of the G6PD gene of the Virgin...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.6.2.114
更新日期:1996-02-01 00:00:00
abstract::The genome size of Pseudoalteromonas haloplanktis, a ubiquitous and easily cultured marine bacterium, was measured as a step toward estimating the genome complexity of marine bacterioplankton. To determine total genome size, we digested P. haloplanktis DNA with the restriction endonucleases Notl and Sfil, separated th...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.6.12.1160
更新日期:1996-12-01 00:00:00
abstract::Disregulation of imprinted genes can be associated with tumorigenesis and altered cell differentiation capacity and so could provide adverse outcomes for stem cell applications. Although the maintenance of mouse and primate embryonic stem cells in a pluripotent state has been reported to disrupt the monoallelic expres...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.6609207
更新日期:2007-12-01 00:00:00
abstract::The rate of transcription elongation plays an important role in the timing of expression of full-length transcripts as well as in the regulation of alternative splicing. In this study, we coupled Bru-seq technology with 5,6-dichlorobenzimidazole 1-β-D-ribofuranoside (DRB) to estimate the elongation rates of over 2000 ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.171405.113
更新日期:2014-06-01 00:00:00
abstract::Aberrant DNA methylation (DNAm) was first linked to cancer over 25 yr ago. Since then, many studies have associated hypermethylation of tumor suppressor genes and hypomethylation of oncogenes to the tumorigenic process. However, most of these studies have been limited to the analysis of promoters and CpG islands (CGIs...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.109678.110
更新日期:2011-04-01 00:00:00
abstract::A-to-I RNA editing is a conserved widespread phenomenon in which adenosine (A) is converted to inosine (I) by adenosine deaminases (ADARs) in double-stranded RNA regions, mainly noncoding. Mutations in ADAR enzymes in Caenorhabditis elegans cause defects in normal development but are not lethal as in human and mouse. ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.211169.116
更新日期:2017-03-01 00:00:00
abstract::Maize (Zea mays L. ssp. mays), one of the most important agricultural crops in the world, originated by hybridization of two closely related progenitors. To investigate the fate of its genes after tetraploidization, we analyzed the sequence of five duplicated regions from different chromosomal locations. We also compa...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.2701104
更新日期:2004-10-01 00:00:00
abstract::In contrast to other animal cell lines, the chicken pre-B cell lymphoma line, DT40, exhibits a high level of homologous recombination, which can be exploited to generate site-specific alterations in defined target genes or regions. In addition, the ability to generate human/chicken monochromosomal hybrids in the DT40 ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.8.6.666
更新日期:1998-06-01 00:00:00
abstract::Comparative genomics provides a general methodology for discovering functional DNA elements and understanding their evolution. The availability of many related genomes enables more powerful analyses, but requires rigorous phylogenetic methods to resolve orthologous genes and regions. Here, we use 12 recently sequenced...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.7105007
更新日期:2007-12-01 00:00:00
abstract::Large scale gene perturbation experiments generate information about the number of genes whose activity is directly or indirectly affected by a gene perturbation. From this information, one can numerically estimate coarse structural network features such as the total number of direct regulatory interactions and the nu...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.193902
更新日期:2002-02-01 00:00:00
abstract::More than 25 loci have been linked to type 1 diabetes (T1D) in the nonobese diabetic (NOD) mouse, but identification of the underlying genes remains challenging. We describe here the positional cloning of a T1D susceptibility locus, Idd11, located on mouse chromosome 4. Sequence analysis of a series of congenic NOD mo...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.101881.109
更新日期:2010-12-01 00:00:00
abstract::MULTIPROSPECTOR, a multimeric threading algorithm for the prediction of protein-protein interactions, is applied to the genome of Saccharomyces cerevisiae. Each possible pairwise interaction among more than 6000 encoded proteins is evaluated against a dimer database of 768 complex structures by using a confidence esti...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.1145203
更新日期:2003-06-01 00:00:00
abstract::In the attempt to understand human variation and the genetic basis of complex disease, a tremendous number of single nucleotide polymorphisms (SNPs) have been discovered and deposited into NCBI's dbSNP public database. More than 2.7 million SNPs in the database have genotype information. This data provides an invaluab...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.4297805
更新日期:2005-11-01 00:00:00
abstract::We compare several commonly used expression-based gene clustering algorithms using a figure of merit based on the mutual information between cluster membership and known gene attributes. By studying various publicly available expression data sets we conclude that enrichment of clusters for biological function is, in g...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.397002
更新日期:2002-10-01 00:00:00
abstract::Using AP-PCR-based DNA profiling we examined some structural features of B chromosomes from yellow-necked mice Apodemus flavicollis. Mice harboring one, two, or three or lacking B chromosomes were examined. Chromosomal structure was scanned for variant bands by using a series of arbitrary primers and from these, infor...
journal_title:Genome research
pub_type: 杂志文章
doi:
更新日期:2000-01-01 00:00:00
abstract::The discovery of the genetic code was one of the most important advances of modern biology. But there is more to a DNA code than protein sequence; DNA carries signals for splicing, localization, folding, and regulation that are often embedded within the protein-coding sequence. In this issue, Itzkovitz and Alon show t...
journal_title:Genome research
pub_type: 评论,杂志文章,评审
doi:10.1101/gr.6144007
更新日期:2007-04-01 00:00:00
abstract::Primordial dwarfism (PD) is a disease in which severely impaired fetal growth persists throughout postnatal development and results in stunted adult size. The condition is highly heterogeneous clinically, but the use of certain phenotypic aspects such as head circumference and facial appearance has proven helpful in d...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.160572.113
更新日期:2014-02-01 00:00:00
abstract::Genome-scale metabolic models promise important insights into cell function. However, the definition of pathways and functional network modules within these models, and in the biochemical literature in general, is often based on intuitive reasoning. Although mathematical methods have been proposed to identify modules,...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.5662207
更新日期:2007-04-01 00:00:00
abstract::In interphase eukaryotic cells, almost all heterochromatin is located adjacent to the nucleolus or to the nuclear lamina, thus defining nucleolus-associated domains (NADs) and lamina-associated domains (LADs), respectively. Here, we determined the first genome-scale map of murine NADs in mouse embryonic fibroblasts (M...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.247072.118
更新日期:2019-08-01 00:00:00
abstract::Current methods struggle to reconstruct and visualize the genomic relationships of large numbers of bacterial genomes. GrapeTree facilitates the analyses of large numbers of allelic profiles by a static "GrapeTree Layout" algorithm that supports interactive visualizations of large trees within a web browser window. Gr...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.232397.117
更新日期:2018-09-01 00:00:00
abstract::Transcript leaders (TLs) can have profound effects on mRNA translation and stability. To map TL boundaries genome-wide, we developed TL-sequencing (TL-seq), a technique combining enzymatic capture of m(7)G-capped mRNA 5' ends with high-throughput sequencing. TL-seq identified mRNA start sites for the majority of yeast...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.150342.112
更新日期:2013-06-01 00:00:00
abstract::Candida albicans is a commensal fungus of the human gastrointestinal tract and a prevalent opportunistic pathogen. To examine diversity within this species, extensive genomic and phenotypic analyses were performed on 21 clinical C. albicans isolates. Genomic variation was evident in the form of polymorphisms, copy num...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.174623.114
更新日期:2015-03-01 00:00:00
abstract::Advances in single-cell genomics enable commensurate improvements in methods for uncovering lineage relations among individual cells. Current sequencing-based methods for cell lineage analysis depend on low-resolution bulk analysis or rely on extensive single-cell sequencing, which is not scalable and could be biased ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.202903.115
更新日期:2016-11-01 00:00:00
abstract::Population genetics has evolved from a theory-driven field with little empirical data into a data-driven discipline in which genome-scale data sets test the limits of available models and computational analysis methods. In humans and a few model organisms, analyses of whole-genome sequence polymorphism data are curren...
journal_title:Genome research
pub_type: 杂志文章,评审
doi:10.1101/gr.079509.108
更新日期:2010-03-01 00:00:00
abstract::Mutational analysis of large genes with complex genomic structures plays an important role in medical genetics. Technical limitations associated with current mutation screening protocols have placed increased emphasis on the development of new technologies to simplify these procedures. High-density arrays of >90,000-o...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.8.12.1245
更新日期:1998-12-01 00:00:00