BLAT--the BLAST-like alignment tool.

Abstract:

:Analyzing vertebrate genomes requires rapid mRNA/DNA and cross-species protein alignments. A new tool, BLAT, is more accurate and 500 times faster than popular existing tools for mRNA/DNA alignments and 50 times faster for protein alignments at sensitivity settings typically used when comparing vertebrate sequences. BLAT's speed stems from an index of all nonoverlapping K-mers in the genome. This index fits inside the RAM of inexpensive computers, and need only be computed once for each genome assembly. BLAT has several major stages. It uses the index to find regions in the genome likely to be homologous to the query sequence. It performs an alignment between homologous regions. It stitches together these aligned regions (often exons) into larger alignments (typically genes). Finally, BLAT revisits small internal exons possibly missed at the first stage and adjusts large gap boundaries that have canonical splice sites where feasible. This paper describes how BLAT was optimized. Effects on speed and sensitivity are explored for various K-mer sizes, mismatch schemes, and number of required index matches. BLAT is compared with other alignment programs on various test sets and then used in several genome-wide applications. http://genome.ucsc.edu hosts a web-based BLAT server for the human genome.

journal_name

Genome Res

journal_title

Genome research

authors

Kent WJ

doi

10.1101/gr.229202

subject

Has Abstract

pub_date

2002-04-01 00:00:00

pages

656-64

issue

4

eissn

1088-9051

issn

1549-5469

journal_volume

12

pub_type

杂志文章
  • Independent evolution of transcript abundance and gene regulatory dynamics.

    abstract::Changes in gene expression drive novel phenotypes, raising interest in how gene expression evolves. In contrast to the static genome, cells modulate gene expression in response to changing environments. Previous comparative studies focused on specific conditions, describing interspecies variation in expression levels,...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.261537.120

    authors: Krieger G,Lupo O,Levy AA,Barkai N

    更新日期:2020-07-01 00:00:00

  • rVista for comparative sequence-based discovery of functional transcription factor binding sites.

    abstract::Identifying transcriptional regulatory elements represents a significant challenge in annotating the genomes of higher vertebrates. We have developed a computational tool, rVista, for high-throughput discovery of cis-regulatory elements that combines clustering of predicted transcription factor binding sites (TFBSs) a...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.225502

    authors: Loots GG,Ovcharenko I,Pachter L,Dubchak I,Rubin EM

    更新日期:2002-05-01 00:00:00

  • Profiling patterned transcripts in Drosophila embryos.

    abstract::Here we describe a high-throughput screen to isolate transcripts with spatially restricted patterns of expression in early embryos. Our approach utilizes robotic automation for rapid analysis of sequence-selected cDNAs in a whole-mount in situ hybridization assay. We determined the spatial distribution of a random col...

    journal_title:Genome research

    pub_type: 信件

    doi:10.1101/gr.84402

    authors: Simin K,Scuderi A,Reamey J,Dunn D,Weiss R,Metherall JE,Letsou A

    更新日期:2002-07-01 00:00:00

  • Physicochemical constraint violation by missense substitutions mediates impairment of protein function and disease severity.

    abstract::We find that the degree of impairment of protein function by missense variants is predictable by comparative sequence analysis alone. The applicable range of impairment is not confined to binary predictions that distinguish normal from deleterious variants, but extends continuously from mild to severe effects. The acc...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.3804205

    authors: Stone EA,Sidow A

    更新日期:2005-07-01 00:00:00

  • New class of microRNA targets containing simultaneous 5'-UTR and 3'-UTR interaction sites.

    abstract::MicroRNAs (miRNAs) are known to post-transcriptionally regulate target mRNAs through the 3'-UTR, which interacts mainly with the 5'-end of miRNA in animals. Here we identify many endogenous motifs within human 5'-UTRs specific to the 3'-ends of miRNAs. The 3'-end of conserved miRNAs in particular has significant inter...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.089367.108

    authors: Lee I,Ajay SS,Yook JI,Kim HS,Hong SH,Kim NH,Dhanasekaran SM,Chinnaiyan AM,Athey BD

    更新日期:2009-07-01 00:00:00

  • Methylation analysis of a marsupial X-linked CpG island by bisulfite genomic sequencing.

    abstract::Paternal X chromosome inactivation occurs in rodent extraembryonic membranes and in all tissues of marsupials. Methylation of CpG islands occurs on the inactive X in eutherians and is considered to be a stabilizing mechanism. The only previous study of a marsupial X-linked CpG island was of the G6PD gene of the Virgin...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6.2.114

    authors: Loebel DA,Johnston PG

    更新日期:1996-02-01 00:00:00

  • The marine bacterium Pseudoalteromonas haloplanktis has a complex genome structure composed of two separate genetic units.

    abstract::The genome size of Pseudoalteromonas haloplanktis, a ubiquitous and easily cultured marine bacterium, was measured as a step toward estimating the genome complexity of marine bacterioplankton. To determine total genome size, we digested P. haloplanktis DNA with the restriction endonucleases Notl and Sfil, separated th...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6.12.1160

    authors: Lanoil BD,Ciuffetti LM,Giovannoni SJ

    更新日期:1996-12-01 00:00:00

  • Gene-specific vulnerability to imprinting variability in human embryonic stem cell lines.

    abstract::Disregulation of imprinted genes can be associated with tumorigenesis and altered cell differentiation capacity and so could provide adverse outcomes for stem cell applications. Although the maintenance of mouse and primate embryonic stem cells in a pluripotent state has been reported to disrupt the monoallelic expres...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6609207

    authors: Kim KP,Thurston A,Mummery C,Ward-van Oostwaard D,Priddle H,Allegrucci C,Denning C,Young L

    更新日期:2007-12-01 00:00:00

  • Rate of elongation by RNA polymerase II is associated with specific gene features and epigenetic modifications.

    abstract::The rate of transcription elongation plays an important role in the timing of expression of full-length transcripts as well as in the regulation of alternative splicing. In this study, we coupled Bru-seq technology with 5,6-dichlorobenzimidazole 1-β-D-ribofuranoside (DRB) to estimate the elongation rates of over 2000 ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.171405.113

    authors: Veloso A,Kirkconnell KS,Magnuson B,Biewen B,Paulsen MT,Wilson TE,Ljungman M

    更新日期:2014-06-01 00:00:00

  • Comparative methylome analysis of benign and malignant peripheral nerve sheath tumors.

    abstract::Aberrant DNA methylation (DNAm) was first linked to cancer over 25 yr ago. Since then, many studies have associated hypermethylation of tumor suppressor genes and hypomethylation of oncogenes to the tumorigenic process. However, most of these studies have been limited to the analysis of promoters and CpG islands (CGIs...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.109678.110

    authors: Feber A,Wilson GA,Zhang L,Presneau N,Idowu B,Down TA,Rakyan VK,Noon LA,Lloyd AC,Stupka E,Schiza V,Teschendorff AE,Schroth GP,Flanagan A,Beck S

    更新日期:2011-04-01 00:00:00

  • A-to-I RNA editing promotes developmental stage-specific gene and lncRNA expression.

    abstract::A-to-I RNA editing is a conserved widespread phenomenon in which adenosine (A) is converted to inosine (I) by adenosine deaminases (ADARs) in double-stranded RNA regions, mainly noncoding. Mutations in ADAR enzymes in Caenorhabditis elegans cause defects in normal development but are not lethal as in human and mouse. ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.211169.116

    authors: Goldstein B,Agranat-Tamir L,Light D,Ben-Naim Zgayer O,Fishman A,Lamm AT

    更新日期:2017-03-01 00:00:00

  • Gene loss and movement in the maize genome.

    abstract::Maize (Zea mays L. ssp. mays), one of the most important agricultural crops in the world, originated by hybridization of two closely related progenitors. To investigate the fate of its genes after tetraploidization, we analyzed the sequence of five duplicated regions from different chromosomal locations. We also compa...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.2701104

    authors: Lai J,Ma J,Swigonová Z,Ramakrishna W,Linton E,Llaca V,Tanyolac B,Park YJ,Jeong OY,Bennetzen JL,Messing J

    更新日期:2004-10-01 00:00:00

  • Rescue of targeted regions of mammalian chromosomes by in vivo recombination in yeast.

    abstract::In contrast to other animal cell lines, the chicken pre-B cell lymphoma line, DT40, exhibits a high level of homologous recombination, which can be exploited to generate site-specific alterations in defined target genes or regions. In addition, the ability to generate human/chicken monochromosomal hybrids in the DT40 ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.8.6.666

    authors: Kouprina N,Kawamoto K,Barrett JC,Larionov V,Koi M

    更新日期:1998-06-01 00:00:00

  • Accurate gene-tree reconstruction by learning gene- and species-specific substitution rates across multiple complete genomes.

    abstract::Comparative genomics provides a general methodology for discovering functional DNA elements and understanding their evolution. The availability of many related genomes enables more powerful analyses, but requires rigorous phylogenetic methods to resolve orthologous genes and regions. Here, we use 12 recently sequenced...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.7105007

    authors: Rasmussen MD,Kellis M

    更新日期:2007-12-01 00:00:00

  • Estimating coarse gene network structure from large-scale gene perturbation data.

    abstract::Large scale gene perturbation experiments generate information about the number of genes whose activity is directly or indirectly affected by a gene perturbation. From this information, one can numerically estimate coarse structural network features such as the total number of direct regulatory interactions and the nu...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.193902

    authors: Wagner A

    更新日期:2002-02-01 00:00:00

  • A recombination hotspot leads to sequence variability within a novel gene (AK005651) and contributes to type 1 diabetes susceptibility.

    abstract::More than 25 loci have been linked to type 1 diabetes (T1D) in the nonobese diabetic (NOD) mouse, but identification of the underlying genes remains challenging. We describe here the positional cloning of a T1D susceptibility locus, Idd11, located on mouse chromosome 4. Sequence analysis of a series of congenic NOD mo...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.101881.109

    authors: Tan IK,Mackin L,Wang N,Papenfuss AT,Elso CM,Ashton MP,Quirk F,Phipson B,Bahlo M,Speed TP,Smyth GK,Morahan G,Brodnicki TC

    更新日期:2010-12-01 00:00:00

  • Multimeric threading-based prediction of protein-protein interactions on a genomic scale: application to the Saccharomyces cerevisiae proteome.

    abstract::MULTIPROSPECTOR, a multimeric threading algorithm for the prediction of protein-protein interactions, is applied to the genome of Saccharomyces cerevisiae. Each possible pairwise interaction among more than 6000 encoded proteins is evaluated against a dimer database of 768 complex structures by using a confidence esti...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.1145203

    authors: Lu L,Arakaki AK,Lu H,Skolnick J

    更新日期:2003-06-01 00:00:00

  • Inference and analysis of haplotypes from combined genotyping studies deposited in dbSNP.

    abstract::In the attempt to understand human variation and the genetic basis of complex disease, a tremendous number of single nucleotide polymorphisms (SNPs) have been discovered and deposited into NCBI's dbSNP public database. More than 2.7 million SNPs in the database have genotype information. This data provides an invaluab...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.4297805

    authors: Zaitlen NA,Kang HM,Feolo ML,Sherry ST,Halperin E,Eskin E

    更新日期:2005-11-01 00:00:00

  • Judging the quality of gene expression-based clustering methods using gene annotation.

    abstract::We compare several commonly used expression-based gene clustering algorithms using a figure of merit based on the mutual information between cluster membership and known gene attributes. By studying various publicly available expression data sets we conclude that enrichment of clusters for biological function is, in g...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.397002

    authors: Gibbons FD,Roth FP

    更新日期:2002-10-01 00:00:00

  • DNA profiling of B chromosomes from the yellow-necked mouse Apodemus flavicollis (Rodentia, Mammalia).

    abstract::Using AP-PCR-based DNA profiling we examined some structural features of B chromosomes from yellow-necked mice Apodemus flavicollis. Mice harboring one, two, or three or lacking B chromosomes were examined. Chromosomal structure was scanned for variant bands by using a series of arbitrary primers and from these, infor...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:

    authors: Tanic N,Dedovic N,Vujosevic M,Dimitrijevic B

    更新日期:2000-01-01 00:00:00

  • Evolution and multilevel optimization of the genetic code.

    abstract::The discovery of the genetic code was one of the most important advances of modern biology. But there is more to a DNA code than protein sequence; DNA carries signals for splicing, localization, folding, and regulation that are often embedded within the protein-coding sequence. In this issue, Itzkovitz and Alon show t...

    journal_title:Genome research

    pub_type: 评论,杂志文章,评审

    doi:10.1101/gr.6144007

    authors: Bollenbach T,Vetsigian K,Kishony R

    更新日期:2007-04-01 00:00:00

  • Genomic analysis of primordial dwarfism reveals novel disease genes.

    abstract::Primordial dwarfism (PD) is a disease in which severely impaired fetal growth persists throughout postnatal development and results in stunted adult size. The condition is highly heterogeneous clinically, but the use of certain phenotypic aspects such as head circumference and facial appearance has proven helpful in d...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.160572.113

    authors: Shaheen R,Faqeih E,Ansari S,Abdel-Salam G,Al-Hassnan ZN,Al-Shidi T,Alomar R,Sogaty S,Alkuraya FS

    更新日期:2014-02-01 00:00:00

  • Evaluation of predicted network modules in yeast metabolism using NMR-based metabolite profiling.

    abstract::Genome-scale metabolic models promise important insights into cell function. However, the definition of pathways and functional network modules within these models, and in the biochemical literature in general, is often based on intuitive reasoning. Although mathematical methods have been proposed to identify modules,...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.5662207

    authors: Bundy JG,Papp B,Harmston R,Browne RA,Clayson EM,Burton N,Reece RJ,Oliver SG,Brindle KM

    更新日期:2007-04-01 00:00:00

  • Two contrasting classes of nucleolus-associated domains in mouse fibroblast heterochromatin.

    abstract::In interphase eukaryotic cells, almost all heterochromatin is located adjacent to the nucleolus or to the nuclear lamina, thus defining nucleolus-associated domains (NADs) and lamina-associated domains (LADs), respectively. Here, we determined the first genome-scale map of murine NADs in mouse embryonic fibroblasts (M...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.247072.118

    authors: Vertii A,Ou J,Yu J,Yan A,Pagès H,Liu H,Zhu LJ,Kaufman PD

    更新日期:2019-08-01 00:00:00

  • GrapeTree: visualization of core genomic relationships among 100,000 bacterial pathogens.

    abstract::Current methods struggle to reconstruct and visualize the genomic relationships of large numbers of bacterial genomes. GrapeTree facilitates the analyses of large numbers of allelic profiles by a static "GrapeTree Layout" algorithm that supports interactive visualizations of large trees within a web browser window. Gr...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.232397.117

    authors: Zhou Z,Alikhan NF,Sergeant MJ,Luhmann N,Vaz C,Francisco AP,Carriço JA,Achtman M

    更新日期:2018-09-01 00:00:00

  • Roles for transcript leaders in translation and mRNA decay revealed by transcript leader sequencing.

    abstract::Transcript leaders (TLs) can have profound effects on mRNA translation and stability. To map TL boundaries genome-wide, we developed TL-sequencing (TL-seq), a technique combining enzymatic capture of m(7)G-capped mRNA 5' ends with high-throughput sequencing. TL-seq identified mRNA start sites for the majority of yeast...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.150342.112

    authors: Arribere JA,Gilbert WV

    更新日期:2013-06-01 00:00:00

  • Genetic and phenotypic intra-species variation in Candida albicans.

    abstract::Candida albicans is a commensal fungus of the human gastrointestinal tract and a prevalent opportunistic pathogen. To examine diversity within this species, extensive genomic and phenotypic analyses were performed on 21 clinical C. albicans isolates. Genomic variation was evident in the form of polymorphisms, copy num...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.174623.114

    authors: Hirakawa MP,Martinez DA,Sakthikumar S,Anderson MZ,Berlin A,Gujja S,Zeng Q,Zisson E,Wang JM,Greenberg JM,Berman J,Bennett RJ,Cuomo CA

    更新日期:2015-03-01 00:00:00

  • A generic, cost-effective, and scalable cell lineage analysis platform.

    abstract::Advances in single-cell genomics enable commensurate improvements in methods for uncovering lineage relations among individual cells. Current sequencing-based methods for cell lineage analysis depend on low-resolution bulk analysis or rely on extensive single-cell sequencing, which is not scalable and could be biased ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.202903.115

    authors: Biezuner T,Spiro A,Raz O,Amir S,Milo L,Adar R,Chapal-Ilani N,Berman V,Fried Y,Ainbinder E,Cohen G,Barr HM,Halaban R,Shapiro E

    更新日期:2016-11-01 00:00:00

  • Population genetic inference from genomic sequence variation.

    abstract::Population genetics has evolved from a theory-driven field with little empirical data into a data-driven discipline in which genome-scale data sets test the limits of available models and computational analysis methods. In humans and a few model organisms, analyses of whole-genome sequence polymorphism data are curren...

    journal_title:Genome research

    pub_type: 杂志文章,评审

    doi:10.1101/gr.079509.108

    authors: Pool JE,Hellmann I,Jensen JD,Nielsen R

    更新日期:2010-03-01 00:00:00

  • Strategies for mutational analysis of the large multiexon ATM gene using high-density oligonucleotide arrays.

    abstract::Mutational analysis of large genes with complex genomic structures plays an important role in medical genetics. Technical limitations associated with current mutation screening protocols have placed increased emphasis on the development of new technologies to simplify these procedures. High-density arrays of >90,000-o...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.8.12.1245

    authors: Hacia JG,Sun B,Hunt N,Edgemon K,Mosbrook D,Robbins C,Fodor SP,Tagle DA,Collins FS

    更新日期:1998-12-01 00:00:00