Preferred and avoided codon pairs in three domains of life.

Abstract:

BACKGROUND:Alternative synonymous codons are not used with equal frequencies. In addition, the contexts of codons - neighboring nucleotides and neighboring codons - can have certain patterns. The codon context can influence both translational accuracy and elongation rates. However, it is not known how strong or conserved the codon context preferences in different organisms are. We analyzed 138 organisms (bacteria, archaea and eukaryotes) to find conserved patterns of codon pairs. RESULTS:After removing the effects of single codon usage and dipeptide biases we discovered a set of neighboring codons for which avoidances or preferences were conserved in all three domains of life. Such biased codon pairs could be divided into subtypes on the basis of the nucleotide patterns that influence the bias. The most frequently avoided type of codon pair was nnUAnn. We discovered that 95.7% of avoided nnUAnn type patterns contain out-frame UAA or UAG triplets on the sense and/or antisense strand. On average, nnUAnn codon pairs are more frequently avoided in ORFeomes than in genomes. Thus we assume that translational selection plays a major role in the avoidance of these codon pairs. Among the preferred codon pairs, nnGCnn was the major type. CONCLUSION:Translational selection shapes codon pair usage in protein coding sequences by rules that are common to all three domains of life. The most frequently avoided codon pairs contain the patterns nnUAnn, nnGGnn, nnGnnC, nnCGCn, GUCCnn, CUCCnn, nnCnnA or UUCGnn. The most frequently preferred codon pairs contain the patterns nnGCnn, nnCAnn or nnUnCn.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Tats A,Tenson T,Remm M

doi

10.1186/1471-2164-9-463

subject

Has Abstract

pub_date

2008-10-08 00:00:00

pages

463

issn

1471-2164

pii

1471-2164-9-463

journal_volume

9

pub_type

杂志文章
  • MixClone: a mixture model for inferring tumor subclonal populations.

    abstract:BACKGROUND:Tumor genomes are often highly heterogeneous, consisting of genomes from multiple subclonal types. Complete characterization of all subclonal types is a fundamental need in tumor genome analysis. With the advancement of next-generation sequencing, computational methods have recently been developed to infer t...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-16-S2-S1

    authors: Li Y,Xie X

    更新日期:2015-01-01 00:00:00

  • Comparative profiling of the transcriptional response to infection in two species of Drosophila by short-read cDNA sequencing.

    abstract:BACKGROUND:Homology-based comparisons of the genes involved in innate immunity across many insect taxa with fully sequenced genomes has revealed a striking pattern of gene gain and loss, particularly among genes that encode proteins involved in clearing pathogens (effectors). However, limited functional annotation in n...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-259

    authors: Sackton TB,Clark AG

    更新日期:2009-06-07 00:00:00

  • Nicotiana attenuata Data Hub (NaDH): an integrative platform for exploring genomic, transcriptomic and metabolomic data in wild tobacco.

    abstract:BACKGROUND:Nicotiana attenuata (coyote tobacco) is an ecological model for studying plant-environment interactions and plant gene function under real-world conditions. During the last decade, large amounts of genomic, transcriptomic and metabolomic data have been generated with this plant which has provided new insight...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3465-9

    authors: Brockmöller T,Ling Z,Li D,Gaquerel E,Baldwin IT,Xu S

    更新日期:2017-01-13 00:00:00

  • Polymorphism discovery and allele frequency estimation using high-throughput DNA sequencing of target-enriched pooled DNA samples.

    abstract:BACKGROUND:The central role of the somatotrophic axis in animal post-natal growth, development and fertility is well established. Therefore, the identification of genetic variants affecting quantitative traits within this axis is an attractive goal. However, large sample numbers are a pre-requisite for the identificati...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-16

    authors: Mullen MP,Creevey CJ,Berry DP,McCabe MS,Magee DA,Howard DJ,Killeen AP,Park SD,McGettigan PA,Lucy MC,Machugh DE,Waters SM

    更新日期:2012-01-11 00:00:00

  • Transferring knowledge of bacterial protein interaction networks to predict pathogen targeted human genes and immune signaling pathways: a case study on M. tuberculosis.

    abstract:BACKGROUND:Bacterial invasive infection and host immune response is fundamental to the understanding of pathogen pathogenesis and the discovery of effective therapeutic drugs. However, there are very few experimental studies on the signaling cross-talks between bacteria and human host to date. METHODS:In this work, ta...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4873-9

    authors: Mei S,Flemington EK,Zhang K

    更新日期:2018-06-28 00:00:00

  • Comparative transcriptomic analysis unveils interactions between the regulatory CarS protein and light response in Fusarium.

    abstract:BACKGROUND:The orange pigmentation of the agar cultures of many Fusarium species is due to the production of carotenoids, terpenoid pigments whose synthesis is stimulated by light. The genes of the carotenoid pathway and their regulation have been investigated in detail in Fusarium fujikuroi. In this and other Fusarium...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5430-x

    authors: Ruger-Herreros M,Parra-Rivero O,Pardo-Medina J,Romero-Campero FJ,Limón MC,Avalos J

    更新日期:2019-01-21 00:00:00

  • Genomic comparison of serogroups O159 and O170 with other Vibrio cholerae serogroups.

    abstract:BACKGROUND:Of the hundreds of Vibrio cholerae serogroups, O1 and O139 are the main epidemic-causing ones. Although non-O1/non-O139 serogroups rarely cause epidemics, the possibility exists for strains within them to have pathogenic potential. RESULTS:We selected 25 representative strains within 16 V. cholerae serogrou...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5603-7

    authors: Li Z,Lu X,Wang D,Liang WL,Zhang J,Li J,Xu J,Pang B,Kan B

    更新日期:2019-03-25 00:00:00

  • Characterization of SR3 reveals abundance of non-LTR retrotransposons of the RTE clade in the genome of the human blood fluke, Schistosoma mansoni.

    abstract:BACKGROUND:It is becoming apparent that perhaps as much as half of the genome of the human blood fluke Schistosoma mansoni is constituted of mobile genetic element-related sequences. Non-long terminal repeat (LTR) retrotransposons, related to the LINE elements of mammals, comprise much of this repetitive component of t...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-6-154

    authors: Laha T,Kewgrai N,Loukas A,Brindley PJ

    更新日期:2005-11-04 00:00:00

  • Expression, tandem repeat copy number variation and stability of four macrosatellite arrays in the human genome.

    abstract:BACKGROUND:Macrosatellites are some of the largest variable number tandem repeats in the human genome, but what role these unusual sequences perform is unknown. Their importance to human health is clearly demonstrated by the 4q35 macrosatellite D4Z4 that is associated with the onset of the muscle degenerative disease f...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-632

    authors: Tremblay DC,Alexander G Jr,Moseley S,Chadwick BP

    更新日期:2010-11-15 00:00:00

  • Profiling expression changes caused by a segmental aneuploid in maize.

    abstract:BACKGROUND:While changes in chromosome number that result in aneuploidy are associated with phenotypic consequences such as Down syndrome and cancer, the molecular causes of specific phenotypes and genome-wide expression changes that occur in aneuploids are still being elucidated. RESULTS:We employed a segmental aneup...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-7

    authors: Makarevitch I,Phillips RL,Springer NM

    更新日期:2008-01-10 00:00:00

  • Analysis of intra-genomic GC content homogeneity within prokaryotes.

    abstract:BACKGROUND:Bacterial genomes possess varying GC content (total guanines (Gs) and cytosines (Cs) per total of the four bases within the genome) but within a given genome, GC content can vary locally along the chromosome, with some regions significantly more or less GC rich than on average. We have examined how the GC co...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-464

    authors: Bohlin J,Snipen L,Hardy SP,Kristoffersen AB,Lagesen K,Dønsvik T,Skjerve E,Ussery DW

    更新日期:2010-08-06 00:00:00

  • Sequencing and characterization of the guppy (Poecilia reticulata) transcriptome.

    abstract:BACKGROUND:Next-generation sequencing is providing researchers with a relatively fast and affordable option for developing genomic resources for organisms that are not among the traditional genetic models. Here we present a de novo assembly of the guppy (Poecilia reticulata) transcriptome using 454 sequence reads, and ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-202

    authors: Fraser BA,Weadick CJ,Janowitz I,Rodd FH,Hughes KA

    更新日期:2011-04-20 00:00:00

  • Genome-wide association and transcriptional studies reveal novel genes for unsaturated fatty acid synthesis in a panel of soybean accessions.

    abstract:BACKGROUND:The nutritional value of soybean oil is largely influenced by the proportions of unsaturated fatty acids (FAs), including oleic acid (OA, 18:1), linoleic acid (LLA, 18:2), and linolenic acid (LNA, 18:3). Genome-wide association (GWAS) studies along with gene expression studies in soybean [Glycine max (L.) Me...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5449-z

    authors: Zhao X,Jiang H,Feng L,Qu Y,Teng W,Qiu L,Zheng H,Han Y,Li W

    更新日期:2019-01-21 00:00:00

  • Genome-wide metabolic (re-) annotation of Kluyveromyces lactis.

    abstract:BACKGROUND:Even before having its genome sequence published in 2004, Kluyveromyces lactis had long been considered a model organism for studies in genetics and physiology. Research on Kluyveromyces lactis is quite advanced and this yeast species is one of the few with which it is possible to perform formal genetic anal...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-517

    authors: Dias O,Gombert AK,Ferreira EC,Rocha I

    更新日期:2012-10-01 00:00:00

  • Outlier analysis of functional genomic profiles enriches for oncology targets and enables precision medicine.

    abstract:BACKGROUND:Genome-scale functional genomic screens across large cell line panels provide a rich resource for discovering tumor vulnerabilities that can lead to the next generation of targeted therapies. Their data analysis typically has focused on identifying genes whose knockdown enhances response in various pre-defin...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2807-y

    authors: Zhu Z,Ihle NT,Rejto PA,Zarrinkar PP

    更新日期:2016-06-13 00:00:00

  • Detecting fitness epistasis in recently admixed populations with genome-wide data.

    abstract:BACKGROUND:Fitness epistasis, the interaction effect of genes at different loci on fitness, makes an important contribution to adaptive evolution. Although fitness interaction evidence has been observed in model organisms, it is more difficult to detect and remains poorly understood in human populations as a result of ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-06874-7

    authors: Ni X,Zhou M,Wang H,He KY,Broeckel U,Hanis C,Kardia S,Redline S,Cooper RS,Tang H,Zhu X

    更新日期:2020-07-11 00:00:00

  • Identification, characterization, and utilization of genome-wide simple sequence repeats to identify a QTL for acidity in apple.

    abstract:BACKGROUND:Apple is an economically important fruit crop worldwide. Developing a genetic linkage map is a critical step towards mapping and cloning of genes responsible for important horticultural traits in apple. To facilitate linkage map construction, we surveyed and characterized the distribution and frequency of pe...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-537

    authors: Zhang Q,Ma B,Li H,Chang Y,Han Y,Li J,Wei G,Zhao S,Khan MA,Zhou Y,Gu C,Zhang X,Han Z,Korban SS,Li S,Han Y

    更新日期:2012-10-07 00:00:00

  • Identification of functional regulatory elements in the human genome using pooled CRISPR screens.

    abstract:BACKGROUND:Genome-scale pooled CRISPR screens are powerful tools for identifying genetic dependencies across varied cellular processes. The vast majority of CRISPR screens reported to date have focused exclusively on the perturbation of protein-coding gene function. However, protein-coding genes comprise < 2% of the se...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-6497-0

    authors: Borys SM,Younger ST

    更新日期:2020-01-31 00:00:00

  • Characterization of a novel chicken muscle disorder through differential gene expression and pathway analysis using RNA-sequencing.

    abstract:BACKGROUND:Improvements in poultry production within the past 50 years have led to increased muscle yield and growth rate, which may be contributing to an increased rate and development of new muscle disorders in chickens. Previously reported muscle disorders and conditions are generally associated with poor meat quali...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1623-0

    authors: Mutryn MF,Brannick EM,Fu W,Lee WR,Abasht B

    更新日期:2015-05-21 00:00:00

  • Liver proteome response of pre-harvest Atlantic salmon following exposure to elevated temperature.

    abstract:BACKGROUND:Atlantic salmon production in Tasmania (Southern Australia) occurs near the upper limits of the species thermal tolerance. Summer water temperatures can average over 19 °C over several weeks and have negative effects on performance and health. Liver tissue exerts important metabolic functions in thermal adap...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4517-0

    authors: Nuez-Ortín WG,Carter CG,Nichols PD,Cooke IR,Wilson R

    更新日期:2018-02-12 00:00:00

  • Mycoplasma non-coding RNA: identification of small RNAs and targets.

    abstract:BACKGROUND:Bacterial non-coding RNAs act by base-pairing as regulatory elements in crucial biological processes. We performed the identification of trans-encoded small RNAs (sRNA) from the genomes of Mycoplama hyopneumoniae, Mycoplasma flocculare and Mycoplasma hyorhinis, which are Mycoplasma species that have been ide...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3061-z

    authors: Siqueira FM,de Morais GL,Higashi S,Beier LS,Breyer GM,de Sá Godinho CP,Sagot MF,Schrank IS,Zaha A,de Vasconcelos AT

    更新日期:2016-10-25 00:00:00

  • Single feature polymorphism (SFP)-based selective sweep identification and association mapping of growth-related metabolic traits in Arabidopsis thaliana.

    abstract:BACKGROUND:Natural accessions of Arabidopsis thaliana are characterized by a high level of phenotypic variation that can be used to investigate the extent and mode of selection on the primary metabolic traits. A collection of 54 A. thaliana natural accession-derived lines were subjected to deep genotyping through Singl...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-188

    authors: Childs LH,Witucka-Wall H,Günther T,Sulpice R,Korff MV,Stitt M,Walther D,Schmid KJ,Altmann T

    更新日期:2010-03-20 00:00:00

  • Transcriptome analysis of a respiratory Saccharomyces cerevisiae strain suggests the expression of its phenotype is glucose insensitive and predominantly controlled by Hap4, Cat8 and Mig1.

    abstract:BACKGROUND:We previously described the first respiratory Saccharomyces cerevisiae strain, KOY.TM6*P, by integrating the gene encoding a chimeric hexose transporter, Tm6*, into the genome of an hxt null yeast. Subsequently we transferred this respiratory phenotype in the presence of up to 50 g/L glucose to a yeast strai...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-365

    authors: Bonander N,Ferndahl C,Mostad P,Wilks MD,Chang C,Showe L,Gustafsson L,Larsson C,Bill RM

    更新日期:2008-07-31 00:00:00

  • Gene expression changes during caste-specific neuronal development in the damp-wood termite Hodotermopsis sjostedti.

    abstract:BACKGROUND:One of the key characters of social insects is the division of labor, in which different tasks are allocated to various castes. In termites, one of the representative groups of social insects, morphological differences as well as behavioral differences can be recognized among castes. However, very little is ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-314

    authors: Ishikawa Y,Okada Y,Ishikawa A,Miyakawa H,Koshikawa S,Miura T

    更新日期:2010-05-20 00:00:00

  • Comparative transcriptomics of early petal development across four diverse species of Aquilegia reveal few genes consistently associated with nectar spur development.

    abstract:BACKGROUND:Petal nectar spurs, which facilitate pollination through animal attraction and pollen placement, represent a key innovation promoting diversification in the genus Aquilegia (Ranunculaceae). Identifying the genetic components that contribute to the development of these three-dimensional structures will inform...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6002-9

    authors: Ballerini ES,Kramer EM,Hodges SA

    更新日期:2019-08-22 00:00:00

  • Tracking the best reference genes for RT-qPCR data normalization in filamentous fungi.

    abstract:BACKGROUND:A critical step in the RT-qPCR workflow for studying gene expression is data normalization, one of the strategies being the use of reference genes. This study aimed to identify and validate a selection of reference genes for relative quantification in Talaromyces versatilis, a relevant industrial filamentous...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1224-y

    authors: Llanos A,François JM,Parrou JL

    更新日期:2015-02-14 00:00:00

  • Applicability of DNA pools on 500 K SNP microarrays for cost-effective initial screens in genomewide association studies.

    abstract:BACKGROUND:Genetic influences underpinning complex traits are thought to involve multiple quantitative trait loci (QTLs) of small effect size. Detection of such QTL associations requires systematic screening of large numbers of DNA markers within large sample populations. Using pooled DNA on SNP microarrays to screen f...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-214

    authors: Docherty SJ,Butcher LM,Schalkwyk LC,Plomin R

    更新日期:2007-07-04 00:00:00

  • A genome-wide analysis of the phospholipid: diacylglycerol acyltransferase gene family in Gossypium.

    abstract:BACKGROUND:Cotton (Gossypium spp.) is the most important natural fiber crop worldwide, and cottonseed oil is its most important byproduct. Phospholipid: diacylglycerol acyltransferase (PDAT) is important in TAG biosynthesis, as it catalyzes the transfer of a fatty acyl moiety from the sn-2 position of a phospholipid to...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5728-8

    authors: Zang X,Geng X,Ma L,Wang N,Pei W,Wu M,Zhang J,Yu J

    更新日期:2019-05-22 00:00:00

  • HumCFS: a database of fragile sites in human chromosomes.

    abstract:BACKGROUND:Fragile sites are the chromosomal regions that are susceptible to breakage, and their frequency varies among the human population. Based on the frequency of fragile site induction, they are categorized as common and rare fragile sites. Common fragile sites are sensitive to replication stress and often rearra...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5330-5

    authors: Kumar R,Nagpal G,Kumar V,Usmani SS,Agrawal P,Raghava GPS

    更新日期:2019-04-18 00:00:00

  • Whole exome sequencing (WES) on formalin-fixed, paraffin-embedded (FFPE) tumor tissue in gastrointestinal stromal tumors (GIST).

    abstract:BACKGROUND:Next generation sequencing (NGS) technology has been rapidly introduced into basic and translational research in oncology, but the reduced availability of fresh frozen (FF) tumor tissues and the poor quality of DNA extracted from formalin-fixed, paraffin-embedded (FFPE) has significantly impaired this proces...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1982-6

    authors: Astolfi A,Urbini M,Indio V,Nannini M,Genovese CG,Santini D,Saponara M,Mandrioli A,Ercolani G,Brandi G,Biasco G,Pantaleo MA

    更新日期:2015-11-03 00:00:00