Abstract:
BACKGROUND:Advances in human genomics have allowed unprecedented productivity in terms of algorithms, software, and literature available for translating raw next-generation sequence data into high-quality information. The challenges of variant identification in organisms with lower quality reference genomes are less well documented. We explored the consequences of commonly recommended preparatory steps and the effects of single and multi sample variant identification methods using four publicly available software applications (Platypus, HaplotypeCaller, Samtools and UnifiedGenotyper) on whole genome sequence data of 65 key ancestors of Swiss dairy cattle populations. Accuracy of calling next-generation sequence variants was assessed by comparison to the same loci from medium and high-density single nucleotide variant (SNV) arrays. RESULTS:The total number of SNVs identified varied by software and method, with single (multi) sample results ranging from 17.7 to 22.0 (16.9 to 22.0) million variants. Computing time varied considerably between software. Preparatory realignment of insertions and deletions and subsequent base quality score recalibration had only minor effects on the number and quality of SNVs identified by different software, but increased computing time considerably. Average concordance for single (multi) sample results with high-density chip data was 58.3% (87.0%) and average genotype concordance in correctly identified SNVs was 99.2% (99.2%) across software. The average quality of SNVs identified, measured as the ratio of transitions to transversions, was higher using single sample methods than multi sample methods. A consensus approach using results of different software generally provided the highest variant quality in terms of transition/transversion ratio. CONCLUSIONS:Our findings serve as a reference for variant identification pipeline development in non-human organisms and help assess the implication of preparatory steps in next-generation sequencing pipelines for organisms with incomplete reference genomes (pipeline code is included). Benchmarking this information should prove particularly useful in processing next-generation sequencing data for use in genome-wide association studies and genomic selection.
journal_name
BMC Genomicsjournal_title
BMC genomicsauthors
Baes CF,Dolezal MA,Koltes JE,Bapst B,Fritz-Waters E,Jansen S,Flury C,Signer-Hasler H,Stricker C,Fernando R,Fries R,Moll J,Garrick DJ,Reecy JM,Gredler Bdoi
10.1186/1471-2164-15-948subject
Has Abstractpub_date
2014-11-01 00:00:00pages
948issn
1471-2164pii
1471-2164-15-948journal_volume
15pub_type
杂志文章相关文献
BMC GENOMICS文献大全abstract:BACKGROUND:Obesity affects quality of life and life expectancy and is associated with cardiovascular disorders, cancer, diabetes, reproductive disorders in women, prostate diseases in men, and congenital anomalies in children. The use of single nucleotide polymorphism (SNP) markers of diseases and drug responses (i.e.,...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-16-S13-S5
更新日期:2015-01-01 00:00:00
abstract:BACKGROUND:Sugarcane (Saccharum spp.) has become an increasingly important crop for its leading role in biofuel production. The high sugar content species S. officinarum is an octoploid without known diploid or tetraploid progenitors. Commercial sugarcane cultivars are hybrids between S. officinarum and wild species S....
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-261
更新日期:2010-04-23 00:00:00
abstract:BACKGROUND:Although hatching is perhaps the most abrupt and profound metabolic challenge that a chicken must undergo; there have been no attempts to functionally map the metabolic pathways induced in liver during the embryo-to-hatchling transition. Furthermore, we know very little about the metabolic and regulatory fac...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-5080-4
更新日期:2018-09-21 00:00:00
abstract:BACKGROUND:Transcriptomics in non-model plant systems has recently reached a point where the examination of nuclear genome-wide patterns in understudied groups is an achievable reality. This progress is especially notable in evolutionary studies of ferns, for which molecular resources to date have been derived primaril...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-3034-2
更新日期:2016-08-30 00:00:00
abstract:BACKGROUND:Progress in the fields of protein separation and identification technologies has accelerated research into biofluids proteomics for protein biomarker discovery. Urine has become an ideal and rich source of biomarkers in clinical proteomics. Here we performed a proteomic analysis of urine samples from pregnan...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-777
更新日期:2013-11-11 00:00:00
abstract:BACKGROUND:Because of the advent of high-throughput sequencing and the consequent reduction in the cost of sequencing, many organisms have been completely sequenced and most of their genes identified. It thus has become possible to represent whole genomes as ordered lists of gene identifiers and to study the rearrangem...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-S2-S15
更新日期:2008-09-16 00:00:00
abstract:BACKGROUND:Recent studies have demonstrated an unexpected complexity of transcription in eukaryotes. The majority of the genome is transcribed and only a little fraction of these transcripts is annotated as protein coding genes and their splice variants. Indeed, most transcripts are the result of antisense, overlapping...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-855
更新日期:2013-12-05 00:00:00
abstract:BACKGROUND:Malaria, caused by the parasitic protist Plasmodium falciparum, represents a major public health problem in the developing world. The P. falciparum genome has been sequenced, which provides new opportunities for the identification of novel drug targets. Eukaryotic protein kinases (ePKs) form a large family o...
journal_title:BMC genomics
pub_type: 杂志文章,评审
doi:10.1186/1471-2164-5-79
更新日期:2004-10-12 00:00:00
abstract:BACKGROUND:One of the key characters of social insects is the division of labor, in which different tasks are allocated to various castes. In termites, one of the representative groups of social insects, morphological differences as well as behavioral differences can be recognized among castes. However, very little is ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-314
更新日期:2010-05-20 00:00:00
abstract:BACKGROUND:Anthocyanins are water-soluble colored flavonoids present in multiple organs of various plant species including flowers, fruits, leaves, stems and roots. DNA-binding R2R3-MYB transcription factors, basic helix-loop-helix (bHLH) transcription factors, and WD40 repeat proteins are known to form MYB-bHLH-WD rep...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-5135-6
更新日期:2018-11-08 00:00:00
abstract:BACKGROUND:Brucella abortus is an intracellular zoonotic pathogen which causes undulant fever, endocarditis, arthritis and osteomyelitis in human and abortion and infertility in cattle. This bacterium is able to invade and replicate in host macrophage instead of getting removed by this defense mechanism. Therefore, und...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-426
更新日期:2013-06-27 00:00:00
abstract:INTRODUCTION:In the classification of Mass Spectrometry (MS) proteomics data, peak detection, feature selection, and learning classifiers are critical to classification accuracy. To better understand which methods are more accurate when classifying data, some publicly available peak detection algorithms for Matrix assi...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-10-S1-S3
更新日期:2009-07-07 00:00:00
abstract:BACKGROUND:Host-specific environmental factors induce changes in Bacillus anthracis gene transcription during infection. A global transcription regulator, CodY, plays a pivotal role in regulating central metabolism, biosynthesis, and virulence in B. anthracis. In this study, we utilized RNA-sequencing to assess changes...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-3004-8
更新日期:2016-08-17 00:00:00
abstract:BACKGROUND:Pitayas are currently attracting considerable interest as a tropical fruit with numerous health benefits. However, as a long-day plant, pitaya plants cannot flower in the winter season from November to April in Hainan, China. To harvest pitayas with high economic value in the winter season, it is necessary t...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-6726-6
更新日期:2020-04-29 00:00:00
abstract:BACKGROUND:Cytoplasmic male sterility (CMS) is an inability to produce functional pollen that is caused by mutation of the mitochondrial genome. Comparative analyses of mitochondrial genomes of lines with and without CMS in several species have revealed structural differences between genomes, including extensive rearra...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-561
更新日期:2014-07-04 00:00:00
abstract:BACKGROUND:Cotton is an important fiber crop but has serious heterosis effects, and cytoplasmic male sterility (CMS) is the major cause of heterosis in plants. However, to the best of our knowledge, no studies have investigated CMS Yamian A in cotton with the genetic background of Australian wild Gossypium bickii. Conj...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-07261-y
更新日期:2020-12-02 00:00:00
abstract:BACKGROUND:The main limitations in the analysis of viral metagenomes are perhaps the high genetic variability and the lack of information in extant databases. To address these issues, several bioinformatic tools have been specifically designed or adapted for metagenomics by improving read assembly and creating more sen...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-37
更新日期:2014-01-18 00:00:00
abstract:BACKGROUND:The genomic architecture of adaptive traits remains poorly understood in non-model plants. Various approaches can be used to bridge this gap, including the mapping of quantitative trait loci (QTL) in pedigrees, and genetic association studies in non-structured populations. Here we present results on the geno...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-368
更新日期:2013-06-01 00:00:00
abstract:BACKGROUND:Lipopolysaccharide (LPS) endotoxin stimulates pro-inflammatory pathways and is a key player in the pathological mechanisms involved in the development of endometritis. This study aimed to investigate LPS-induced DNA methylation changes in bovine endometrial epithelial cells (bEECs), which may affect endometr...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-06777-7
更新日期:2020-06-03 00:00:00
abstract:BACKGROUND:Oxidative stress caused by ground level ozone is a contributor to yield loss in a number of important crop plants. Soybean (Glycine max) is considered to be ozone sensitive, and current research into its response to oxidative stress is limited. To better understand the genetic response in soybean to oxidativ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1637-7
更新日期:2015-06-04 00:00:00
abstract:BACKGROUND:The obligate intracellular parasite Toxoplasma gondii establishes a life-long chronic infection within any warm-blooded host. After ingestion of an encysted parasite, T. gondii disseminates throughout the body as a rapidly replicating form during acute infection. Over time and after stimulation of the host i...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-806
更新日期:2014-09-20 00:00:00
abstract:BACKGROUND:The most critical step in the pearl formation during aquaculture is issued to the proliferation and differentiation of outer epithelial cells of mantle graft into pearl sac. This pearl sac secretes various matrix proteins to produce pearls by a complex physiological process which has not been well-understood...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-5579-3
更新日期:2019-03-25 00:00:00
abstract:BACKGROUND:Genome-wide disease-gene finding approaches may sometimes provide us with a long list of candidate genes. Since using pure experimental approaches to verify all candidates could be expensive, a number of network-based methods have been developed to prioritize candidates. Such tools usually have a set of para...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-12-S3-S25
更新日期:2011-11-30 00:00:00
abstract:BACKGROUND:Due to the predominant usage of short-read sequencing to date, most bacterial genome sequences reported in the last years remain at the draft level. This precludes certain types of analyses, such as the in-depth analysis of genome plasticity. RESULTS:Here we report the finalized genome sequence of the envir...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-4301-6
更新日期:2018-01-05 00:00:00
abstract:BACKGROUND:Toxin-antitoxin (TA) systems, abundant in prokaryotes, are composed of a toxin gene and its cognate antitoxin. Several toxins are implied to affect the physiological state and stress tolerance of bacteria in a population. We previously identified a chromosomally encoded hok-sok type I TA system in Erwinia am...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-021-07376-w
更新日期:2021-01-22 00:00:00
abstract:BACKGROUND:Moraxella catarrhalis is an important pathogen that often causes otitis media in children, a disease that is not currently vaccine preventable. Asymptomatic colonisation of the human upper respiratory tract is common and lack of clearance by the immune system is likely due to the emergence of seroresistant g...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-2104-1
更新日期:2015-10-24 00:00:00
abstract:BACKGROUND:Sensory hair cells are exquisitely sensitive to mechanical stimuli and as such, are prone to damage and apoptosis during dissections or in vitro manipulations. Thiouracil (TU)-tagging is a noninvasive method to label cell type-specific transcripts in an intact organism, thereby meeting the challenge of how t...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-2072-5
更新日期:2015-10-23 00:00:00
abstract:BACKGROUND:Quantitative trait locus (QTL) studies show that variation in salinity tolerance in Arctic charr and rainbow trout has a genetic basis, even though both these species have low to moderate salinity tolerance capacities. QTL were observed to localize to homologous linkage group segments within putative chromos...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-420
更新日期:2012-08-24 00:00:00
abstract:BACKGROUND:A high caloric diet, in conjunction with low levels of physical activity, promotes obesity. Many studies are available regarding the relation between dietary saturated fats and the etiology of obesity, but most focus on liver, muscle and white adipose tissue. Furthermore, the majority of transcriptomic studi...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2424-9
更新日期:2016-02-09 00:00:00
abstract:BACKGROUND:The transition from exponential to stationary phase in Streptomyces coelicolor is accompanied by a major metabolic switch and results in a strong activation of secondary metabolism. Here we have explored the underlying reorganization of the metabolome by combining computational predictions based on constrain...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-202
更新日期:2010-03-26 00:00:00