Abstract:
Background:Conserved nucleic acid sequences play an essential role in transcriptional regulation. The motifs/templates derived from nucleic acid sequence datasets are usually used as biomarkers to predict biochemical properties such as protein binding sites or to identify specific non-coding RNAs. In many cases, template-based nucleic acid sequence classification performs better than some feature extraction methods, such as N-gram and k-spaced pairs classification. The availability of large-scale experimental data provides an unprecedented opportunity to improve motif extraction methods. The process for pattern extraction from large-scale data is crucial for the creation of predictive models. Methods:In this article, a Teiresias-like feature extraction algorithm to discover frequent sub-sequences (CFSP) is proposed. Although gaps are allowed in some motif discovery algorithms, the distance and number of gaps are limited. The proposed algorithm can find frequent sequence pairs with a larger gap. The combinations of frequent sub-sequences in given protracted sequences capture the long-distance correlation, which implies a specific molecular biological property. Hence, the proposed algorithm intends to discover the combinations. A set of frequent sub-sequences derived from nucleic acid sequences with order is used as a base frequent sub-sequence array. The mutation information is attached to each sub-sequence array to implement fuzzy matching. Thus, a mutate records a single nucleotide variant or nucleotides insertion/deletion (indel) to encode a slight difference between frequent sequences and a matched subsequence of a sequence under investigation. Conclusions:The proposed algorithm has been validated with several nucleic acid sequence prediction case studies. These data demonstrate better results than the recently available feature descriptors based methods based on experimental data sets such as miRNA, piRNA, and Sigma 54 promoters. CFSP is implemented in C++ and shell script; the source code and related data are available at https://github.com/HePeng2016/CFSP.
journal_name
PeerJjournal_title
PeerJauthors
Peng Hdoi
10.7717/peerj.8965subject
Has Abstractpub_date
2020-04-20 00:00:00pages
e8965issn
2167-8359pii
8965journal_volume
8pub_type
杂志文章相关文献
PeerJ文献大全abstract::Coastal birds are critical ecosystem constituents on sandy shores, yet are threatened by depressed reproductive success resulting from direct and indirect anthropogenic and natural pressures. Few studies examine clutch fate across the wide range of environments experienced by birds; instead, most focus at the small si...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.2460
更新日期:2016-09-13 00:00:00
abstract:Background:Large and complex mounds built by termites of the genus Macrotermes characterize many dry African landscapes, including the savannas, bushlands, and dry forests of the Tsavo Ecosystem in southern Kenya. The termites live in obligate symbiosis with filamentous fungi of the genus Termitomyces. The insects coll...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.6237
更新日期:2019-01-16 00:00:00
abstract::Rao's quadratic diversity index is one of the most widely applied diversity indices in functional and phylogenetic ecology. The standard way of computing Rao's quadratic diversity index for an ecological assemblage with a group of species with varying abundances is to sum the functional or phylogenetic distances betwe...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.5211
更新日期:2018-07-06 00:00:00
abstract::Divergences between agricultural management can result in different types of biological interactions between plants and microorganisms, which may affect food quality and productivity. Conventional practices are well-established in the agroindustry as very efficient and lucrative; however, the increasing demand for sus...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.9152
更新日期:2020-06-02 00:00:00
abstract::We study the glycosylation processes that convert initially toxic substrates to nutritionally valuable metabolites in the flavonoid biosynthesis pathway of tomato (Solanum lycopersicum) seedlings. To estimate the reaction rates we use ordinary differential equations (ODEs) to model the enzyme kinetics. A popular choic...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.2417
更新日期:2016-09-20 00:00:00
abstract:AIM:To prospectively evaluate the effects of vitamin D3 on disease activity and quality of life in ulcerative colitis (UC) patients with hypovitaminosis D. METHODS:The study was a prospective double-blinded, randomized trial conducted at Community Regional Medical Center, Fresno, CA from 2012-2013. Patients with UC an...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.3654
更新日期:2017-08-03 00:00:00
abstract:Background:Lysyl oxidases (LOX) have been extensively studied in mammals, whereas properties and functions of recently found homologues in prokaryotic genomes remain enigmatic. Methods:LOX open reading frame was cloned from Haloterrigena turkmenica in an E. coli expression vector. Recombinant Haloterrigena turkmenica ...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.6691
更新日期:2019-04-05 00:00:00
abstract::Ticks are globally distributed arthropods and a public health concern due to the many human pathogens they carry and transmit, including the causative agent of Lyme disease, Borrelia burgdorferi. As tick species' ranges increase, so do the number of reported tick related illnesses. The microbiome is a critical part of...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.10424
更新日期:2020-12-02 00:00:00
abstract:Background:We investigated the effects of gastric Helicobacter pylori infection on the daytime and overnight human oral microbiota. Methods:Twenty four volunteers were recruited. Ten tested positive for H. pylori infection by the Carbon-14 Urea Breath Test, and the rest were negative. Two oral swabs were collected: on...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.6336
更新日期:2019-01-28 00:00:00
abstract::With the increased availability of genome sequences for bacteria, it has become routine practice to construct genome-based phylogenies. These phylogenies have formed the basis for various taxonomic decisions, especially for resolving problematic relationships between taxa. Despite the popularity of concatenating share...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.6698
更新日期:2019-04-16 00:00:00
abstract::Flatfish undergo extreme morphological development and settle to a benthic in the adult stage, and are likely to be more susceptible to environmental stress. Heat shock proteins 70 (hsp70) are involved in embryonic development and stress response in metazoan animals. However, the evolutionary history and functions of ...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.7781
更新日期:2019-11-11 00:00:00
abstract::Abnormal behaviors in captive animals are generally defined as behaviors that are atypical for the species and are often considered to be indicators of poor welfare. Although some abnormal behaviors have been empirically linked to conditions related to elevated stress and compromised welfare in primates, others have l...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.2225
更新日期:2016-07-13 00:00:00
abstract::The active place avoidance task is a dry-arena task used to assess spatial navigation and memory in rodents. In this task, a subject is put on a rotating circular arena and avoids an invisible sector that is stable in relation to the room. Rotation of the arena means that the subject's avoidance must be active, otherw...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.1257
更新日期:2015-09-22 00:00:00
abstract::The interplay between historical and contemporary processes can produce complex patterns of genetic differentiation in the marine realm. Recent mitochondrial and nuclear sequence analyses revealed cryptic speciation in the Japanese mantis shrimp Oratosquilla oratoria. Herein, we applied nuclear microsatellite markers ...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.10270
更新日期:2020-11-05 00:00:00
abstract::The ornamental trade is a worldwide industry worth >15 billion USD with a problem of rampant product misidentification. Minimizing misidentification is critical in the face of overexploitation of species in the trade. We surveyed the peppermint shrimp ornamental marketplace in the southeastern USA, the most intense ma...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.3786
更新日期:2017-09-18 00:00:00
abstract::Ocean acidification (OA) is one of the most significant threats to marine life, and is predicted to drive important changes in marine communities. Although OA impacts will be the sum of direct effects mediated by alterations of physiological rates and indirect effects mediated by shifts in species interactions and bio...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.5327
更新日期:2018-07-31 00:00:00
abstract:Background:"Quantile-dependent expressivity" occurs when the effect size of a genetic variant depends upon whether the phenotype (e.g. adiponectin) is high or low relative to its distribution. We have previously shown that the heritability (h2 ) of adiposity, lipoproteins, postprandial lipemia, pulmonary function, and ...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.10099
更新日期:2020-10-14 00:00:00
abstract::Background. Pollinators, which provide the agriculturally and ecologically essential service of pollination, are under threat at a global scale. Habitat loss and homogenisation, pesticides, parasites and pathogens, invasive species, and climate change have been identified as past and current threats to pollinators. Ac...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.2249
更新日期:2016-08-09 00:00:00
abstract::Pantoea stewartii subsp. stewartii is a bacterial phytopathogen that causes Stewart's wilt disease in corn. It uses quorum sensing to regulate expression of some genes involved in virulence in a cell density-dependent manner as the bacterial population grows from small numbers at the initial infection site in the leaf...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.3237
更新日期:2017-04-27 00:00:00
abstract::Antibiotic resistance in our pathogens is medicine's climate change: caused by human activity, and resulting in more extreme outcomes. Resistance emerges in microbial populations when antibiotics act on phenotypic variance within the population. This can arise from either genotypic diversity (resulting from a mutation...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.5801
更新日期:2018-10-12 00:00:00
abstract::Estimating the depth of anaesthesia (DoA) in operations has always been a challenging issue due to the underlying complexity of the brain mechanisms. Electroencephalogram (EEG) signals are undoubtedly the most widely used signals for measuring DoA. In this paper, a novel EEG-based index is proposed to evaluate DoA for...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.4817
更新日期:2018-05-23 00:00:00
abstract:Background:Quality of life is an important health outcome for older persons. It predicts the adverse outcomes of institutionalization and premature death. The aim of this cross-sectional study was to determine the influence of both disability in activities of daily living (ADL) and instrumental activities of daily livi...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.5425
更新日期:2018-08-09 00:00:00
abstract:BACKGROUND:Highlighted text in the Internet (i.e., hypertext) is predominantly blue and underlined. The perceptibility of these hypertext characteristics was heavily questioned by applied research and empirical tests resulted in inconclusive results. The ability to recognize blue text in foveal and parafoveal vision wa...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.2467
更新日期:2016-09-20 00:00:00
abstract::Climatic variables have been the main predictors employed in ecological niche modeling and species distribution modeling, although biotic interactions are known to affect species' spatial distributions via mechanisms such as predation, competition, and mutualism. Biotic interactions can affect species' responses to ab...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.6052
更新日期:2018-12-05 00:00:00
abstract::Fungi play a critical role in a range of ecosystems; however, their interactions and functions in marine hosts, and particular sponges, is poorly understood. Here we assess the fungal community composition of three co-occurring sponges (Cymbastela concentrica, Scopalina sp., Tedania anhelans) and the surrounding seawa...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.4965
更新日期:2018-06-04 00:00:00
abstract::In group-living animals, heterogeneity in individuals' social connections may mediate the sharing of microbial infectious agents. In this regard, the genetic relatedness of individuals' commensal gut bacterium Escherichia coli may be ideal to assess the potential for pathogen transmission through animal social network...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.4271
更新日期:2018-01-17 00:00:00
abstract::Changes in behavior are often the proximate response of animals to human disturbance, with variability in tolerance levels leading some species to exhibit striking shifts in life history, fitness, and/or survival. Thus, elucidating the effects of disturbance on animal behavior, and how this varies among taxonomically ...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.6956
更新日期:2019-06-07 00:00:00
abstract::Flies use specialized photoreceptors R7 and R8 in the dorsal rim area (DRA) to detect skylight polarization. R7 and R8 form a tiered waveguide (central rhabdomere pair, CRP) with R7 on top, filtering light delivered to R8. We examine how the division of a given resource, CRP length, between R7 and R8 affects their abi...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.2772
更新日期:2017-01-12 00:00:00
abstract:Context:Systemic lupus erythematosus (SLE) is a chronic inflammatory autoimmune disease with unknown etiology. Objective:Human plasma is comprised of over 10 orders of magnitude concentration of proteins and tissue leakages. The changes in the abundance of these proteins have played an important role in various human ...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.4730
更新日期:2018-05-08 00:00:00
abstract::Motile cryptofauna inhabiting coral reefs are complex assemblages that utilize the space available among dead coral stands and the surrounding coral rubble substrate. They comprise a group of organisms largely overlooked in biodiversity estimates because they are hard to collect and identify, and their collection caus...
journal_title:PeerJ
pub_type: 杂志文章
doi:10.7717/peerj.10389
更新日期:2020-11-23 00:00:00