Abstract:
:Many data sets exhibit well-defined structure that can be exploited to design faster search tools, but it is not always clear when such acceleration is possible. Here we introduce a framework for similarity search based on characterizing a data set's entropy and fractal dimension. We prove that searching scales in time with metric entropy (number of covering hyperspheres), if the fractal dimension of the data set is low, and scales in space with the sum of metric entropy and information-theoretic entropy (randomness of the data). Using these ideas, we present accelerated versions of standard tools, with no loss in specificity and little loss in sensitivity, for use in three domains-high-throughput drug screening (Ammolite, 150x speedup), metagenomics (MICA, 3.5x speedup of DIAMOND (3700x BLASTX)), and protein structure search (esFragBag, 10x speedup of FragBag). Our framework can be used to achieve 'compressive omics,' and the general theory can be readily applied to data science problems outside of biology. Source code: http://gems.csail.mit.edu.
journal_name
Cell Systjournal_title
Cell systemsauthors
Yu YW,Daniels NM,Danko DC,Berger Bdoi
10.1016/j.cels.2015.08.004subject
Has Abstractpub_date
2015-08-26 00:00:00pages
130-140issue
2eissn
2405-4712issn
2405-4720pii
S2405-4712(15)00058-7journal_volume
1pub_type
杂志文章相关文献
Cell Systems文献大全abstract::Although molecular mechanisms that prompt cell-cycle arrest in response to DNA damage have been elucidated, the systems-level properties of DNA damage checkpoints are not understood. Here, using time-lapse microscopy and simulations that model the cell cycle as a series of Poisson processes, we characterize DNA damage...
journal_title:Cell systems
pub_type: 杂志文章
doi:10.1016/j.cels.2017.09.015
更新日期:2017-11-22 00:00:00
abstract::We combine a genome-scale RNAi screen in mouse epiblast stem cells (EpiSCs) with genetic interaction, protein localization, and "protein-level dependency" studies-a systematic technique that uncovers post-transcriptional regulation-to delineate the network of factors that control the expression of Oct4, a key regulato...
journal_title:Cell systems
pub_type: 杂志文章
doi:10.1016/j.cels.2015.08.002
更新日期:2015-08-26 00:00:00
abstract::Understanding the complex interactions that occur between heterologous and native biochemical pathways represents a major challenge in metabolic engineering and synthetic biology. We present a workflow that integrates metabolomics, proteomics, and genome-scale models of Escherichia coli metabolism to study the effects...
journal_title:Cell systems
pub_type: 杂志文章
doi:10.1016/j.cels.2016.04.004
更新日期:2016-05-25 00:00:00
abstract::Establishing a quantitative connection between the population growth rate and the generation times of single cells is a prerequisite for understanding evolutionary dynamics of microbes. However, existing theories fail to account for the experimentally observed correlations between mother-daughter generation times that...
journal_title:Cell systems
pub_type: 杂志文章
doi:10.1016/j.cels.2017.08.015
更新日期:2017-10-25 00:00:00
abstract::Isogenic cells in a common environment show substantial cell-to-cell variation in gene expression, often referred to as "expression noise." Here, we use multiple single-cell RNA-sequencing datasets to identify features associated with high or low expression noise in mouse embryonic stem cells. These include the core p...
journal_title:Cell systems
pub_type: 杂志文章
doi:10.1016/j.cels.2017.10.003
更新日期:2017-11-22 00:00:00
abstract::It is known that the calcium-dependent transcription factor NFAT initiates transcription in response to pulsatile loads of calcium signal. However, the relative contributions of calcium oscillation frequency, amplitude, and duty cycle to transcriptional activity remain unclear. Here, we engineer HeLa cells to permit o...
journal_title:Cell systems
pub_type: 杂志文章
doi:10.1016/j.cels.2016.03.010
更新日期:2016-04-27 00:00:00
abstract::The human interactome is instrumental in the systems-level study of the cell and the contextualization of disease-associated gene perturbations. However, reference organismal interactomes do not capture the cell-type-specific context in which proteins and modules preferentially act. Here, we introduce SCINET, a comput...
journal_title:Cell systems
pub_type: 杂志文章
doi:10.1016/j.cels.2019.10.007
更新日期:2019-12-18 00:00:00
abstract::The accumulation of protein damage in aging organisms is thought to contribute to many aging-related diseases. Yet the properties determining which proteins are most susceptible remain poorly understood. Are certain conformations more vulnerable? Which chaperones are the main guardians? We address these questions with...
journal_title:Cell systems
pub_type: 杂志文章
doi:10.1016/j.cels.2018.05.001
更新日期:2018-06-27 00:00:00
abstract::Temporal interference (TI) is a non-invasive neurostimulation technique that utilizes high-frequency external electric fields to stimulate deep neuronal structures without affecting superficial, off-target structures. TI represents a potential breakthrough for treating conditions, such as Parkinson's disease and chron...
journal_title:Cell systems
pub_type: 杂志文章
doi:10.1016/j.cels.2020.10.004
更新日期:2020-12-16 00:00:00
abstract::The recent increase of immunopeptidomics data, obtained by mass spectrometry or binding assays, opens up possibilities for investigating endogenous antigen presentation by the highly polymorphic human leukocyte antigen class I (HLA-I) protein. State-of-the-art methods predict with high accuracy presentation by HLA all...
journal_title:Cell systems
pub_type: 杂志文章
doi:10.1016/j.cels.2020.11.005
更新日期:2020-12-11 00:00:00
abstract::Signaling proteins display remarkable cell-to-cell heterogeneity in their dynamic responses to stimuli, but the consequences of this heterogeneity remain largely unknown. For instance, the contribution of the dynamics of the innate immune transcription factor nuclear factor κB (NF-κB) to gene expression output is disp...
journal_title:Cell systems
pub_type: 杂志文章
doi:10.1016/j.cels.2017.03.010
更新日期:2017-04-26 00:00:00
abstract::The tumor-suppressing transcription factor p53 is highly conserved at the protein level and plays a key role in the DNA damage response. One important aspect of p53 regulation is its dynamics in response to DNA damage, which include oscillations. Here, we observe that, while the qualitative oscillatory nature of p53 d...
journal_title:Cell systems
pub_type: 杂志文章
doi:10.1016/j.cels.2017.09.012
更新日期:2017-10-25 00:00:00
abstract::One snapshot of the peer review process for "Death Rate of E. coli during Starvation Is Set by Maintenance Cost and Biomass Recycling" (Schink et al., 2019). ...
journal_title:Cell systems
pub_type: 评论,杂志文章
doi:10.1016/j.cels.2019.07.004
更新日期:2019-07-24 00:00:00
abstract::A principled approach to integrating metabolomics, proteomics, and genome-scale metabolic modeling facilitaties rational pathway engineering of E. coli. ...
journal_title:Cell systems
pub_type: 评论,杂志文章
doi:10.1016/j.cels.2016.05.005
更新日期:2016-06-22 00:00:00
abstract::Understanding the effects of genetic perturbations on the cellular state has been challenging using traditional pooled screens, which typically rely on the delivery of a single perturbation per cell and unidimensional phenotypic readouts. Here, we use barcoded open reading frame overexpression libraries coupled with s...
journal_title:Cell systems
pub_type: 杂志文章
doi:10.1016/j.cels.2018.10.008
更新日期:2018-11-28 00:00:00
abstract::This month: selected work from the 2018 RECOMB meeting, organized by Ecole Polytechnique and held last April in Paris. ...
journal_title:Cell systems
pub_type: 杂志文章
doi:10.1016/j.cels.2018.08.005
更新日期:2018-08-22 00:00:00
abstract::Cell-to-cell variation in gene expression and the propagation of such variation (PoV or "noise propagation") from one gene to another in the gene network, as reflected by gene-gene correlation across single cells, are commonly observed in single-cell transcriptomic studies and can shape the phenotypic diversity of cel...
journal_title:Cell systems
pub_type: 杂志文章
doi:10.1016/j.cels.2017.03.002
更新日期:2017-04-26 00:00:00
abstract::Two recent studies in Cell and Science demonstrate the reconstruction of global mechanistic networks and identification of regulatory principles from multi-omics data. ...
journal_title:Cell systems
pub_type: 评论,杂志文章
doi:10.1016/j.cels.2017.01.007
更新日期:2017-01-25 00:00:00
abstract::Here, we outline p-Creode, a new algorithm to construct multi-branching cell lineage trajectories from single-cell data. Application of this platform to diverse sources of single-cell data demonstrates its robustness and scalability, while the discovery of a new origin for rare gut tuft cells showcases the utility of ...
journal_title:Cell systems
pub_type: 评论,杂志文章
doi:10.1016/j.cels.2017.12.015
更新日期:2018-01-24 00:00:00
abstract::Studying autism genes in the context of the protein complexes to which they belong illustrates the potential of network-centric approaches for understanding complex genetic disease. ...
journal_title:Cell systems
pub_type: 杂志文章
doi:10.1016/j.cels.2015.11.004
更新日期:2015-11-25 00:00:00
abstract::Racism and COVID-19 represent a pandemic on a pandemic for Blacks. The pandemics find themselves synergized to the detriment of Blacks and their health. The complexity of the combination of these pandemics are evident when examining the interplay between racist policing practices and health. ...
journal_title:Cell systems
pub_type: 杂志文章
doi:10.1016/j.cels.2020.07.002
更新日期:2020-07-22 00:00:00
abstract::Recent advances in synthetic biology and biological system engineering have allowed the design and construction of engineered live biotherapeutics targeting a range of human clinical applications. In this review, we outline how systems approaches have been used to move from simple constitutive systems, where a single ...
journal_title:Cell systems
pub_type: 杂志文章,评审
doi:10.1016/j.cels.2018.06.008
更新日期:2018-07-25 00:00:00
abstract::Two studies show that noise is a key ingredient of new mechanisms for entraining the NF-κB system. ...
journal_title:Cell systems
pub_type: 杂志文章
doi:10.1016/j.cels.2016.12.008
更新日期:2016-12-21 00:00:00
abstract::An integrated approach unifies experimental observations and mathematical modeling to represent differentiation dynamics as discrete transition events underpinned by stochastic transitions between hidden states. ...
journal_title:Cell systems
pub_type: 评论,杂志文章
doi:10.1016/j.cels.2017.09.009
更新日期:2017-09-27 00:00:00
abstract::Hi-C experiments study how genomes fold in 3D, generating contact maps containing features as small as 20 bp and as large as 200 Mb. Here we introduce Juicebox, a tool for exploring Hi-C and other contact map data. Juicebox allows users to zoom in and out of Hi-C maps interactively, just as a user of Google Earth migh...
journal_title:Cell systems
pub_type: 杂志文章
doi:10.1016/j.cels.2015.07.012
更新日期:2016-07-01 00:00:00
abstract::The construction of complex gene-regulatory networks requires both inhibitory and upregulatory modules. However, the vast majority of RNA-based regulatory "parts" are inhibitory. Using a synthetic biology approach combined with SHAPE-seq, we explored the regulatory effect of RNA-binding protein (RBP)-RNA interactions ...
journal_title:Cell systems
pub_type: 杂志文章
doi:10.1016/j.cels.2019.04.007
更新日期:2019-07-24 00:00:00
abstract::Alterations of circadian clock genes are associated with patient survival, tumor stage, and clinical subtype across various cancer types, highlighting the importance of timing in cancer treatment. ...
journal_title:Cell systems
pub_type: 评论
doi:10.1016/j.cels.2018.03.006
更新日期:2018-03-28 00:00:00
abstract::Translation of mRNA into protein is a fundamental yet complex biological process with multiple factors that can potentially affect its efficiency. Here, we study a stochastic model describing the traffic flow of ribosomes along the mRNA and identify the key parameters that govern the overall rate of protein synthesis,...
journal_title:Cell systems
pub_type: 杂志文章
doi:10.1016/j.cels.2019.12.003
更新日期:2020-02-26 00:00:00
abstract::Effective design of combination therapies requires understanding the changes in cell physiology that result from drug interactions. Here, we show that the genome-wide transcriptional response to combinations of two drugs, measured at a rigorously controlled growth rate, can predict higher-order antagonism with a third...
journal_title:Cell systems
pub_type: 杂志文章
doi:10.1016/j.cels.2019.10.004
更新日期:2019-11-27 00:00:00
abstract::Gram-negative bacteria are intrinsically resistant to drugs because of their double-membrane envelope structure that acts as a permeability barrier and as an anchor for efflux pumps. Antibiotics are blocked and expelled from cells and cannot reach high-enough intracellular concentrations to exert a therapeutic effect....
journal_title:Cell systems
pub_type: 杂志文章
doi:10.1016/j.cels.2019.03.008
更新日期:2019-04-24 00:00:00