Abstract:
:In the last few decades, data-driven methods have come to dominate many fields of scientific inquiry. Open data and open-source software have enabled the rapid implementation of novel methods to manage and analyze the growing flood of data. However, it has become apparent that many scientific fields exhibit distressingly low rates of reproducibility. Although there are many dimensions to this issue, we believe that there is a lack of formalism used when describing end-to-end published results, from the data source to the analysis to the final published results. Even when authors do their best to make their research and data accessible, this lack of formalism reduces the clarity and efficiency of reporting, which contributes to issues of reproducibility. Data provenance aids both reproducibility through systematic and formal records of the relationships among data sources, processes, datasets, publications and researchers.
journal_name
Sci Datajournal_title
Scientific dataauthors
Pasquier T,Lau MK,Trisovic A,Boose ER,Couturier B,Crosas M,Ellison AM,Gibson V,Jones CR,Seltzer Mdoi
10.1038/sdata.2017.114subject
Has Abstractpub_date
2017-09-05 00:00:00pages
170114issn
2052-4463pii
sdata2017114journal_volume
4pub_type
杂志文章相关文献
Scientific Data文献大全abstract::Between the 2011 and 2016 national censuses, the Australian Bureau of Statistics changed its anonymity policy compliance system for the distribution of census data. The new method has resulted in dramatic inconsistencies when comparing low-resolution data to aggregated high-resolution data. Hence, aggregated totals do...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-019-0137-z
更新日期:2019-08-16 00:00:00
abstract::Plants use surface receptors to perceive information about many aspects of their local environment. These receptors physically interact to form both steady state and signalling competent complexes. The signalling events downstream of receptor activation impact both plant developmental and immune responses. Here, we pr...
journal_title:Scientific data
pub_type:
doi:10.1038/sdata.2019.25
更新日期:2019-02-26 00:00:00
abstract::The COVID-19 pandemic has ignited interest in age-specific manifestations of infection but surprisingly little is known about relative severity of infectious disease between the extremes of age. In a systematic analysis we identified 142 datasets with information on severity of disease by age for 32 different infectio...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-00668-y
更新日期:2020-10-15 00:00:00
abstract::High-quality and high-throughput sequencing technologies are required for therapeutic and diagnostic analyses of human gut microbiota. Here, we evaluated the advantages and disadvantages of the various commercial sequencing platforms for studying human gut microbiota. We generated fecal bacterial sequences from 170 Ko...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2018.68
更新日期:2018-04-24 00:00:00
abstract::Understanding dynamic human mobility changes and spatial interaction patterns at different geographic scales is crucial for assessing the impacts of non-pharmaceutical interventions (such as stay-at-home orders) during the COVID-19 pandemic. In this data descriptor, we introduce a regularly-updated multiscale dynamic ...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-00734-5
更新日期:2020-11-12 00:00:00
abstract::The behaviors of building occupants have continued to perplex scholars for years in our attempts to develop models for energy efficient housing. Building simulations, project delivery approaches, policies, and more have fell short of their optimistic goals due to the complexity of human behavior. As a part of a multip...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-019-0275-3
更新日期:2019-11-26 00:00:00
abstract::The data record contains Material Intensity data for buildings (MI). MI coefficients are often used for different types of analysis of socio-economic systems and in particular for environmental assessments. Until now, MI values were compiled and reported ad-hoc with few cross-study comparisons. We extracted and conver...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-019-0021-x
更新日期:2019-04-09 00:00:00
abstract::Surveys for more than 9,500 households were conducted in the growing seasons 2002/2003 or 2003/2004 in eleven African countries: Burkina Faso, Cameroon, Ghana, Niger and Senegal in western Africa; Egypt in northern Africa; Ethiopia and Kenya in eastern Africa; South Africa, Zambia and Zimbabwe in southern Africa. Hous...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2016.20
更新日期:2016-05-24 00:00:00
abstract::We describe a screen for cellular response to drugs that makes use of haploid embryonic stem cells. We generated ten libraries of mutants with piggyBac gene trap transposon integrations, totalling approximately 100,000 mutant clones. Random barcode sequences were inserted into the transposon vector to allow the number...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2017.20
更新日期:2017-03-01 00:00:00
abstract::This article presents a practical roadmap for scholarly data repositories to implement data citation in accordance with the Joint Declaration of Data Citation Principles, a synopsis and harmonization of the recommendations of major science policy bodies. The roadmap was developed by the Repositories Expert Group, as p...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-019-0031-8
更新日期:2019-04-10 00:00:00
abstract::It is estimated that approximately 4-5% of national energy consumption can be saved through corrections to existing commercial building controls infrastructure and resulting improvements to efficiency. Correspondingly, automated fault detection and diagnostics (FDD) algorithms are designed to identify the presence of ...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-0398-6
更新日期:2020-02-24 00:00:00
abstract::A comprehensive transcriptome analysis of an expressed sequence tag (EST) database of the spider Dolomedes fimbriatus venom glands using single-residue distribution analysis (SRDA) identified 7,169 unique sequences. Mature chains of 163 different toxin-like polypeptides were predicted on the basis of well-established ...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2014.23
更新日期:2014-08-05 00:00:00
abstract::In angiogenesis with concurrent inflammation, many pathways are activated, some linked to VEGF and others largely VEGF-independent. Pathways involving inflammatory mediators, chemokines, and micro-RNAs may play important roles in maintaining a pro-angiogenic environment or mediating angiogenic regression. Here, we des...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2016.103
更新日期:2016-11-22 00:00:00
abstract::The London Planetree (Platanus acerifolia) are present throughout the world. The tree is considered a greening plant and is commonly planted in streets, parks, and courtyards. The Sycamore lace bug (Corythucha ciliata) is a serious pest of this tree. To determine the molecular mechanism behind the interaction between ...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-019-0111-9
更新日期:2019-07-22 00:00:00
abstract::In this work, we compare the resolution of V2-V3 and V3-V4 16S rRNA regions for the purposes of estimating microbial community diversity using paired-end Illumina MiSeq reads, and show that the fragment, including V2 and V3 regions, has higher resolution for lower-rank taxa (genera and species). It allows for a more p...
journal_title:Scientific data
pub_type:
doi:10.1038/sdata.2019.7
更新日期:2019-02-05 00:00:00
abstract::Land-atmosphere interactions at different temporal and spatial scales are important for our understanding of the Earth system and its modeling. The Landscape Evolution Observatory (LEO) at Biosphere 2, managed by the University of Arizona, hosts three nearly identical artificial bare-soil hillslopes with dimensions of...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-00645-5
更新日期:2020-09-15 00:00:00
abstract::As a World Health Organization Research and Development Blueprint priority pathogen, there is a need to better understand the geographic distribution of Middle East Respiratory Syndrome Coronavirus (MERS-CoV) and its potential to infect mammals and humans. This database documents cases of MERS-CoV globally, with speci...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-019-0330-0
更新日期:2019-12-13 00:00:00
abstract::The zooplankter Calanus finmarchicus is a member of the so-called "Calanus Complex", a group of copepods that constitutes a key element of the Arctic polar marine ecosystem, providing a crucial link between primary production and higher trophic levels. Climate change induces the shift of C. finmarchicus to higher lati...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-00751-4
更新日期:2020-11-24 00:00:00
abstract::Asymptomatic ebolavirus infection could greatly influence transmission dynamics, but there is little consensus on how frequently it occurs or even if it exists. This paper summarises the available evidence on seroprevalence of Ebola, Sudan and Bundibugyo virus IgG in people without known ebolavirus disease. Through sy...
journal_title:Scientific data
pub_type: 杂志文章,meta分析,评审
doi:10.1038/sdata.2016.133
更新日期:2017-01-31 00:00:00
abstract::Studies of chimpanzee vocal communication provide valuable insights into the evolution of communication in complex societies, and also comparative data for understanding the evolution of human language. One particularly valuable dataset of recordings from free-living chimpanzees was collected by Frans X. Plooij and th...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2015.27
更新日期:2015-05-26 00:00:00
abstract::Motion capture is necessary to quantify gait deviations in individuals with lower-limb amputations. However, access to the patient population and the necessary equipment is limited. Here we present the first open biomechanics dataset for 18 individuals with unilateral above-knee amputations walking at different speeds...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-0494-7
更新日期:2020-05-21 00:00:00
abstract::Trait-based approaches advance ecological and evolutionary research because traits provide a strong link to an organism's function and fitness. Trait-based research might lead to a deeper understanding of the functions of, and services provided by, ecosystems, thereby improving management, which is vital in the curren...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2016.17
更新日期:2016-03-29 00:00:00
abstract::Neural microarchitecture is heterogeneous, varying both across and within brain regions. The consistent identification of regions of interest is one of the most critical aspects in examining neurocircuitry, as these structures serve as the vital landmarks with which to map brain pathways. Access to continuous, three-d...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-00692-y
更新日期:2020-10-20 00:00:00
abstract::When visual input has conflicting interpretations, conscious perception can alternate spontaneously between these possible interpretations. This is called bistable perception. Previous neuroimaging studies have indicated the involvement of two right parietal areas in resolving perceptual ambiguity (ant-SPLr and post-S...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2016.65
更新日期:2016-08-16 00:00:00
abstract::Cross sectional imaging is essential for the patient-specific planning and delivery of radiotherapy, a primary determinant of head and neck cancer outcomes. Due to challenges ensuring data quality and patient de-identification, publicly available datasets including diagnostic and radiation treatment planning imaging a...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2018.173
更新日期:2018-09-04 00:00:00
abstract::Understanding of sequence diversity is the cornerstone of analysis of genetic disorders, population genetics, and evolutionary biology. Here, we present an update of our sequencing set to 15,220 Icelanders who we sequenced to an average genome-wide coverage of 34X. We identified 39,020,168 autosomal variants passing G...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2017.115
更新日期:2017-09-21 00:00:00
abstract::Direct-infusion mass spectrometry (DIMS) metabolomics is an important approach for characterising molecular responses of organisms to disease, drugs and the environment. Increasingly large-scale metabolomics studies are being conducted, necessitating improvements in both bioanalytical and computational workflows to ma...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2014.12
更新日期:2014-06-10 00:00:00
abstract::Soybean aphid (Aphis glycines; SBA) and soybean cyst nematode (Heterodera glycines; SCN) are two major pests of soybean (Glycine max) in the United States of America. This study aims to characterize three-way interactions among soybean, SBA, and SCN using both demographic and genetic datasets. SCN-resistant and SCN-su...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-019-0140-4
更新日期:2019-07-24 00:00:00
abstract::Metadata that are structured using principled schemas and that use terms from ontologies are essential to making biomedical data findable and reusable for downstream analyses. The largest source of metadata that describes the experimental protocol, funding, and scientific leadership of clinical studies is ClinicalTria...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-00780-z
更新日期:2020-12-18 00:00:00
abstract::Polyadenylation plays an important role in gene regulation, thus affecting a wide variety of biological processes. In the rice blast fungus Magnaporthe oryzae the cleavage factor I protein Rpb35 is required for pre-mRNA polyadenylation and fungal virulence. Here we present the bioinformatic approach and output data re...
journal_title:Scientific data
pub_type:
doi:10.1038/sdata.2018.271
更新日期:2018-11-27 00:00:00