Enabling precision medicine in neonatology, an integrated repository for preterm birth research.

Abstract:

:Preterm birth, or the delivery of an infant prior to 37 weeks of gestation, is a significant cause of infant morbidity and mortality. In the last decade, the advent and continued development of molecular profiling technologies has enabled researchers to generate vast amount of 'omics' data, which together with integrative computational approaches, can help refine the current knowledge about disease mechanisms, diagnostics, and therapeutics. Here we describe the March of Dimes' Database for Preterm Birth Research (http://www.immport.org/resources/mod), a unique resource that contains a variety of 'omics' datasets related to preterm birth. The database is open publicly, and as of January 2018, links 13 molecular studies with data across tens of thousands of patients from 6 measurement modalities. The data in the repository are highly diverse and include genomic, transcriptomic, immunological, and microbiome data. Relevant datasets are augmented with additional molecular characterizations of almost 25,000 biological samples from public databases. We believe our data-sharing efforts will lead to enhanced research collaborations and coordination accelerating the overall pace of discovery in preterm birth research.

journal_name

Sci Data

journal_title

Scientific data

authors

Sirota M,Thomas CG,Liu R,Zuhl M,Banerjee P,Wong RJ,Quaintance CC,Leite R,Chubiz J,Anderson R,Chappell J,Kim M,Grobman W,Zhang G,Rokas A,England SK,Parry S,Shaw GM,Simpson JL,Thomson E,Butte AJ,March of Dimes Pre

doi

10.1038/sdata.2018.219

subject

Has Abstract

pub_date

2018-11-06 00:00:00

pages

180219

issn

2052-4463

pii

sdata2018219

journal_volume

5

pub_type

杂志文章
  • Oral microbiota and dental caries data from monozygotic and dizygotic twin children.

    abstract::There are recent studies which aimed to detect the inheritance on the etiology of dental caries exploring oral composition. We present data on the oral microbiota and its relation with dental caries and other factors in monozygotic (MZ) and dizygotic (DZ) twin children. Following clinical investigation, DNA samples we...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00691-z

    authors: Kasimoglu Y,Koruyucu M,Birant S,Karacan I,Topcuoglu N,Tuna EB,Gencay K,Seymen F

    更新日期:2020-10-13 00:00:00

  • Corrigendum: High-throughput RNAi screen for essential genes and drug synergistic combinations in colorectal cancer.

    abstract::This corrects the article DOI: 10.1038/sdata.2017.139. ...

    journal_title:Scientific data

    pub_type: 杂志文章,已发布勘误

    doi:10.1038/sdata.2018.215

    authors: Williams SP,Barthorpe AS,Lightfoot H,Garnett MJ,McDermott U

    更新日期:2018-10-09 00:00:00

  • Construction, complete sequence, and annotation of a BAC contig covering the silkworm chorion locus.

    abstract::The silkmoth chorion was studied extensively by F.C. Kafatos' group for almost 40 years. However, the complete structure of the chorion locus was not obtained in the genome sequence of Bombyx mori published in 2008 due to repetitive sequences, resulting in gaps and an incomplete view of the locus. To obtain the comple...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2015.62

    authors: Chen Z,Nohata J,Guo H,Li S,Liu J,Guo Y,Yamamoto K,Kadono-Okuda K,Liu C,Arunkumar KP,Nagaraju J,Zhang Y,Liu S,Labropoulou V,Swevers L,Tsitoura P,Iatrou K,Gopinathan KP,Goldsmith MR,Xia Q,Mita K

    更新日期:2015-11-10 00:00:00

  • Survey-based socio-economic data from slums in Bangalore, India.

    abstract::In 2010, an estimated 860 million people were living in slums worldwide, with around 60 million added to the slum population between 2000 and 2010. In 2011, 200 million people in urban Indian households were considered to live in slums. In order to address and create slum development programmes and poverty alleviation...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2017.200

    authors: Roy D,Palavalli B,Menon N,King R,Pfeffer K,Lees M,Sloot PMA

    更新日期:2018-01-09 00:00:00

  • Thermodynamic and transport properties of hydrogen containing streams.

    abstract::The use of hydrogen (H2) as a substitute for fossil fuel, which accounts for the majority of the world's energy, is environmentally the most benign option for the reduction of CO2 emissions. This will require gigawatt-scale storage systems and as such, H2 storage in porous rocks in the subsurface will be required. Acc...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-0568-6

    authors: Hassanpouryouzband A,Joonaki E,Edlmann K,Heinemann N,Yang J

    更新日期:2020-07-09 00:00:00

  • A draft genome assembly of spotted hyena, Crocuta crocuta.

    abstract::The spotted hyena (Crocuta crocuta), one of the largest terrestrial predators native to sub-Saharan Africa, is well known for its matriarchal social system and large-sized social group in which larger females dominate smaller males. Spotted hyenas are highly adaptable predators as they both actively hunt prey and scav...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-0468-9

    authors: Yang C,Li F,Xiong Z,Koepfli KP,Ryder O,Perelman P,Li Q,Zhang G

    更新日期:2020-04-28 00:00:00

  • A catalogue of 863 Rett-syndrome-causing MECP2 mutations and lessons learned from data integration.

    abstract::Rett syndrome (RTT) is a rare neurological disorder mostly caused by a genetic variation in MECP2. Making new MECP2 variants and the related phenotypes available provides data for better understanding of disease mechanisms and faster identification of variants for diagnosis. This is, however, currently hampered by the...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00794-7

    authors: Ehrhart F,Jacobsen A,Rigau M,Bosio M,Kaliyaperumal R,Laros JFJ,Willighagen EL,Valencia A,Roos M,Capella-Gutierrez S,Curfs LMG,Evelo CT

    更新日期:2021-01-15 00:00:00

  • Unbalanced historical phenotypic data from seed regeneration of a barley ex situ collection.

    abstract::The scarce knowledge on phenotypic characterization restricts the usage of genetic diversity of plant genetic resources in research and breeding. We describe original and ready-to-use processed data for approximately 60% of ~22,000 barley accessions hosted at the Federal ex situ Genebank for Agricultural and Horticult...

    journal_title:Scientific data

    pub_type:

    doi:10.1038/sdata.2018.278

    authors: Gonzalez MY,Weise S,Zhao Y,Philipp N,Arend D,Börner A,Oppermann M,Graner A,Reif JC,Schulthess AW

    更新日期:2018-12-04 00:00:00

  • A band-gap database for semiconducting inorganic materials calculated with hybrid functional.

    abstract::Semiconducting inorganic materials with band gaps ranging between 0 and 5 eV constitute major components in electronic, optoelectronic and photovoltaic devices. Since the band gap is a primary material property that affects the device performance, large band-gap databases are useful in selecting optimal materials in e...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00723-8

    authors: Kim S,Lee M,Hong C,Yoon Y,An H,Lee D,Jeong W,Yoo D,Kang Y,Youn Y,Han S

    更新日期:2020-11-11 00:00:00

  • Spatiotemporal dataset on Chinese population distribution and its driving factors from 1949 to 2013.

    abstract::Spatio-temporal data on human population and its driving factors is critical to understanding and responding to population problems. Unfortunately, such spatio-temporal data on a large scale and over the long term are often difficult to obtain. Here, we present a dataset on Chinese population distribution and its driv...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2016.47

    authors: Wang L,Chen L

    更新日期:2016-07-05 00:00:00

  • A multi-species repository of social networks.

    abstract::Social network analysis is an invaluable tool to understand the patterns, evolution, and consequences of sociality. Comparative studies over a range of social systems across multiple taxonomic groups are particularly valuable. Such studies however require quantitative social association or interaction data across mult...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0056-z

    authors: Sah P,Méndez JD,Bansal S

    更新日期:2019-04-29 00:00:00

  • Transcriptome dataset of human corneal endothelium based on ribosomal RNA-depleted RNA-Seq data.

    abstract::The corneal endothelium maintains corneal transparency; consequently, damage to this endothelium by a number of pathological conditions results in severe vision loss. Publicly available expression databases of human tissues are useful for investigating the pathogenesis of diseases and for developing new therapeutic mo...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00754-1

    authors: Tokuda Y,Okumura N,Komori Y,Hanada N,Tashiro K,Koizumi N,Nakano M

    更新日期:2020-11-20 00:00:00

  • De novo transcriptome assembly databases for the butterfly orchid Phalaenopsis equestris.

    abstract::Orchids are renowned for their spectacular flowers and ecological adaptations. After the sequencing of the genome of the tropical epiphytic orchid Phalaenopsis equestris, we combined Illumina HiSeq2000 for RNA-Seq and Trinity for de novo assembly to characterize the transcriptomes for 11 diverse P. equestris tissues r...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2016.83

    authors: Niu SC,Xu Q,Zhang GQ,Zhang YQ,Tsai WC,Hsu JL,Liang CK,Luo YB,Liu ZJ

    更新日期:2016-09-27 00:00:00

  • Outlier analyses of the Protein Data Bank archive using a probability-density-ranking approach.

    abstract::Outlier analyses are central to scientific data assessments. Conventional outlier identification methods do not work effectively for Protein Data Bank (PDB) data, which are characterized by heavy skewness and the presence of bounds and/or long tails. We have developed a data-driven nonparametric method to identify out...

    journal_title:Scientific data

    pub_type:

    doi:10.1038/sdata.2018.293

    authors: Shao C,Liu Z,Yang H,Wang S,Burley SK

    更新日期:2018-12-11 00:00:00

  • Construction of the REACHES climate database based on historical documents of China.

    abstract::This paper describes the methodology of an ongoing project of constructing an East Asian climate database REACHES based on Chinese historical documents. The record source is Compendium of Meteorological Records of China in the Last 3000 Years which collects meteorology and climate related records from mainly official ...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2018.288

    authors: Wang PK,Lin KE,Liao YC,Liao HM,Lin YS,Hsu CT,Hsu SM,Wan CW,Lee SY,Fan IC,Tan PH,Ting TT

    更新日期:2018-12-18 00:00:00

  • Nationwide registry of sepsis patients in Japan focused on disseminated intravascular coagulation 2011-2013.

    abstract::Sepsis is a syndrome with physiologic, pathologic, and biochemical abnormalities induced by infection. Sepsis can induce the dysregulation of systemic coagulation and fibrinolytic systems, resulting in disseminated intravascular coagulation (DIC), which is associated with a high mortality rate. Although there is no in...

    journal_title:Scientific data

    pub_type:

    doi:10.1038/sdata.2018.243

    authors: Hayakawa M,Yamakawa K,Saito S,Uchino S,Kudo D,Iizuka Y,Sanui M,Takimoto K,Mayumi T

    更新日期:2018-12-11 00:00:00

  • RE-Europe, a large-scale dataset for modeling a highly renewable European electricity system.

    abstract::Future highly renewable energy systems will couple to complex weather and climate dynamics. This coupling is generally not captured in detail by the open models developed in the power and energy system communities, where such open models exist. To enable modeling such a future energy system, we describe a dedicated la...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2017.175

    authors: Jensen TV,Pinson P

    更新日期:2017-11-28 00:00:00

  • A database seed for a community-driven material intensity research platform.

    abstract::The data record contains Material Intensity data for buildings (MI). MI coefficients are often used for different types of analysis of socio-economic systems and in particular for environmental assessments. Until now, MI values were compiled and reported ad-hoc with few cross-study comparisons. We extracted and conver...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0021-x

    authors: Heeren N,Fishman T

    更新日期:2019-04-09 00:00:00

  • Japan prefectural emission accounts and socioeconomic data 2007 to 2015.

    abstract::In the wake of the Fukushima nuclear disaster, Japan largely moved away from nuclear power generation and turned back towards an energy sector dominated by fossil fuels. As a result, the pace towards reaching emission reduction targets has largely slowed down. This situation indicates that higher emissions will contin...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-0571-y

    authors: Long Y,Yoshida Y,Zhang H,Zheng H,Shan Y,Guan D

    更新日期:2020-07-13 00:00:00

  • Extended regions of suspected mis-assembly in the rat reference genome.

    abstract::We performed whole-genome sequencing for eight inbred rat strains commonly used in genetic mapping studies. They are the founders of the NIH heterogeneous stock (HS) outbred colony. We provide their sequences and variant calls to the rat genomics community. When analyzing the variant calls we identified regions with u...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0041-6

    authors: Ramdas S,Ozel AB,Treutelaar MK,Holl K,Mandel M,Woods LCS,Li JZ

    更新日期:2019-04-23 00:00:00

  • A global view on the effect of water uptake on aerosol particle light scattering.

    abstract::A reference dataset of multi-wavelength particle light scattering and hemispheric backscattering coefficients for different relative humidities (RH) between RH = 30 and 95% and wavelengths between λ = 450 nm and 700 nm is described in this work. Tandem-humidified nephelometer measurements from 26 ground-based sites ar...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0158-7

    authors: Burgos MA,Andrews E,Titos G,Alados-Arboledas L,Baltensperger U,Day D,Jefferson A,Kalivitis N,Mihalopoulos N,Sherman J,Sun J,Weingartner E,Zieger P

    更新日期:2019-08-22 00:00:00

  • MatchingLand, geospatial data testbed for the assessment of matching methods.

    abstract::This article presents datasets prepared with the aim of helping the evaluation of geospatial matching methods for vector data. These datasets were built up from mapping data produced by official Spanish mapping agencies. The testbed supplied encompasses the three geometry types: point, line and area. Initial datasets ...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2017.180

    authors: Xavier EMA,Ariza-López FJ,Ureña-Cámara MA

    更新日期:2017-12-05 00:00:00

  • An annotated fluorescence image dataset for training nuclear segmentation methods.

    abstract::Fully-automated nuclear image segmentation is the prerequisite to ensure statistically significant, quantitative analyses of tissue preparations,applied in digital pathology or quantitative microscopy. The design of segmentation methods that work independently of the tissue type or preparation is complex, due to varia...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00608-w

    authors: Kromp F,Bozsaky E,Rifatbegovic F,Fischer L,Ambros M,Berneder M,Weiss T,Lazic D,Dörr W,Hanbury A,Beiske K,Ambros PF,Ambros IM,Taschner-Mandl S

    更新日期:2020-08-11 00:00:00

  • Linking in silico MS/MS spectra with chemistry data to improve identification of unknowns.

    abstract::Confident identification of unknown chemicals in high resolution mass spectrometry (HRMS) screening studies requires cohesive workflows and complementary data, tools, and software. Chemistry databases, screening libraries, and chemical metadata have become fixtures in identification workflows. To increase confidence i...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0145-z

    authors: McEachran AD,Balabin I,Cathey T,Transue TR,Al-Ghoul H,Grulke C,Sobus JR,Williams AJ

    更新日期:2019-08-02 00:00:00

  • Viruses of the Nahant Collection, characterization of 251 marine Vibrionaceae viruses.

    abstract::Viruses are highly discriminating in their interactions with host cells and are thought to play a major role in maintaining diversity of environmental microbes. However, large-scale ecological and genomic studies of co-occurring virus-host pairs, required to characterize the mechanistic and genomic foundations of viru...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2018.114

    authors: Kauffman KM,Brown JM,Sharma RS,VanInsberghe D,Elsherbini J,Polz M,Kelly L

    更新日期:2018-07-03 00:00:00

  • Systematic analysis of infectious disease outcomes by age shows lowest severity in school-age children.

    abstract::The COVID-19 pandemic has ignited interest in age-specific manifestations of infection but surprisingly little is known about relative severity of infectious disease between the extremes of age. In a systematic analysis we identified 142 datasets with information on severity of disease by age for 32 different infectio...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00668-y

    authors: Glynn JR,Moss PAH

    更新日期:2020-10-15 00:00:00

  • Sample descriptors linked to metagenomic sequencing data from human and animal enteric samples from Vietnam.

    abstract::There is still limited information on the diversity of viruses co-circulating in humans and animals. Here, we report data obtained from a large field collection of enteric samples taken from humans, pigs, rodents and other mammal hosts in Vietnam between 2012 and 2016. Each of 2100 stool or rectal swab samples was sub...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0215-2

    authors: Woolhouse M,Ashworth J,Bogaardt C,Tue NT,Baker S,Thwaites G,Phuc TM

    更新日期:2019-10-15 00:00:00

  • Harmonised LUCAS in-situ land cover and use database for field surveys from 2006 to 2018 in the European Union.

    abstract::Accurately characterizing land surface changes with Earth Observation requires geo-located ground truth. In the European Union (EU), a tri-annual surveyed sample of land cover and land use has been collected since 2006 under the Land Use/Cover Area frame Survey (LUCAS). A total of 1351293 observations at 651780 unique...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00675-z

    authors: d'Andrimont R,Yordanov M,Martinez-Sanchez L,Eiselt B,Palmieri A,Dominici P,Gallego J,Reuter HI,Joebges C,Lemoine G,van der Velde M

    更新日期:2020-10-16 00:00:00

  • A global compendium of human Crimean-Congo haemorrhagic fever virus occurrence.

    abstract::In order to map global disease risk, a geographic database of human Crimean-Congo haemorrhagic fever virus (CCHFV) occurrence was produced by surveying peer-reviewed literature and case reports, as well as informal online sources. Here we present this database, comprising occurrence data linked to geographic point or ...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2015.16

    authors: Messina JP,Pigott DM,Duda KA,Brownstein JS,Myers MF,George DB,Hay SI

    更新日期:2015-04-14 00:00:00

  • Phase contrast time-lapse microscopy datasets with automated and manual cell tracking annotations.

    abstract::Phase contrast time-lapse microscopy is a non-destructive technique that generates large volumes of image-based information to quantify the behaviour of individual cells or cell populations. To guide the development of algorithms for computer-aided cell tracking and analysis, 48 time-lapse image sequences, each spanni...

    journal_title:Scientific data

    pub_type:

    doi:10.1038/sdata.2018.237

    authors: Ker DFE,Eom S,Sanami S,Bise R,Pascale C,Yin Z,Huh SI,Osuna-Highley E,Junkers SN,Helfrich CJ,Liang PY,Pan J,Jeong S,Kang SS,Liu J,Nicholson R,Sandbothe MF,Van PT,Liu A,Chen M,Kanade T,Weiss LE,Campbell PG

    更新日期:2018-11-13 00:00:00