An annotated fluorescence image dataset for training nuclear segmentation methods.

Abstract:

:Fully-automated nuclear image segmentation is the prerequisite to ensure statistically significant, quantitative analyses of tissue preparations,applied in digital pathology or quantitative microscopy. The design of segmentation methods that work independently of the tissue type or preparation is complex, due to variations in nuclear morphology, staining intensity, cell density and nuclei aggregations. Machine learning-based segmentation methods can overcome these challenges, however high quality expert-annotated images are required for training. Currently, the limited number of annotated fluorescence image datasets publicly available do not cover a broad range of tissues and preparations. We present a comprehensive, annotated dataset including tightly aggregated nuclei of multiple tissues for the training of machine learning-based nuclear segmentation algorithms. The proposed dataset covers sample preparation methods frequently used in quantitative immunofluorescence microscopy. We demonstrate the heterogeneity of the dataset with respect to multiple parameters such as magnification, modality, signal-to-noise ratio and diagnosis. Based on a suggested split into training and test sets and additional single-nuclei expert annotations, machine learning-based image segmentation methods can be trained and evaluated.

journal_name

Sci Data

journal_title

Scientific data

authors

Kromp F,Bozsaky E,Rifatbegovic F,Fischer L,Ambros M,Berneder M,Weiss T,Lazic D,Dörr W,Hanbury A,Beiske K,Ambros PF,Ambros IM,Taschner-Mandl S

doi

10.1038/s41597-020-00608-w

subject

Has Abstract

pub_date

2020-08-11 00:00:00

pages

262

issue

1

issn

2052-4463

pii

10.1038/s41597-020-00608-w

journal_volume

7

pub_type

杂志文章
  • VLUIS, a land use data product for Victoria, Australia, covering 2006 to 2013.

    abstract::Land Use Information is a key dataset required to enable an understanding of the changing nature of our landscapes and the associated influences on natural resources and regional communities. The Victorian Land Use Information System (VLUIS) data product has been created within the State Government of Victoria to supp...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2015.70

    authors: Morse-McNabb E,Sheffield K,Clark R,Lewis H,Robson S,Cherry D,Williams S

    更新日期:2015-11-24 00:00:00

  • Synthetic skull bone defects for automatic patient-specific craniofacial implant design.

    abstract::Patient-specific craniofacial implants are used to repair skull bone defects after trauma or surgery. Currently, cranial implants are designed and produced by third-party suppliers, which is usually time-consuming and expensive. Recent advances in additive manufacturing made the in-hospital or in-operation-room fabric...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-021-00806-0

    authors: Li J,Gsaxner C,Pepe A,Morais A,Alves V,von Campe G,Wallner J,Egger J

    更新日期:2021-01-29 00:00:00

  • An analecta of visualizations for foodborne illness trends and seasonality.

    abstract::Disease surveillance systems worldwide face increasing pressure to maintain and distribute data in usable formats supplemented with effective visualizations to enable actionable policy and programming responses. Annual reports and interactive portals provide access to surveillance data and visualizations depicting tem...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00677-x

    authors: Simpson RB,Zhou B,Alarcon Falconi TM,Naumova EN

    更新日期:2020-10-13 00:00:00

  • HYSOGs250m, global gridded hydrologic soil groups for curve-number-based runoff modeling.

    abstract::Hydrologic soil groups (HSGs) are a fundamental component of the USDA curve-number (CN) method for estimation of rainfall runoff; yet these data are not readily available in a format or spatial-resolution suitable for regional- and global-scale modeling applications. We developed a globally consistent, gridded dataset...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2018.91

    authors: Ross CW,Prihodko L,Anchang J,Kumar S,Ji W,Hanan NP

    更新日期:2018-05-15 00:00:00

  • CU-BEMS, smart building electricity consumption and indoor environmental sensor datasets.

    abstract::This paper describes the release of the detailed building operation data, including electricity consumption and indoor environmental measurements, of the seven-story 11,700-m2 office building located in Bangkok, Thailand. The electricity consumption data (kW) are that of individual air conditioning units, lighting, an...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00582-3

    authors: Pipattanasomporn M,Chitalia G,Songsiri J,Aswakul C,Pora W,Suwankawin S,Audomvongseree K,Hoonchareon N

    更新日期:2020-07-20 00:00:00

  • A statistical atlas of cerebral arteries generated using multi-center MRA datasets from healthy subjects.

    abstract::Magnetic resonance angiography (MRA) can capture the variation of cerebral arteries with high spatial resolution. These measurements include valuable information about the morphology, geometry, and density of brain arteries, which may be useful to identify risk factors for cerebrovascular and neurological diseases at ...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0034-5

    authors: Mouches P,Forkert ND

    更新日期:2019-04-11 00:00:00

  • Optical motion capture dataset of selected techniques in beginner and advanced Kyokushin karate athletes.

    abstract::Human motion capture is commonly used in various fields, including sport, to analyze, understand, and synthesize kinematic and kinetic data. Specialized computer vision and marker-based optical motion capture techniques constitute the gold-standard for accurate and robust human motion capture. The dataset presented co...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-021-00801-5

    authors: Szczęsna A,Błaszczyszyn M,Pawlyta M

    更新日期:2021-01-18 00:00:00

  • Large-scale modeled contemporary and future water temperature estimates for 10774 Midwestern U.S. Lakes.

    abstract::Climate change has already influenced lake temperatures globally, but understanding future change is challenging. The response of lakes to changing climate drivers is complex due to the nature of lake-atmosphere coupling, ice cover, and stratification. To better understand the diversity of lake responses to climate ch...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2017.53

    authors: Winslow LA,Hansen GJA,Read JS,Notaro M

    更新日期:2017-04-25 00:00:00

  • Building fault detection data to aid diagnostic algorithm creation and performance testing.

    abstract::It is estimated that approximately 4-5% of national energy consumption can be saved through corrections to existing commercial building controls infrastructure and resulting improvements to efficiency. Correspondingly, automated fault detection and diagnostics (FDD) algorithms are designed to identify the presence of ...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-0398-6

    authors: Granderson J,Lin G,Harding A,Im P,Chen Y

    更新日期:2020-02-24 00:00:00

  • Flow and detailed 3D morphodynamic data from laboratory experiments of fluvial dike breaching.

    abstract::This paper presents a dataset obtained from fifty four laboratory experiments of the breaching of fluvial dikes due to flow overtopping. Data were collected on two complementary experimental setups, each consisting of a main channel representing the river, an erodible lateral dike and a floodplain. The dataset covers ...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0057-y

    authors: Rifai I,El Kadi Abderrezzak K,Erpicum S,Archambeau P,Violeau D,Pirotton M,Dewals B

    更新日期:2019-05-13 00:00:00

  • A Mediterranean coastal database for assessing the impacts of sea-level rise and associated hazards.

    abstract::We have developed a new coastal database for the Mediterranean basin that is intended for coastal impact and adaptation assessment to sea-level rise and associated hazards on a regional scale. The data structure of the database relies on a linear representation of the coast with associated spatial assessment units. Us...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2018.44

    authors: Wolff C,Vafeidis AT,Muis S,Lincke D,Satta A,Lionello P,Jimenez JA,Conte D,Hinkel J

    更新日期:2018-03-27 00:00:00

  • Human pluripotent stem cell derived HLC transcriptome data enables molecular dissection of hepatogenesis.

    abstract::Induced pluripotent stem cells (iPSCs) and human embryonic stem cells (hESCs) differentiated into hepatocyte-like cells (HLCs) provide a defined and renewable source of cells for drug screening, toxicology and regenerative medicine. We previously reprogrammed human fetal foreskin fibroblast cells (HFF1) into iPSCs emp...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2018.35

    authors: Wruck W,Adjaye J

    更新日期:2018-03-13 00:00:00

  • Facial model collection for medical augmented reality in oncologic cranio-maxillofacial surgery.

    abstract::Medical augmented reality (AR) is an increasingly important topic in many medical fields. AR enables x-ray vision to see through real world objects. In medicine, this offers pre-, intra- or post-interventional visualization of "hidden" structures. In contrast to a classical monitor view, AR applications provide visual...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0327-8

    authors: Gsaxner C,Wallner J,Chen X,Zemann W,Egger J

    更新日期:2019-12-09 00:00:00

  • A data set of global river networks and corresponding water resources zones divisions.

    abstract::As basic data, the river networks and water resources zones (WRZ) are critical for planning, utilization, development, conservation and management of water resources. Currently, the river network and WRZ of world are most obtained based on digital elevation model data automatically, which are not accuracy enough, espe...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0243-y

    authors: Yan D,Wang K,Qin T,Weng B,Wang H,Bi W,Li X,Li M,Lv Z,Liu F,He S,Ma J,Shen Z,Wang J,Bai H,Man Z,Sun C,Liu M,Shi X,Jing L,Sun R,Cao S,Hao C,Wang L,Pei M,Dorjsuren B,Gedefaw M,Girma A,Abiyu A

    更新日期:2019-10-22 00:00:00

  • The odonate phenotypic database, a new open data resource for comparative studies of an old insect order.

    abstract::We present The Odonate Phenotypic Database (OPD): an online data resource of dragonfly and damselfly phenotypes (Insecta: Odonata). Odonata is a relatively small insect order that currently consists of about 6400 species belonging to 32 families. The database consists of multiple morphological, life-history and behavi...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0318-9

    authors: Waller JT,Willink B,Tschol M,Svensson EI

    更新日期:2019-12-12 00:00:00

  • Genotoype-by-sequencing of three geographically distinct populations of Olympia oysters, Ostrea lurida.

    abstract::Olympia oysters are found along the west coast of North America and as the only native oyster species in the region, receive considerable attention with regard to restoration and conservation. Knowledge of genetic structure of this species is essential for resource managers. Here we provide genetic data for three dist...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2017.130

    authors: White SJ,Vadopalas B,Silliman K,Roberts SB

    更新日期:2017-09-12 00:00:00

  • Linking in silico MS/MS spectra with chemistry data to improve identification of unknowns.

    abstract::Confident identification of unknown chemicals in high resolution mass spectrometry (HRMS) screening studies requires cohesive workflows and complementary data, tools, and software. Chemistry databases, screening libraries, and chemical metadata have become fixtures in identification workflows. To increase confidence i...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0145-z

    authors: McEachran AD,Balabin I,Cathey T,Transue TR,Al-Ghoul H,Grulke C,Sobus JR,Williams AJ

    更新日期:2019-08-02 00:00:00

  • Quantitative mapping of RNA-mediated nuclear estrogen receptor β interactome in human breast cancer cells.

    abstract::The nuclear receptor estrogen receptor 2 (ESR2, ERβ) modulates cancer cell proliferation and tumor growth, exerting an oncosuppressive role in breast cancer (BC). Interaction proteomics by tandem affinity purification coupled to mass spectrometry was previously applied in BC cells to identify proteins acting in concer...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2018.31

    authors: Giurato G,Nassa G,Salvati A,Alexandrova E,Rizzo F,Nyman TA,Weisz A,Tarallo R

    更新日期:2018-03-06 00:00:00

  • A dataset describing a suite of novel antibody reagents for the RAS signaling network.

    abstract::RAS genes are frequently mutated in cancer and have for decades eluded effective therapeutic attack. The National Cancer Institute's RAS Initiative has a focus on understanding pathways and discovering therapies for RAS-driven cancers. Part of these efforts is the generation of novel reagents to enable the quantificat...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0166-7

    authors: Schoenherr RM,Huang D,Voytovich UJ,Ivey RG,Kennedy JJ,Saul RG,Colantonio S,Roberts RR,Knotts JG,Kaczmarczyk JA,Perry C,Hewitt SM,Bocik W,Whiteley GR,Hiltke T,Boja ES,Rodriguez H,Whiteaker JR,Paulovich AG

    更新日期:2019-08-29 00:00:00

  • An open science resource for establishing reliability and reproducibility in functional connectomics.

    abstract::Efforts to identify meaningful functional imaging-based biomarkers are limited by the ability to reliably characterize inter-individual differences in human brain function. Although a growing number of connectomics-based measures are reported to have moderate to high test-retest reliability, the variability in data ac...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2014.49

    authors: Zuo XN,Anderson JS,Bellec P,Birn RM,Biswal BB,Blautzik J,Breitner JC,Buckner RL,Calhoun VD,Castellanos FX,Chen A,Chen B,Chen J,Chen X,Colcombe SJ,Courtney W,Craddock RC,Di Martino A,Dong HM,Fu X,Gong Q,Gorgolews

    更新日期:2014-12-09 00:00:00

  • A multi-species repository of social networks.

    abstract::Social network analysis is an invaluable tool to understand the patterns, evolution, and consequences of sociality. Comparative studies over a range of social systems across multiple taxonomic groups are particularly valuable. Such studies however require quantitative social association or interaction data across mult...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0056-z

    authors: Sah P,Méndez JD,Bansal S

    更新日期:2019-04-29 00:00:00

  • The effect of 16S rRNA region choice on bacterial community metabarcoding results.

    abstract::In this work, we compare the resolution of V2-V3 and V3-V4 16S rRNA regions for the purposes of estimating microbial community diversity using paired-end Illumina MiSeq reads, and show that the fragment, including V2 and V3 regions, has higher resolution for lower-rank taxa (genera and species). It allows for a more p...

    journal_title:Scientific data

    pub_type:

    doi:10.1038/sdata.2019.7

    authors: Bukin YS,Galachyants YP,Morozov IV,Bukin SV,Zakharenko AS,Zemskaya TI

    更新日期:2019-02-05 00:00:00

  • Comprehensive high-resolution multiple-reaction monitoring mass spectrometry for targeted eicosanoid assays.

    abstract::Eicosanoids comprise a class of bioactive lipids derived from a unique group of essential fatty acids that mediate a variety of important physiological functions. Owing to the structural diversity of these lipids, their analysis in biological samples is often a major challenge. Advancements in mass spectrometric have ...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2018.167

    authors: Sorgi CA,Peti APF,Petta T,Meirelles AFG,Fontanari C,Moraes LAB,Faccioli LH

    更新日期:2018-08-21 00:00:00

  • Construction of the REACHES climate database based on historical documents of China.

    abstract::This paper describes the methodology of an ongoing project of constructing an East Asian climate database REACHES based on Chinese historical documents. The record source is Compendium of Meteorological Records of China in the Last 3000 Years which collects meteorology and climate related records from mainly official ...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2018.288

    authors: Wang PK,Lin KE,Liao YC,Liao HM,Lin YS,Hsu CT,Hsu SM,Wan CW,Lee SY,Fan IC,Tan PH,Ting TT

    更新日期:2018-12-18 00:00:00

  • A data citation roadmap for scholarly data repositories.

    abstract::This article presents a practical roadmap for scholarly data repositories to implement data citation in accordance with the Joint Declaration of Data Citation Principles, a synopsis and harmonization of the recommendations of major science policy bodies. The roadmap was developed by the Repositories Expert Group, as p...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0031-8

    authors: Fenner M,Crosas M,Grethe JS,Kennedy D,Hermjakob H,Rocca-Serra P,Durand G,Berjon R,Karcher S,Martone M,Clark T

    更新日期:2019-04-10 00:00:00

  • Curated compendium of human transcriptional biomarker data.

    abstract::One important use of genome-wide transcriptional profiles is to identify relationships between transcription levels and patient outcomes. These translational insights can guide the development of biomarkers for clinical application. Data from thousands of translational-biomarker studies have been deposited in public r...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2018.66

    authors: Golightly NP,Bell A,Bischoff AI,Hollingsworth PD,Piccolo SR

    更新日期:2018-04-17 00:00:00

  • The downed and dead wood inventory of forests in the United States.

    abstract::The quantity and condition of downed dead wood (DDW) is emerging as a major factor governing forest ecosystem processes such as carbon cycling, fire behavior, and tree regeneration. Despite this, systematic inventories of DDW are sparse if not absent across major forest biomes. The Forest Inventory and Analysis progra...

    journal_title:Scientific data

    pub_type:

    doi:10.1038/sdata.2018.303

    authors: Woodall CW,Monleon VJ,Fraver S,Russell MB,Hatfield MH,Campbell JL,Domke GM

    更新日期:2019-01-08 00:00:00

  • Evaluating FAIR maturity through a scalable, automated, community-governed framework.

    abstract::Transparent evaluations of FAIRness are increasingly required by a wide range of stakeholders, from scientists to publishers, funding agencies and policy makers. We propose a scalable, automatable framework to evaluate digital resources that encompasses measurable indicators, open source tools, and participation guide...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0184-5

    authors: Wilkinson MD,Dumontier M,Sansone SA,Bonino da Silva Santos LO,Prieto M,Batista D,McQuilton P,Kuhn T,Rocca-Serra P,Crosas M,Schultes E

    更新日期:2019-09-20 00:00:00

  • A suite of global accessibility indicators.

    abstract::Good access to resources and opportunities is essential for sustainable development. Improving access, especially in rural areas, requires useful measures of current access to the locations where these resources and opportunities are found. Recent work has developed a global map of travel times to cities with more tha...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0265-5

    authors: Nelson A,Weiss DJ,van Etten J,Cattaneo A,McMenomy TS,Koo J

    更新日期:2019-11-07 00:00:00

  • Enabling precision medicine in neonatology, an integrated repository for preterm birth research.

    abstract::Preterm birth, or the delivery of an infant prior to 37 weeks of gestation, is a significant cause of infant morbidity and mortality. In the last decade, the advent and continued development of molecular profiling technologies has enabled researchers to generate vast amount of 'omics' data, which together with integra...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2018.219

    authors: Sirota M,Thomas CG,Liu R,Zuhl M,Banerjee P,Wong RJ,Quaintance CC,Leite R,Chubiz J,Anderson R,Chappell J,Kim M,Grobman W,Zhang G,Rokas A,England SK,Parry S,Shaw GM,Simpson JL,Thomson E,Butte AJ,March of Dimes Pre

    更新日期:2018-11-06 00:00:00