Abstract:
:Chest radiography is an extremely powerful imaging modality, allowing for a detailed inspection of a patient's chest, but requires specialized training for proper interpretation. With the advent of high performance general purpose computer vision algorithms, the accurate automated analysis of chest radiographs is becoming increasingly of interest to researchers. Here we describe MIMIC-CXR, a large dataset of 227,835 imaging studies for 65,379 patients presenting to the Beth Israel Deaconess Medical Center Emergency Department between 2011-2016. Each imaging study can contain one or more images, usually a frontal view and a lateral view. A total of 377,110 images are available in the dataset. Studies are made available with a semi-structured free-text radiology report that describes the radiological findings of the images, written by a practicing radiologist contemporaneously during routine clinical care. All images and reports have been de-identified to protect patient privacy. The dataset is made freely available to facilitate and encourage a wide range of research in computer vision, natural language processing, and clinical data mining.
journal_name
Sci Datajournal_title
Scientific dataauthors
Johnson AEW,Pollard TJ,Berkowitz SJ,Greenbaum NR,Lungren MP,Deng CY,Mark RG,Horng Sdoi
10.1038/s41597-019-0322-0subject
Has Abstractpub_date
2019-12-12 00:00:00pages
317issue
1issn
2052-4463pii
10.1038/s41597-019-0322-0journal_volume
6pub_type
杂志文章相关文献
Scientific Data文献大全abstract::Rett syndrome (RTT) is a rare neurological disorder mostly caused by a genetic variation in MECP2. Making new MECP2 variants and the related phenotypes available provides data for better understanding of disease mechanisms and faster identification of variants for diagnosis. This is, however, currently hampered by the...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-00794-7
更新日期:2021-01-15 00:00:00
abstract::RAS genes are frequently mutated in cancer and have for decades eluded effective therapeutic attack. The National Cancer Institute's RAS Initiative has a focus on understanding pathways and discovering therapies for RAS-driven cancers. Part of these efforts is the generation of novel reagents to enable the quantificat...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-019-0166-7
更新日期:2019-08-29 00:00:00
abstract::Viruses are highly discriminating in their interactions with host cells and are thought to play a major role in maintaining diversity of environmental microbes. However, large-scale ecological and genomic studies of co-occurring virus-host pairs, required to characterize the mechanistic and genomic foundations of viru...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2018.114
更新日期:2018-07-03 00:00:00
abstract::The human HCC1806 cell line is frequently used as a preclinical model for triple negative breast cancer (TNBC). Given that dysregulated epigenetic mechanisms are involved in cancer pathogenesis, emerging therapeutic strategies target chromatin regulators, such as histone deacetylases. A comprehensive understanding of ...
journal_title:Scientific data
pub_type:
doi:10.1038/sdata.2019.33
更新日期:2019-03-05 00:00:00
abstract::In prokaryotes, protein phosphorylation plays a critical role in regulating a broad spectrum of biological processes and occurs mainly on various amino acids, including serine (S), threonine (T), tyrosine (Y), arginine (R), aspartic acid (D), histidine (H) and cysteine (C) residues of protein substrates. Through liter...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-0506-7
更新日期:2020-05-29 00:00:00
abstract::We introduce the Precipitation Probability DISTribution (PPDIST) dataset, a collection of global high-resolution (0.1°) observation-based climatologies (1979-2018) of the occurrence and peak intensity of precipitation (P) at daily and 3-hourly time-scales. The climatologies were produced using neural networks trained ...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-00631-x
更新日期:2020-09-11 00:00:00
abstract::This paper describes the release of the detailed building operation data, including electricity consumption and indoor environmental measurements, of the seven-story 11,700-m2 office building located in Bangkok, Thailand. The electricity consumption data (kW) are that of individual air conditioning units, lighting, an...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-00582-3
更新日期:2020-07-20 00:00:00
abstract::Semiconducting inorganic materials with band gaps ranging between 0 and 5 eV constitute major components in electronic, optoelectronic and photovoltaic devices. Since the band gap is a primary material property that affects the device performance, large band-gap databases are useful in selecting optimal materials in e...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-00723-8
更新日期:2020-11-11 00:00:00
abstract::Transparent evaluations of FAIRness are increasingly required by a wide range of stakeholders, from scientists to publishers, funding agencies and policy makers. We propose a scalable, automatable framework to evaluate digital resources that encompasses measurable indicators, open source tools, and participation guide...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-019-0184-5
更新日期:2019-09-20 00:00:00
abstract::The use of hydrogen (H2) as a substitute for fossil fuel, which accounts for the majority of the world's energy, is environmentally the most benign option for the reduction of CO2 emissions. This will require gigawatt-scale storage systems and as such, H2 storage in porous rocks in the subsurface will be required. Acc...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-0568-6
更新日期:2020-07-09 00:00:00
abstract::The COVID-19 pandemic has ignited interest in age-specific manifestations of infection but surprisingly little is known about relative severity of infectious disease between the extremes of age. In a systematic analysis we identified 142 datasets with information on severity of disease by age for 32 different infectio...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-00668-y
更新日期:2020-10-15 00:00:00
abstract::High-quality and high-throughput sequencing technologies are required for therapeutic and diagnostic analyses of human gut microbiota. Here, we evaluated the advantages and disadvantages of the various commercial sequencing platforms for studying human gut microbiota. We generated fecal bacterial sequences from 170 Ko...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2018.68
更新日期:2018-04-24 00:00:00
abstract::Animal muscles must maintain their function and structure while bearing substantial mechanical loads. How muscles withstand persistent mechanical strain is presently not well understood. Understanding the mechanisms by which tissues maintain their complex architecture is a key goal of cell biology. This dataset repres...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2014.2
更新日期:2014-03-11 00:00:00
abstract::Lysosomes are the main degradative organelles of cells and involved in a variety of processes including the recycling of macromolecules, storage of compounds, and metabolic signaling. Despite an increasing interest in the proteomic analysis of lysosomes, no systematic study of sample preparation protocols for lysosome...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-0399-5
更新日期:2020-02-26 00:00:00
abstract::With the help of the bacteria in the rumen, ruminants can effectively convert human inedible plant fiber to edible food (meat and milk). However, the understanding of rumen bacteriome in dairy cows is still limited, especially in a large population under the same diet, breed, and milking period. Here we described the ...
journal_title:Scientific data
pub_type:
doi:10.1038/sdata.2018.301
更新日期:2019-01-22 00:00:00
abstract::This article presents data derived from a 36 year-long uninterrupted observational study of amphibian populations living in the city and vicinity of Yekaterinburg, Russia. This area is inhabited by six amphibian species. Based on a degree of anthropogenic transformation, the urban territory is divided into five highly...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2015.18
更新日期:2015-05-12 00:00:00
abstract::Future highly renewable energy systems will couple to complex weather and climate dynamics. This coupling is generally not captured in detail by the open models developed in the power and energy system communities, where such open models exist. To enable modeling such a future energy system, we describe a dedicated la...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2017.175
更新日期:2017-11-28 00:00:00
abstract::Induced pluripotent stem cells (iPSCs) and human embryonic stem cells (hESCs) differentiated into hepatocyte-like cells (HLCs) provide a defined and renewable source of cells for drug screening, toxicology and regenerative medicine. We previously reprogrammed human fetal foreskin fibroblast cells (HFF1) into iPSCs emp...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2018.35
更新日期:2018-03-13 00:00:00
abstract::When visual input has conflicting interpretations, conscious perception can alternate spontaneously between these possible interpretations. This is called bistable perception. Previous neuroimaging studies have indicated the involvement of two right parietal areas in resolving perceptual ambiguity (ant-SPLr and post-S...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2016.65
更新日期:2016-08-16 00:00:00
abstract::We provide a detailed description of a gadoteridol-derivatized lysozyme (gadolinium lysozyme) two-colour serial femtosecond crystallography (SFX) dataset for multiple wavelength anomalous dispersion (MAD) structure determination. The data was collected at the Spring-8 Angstrom Compact free-electron LAser (SACLA) facil...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2017.188
更新日期:2017-12-12 00:00:00
abstract::Efficient energy consumption at the building level is vital for sustainability. Providing energy efficient systems and solutions requires an understanding of how energy gets consumed. However, there is a general lack of large-scale open datasets about the energy consumption of buildings, which hinders the research. Th...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2019.15
更新日期:2019-02-19 00:00:00
abstract::In angiogenesis with concurrent inflammation, many pathways are activated, some linked to VEGF and others largely VEGF-independent. Pathways involving inflammatory mediators, chemokines, and micro-RNAs may play important roles in maintaining a pro-angiogenic environment or mediating angiogenic regression. Here, we des...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2016.103
更新日期:2016-11-22 00:00:00
abstract::This article presents a practical roadmap for scholarly data repositories to implement data citation in accordance with the Joint Declaration of Data Citation Principles, a synopsis and harmonization of the recommendations of major science policy bodies. The roadmap was developed by the Repositories Expert Group, as p...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-019-0031-8
更新日期:2019-04-10 00:00:00
abstract::Both poly(A) enrichment and ribosomal RNA depletion are commonly used for RNA sequencing. Either has its advantages and disadvantages that may lead to biases in the downstream analyses. To better access these effects, we carried out both ribosomal RNA-depleted and poly(A)-selected RNA-seq for CD4+ T naive cells isolat...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-00719-4
更新日期:2020-11-09 00:00:00
abstract::The experimental dataset presented was collected in an 18 m long and 1 m wide laboratory flume. Low to high flood flows through an urbanized floodplain were modelled. The floodplain bed is rough, modelled with dense artificial grass. A square cylinder array, representing house models, was set on the rough bed. The cyl...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-020-00791-w
更新日期:2021-01-11 00:00:00
abstract::The data record contains Material Intensity data for buildings (MI). MI coefficients are often used for different types of analysis of socio-economic systems and in particular for environmental assessments. Until now, MI values were compiled and reported ad-hoc with few cross-study comparisons. We extracted and conver...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-019-0021-x
更新日期:2019-04-09 00:00:00
abstract::Bats, including African straw-coloured fruit bats (Eidolon helvum), have been highlighted as reservoirs of many recently emerged zoonotic viruses. This common, widespread and ecologically important species was the focus of longitudinal and continent-wide studies of the epidemiological and ecology of Lagos bat virus, h...
journal_title:Scientific data
pub_type: 评论,杂志文章
doi:10.1038/sdata.2016.49
更新日期:2016-08-01 00:00:00
abstract::Efforts to identify meaningful functional imaging-based biomarkers are limited by the ability to reliably characterize inter-individual differences in human brain function. Although a growing number of connectomics-based measures are reported to have moderate to high test-retest reliability, the variability in data ac...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/sdata.2014.49
更新日期:2014-12-09 00:00:00
abstract::While tick-borne zoonoses, such as Lyme disease and tick-borne encephalitis, present an increasing global concern, knowledge of their vectors' distribution remains limited, especially for China. In this paper, we present the first comprehensive dataset of known tick species and their distributions in China, derived fr...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-019-0115-5
更新日期:2019-07-01 00:00:00
abstract::As basic data, the river networks and water resources zones (WRZ) are critical for planning, utilization, development, conservation and management of water resources. Currently, the river network and WRZ of world are most obtained based on digital elevation model data automatically, which are not accuracy enough, espe...
journal_title:Scientific data
pub_type: 杂志文章
doi:10.1038/s41597-019-0243-y
更新日期:2019-10-22 00:00:00