Evaluating FAIR maturity through a scalable, automated, community-governed framework.

Abstract:

:Transparent evaluations of FAIRness are increasingly required by a wide range of stakeholders, from scientists to publishers, funding agencies and policy makers. We propose a scalable, automatable framework to evaluate digital resources that encompasses measurable indicators, open source tools, and participation guidelines, which come together to accommodate domain relevant community-defined FAIR assessments. The components of the framework are: (1) Maturity Indicators - community-authored specifications that delimit a specific automatically-measurable FAIR behavior; (2) Compliance Tests - small Web apps that test digital resources against individual Maturity Indicators; and (3) the Evaluator, a Web application that registers, assembles, and applies community-relevant sets of Compliance Tests against a digital resource, and provides a detailed report about what a machine "sees" when it visits that resource. We discuss the technical and social considerations of FAIR assessments, and how this translates to our community-driven infrastructure. We then illustrate how the output of the Evaluator tool can serve as a roadmap to assist data stewards to incrementally and realistically improve the FAIRness of their resources.

journal_name

Sci Data

journal_title

Scientific data

authors

Wilkinson MD,Dumontier M,Sansone SA,Bonino da Silva Santos LO,Prieto M,Batista D,McQuilton P,Kuhn T,Rocca-Serra P,Crosas M,Schultes E

doi

10.1038/s41597-019-0184-5

subject

Has Abstract

pub_date

2019-09-20 00:00:00

pages

174

issue

1

issn

2052-4463

pii

10.1038/s41597-019-0184-5

journal_volume

6

pub_type

杂志文章
  • Optical motion capture dataset of selected techniques in beginner and advanced Kyokushin karate athletes.

    abstract::Human motion capture is commonly used in various fields, including sport, to analyze, understand, and synthesize kinematic and kinetic data. Specialized computer vision and marker-based optical motion capture techniques constitute the gold-standard for accurate and robust human motion capture. The dataset presented co...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-021-00801-5

    authors: Szczęsna A,Błaszczyszyn M,Pawlyta M

    更新日期:2021-01-18 00:00:00

  • Tracking vegetation phenology across diverse biomes using Version 2.0 of the PhenoCam Dataset.

    abstract::Monitoring vegetation phenology is critical for quantifying climate change impacts on ecosystems. We present an extensive dataset of 1783 site-years of phenological data derived from PhenoCam network imagery from 393 digital cameras, situated from tropics to tundra across a wide range of plant functional types, biomes...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0229-9

    authors: Seyednasrollah B,Young AM,Hufkens K,Milliman T,Friedl MA,Frolking S,Richardson AD

    更新日期:2019-10-22 00:00:00

  • A lake data set for the Tibetan Plateau from the 1960s, 2005, and 2014.

    abstract::Long-term datasets of number and size of lakes over the Tibetan Plateau (TP) are among the most critical components for better understanding the interactions among the cryosphere, hydrosphere, and atmosphere at regional and global scales. Due to the harsh environment and the scarcity of data over the TP, data accumula...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2016.39

    authors: Wan W,Long D,Hong Y,Ma Y,Yuan Y,Xiao P,Duan H,Han Z,Gu X

    更新日期:2016-06-21 00:00:00

  • Synthetic skull bone defects for automatic patient-specific craniofacial implant design.

    abstract::Patient-specific craniofacial implants are used to repair skull bone defects after trauma or surgery. Currently, cranial implants are designed and produced by third-party suppliers, which is usually time-consuming and expensive. Recent advances in additive manufacturing made the in-hospital or in-operation-room fabric...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-021-00806-0

    authors: Li J,Gsaxner C,Pepe A,Morais A,Alves V,von Campe G,Wallner J,Egger J

    更新日期:2021-01-29 00:00:00

  • Daily transcriptomes of the copepod Calanus finmarchicus during the summer solstice at high Arctic latitudes.

    abstract::The zooplankter Calanus finmarchicus is a member of the so-called "Calanus Complex", a group of copepods that constitutes a key element of the Arctic polar marine ecosystem, providing a crucial link between primary production and higher trophic levels. Climate change induces the shift of C. finmarchicus to higher lati...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00751-4

    authors: Payton L,Noirot C,Hoede C,Hüppe L,Last K,Wilcockson D,Ershova EA,Valière S,Meyer B

    更新日期:2020-11-24 00:00:00

  • Genotoype-by-sequencing of three geographically distinct populations of Olympia oysters, Ostrea lurida.

    abstract::Olympia oysters are found along the west coast of North America and as the only native oyster species in the region, receive considerable attention with regard to restoration and conservation. Knowledge of genetic structure of this species is essential for resource managers. Here we provide genetic data for three dist...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2017.130

    authors: White SJ,Vadopalas B,Silliman K,Roberts SB

    更新日期:2017-09-12 00:00:00

  • Transcriptomic profiling of 39 commonly-used neuroblastoma cell lines.

    abstract::Neuroblastoma cell lines are an important and cost-effective model used to study oncogenic drivers of the disease. While many of these cell lines have been previously characterized with SNP, methylation, and/or mRNA expression microarrays, there has not been an effort to comprehensively sequence these cell lines. Here...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2017.33

    authors: Harenza JL,Diamond MA,Adams RN,Song MM,Davidson HL,Hart LS,Dent MH,Fortina P,Reynolds CP,Maris JM

    更新日期:2017-03-28 00:00:00

  • Transcriptome data of temporal and cingulate cortex in the Rett syndrome brain.

    abstract::Rett syndrome is an X-linked neurodevelopmental disorder caused by mutation in the methyl-CpG-binding protein 2 gene (MECP2) in the majority of cases. We describe an RNA sequencing dataset of postmortem brain tissue samples from four females clinically diagnosed with Rett syndrome and four age-matched female donors. T...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-0527-2

    authors: Aldinger KA,Timms AE,MacDonald JW,McNamara HK,Herstein JS,Bammler TK,Evgrafov OV,Knowles JA,Levitt P

    更新日期:2020-06-19 00:00:00

  • flEECe, an energy use and occupant behavior dataset for net-zero energy affordable senior residential buildings.

    abstract::The behaviors of building occupants have continued to perplex scholars for years in our attempts to develop models for energy efficient housing. Building simulations, project delivery approaches, policies, and more have fell short of their optimistic goals due to the complexity of human behavior. As a part of a multip...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0275-3

    authors: Paige F,Agee P,Jazizadeh F

    更新日期:2019-11-26 00:00:00

  • Spatiotemporal dataset on Chinese population distribution and its driving factors from 1949 to 2013.

    abstract::Spatio-temporal data on human population and its driving factors is critical to understanding and responding to population problems. Unfortunately, such spatio-temporal data on a large scale and over the long term are often difficult to obtain. Here, we present a dataset on Chinese population distribution and its driv...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2016.47

    authors: Wang L,Chen L

    更新日期:2016-07-05 00:00:00

  • A global compendium of human Crimean-Congo haemorrhagic fever virus occurrence.

    abstract::In order to map global disease risk, a geographic database of human Crimean-Congo haemorrhagic fever virus (CCHFV) occurrence was produced by surveying peer-reviewed literature and case reports, as well as informal online sources. Here we present this database, comprising occurrence data linked to geographic point or ...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2015.16

    authors: Messina JP,Pigott DM,Duda KA,Brownstein JS,Myers MF,George DB,Hay SI

    更新日期:2015-04-14 00:00:00

  • A structured open dataset of government interventions in response to COVID-19.

    abstract::In response to the COVID-19 pandemic, governments have implemented a wide range of non-pharmaceutical interventions (NPIs). Monitoring and documenting government strategies during the COVID-19 crisis is crucial to understand the progression of the epidemic. Following a content analysis strategy of existing public info...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00609-9

    authors: Desvars-Larrive A,Dervic E,Haug N,Niederkrotenthaler T,Chen J,Di Natale A,Lasser J,Gliga DS,Roux A,Sorger J,Chakraborty A,Ten A,Dervic A,Pacheco A,Jurczak A,Cserjan D,Lederhilger D,Bulska D,Berishaj D,Tames EF,Álv

    更新日期:2020-08-27 00:00:00

  • I-BLEND, a campus-scale commercial and residential buildings electrical energy dataset.

    abstract::Efficient energy consumption at the building level is vital for sustainability. Providing energy efficient systems and solutions requires an understanding of how energy gets consumed. However, there is a general lack of large-scale open datasets about the energy consumption of buildings, which hinders the research. Th...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2019.15

    authors: Rashid H,Singh P,Singh A

    更新日期:2019-02-19 00:00:00

  • Erratum: Genomes and phenomes of a population of outbred rats and its progenitors.

    abstract::[This corrects the article DOI: 10.1038/sdata.2014.11.]. ...

    journal_title:Scientific data

    pub_type: 已发布勘误

    doi:10.1038/sdata.2014.16

    authors: Baud A,Guryev V,Hummel O,Johannesson M,Rat Genome Sequencing and Mapping Consortium.,Flint J

    更新日期:2014-07-08 00:00:00

  • Highly sampled measurements in a controlled atmosphere at the Biosphere 2 Landscape Evolution Observatory.

    abstract::Land-atmosphere interactions at different temporal and spatial scales are important for our understanding of the Earth system and its modeling. The Landscape Evolution Observatory (LEO) at Biosphere 2, managed by the University of Arizona, hosts three nearly identical artificial bare-soil hillslopes with dimensions of...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00645-5

    authors: Arevalo J,Zeng X,Durcik M,Sibayan M,Pangle L,Abramson N,Bugaj A,Ng WR,Kim M,Barron-Gafford G,van Haren J,Niu GY,Adams J,Ruiz J,Troch PA

    更新日期:2020-09-15 00:00:00

  • Viruses of the Nahant Collection, characterization of 251 marine Vibrionaceae viruses.

    abstract::Viruses are highly discriminating in their interactions with host cells and are thought to play a major role in maintaining diversity of environmental microbes. However, large-scale ecological and genomic studies of co-occurring virus-host pairs, required to characterize the mechanistic and genomic foundations of viru...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2018.114

    authors: Kauffman KM,Brown JM,Sharma RS,VanInsberghe D,Elsherbini J,Polz M,Kelly L

    更新日期:2018-07-03 00:00:00

  • A comprehensive collection of systems biology data characterizing the host response to viral infection.

    abstract::The Systems Biology for Infectious Diseases Research program was established by the U.S. National Institute of Allergy and Infectious Diseases to investigate host-pathogen interactions at a systems level. This program generated 47 transcriptomic and proteomic datasets from 30 studies that investigate in vivo and in vi...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2014.33

    authors: Aevermann BD,Pickett BE,Kumar S,Klem EB,Agnihothram S,Askovich PS,Bankhead A 3rd,Bolles M,Carter V,Chang J,Clauss TR,Dash P,Diercks AH,Eisfeld AJ,Ellis A,Fan S,Ferris MT,Gralinski LE,Green RR,Gritsenko MA,Hatta M

    更新日期:2014-10-14 00:00:00

  • The BenBioDen database, a global database for meio-, macro- and megabenthic biomass and densities.

    abstract::Benthic fauna refers to all fauna that live in or on the seafloor, which researchers typically divide into size classes meiobenthos (32/64 µm-0.5/1 mm), macrobenthos (250 µm-1 cm), and megabenthos (>1 cm). Benthic fauna play important roles in bioturbation activity, mineralization of organic matter, and in marine food...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-0551-2

    authors: Stratmann T,van Oevelen D,Martínez Arbizu P,Wei CL,Liao JX,Cusson M,Scrosati RA,Archambault P,Snelgrove PVR,Ramey-Balci PA,Burd BJ,Kenchington E,Gilkinson K,Belley R,Soetaert K

    更新日期:2020-06-29 00:00:00

  • Long-term surveys of age structure in 13 ungulate and one ostrich species in the Serengeti, 1926-2018.

    abstract::The Serengeti ecosystem spans an extensive network of protected areas in Tanzania, eastern Africa, and a UNESCO Wold Heritage Site. It is home to some of the largest animal migrations on the planet. Here, we describe a dataset consisting of the sample counts of three age classes (infant, juvenile and adult) of 13 ungu...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00701-0

    authors: Rogy P,Sinclair ARE

    更新日期:2020-10-21 00:00:00

  • Multi-year whole-blood transcriptome data for the study of onset and progression of Parkinson's Disease.

    abstract::Parkinson's disease (PD) is an age-related, chronic and progressive neurodegenerative disorder characterized by a loss of multifocal neurons, resulting in both non-motor and motor symptoms. While several genetic and environmental contributory risk factors have been identified, more exact methods for diagnosing and ass...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0022-9

    authors: Valentine MNZ,Hashimoto K,Fukuhara T,Saiki S,Ishikawa KI,Hattori N,Carninci P

    更新日期:2019-04-05 00:00:00

  • The Coral Trait Database, a curated database of trait information for coral species from the global oceans.

    abstract::Trait-based approaches advance ecological and evolutionary research because traits provide a strong link to an organism's function and fitness. Trait-based research might lead to a deeper understanding of the functions of, and services provided by, ecosystems, thereby improving management, which is vital in the curren...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2016.17

    authors: Madin JS,Anderson KD,Andreasen MH,Bridge TC,Cairns SD,Connolly SR,Darling ES,Diaz M,Falster DS,Franklin EC,Gates RD,Harmer A,Hoogenboom MO,Huang D,Keith SA,Kosnik MA,Kuo CY,Lough JM,Lovelock CE,Luiz O,Martinelli J

    更新日期:2016-03-29 00:00:00

  • An accurate registration of the BigBrain dataset with the MNI PD25 and ICBM152 atlases.

    abstract::Brain atlases that encompass detailed anatomical or physiological features are instrumental in the research and surgical planning of various neurological conditions. Magnetic resonance imaging (MRI) has played important roles in neuro-image analysis while histological data remain crucial as a gold standard to guide an...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0217-0

    authors: Xiao Y,Lau JC,Anderson T,DeKraker J,Collins DL,Peters T,Khan AR

    更新日期:2019-10-17 00:00:00

  • A kinematic and kinetic dataset of 18 above-knee amputees walking at various speeds.

    abstract::Motion capture is necessary to quantify gait deviations in individuals with lower-limb amputations. However, access to the patient population and the necessary equipment is limited. Here we present the first open biomechanics dataset for 18 individuals with unilateral above-knee amputations walking at different speeds...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-0494-7

    authors: Hood S,Ishmael MK,Gunnell A,Foreman KB,Lenzi T

    更新日期:2020-05-21 00:00:00

  • Spatial and temporal analysis of extreme sea level and storm surge events around the coastline of the UK.

    abstract::In this paper we analyse the spatial footprint and temporal clustering of extreme sea level and skew surge events around the UK coast over the last 100 years (1915-2014). The vast majority of the extreme sea level events are generated by moderate, rather than extreme skew surges, combined with spring astronomical high...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2016.107

    authors: Haigh ID,Wadey MP,Wahl T,Ozsoy O,Nicholls RJ,Brown JM,Horsburgh K,Gouldby B

    更新日期:2016-12-06 00:00:00

  • Genome-wide identification of accessible chromatin regions in bumblebee by ATAC-seq.

    abstract::Bumblebees (Hymenoptera: Apidae) are important pollinating insects that play pivotal roles in crop production and natural ecosystem services. Although protein-coding genes in bumblebees have been extensively annotated, regulatory sequences of the genome, such as promoters and enhancers, have been poorly annotated. To ...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00713-w

    authors: Zhao X,Su L,Xu W,Schaack S,Sun C

    更新日期:2020-10-26 00:00:00

  • Experimental flows through an array of emerged or slightly submerged square cylinders over a rough bed.

    abstract::The experimental dataset presented was collected in an 18 m long and 1 m wide laboratory flume. Low to high flood flows through an urbanized floodplain were modelled. The floodplain bed is rough, modelled with dense artificial grass. A square cylinder array, representing house models, was set on the rough bed. The cyl...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00791-w

    authors: Oukacine M,Proust S,Larrarte F,Goutal N

    更新日期:2021-01-11 00:00:00

  • Quantitative mapping of RNA-mediated nuclear estrogen receptor β interactome in human breast cancer cells.

    abstract::The nuclear receptor estrogen receptor 2 (ESR2, ERβ) modulates cancer cell proliferation and tumor growth, exerting an oncosuppressive role in breast cancer (BC). Interaction proteomics by tandem affinity purification coupled to mass spectrometry was previously applied in BC cells to identify proteins acting in concer...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2018.31

    authors: Giurato G,Nassa G,Salvati A,Alexandrova E,Rizzo F,Nyman TA,Weisz A,Tarallo R

    更新日期:2018-03-06 00:00:00

  • A band-gap database for semiconducting inorganic materials calculated with hybrid functional.

    abstract::Semiconducting inorganic materials with band gaps ranging between 0 and 5 eV constitute major components in electronic, optoelectronic and photovoltaic devices. Since the band gap is a primary material property that affects the device performance, large band-gap databases are useful in selecting optimal materials in e...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00723-8

    authors: Kim S,Lee M,Hong C,Yoon Y,An H,Lee D,Jeong W,Yoo D,Kang Y,Youn Y,Han S

    更新日期:2020-11-11 00:00:00

  • Tesco Grocery 1.0, a large-scale dataset of grocery purchases in London.

    abstract::We present the Tesco Grocery 1.0 dataset: a record of 420 M food items purchased by 1.6 M fidelity card owners who shopped at the 411 Tesco stores in Greater London over the course of the entire year of 2015, aggregated at the level of census areas to preserve anonymity. For each area, we report the number of transact...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-0397-7

    authors: Aiello LM,Quercia D,Schifanella R,Del Prete L

    更新日期:2020-02-18 00:00:00

  • Human pluripotent stem cell derived HLC transcriptome data enables molecular dissection of hepatogenesis.

    abstract::Induced pluripotent stem cells (iPSCs) and human embryonic stem cells (hESCs) differentiated into hepatocyte-like cells (HLCs) provide a defined and renewable source of cells for drug screening, toxicology and regenerative medicine. We previously reprogrammed human fetal foreskin fibroblast cells (HFF1) into iPSCs emp...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2018.35

    authors: Wruck W,Adjaye J

    更新日期:2018-03-13 00:00:00