A catalogue of 863 Rett-syndrome-causing MECP2 mutations and lessons learned from data integration.

Abstract:

:Rett syndrome (RTT) is a rare neurological disorder mostly caused by a genetic variation in MECP2. Making new MECP2 variants and the related phenotypes available provides data for better understanding of disease mechanisms and faster identification of variants for diagnosis. This is, however, currently hampered by the lack of interoperability between genotype-phenotype databases. Here, we demonstrate on the example of MECP2 in RTT that by making the genotype-phenotype data more Findable, Accessible, Interoperable, and Reusable (FAIR), we can facilitate prioritization and analysis of variants. In total, 10,968 MECP2 variants were successfully integrated. Among these variants 863 unique confirmed RTT causing and 209 unique confirmed benign variants were found. This dataset was used for comparison of pathogenicity predicting tools, protein consequences, and identification of ambiguous variants. Prediction tools generally recognised the RTT causing and benign variants, however, there was a broad range of overlap Nineteen variants were identified that were annotated as both disease-causing and benign, suggesting that there are additional factors in these cases contributing to disease development.

journal_name

Sci Data

journal_title

Scientific data

authors

Ehrhart F,Jacobsen A,Rigau M,Bosio M,Kaliyaperumal R,Laros JFJ,Willighagen EL,Valencia A,Roos M,Capella-Gutierrez S,Curfs LMG,Evelo CT

doi

10.1038/s41597-020-00794-7

subject

Has Abstract

pub_date

2021-01-15 00:00:00

pages

10

issue

1

issn

2052-4463

pii

10.1038/s41597-020-00794-7

journal_volume

8

pub_type

杂志文章
  • Daily transcriptomes of the copepod Calanus finmarchicus during the summer solstice at high Arctic latitudes.

    abstract::The zooplankter Calanus finmarchicus is a member of the so-called "Calanus Complex", a group of copepods that constitutes a key element of the Arctic polar marine ecosystem, providing a crucial link between primary production and higher trophic levels. Climate change induces the shift of C. finmarchicus to higher lati...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00751-4

    authors: Payton L,Noirot C,Hoede C,Hüppe L,Last K,Wilcockson D,Ershova EA,Valière S,Meyer B

    更新日期:2020-11-24 00:00:00

  • De novo transcriptomes of 14 gammarid individuals for proteogenomic analysis of seven taxonomic groups.

    abstract::Gammarids are amphipods found worldwide distributed in fresh and marine waters. They play an important role in aquatic ecosystems and are well established sentinel species in ecotoxicology. In this study, we sequenced the transcriptomes of a male individual and a female individual for seven different taxonomic groups ...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0192-5

    authors: Cogne Y,Degli-Esposti D,Pible O,Gouveia D,François A,Bouchez O,Eché C,Ford A,Geffard O,Armengaud J,Chaumot A,Almunia C

    更新日期:2019-09-27 00:00:00

  • Genome-wide identification of accessible chromatin regions in bumblebee by ATAC-seq.

    abstract::Bumblebees (Hymenoptera: Apidae) are important pollinating insects that play pivotal roles in crop production and natural ecosystem services. Although protein-coding genes in bumblebees have been extensively annotated, regulatory sequences of the genome, such as promoters and enhancers, have been poorly annotated. To ...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00713-w

    authors: Zhao X,Su L,Xu W,Schaack S,Sun C

    更新日期:2020-10-26 00:00:00

  • Publisher Correction: The Scales Project, a cross-national dataset on the interpretation of thermal perception scales.

    abstract::An amendment to this paper has been published and can be accessed via a link at the top of the paper. ...

    journal_title:Scientific data

    pub_type: 杂志文章,已发布勘误

    doi:10.1038/s41597-019-0348-3

    authors: Schweiker M,Abdul-Zahra A,André M,Al-Atrash F,Al-Khatri H,Alprianti RR,Alsaad H,Amin R,Ampatzi E,Arsano AY,Azadeh M,Azar E,Bahareh B,Batagarawa A,Becker S,Buonocore C,Cao B,Choi JH,Chun C,Daanen H,Damiati SA,Dan

    更新日期:2020-01-06 00:00:00

  • Unbalanced historical phenotypic data from seed regeneration of a barley ex situ collection.

    abstract::The scarce knowledge on phenotypic characterization restricts the usage of genetic diversity of plant genetic resources in research and breeding. We describe original and ready-to-use processed data for approximately 60% of ~22,000 barley accessions hosted at the Federal ex situ Genebank for Agricultural and Horticult...

    journal_title:Scientific data

    pub_type:

    doi:10.1038/sdata.2018.278

    authors: Gonzalez MY,Weise S,Zhao Y,Philipp N,Arend D,Börner A,Oppermann M,Graner A,Reif JC,Schulthess AW

    更新日期:2018-12-04 00:00:00

  • Age-related dataset on the mechanical properties and collagen fibril structure of tendons from a murine model.

    abstract::Connective tissues such as tendon, ligament and skin are biological fibre composites comprising collagen fibrils reinforcing the weak proteoglycan-rich ground substance in extracellular matrix (ECM). One of the hallmarks of ageing of connective tissues is the progressive and irreversible change in the tissue mechanica...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2018.140

    authors: Goh KL,Holmes DF,Lu YH,Kadler KE,Purslow PP

    更新日期:2018-07-24 00:00:00

  • Two-colour serial femtosecond crystallography dataset from gadoteridol-derivatized lysozyme for MAD phasing.

    abstract::We provide a detailed description of a gadoteridol-derivatized lysozyme (gadolinium lysozyme) two-colour serial femtosecond crystallography (SFX) dataset for multiple wavelength anomalous dispersion (MAD) structure determination. The data was collected at the Spring-8 Angstrom Compact free-electron LAser (SACLA) facil...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2017.188

    authors: Gorel A,Motomura K,Fukuzawa H,Doak RB,Grünbein ML,Hilpert M,Inoue I,Kloos M,Nass Kovács G,Nango E,Nass K,Roome CM,Shoeman RL,Tanaka R,Tono K,Foucar L,Joti Y,Yabashi M,Iwata S,Ueda K,Barends TRM,Schlichting I

    更新日期:2017-12-12 00:00:00

  • Transcriptome dataset of human corneal endothelium based on ribosomal RNA-depleted RNA-Seq data.

    abstract::The corneal endothelium maintains corneal transparency; consequently, damage to this endothelium by a number of pathological conditions results in severe vision loss. Publicly available expression databases of human tissues are useful for investigating the pathogenesis of diseases and for developing new therapeutic mo...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00754-1

    authors: Tokuda Y,Okumura N,Komori Y,Hanada N,Tashiro K,Koizumi N,Nakano M

    更新日期:2020-11-20 00:00:00

  • Construction, complete sequence, and annotation of a BAC contig covering the silkworm chorion locus.

    abstract::The silkmoth chorion was studied extensively by F.C. Kafatos' group for almost 40 years. However, the complete structure of the chorion locus was not obtained in the genome sequence of Bombyx mori published in 2008 due to repetitive sequences, resulting in gaps and an incomplete view of the locus. To obtain the comple...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2015.62

    authors: Chen Z,Nohata J,Guo H,Li S,Liu J,Guo Y,Yamamoto K,Kadono-Okuda K,Liu C,Arunkumar KP,Nagaraju J,Zhang Y,Liu S,Labropoulou V,Swevers L,Tsitoura P,Iatrou K,Gopinathan KP,Goldsmith MR,Xia Q,Mita K

    更新日期:2015-11-10 00:00:00

  • VLUIS, a land use data product for Victoria, Australia, covering 2006 to 2013.

    abstract::Land Use Information is a key dataset required to enable an understanding of the changing nature of our landscapes and the associated influences on natural resources and regional communities. The Victorian Land Use Information System (VLUIS) data product has been created within the State Government of Victoria to supp...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2015.70

    authors: Morse-McNabb E,Sheffield K,Clark R,Lewis H,Robson S,Cherry D,Williams S

    更新日期:2015-11-24 00:00:00

  • A database of human gait performance on irregular and uneven surfaces collected by wearable sensors.

    abstract::Gait analysis has traditionally relied on laborious and lab-based methods. Data from wearable sensors, such as Inertial Measurement Units (IMU), can be analyzed with machine learning to perform gait analysis in real-world environments. This database provides data from thirty participants (fifteen males and fifteen fem...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-0563-y

    authors: Luo Y,Coppola SM,Dixon PC,Li S,Dennerlein JT,Hu B

    更新日期:2020-07-08 00:00:00

  • The Centennial Trends Greater Horn of Africa precipitation dataset.

    abstract::East Africa is a drought prone, food and water insecure region with a highly variable climate. This complexity makes rainfall estimation challenging, and this challenge is compounded by low rain gauge densities and inhomogeneous monitoring networks. The dearth of observations is particularly problematic over the past ...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2015.50

    authors: Funk C,Nicholson SE,Landsfeld M,Klotter D,Peterson P,Harrison L

    更新日期:2015-09-29 00:00:00

  • Long-term observation of amphibian populations inhabiting urban and forested areas in Yekaterinburg, Russia.

    abstract::This article presents data derived from a 36 year-long uninterrupted observational study of amphibian populations living in the city and vicinity of Yekaterinburg, Russia. This area is inhabited by six amphibian species. Based on a degree of anthropogenic transformation, the urban territory is divided into five highly...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2015.18

    authors: Vershinin VL,Vershinina SD,Berzin DL,Zmeeva DV,Kinev AV

    更新日期:2015-05-12 00:00:00

  • HYSOGs250m, global gridded hydrologic soil groups for curve-number-based runoff modeling.

    abstract::Hydrologic soil groups (HSGs) are a fundamental component of the USDA curve-number (CN) method for estimation of rainfall runoff; yet these data are not readily available in a format or spatial-resolution suitable for regional- and global-scale modeling applications. We developed a globally consistent, gridded dataset...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2018.91

    authors: Ross CW,Prihodko L,Anchang J,Kumar S,Ji W,Hanan NP

    更新日期:2018-05-15 00:00:00

  • Comprehensive draft of the mouse embryonic fibroblast lysosomal proteome by mass spectrometry based proteomics.

    abstract::Lysosomes are the main degradative organelles of cells and involved in a variety of processes including the recycling of macromolecules, storage of compounds, and metabolic signaling. Despite an increasing interest in the proteomic analysis of lysosomes, no systematic study of sample preparation protocols for lysosome...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-0399-5

    authors: Ponnaiyan S,Akter F,Singh J,Winter D

    更新日期:2020-02-26 00:00:00

  • Flow cytometry analysis of adrenoceptors expression in human adipose-derived mesenchymal stem/stromal cells.

    abstract::Mesenchymal stem/stromal cells (MSCs) were identified in most tissues of an adult organism. MSCs mediate physiological renewal, as well as regulation of tissue homeostasis, reparation and regeneration. Functions of MSCs are regulated by endocrine and neuronal signals, and noradrenaline is one of the most important MSC...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2018.196

    authors: Tyurin-Kuzmin PA,Dyikanov DT,Fadeeva JI,Sysoeva VY,Kalinina NI

    更新日期:2018-10-02 00:00:00

  • A statistical atlas of cerebral arteries generated using multi-center MRA datasets from healthy subjects.

    abstract::Magnetic resonance angiography (MRA) can capture the variation of cerebral arteries with high spatial resolution. These measurements include valuable information about the morphology, geometry, and density of brain arteries, which may be useful to identify risk factors for cerebrovascular and neurological diseases at ...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0034-5

    authors: Mouches P,Forkert ND

    更新日期:2019-04-11 00:00:00

  • A test-retest dataset for assessing long-term reliability of brain morphology and resting-state brain activity.

    abstract::We present a test-retest dataset for evaluation of long-term reliability of measures from structural and resting-state functional magnetic resonance imaging (sMRI and rfMRI) scans. The repeated scan dataset was collected from 61 healthy adults in two sessions using highly similar imaging parameters at an interval of 1...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2016.16

    authors: Huang L,Huang T,Zhen Z,Liu J

    更新日期:2016-03-15 00:00:00

  • An archive of longitudinal recordings of the vocalizations of adult Gombe chimpanzees.

    abstract::Studies of chimpanzee vocal communication provide valuable insights into the evolution of communication in complex societies, and also comparative data for understanding the evolution of human language. One particularly valuable dataset of recordings from free-living chimpanzees was collected by Frans X. Plooij and th...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2015.27

    authors: Plooij FX,van de Rijt-Plooij H,Fischer M,Wilson ML,Pusey A

    更新日期:2015-05-26 00:00:00

  • A structured open dataset of government interventions in response to COVID-19.

    abstract::In response to the COVID-19 pandemic, governments have implemented a wide range of non-pharmaceutical interventions (NPIs). Monitoring and documenting government strategies during the COVID-19 crisis is crucial to understand the progression of the epidemic. Following a content analysis strategy of existing public info...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00609-9

    authors: Desvars-Larrive A,Dervic E,Haug N,Niederkrotenthaler T,Chen J,Di Natale A,Lasser J,Gliga DS,Roux A,Sorger J,Chakraborty A,Ten A,Dervic A,Pacheco A,Jurczak A,Cserjan D,Lederhilger D,Bulska D,Berishaj D,Tames EF,Álv

    更新日期:2020-08-27 00:00:00

  • Data for training and testing radiation detection algorithms in an urban environment.

    abstract::The detection, identification, and localization of illicit nuclear materials in urban environments is of utmost importance for national security. Most often, the process of performing these operations consists of a team of trained individuals equipped with radiation detection devices that have built-in algorithms to a...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00672-2

    authors: Ghawaly JM Jr,Nicholson AD,Peplow DE,Anderson-Cook CM,Myers KL,Archer DE,Willis MJ,Quiter BJ

    更新日期:2020-10-05 00:00:00

  • A dataset of distribution and diversity of ticks in China.

    abstract::While tick-borne zoonoses, such as Lyme disease and tick-borne encephalitis, present an increasing global concern, knowledge of their vectors' distribution remains limited, especially for China. In this paper, we present the first comprehensive dataset of known tick species and their distributions in China, derived fr...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0115-5

    authors: Zhang G,Zheng D,Tian Y,Li S

    更新日期:2019-07-01 00:00:00

  • Machine learning for the detection of early immunological markers as predictors of multi-organ dysfunction.

    abstract::The immune response to major trauma has been analysed mainly within post-hospital admission settings where the inflammatory response is already underway and the early drivers of clinical outcome cannot be readily determined. Thus, there is a need to better understand the immediate immune response to injury and how thi...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0337-6

    authors: Bravo-Merodio L,Acharjee A,Hazeldine J,Bentley C,Foster M,Gkoutos GV,Lord JM

    更新日期:2019-12-19 00:00:00

  • Publisher Correction: Tracking vegetation phenology across diverse biomes using Version 2.0 of the PhenoCam Dataset.

    abstract::An amendment to this paper has been published and can be accessed via a link at the top of the paper. ...

    journal_title:Scientific data

    pub_type: 杂志文章,已发布勘误

    doi:10.1038/s41597-019-0270-8

    authors: Seyednasrollah B,Young AM,Hufkens K,Milliman T,Friedl MA,Frolking S,Richardson AD

    更新日期:2019-11-01 00:00:00

  • A systematic review and meta-analysis of seroprevalence surveys of ebolavirus infection.

    abstract::Asymptomatic ebolavirus infection could greatly influence transmission dynamics, but there is little consensus on how frequently it occurs or even if it exists. This paper summarises the available evidence on seroprevalence of Ebola, Sudan and Bundibugyo virus IgG in people without known ebolavirus disease. Through sy...

    journal_title:Scientific data

    pub_type: 杂志文章,meta分析,评审

    doi:10.1038/sdata.2016.133

    authors: Bower H,Glynn JR

    更新日期:2017-01-31 00:00:00

  • Genotoype-by-sequencing of three geographically distinct populations of Olympia oysters, Ostrea lurida.

    abstract::Olympia oysters are found along the west coast of North America and as the only native oyster species in the region, receive considerable attention with regard to restoration and conservation. Knowledge of genetic structure of this species is essential for resource managers. Here we provide genetic data for three dist...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2017.130

    authors: White SJ,Vadopalas B,Silliman K,Roberts SB

    更新日期:2017-09-12 00:00:00

  • Linking in silico MS/MS spectra with chemistry data to improve identification of unknowns.

    abstract::Confident identification of unknown chemicals in high resolution mass spectrometry (HRMS) screening studies requires cohesive workflows and complementary data, tools, and software. Chemistry databases, screening libraries, and chemical metadata have become fixtures in identification workflows. To increase confidence i...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0145-z

    authors: McEachran AD,Balabin I,Cathey T,Transue TR,Al-Ghoul H,Grulke C,Sobus JR,Williams AJ

    更新日期:2019-08-02 00:00:00

  • A multi-omics digital research object for the genetics of sleep regulation.

    abstract::With the aim to uncover the molecular pathways underlying the regulation of sleep, we recently assembled an extensive and comprehensive systems genetics dataset interrogating a genetic reference population of mice at the levels of the genome, the brain and liver transcriptomes, the plasma metabolome, and the sleep-wak...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0171-x

    authors: Jan M,Gobet N,Diessler S,Franken P,Xenarios I

    更新日期:2019-10-31 00:00:00

  • Systematic analysis of infectious disease outcomes by age shows lowest severity in school-age children.

    abstract::The COVID-19 pandemic has ignited interest in age-specific manifestations of infection but surprisingly little is known about relative severity of infectious disease between the extremes of age. In a systematic analysis we identified 142 datasets with information on severity of disease by age for 32 different infectio...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00668-y

    authors: Glynn JR,Moss PAH

    更新日期:2020-10-15 00:00:00

  • Multi-omics profile of the mouse dentate gyrus after kainic acid-induced status epilepticus.

    abstract::Temporal lobe epilepsy (TLE) can develop from alterations in hippocampal structure and circuit characteristics, and can be modeled in mice by administration of kainic acid (KA). Adult neurogenesis in the dentate gyrus (DG) contributes to hippocampal functions and has been reported to contribute to the development of T...

    journal_title:Scientific data

    pub_type: 评论,杂志文章

    doi:10.1038/sdata.2016.68

    authors: Schouten M,Bielefeld P,Fratantoni SA,Hubens CJ,Piersma SR,Pham TV,Voskuyl RA,Lucassen PJ,Jimenez CR,Fitzsimons CP

    更新日期:2016-08-16 00:00:00