Machine learning method using position-specific mutation based classification outperforms one hot coding for disease severity prediction in haemophilia 'A'.

Abstract:

:Haemophilia is an X-linked genetic disorder in which A and B types are the most common that occur due to absence or lack of protein factors VIII and IX, respectively. Severity of the disease depends on mutation. Available Machine Learning (ML) methods that predict the mutational severity by using traditional encoding approaches, generally have high time complexity and compromised accuracy. In this study, Haemophilia 'A' patient mutation dataset containing 7784 mutations was processed by the proposed Position-Specific Mutation (PSM) and One-Hot Encoding (OHE) technique to predict the disease severity. The dataset processed by PSM and OHE methods was analyzed and trained for classification of mutation severity level using various ML algorithms. Surprisingly, PSM outperformed OHE, both in terms of time efficiency and accuracy, with training and prediction time improvement in the range of approximately 91 to 98% and 80 to 99% respectively. The severity prediction accuracy also improved by using PSM with different ML algorithms.

journal_name

Genomics

journal_title

Genomics

authors

Singh VK,Maurya NS,Mani A,Yadav RS

doi

10.1016/j.ygeno.2020.09.020

subject

Has Abstract

pub_date

2020-11-01 00:00:00

pages

5122-5128

issue

6

eissn

0888-7543

issn

1089-8646

pii

S0888-7543(20)30819-3

journal_volume

112

pub_type

杂志文章

相关文献

GENOMICS文献大全
  • Molecular cloning and chromosomal localization of the human alpha 7-nicotinic receptor subunit gene (CHRNA7).

    abstract::We have isolated cDNA and genomic clones coding for the human alpha 7 neuronal nicotinic receptor subunit, the major component of brain nicotinic receptors that are blocked by alpha-bungarotoxin. The human alpha 7 neuronal nicotinic cDNA encodes a mature protein of 479 amino acids that is highly homologous to the rat ...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1994.1075

    authors: Chini B,Raimond E,Elgoyhen AB,Moralli D,Balzaretti M,Heinemann S

    更新日期:1994-01-15 00:00:00

  • Cloning, structural organization, and chromosomal mapping of the human phenol sulfotransferase STP2 gene.

    abstract::Phenol- and monoamine-metabolizing sulfotransferases (STP and STM, respectively) are members of a superfamily of enzymes that add sulfate to a variety of xenobiotics and endobiotics containing hydroxyl or amino functional groups. To characterize related sulfotransferase genes further, we used extra-long PCR (XL-PCR) t...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1996.4575

    authors: Gaedigk A,Beatty BG,Grant DM

    更新日期:1997-03-01 00:00:00

  • Physical mapping by PFGE localizes the COL3A1 and COL5A2 genes to a 35-kb region on human chromosome 2.

    abstract::The genes encoding the alpha 1 chain of Type III collagen (COL3A1) and the alpha 2 chain of Type V (COL5A2) collagen have been mapped to the long arm of human chromosome 2. Linkage analysis in CEPH families indicated that these two genes are close to each other, with no recombination in 37 informative meioses. In the ...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/0888-7543(90)90302-b

    authors: Cutting GR,McGinniss MJ,Kasch LM,Tsipouras P,Antonarakis SE

    更新日期:1990-10-01 00:00:00

  • Genome-wide analysis of AP2/ERF transcription factors in pineapple reveals functional divergence during flowering induction mediated by ethylene and floral organ development.

    abstract::The APETALA2/ethylene-responsive factor (AP2/ERF) has important roles in regulating developmental processes and hormone signaling transduction in plants. Pineapple demonstrates a special sensitivity to ethylene, and AP2/ERFs may contribute to this distinct sensitivity of pineapples to ethylene. However, little informa...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2020.10.040

    authors: Zhang H,Pan X,Liu S,Lin W,Li Y,Zhang X

    更新日期:2021-01-20 00:00:00

  • A class III myosin expressed in the retina is a potential candidate for Bardet-Biedl syndrome.

    abstract::Class III myosins are actin-based motors with amino-terminal kinase domains. Expression of these motors is highly enhanced in retinal photoreceptors. As mutations in the gene encoding NINAC, a Drosophila melanogaster class III myosin, cause retinal degeneration, human homologs of this gene are potential candidates for...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.2002.6749

    authors: Dosé AC,Burnside B

    更新日期:2002-05-01 00:00:00

  • Identification and functional characterization of methyl-CpG binding domain protein from Tribolium castaneum.

    abstract::Methyl-CpG binding domain proteins (MBD) can specifically bind to methylated CpG sites and play important roles in epigenetic gene regulation. Here, we identified and functionally characterized the MBD protein in Tribolium castaneum. T. castaneum genome encodes only one MBD protein: TcMBD2/3. RNA interference targetin...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2019.12.018

    authors: Song X,Zhang Y,Zhong Q,Zhan K,Bi J,Tang J,Xie J,Li B

    更新日期:2020-05-01 00:00:00

  • Isolation of a human chromosome 14-only somatic cell hybrid: analysis using Alu and LINE-based PCR.

    abstract::Interspecific somatic cell hybrids containing single human chromosomes are valuable reagents for localization of cloned genes and DNA fragments to specific chromosomes, for the development of chromosome-specific libraries, and for generation of hybrid cell lines containing subchromosomal regions. A CHO somatic cell hy...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/0888-7543(91)90122-u

    authors: Mares A Jr,Ledbetter SA,Ledbetter DH,Roberts R,Hejtmancik JF

    更新日期:1991-09-01 00:00:00

  • Comparative analysis of neurological disorders focuses genome-wide search for autism genes.

    abstract::The behaviors of autism overlap with a diverse array of other neurological disorders, suggesting common molecular mechanisms. We conducted a large comparative analysis of the network of genes linked to autism with those of 432 other neurological diseases to circumscribe a multi-disorder subcomponent of autism. We leve...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2008.09.015

    authors: Wall DP,Esteban FJ,Deluca TF,Huyck M,Monaghan T,Velez de Mendizabal N,Goñí J,Kohane IS

    更新日期:2009-02-01 00:00:00

  • Genomic organization and genetic mapping of the neuroimmune gene I2rf5 to mouse chromosome 4.

    abstract::The nervous and immune systems share many functional and molecular similarities, including shared surface antigens, secretions of soluble factors, and cross-modulatory effects. We have identified previously a novel mRNA termed F5, which is expressed only in activated T lymphocytes and mature, postmitotic neurons. Tiss...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/0888-7543(95)80137-b

    authors: Autieri MV,Kozak CA,Cohen JA,Prystowsky MB

    更新日期:1995-01-01 00:00:00

  • Sequence analysis of 139 kb in Xp22.1 containing spermine synthase and the 5' region of PEX.

    abstract::Human Xp22.1 contains genes involved in mineral balance that are implicated in X-linked hypophosphatemia (XLH) in humans, its murine homologue (Hyp), and another distinct murine hypophosphatemic disorder (Gy). In XLH, a gene, PEX, has been found to be mutated in up to 83% of patients but the sequences of the promoter ...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1997.4876

    authors: Grieff M,Whyte MP,Thakker RV,Mazzarella R

    更新日期:1997-09-01 00:00:00

  • Blind analysis of denaturing high-performance liquid chromatography as a tool for mutation detection.

    abstract::Denaturing high-performance liquid chromatography (DHPLC) is a novel high-capacity technique for detecting new mutations. We have evaluated the sensitivity and specificity of this method in a blind analysis of exon H of the factor IX gene and exon 16 of the neurofibromatosis type 1 gene. Under a single set of conditio...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1998.5411

    authors: O'Donovan MC,Oefner PJ,Roberts SC,Austin J,Hoogendoorn B,Guy C,Speight G,Upadhyaya M,Sommer SS,McGuffin P

    更新日期:1998-08-15 00:00:00

  • Analysis of expressed sequence tags from a fetal human heart cDNA library.

    abstract::Single-pass sequencing of randomly selected cDNA clones to generate expressed sequence tags (ESTs) has been widely used to identify novel genes and to study gene expression in a variety of tissues. We have generated 2244 ESTs from a human fetal heart library (GenBank Accession Nos. R30692-30774 and R56965-58824), whic...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1995.9874

    authors: Hwang DM,Fung YW,Wang RX,Laurenssen CM,Ng SH,Lam WY,Tsui KW,Fung KP,Waye M,Lee CY

    更新日期:1995-11-20 00:00:00

  • The gene for the muscle-specific enolase is on the short arm of human chromosome 17.

    abstract::The human gene encoding the muscle-specific beta-enolase has been isolated. The beta-enolase gene was mapped to chromosome 17 by analysis of a panel of rodent-human somatic cell hybrids. The gene was further localized to the short arm and tentatively to the region 17pter-p11 by analysis of cell hybrids and transfectan...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/0888-7543(90)90467-9

    authors: Feo S,Oliva D,Barbieri G,Xu WM,Fried M,Giallongo A

    更新日期:1990-01-01 00:00:00

  • Genome scanning of human breast carcinomas using micro- and minisatellite core probes.

    abstract::We have analyzed tumor and lymphocyte DNA from six breast cancer patients by one- and two-dimensional DNA fingerprinting using micro- and minisatellite core probes to estimate the extent and nature of DNA alterations in tumors. Both approaches were compared regarding sensitivity in genome analysis. We find that the nu...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1993.1284

    authors: Hovig E,Mullaart E,Børresen AL,Uitterlinden AG,Vijg J

    更新日期:1993-07-01 00:00:00

  • Genomic features and copper biosorption potential of a new Alcanivorax sp. VBW004 isolated from the shallow hydrothermal vent (Azores, Portugal).

    abstract::A new Alcanivorax sp. VBW004 was isolated from a shallow hydrothermal vent in Azores Island, Portugal. In this study, we determined VBW004 was resistant to copper. This strain showed maximum tolerance of copper concentrations up to 600 μg/mL. Based on 16S rRNA gene sequencing and phylogeny revealed that this strain wa...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2020.06.015

    authors: Ramasamy KP,Rajasabapathy R,Lips I,Mohandass C,James RA

    更新日期:2020-09-01 00:00:00

  • Refined mapping of the Usher syndrome type III locus on chromosome 3, exclusion of candidate genes, and identification of the putative mouse homologous region.

    abstract::A locus for Usher syndrome type III (USH3; MIM No. 276902) was recently assigned to a 5-cM region on chromosome 3q. We constructed a yeast artificial chromosome contig that allowed us to position novel polymorphisms in the region. These were typed in a total of 32 pedigrees from a geographically isolated Finnish found...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1996.0626

    authors: Joensuu T,Blanco G,Pakarinen L,Sistonen P,Kääriäinen H,Brown S,Chapelle A,Sankila EM

    更新日期:1996-12-15 00:00:00

  • Maturational arrest of thymocyte development is caused by a deletion in the receptor-like protein tyrosine phosphatase kappa gene in LEC rats.

    abstract::The Long-Evans Cinnamon (LEC) rat has a spontaneous mutation, T helper immunodeficiency (thid), which causes a markedly reduced CD4(+) thymocyte population. Here we positionally clone the locus and identify a deletion in the gene encoding a receptor-like protein tyrosine phosphatase kappa (Ptprk) that led to complete ...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2007.03.001

    authors: Kose H,Sakai T,Tsukumo S,Wei K,Yamada T,Yasutomo K,Matsumoto K

    更新日期:2007-06-01 00:00:00

  • The NACP/synuclein gene: chromosomal assignment and screening for alterations in Alzheimer disease.

    abstract::The major component of the vascular and plaque amyloid deposits in Alzheimer disease is the amyloid beta peptide (A beta). A second intrinsic component of amyloid, the NAC (non-A beta component of amyloid) peptide, has recently been identified, and its precursor protein was named NACP. A computer homology search allow...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/0888-7543(95)80208-4

    authors: Campion D,Martin C,Heilig R,Charbonnier F,Moreau V,Flaman JM,Petit JL,Hannequin D,Brice A,Frebourg T

    更新日期:1995-03-20 00:00:00

  • PCR amplification of chromosome-specific DNA isolated from flow cytometry-sorted chromosomes.

    abstract::We have established a method for amplifying and obtaining large quantities of chromosome-specific DNA by linker/adaptor ligation and polymerase chain reaction (PCR). Small quantities of DNA isolated from flow cytometry-sorted chromosomes 17 and 21 were digested with MboI, ligated to a linker/adaptor, and then subjecte...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/0888-7543(92)90378-6

    authors: Chang KS,Vyas RC,Deaven LL,Trujillo JM,Stass SA,Hittelman WN

    更新日期:1992-02-01 00:00:00

  • A human homologue of the Drosophila polarity gene frizzled has been identified and mapped to 17q21.1.

    abstract::The frizzled (fz) locus in Drosophila is required for the transmission of polarity signals across the plasma membrane in epidermal cells, as well as to their neighboring cells in the developing wing. The identification of a tissue polarity gene from the fz locus in Drosophila melanogaster has been reported. The fz gen...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1995.1060

    authors: Zhao Z,Lee CC,Baldini A,Caskey CT

    更新日期:1995-05-20 00:00:00

  • Molecular cloning, localization, and developmental expression of mouse brain finger protein (Bfp)/ZNF179: distribution of bfp mRNA partially coincides with the affected areas of Smith-Magenis syndrome.

    abstract::Bfp (brain finger protein) is a member of the RING finger protein family, which is highly expressed in the brain. We have previously shown that one copy of the human bfp gene, mapped at 17p11.2, was actually deleted in six of six Smith-Magenis syndrome (SMS) patients. Now we have isolated the mouse bfp cDNA. Using in ...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1998.5541

    authors: Orimo A,Inoue S,Ikeda K,Sato M,Kato A,Tominaga N,Suzuki M,Noda T,Watanabe M,Muramatsu M

    更新日期:1998-11-15 00:00:00

  • The gene for murine CTP:phosphocholine cytidylyltransferase (Ctpct) is located on mouse chromosome 16.

    abstract::CTP:phosphocholine cytidylyltransferase is the rate-controlling enzyme in phosphatidylcholine biosynthesis and is essential for the survival of eukaryotic cells. The murine cDNA for the cytidylyltransferase was cloned and sequenced. A genomic clone was isolated and the chromosomal location of the Ctpct locus determine...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/s0888-7543(05)80377-5

    authors: Rutherford MS,Rock CO,Jenkins NA,Gilbert DJ,Tessner TG,Copeland NG,Jackowski S

    更新日期:1993-12-01 00:00:00

  • Identification and characterization of long non-coding RNA in prenatal and postnatal skeletal muscle of sheep.

    abstract::lncRNAs are a class of transcriptional RNA molecules of >200 nucleotides in length. However, the overall expression pattern and function of lncRNAs in sheep muscle is not clear. Here, we identified 1566 lncRNAs and 404 differentially expressed lncRNAs in sheep muscle from prenatal (110 days of fetus) and postnatal (2 ...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2018.01.009

    authors: Li CY,Li X,Liu Z,Ni W,Zhang X,Hazi W,Ma Q,Zhang Y,Cao Y,Qi J,Yao Y,Feng L,Wang D,Hou X,Yu S,Liu L,Zhang M,Hu S

    更新日期:2019-03-01 00:00:00

  • A YAC contig joining the desmocollin and desmoglein loci on human chromosome 18 and ordering of the desmocollin genes.

    abstract::The desmocollins and desmogleins are members of the cadherin family of adhesive proteins present in the desmosome type of cell-cell junction. All of the known desmoglein and desmocollin isoforms, which have differing tissue and developmental distributions, are coded by very closely linked genes at 18q12.1. We have pre...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1997.4718

    authors: Cowley CM,Simrak D,Marsden MD,King IA,Arnemann J,Buxton RS

    更新日期:1997-06-01 00:00:00

  • Chromosomal assignment of the genes for proprotein convertases PC4, PC5, and PACE 4 in mouse and human.

    abstract::The genes for three subtilisin/kexin-like proprotein convertases, PC4, PC5, and PACE4, were mapped in the mouse by RFLP analysis of a DNA panel from a (C57BL/6JEi x SPRET/Ei)F1 x SPRET/Ei backcross. The chromosomal locations of the human homologs were determined by Southern blot analysis of a DNA panel from human-rode...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/0888-7543(95)80090-9

    authors: Mbikay M,Seidah NG,Chrétien M,Simpson EM

    更新日期:1995-03-01 00:00:00

  • Structural organization and chromosomal localization of Hyal2, a gene encoding a lysosomal hyaluronidase.

    abstract::The human HYAL2 gene encodes a lysosomal hyaluronidase that is related to the testicular PH-20 hyaluronidase. Regions conserved in these proteins have been used to design PCR primers suitable for the isolation of a fragment of the murine Hyal2 gene. This fragment was used to isolate the Hyal2 cDNA from a cDNA library....

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1998.5472

    authors: Strobl B,Wechselberger C,Beier DR,Lepperdinger G

    更新日期:1998-10-15 00:00:00

  • Direct evidence for homologous sequences on the paracentric regions of human chromosome 1.

    abstract::Calcyclin is a member of the S100 family of proteins, many of which are encoded by genes that have been localized to the proximal long arm of human chromosome 1 (bands q21-q22). A 450-kb yeast artificial chromosome clone containing the human calcyclin gene was identified by PCR screening and used as a probe for fluore...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1994.1277

    authors: Hardas BD,Zhang J,Trent JM,Elder JT

    更新日期:1994-05-15 00:00:00

  • The cloning and nucleotide sequence of human ST2L cDNA.

    abstract::The ST2 gene is a member of the IL-1 receptor family and is hypothesized to be involved in helper T cell function, but its functional ligand and physiological role remain unknown. We have cloned the human ST2L cDNA that encodes a distinct type of membrane-bound ST2 protein. The predicted 556-amino-acid sequence showed...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.2000.6269

    authors: Li H,Tago K,Io K,Kuroiwa K,Arai T,Iwahana H,Tominaga S,Yanagisawa K

    更新日期:2000-08-01 00:00:00

  • Multipoint mapping of the central core disease locus.

    abstract::A linkage analysis with 12 DNA markers from proximal 19q was performed in eight families with central core disease (CCO). Two-point analysis gave a peak lod score of Z = 4.95 at theta = 0.00 for the anonymous marker D19S190 and of Z = 2.53 at theta = 0.00 for the ryanodine receptor (RYR1) candidate gene. Multipoint li...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1993.1302

    authors: Schwemmle S,Wolff K,Palmucci LM,Grimm T,Lehmann-Horn F,Hübner C,Hauser E,Iles DE,MacLennan DH,Müller CR

    更新日期:1993-07-01 00:00:00

  • Partial gene structure and assignment to chromosome 2q37 of the human inwardly rectifying K+ channel (Kir7.1) gene (KCNJ13).

    abstract::The novel weakly inward rectifying potassium channel Kir7.1 is a low-conductance channel that is predominantly expressed in epithelial cells. Here we describe a partial genomic characterization and the chromosomal assignment of the human Kir7.1 gene (KCNJ13). Analysis of the genomic structure using a PCR-based approac...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1998.5598

    authors: Derst C,Döring F,Preisig-Müller R,Daut J,Karschin A,Jeck N,Weber S,Engel H,Grzeschik KH

    更新日期:1998-12-15 00:00:00