Categorizing biomedicine images using novel image features and sparse coding representation.

Abstract:

BACKGROUND:Images embedded in biomedical publications carry rich information that often concisely summarize key hypotheses adopted, methods employed, or results obtained in a published study. Therefore, they offer valuable clues for understanding main content in a biomedical publication. Prior studies have pointed out the potential of mining images embedded in biomedical publications for automatically understanding and retrieving such images' associated source documents. Within the broad area of biomedical image processing, categorizing biomedical images is a fundamental step for building many advanced image analysis, retrieval, and mining applications. Similar to any automatic categorization effort, discriminative image features can provide the most crucial aid in the process. METHOD:We observe that many images embedded in biomedical publications carry versatile annotation text. Based on the locations of and the spatial relationships between these text elements in an image, we thus propose some novel image features for image categorization purpose, which quantitatively characterize the spatial positions and distributions of text elements inside a biomedical image. We further adopt a sparse coding representation (SCR) based technique to categorize images embedded in biomedical publications by leveraging our newly proposed image features. RESULTS:we randomly selected 990 images of the JPG format for use in our experiments where 310 images were used as training samples and the rest were used as the testing cases. We first segmented 310 sample images following the our proposed procedure. This step produced a total of 1035 sub-images. We then manually labeled all these sub-images according to the two-level hierarchical image taxonomy proposed by 1. Among our annotation results, 316 are microscopy images, 126 are gel electrophoresis images, 135 are line charts, 156 are bar charts, 52 are spot charts, 25 are tables, 70 are flow charts, and the remaining 155 images are of the type "others". A serial of experimental results are obtained. Firstly, each image categorizing results is presented, and next image categorizing performance indexes such as precision, recall, F-score, are all listed. Different features which include conventional image features and our proposed novel features indicate different categorizing performance, and the results are demonstrated. Thirdly, we conduct an accuracy comparison between support vector machine classification method and our proposed sparse representation classification method. At last, our proposed approach is compared with three peer classification method and experimental results verify our impressively improved performance. CONCLUSIONS:Compared with conventional image features that do not exploit characteristics regarding text positions and distributions inside images embedded in biomedical publications, our proposed image features coupled with the SR based representation model exhibit superior performance for classifying biomedical images as demonstrated in our comparative benchmark study.

journal_name

BMC Med Genomics

journal_title

BMC medical genomics

authors

Sheng J,Xu S,Luo X

doi

10.1186/1755-8794-6-S3-S8

subject

Has Abstract

pub_date

2013-01-01 00:00:00

pages

S8

issn

1755-8794

pii

1755-8794-6-S3-S8

journal_volume

6 Suppl 3

pub_type

杂志文章
  • Advancing research in NeuroAIDS using collaboration and public data sharing.

    abstract::In this issue of BMC Medical Genomics Griffin et al. present a user-friendly and freely accessible HIV-associated neurocognitive disorder (HAND) genomic database that compiles viral (HIV-1) genetic sequences and other relevant clinical and treatment data. We discuss the benefits and caveats of public data sharing in N...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-015-0150-9

    authors: Cysique LA

    更新日期:2015-11-11 00:00:00

  • Chronic insulin treatment of diabetes does not fully normalize alterations in the retinal transcriptome.

    abstract:BACKGROUND:Diabetic retinopathy (DR) is a leading cause of blindness in working age adults. Approximately 95% of patients with Type 1 diabetes develop some degree of retinopathy within 25 years of diagnosis despite normalization of blood glucose by insulin therapy. The goal of this study was to identify molecular chang...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-4-40

    authors: Bixler GV,Vanguilder HD,Brucklacher RM,Kimball SR,Bronson SK,Freeman WM

    更新日期:2011-05-15 00:00:00

  • Whole exome sequencing in adult-onset hearing loss reveals a high load of predicted pathogenic variants in known deafness-associated genes and identifies new candidate genes.

    abstract:BACKGROUND:Deafness is a highly heterogenous disorder with over 100 genes known to underlie human non-syndromic hearing impairment. However, many more remain undiscovered, particularly those involved in the most common form of deafness: adult-onset progressive hearing loss. Despite several genome-wide association studi...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-018-0395-1

    authors: Lewis MA,Nolan LS,Cadge BA,Matthews LJ,Schulte BA,Dubno JR,Steel KP,Dawson SJ

    更新日期:2018-09-04 00:00:00

  • Screening significantly hypermethylated genes in fetal tissues compared with maternal blood using a methylated-CpG island recovery assay-based microarray.

    abstract:BACKGROUND:The noninvasive prenatal diagnosis procedures that are currently used to detect genetic diseases do not achieve desirable levels of sensitivity and specificity. Recently, fetal methylated DNA biomarkers in maternal peripheral blood have been explored for the noninvasive prenatal detection of genetic disorder...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-5-26

    authors: Yin A,Zhang X,Wu J,Du L,He T,Zhang X

    更新日期:2012-06-18 00:00:00

  • Role of caveolin 1, E-cadherin, Enolase 2 and PKCalpha on resistance to methotrexate in human HT29 colon cancer cells.

    abstract:BACKGROUND:Methotrexate is one of the earliest cytotoxic drugs used in cancer therapy, and despite the isolation of multiple other folate antagonists, methotrexate maintains its significant role as a treatment for different types of cancer and other disorders. The usefulness of treatment with methotrexate is limited by...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-1-35

    authors: Selga E,Morales C,Noé V,Peinado MA,Ciudad CJ

    更新日期:2008-08-11 00:00:00

  • Functional microarray analysis suggests repressed cell-cell signaling and cell survival-related modules inhibit progression of head and neck squamous cell carcinoma.

    abstract:BACKGROUND:Cancer shows a great diversity in its clinical behavior which cannot be easily predicted using the currently available clinical or pathological markers. The identification of pathways associated with lymph node metastasis (N+) and recurrent head and neck squamous cell carcinoma (HNSCC) may increase our under...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-4-33

    authors: Coló AE,Simoes AC,Carvalho AL,Melo CM,Fahham L,Kowalski LP,Soares FA,Neves EJ,Reis LF,Carvalho AF

    更新日期:2011-04-13 00:00:00

  • Modified entropy-based procedure detects gene-gene-interactions in unconventional genetic models.

    abstract:BACKGROUND:Since it is assumed that genetic interactions play an important role in understanding the mechanisms of complex diseases, different statistical approaches have been suggested in recent years for this task. One interesting approach is the entropy-based IGENT method by Kwon et al. that promises an efficient de...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-020-0703-4

    authors: Malten J,König IR

    更新日期:2020-04-23 00:00:00

  • The International Conference on Intelligent Biology and Medicine (ICIBM) 2020: Data-driven analytics in biomedical genomics.

    abstract::This editorial summarizes eight research articles included in this supplement issue for the 2020 International Conference on Intelligent Biology and Medicine (ICIBM 2020) conference, that was held on August 9-10, 2020 (virtual conference), with a topic on data-driven analytics in biomedical genomics. These articles co...

    journal_title:BMC medical genomics

    pub_type: 社论

    doi:10.1186/s12920-020-00833-7

    authors: Shi X,Zhao Z,Wang K,Shen L

    更新日期:2020-12-28 00:00:00

  • OncoRep: an n-of-1 reporting tool to support genome-guided treatment for breast cancer patients using RNA-sequencing.

    abstract:BACKGROUND:Breast cancer comprises multiple tumor entities associated with different biological features and clinical behaviors, making individualized medicine a powerful tool to bring the right drug to the right patient. Next generation sequencing of RNA (RNA-Seq) is a suitable method to detect targets for individuali...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-015-0095-z

    authors: Meißner T,Fisch KM,Gioia L,Su AI

    更新日期:2015-05-21 00:00:00

  • A computational procedure for functional characterization of potential marker genes from molecular data: Alzheimer's as a case study.

    abstract:BACKGROUND:A molecular characterization of Alzheimer's Disease (AD) is the key to the identification of altered gene sets that lead to AD progression. We rely on the assumption that candidate marker genes for a given disease belong to specific pathogenic pathways, and we aim at unveiling those pathways stable across ti...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-4-55

    authors: Squillario M,Barla A

    更新日期:2011-07-05 00:00:00

  • The functional cancer map: a systems-level synopsis of genetic deregulation in cancer.

    abstract:BACKGROUND:Cancer cells are characterized by massive dysegulation of physiological cell functions with considerable disruption of transcriptional regulation. Genome-wide transcriptome profiling can be utilized for early detection and molecular classification of cancers. Accurate discrimination of functionally different...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-4-53

    authors: Krupp M,Maass T,Marquardt JU,Staib F,Bauer T,König R,Biesterfeld S,Galle PR,Tresch A,Teufel A

    更新日期:2011-06-30 00:00:00

  • A genome-wide association study of serum uric acid in African Americans.

    abstract:BACKGROUND:Uric acid is the primary byproduct of purine metabolism. Hyperuricemia is associated with body mass index (BMI), sex, and multiple complex diseases including gout, hypertension (HTN), renal disease, and type 2 diabetes (T2D). Multiple genome-wide association studies (GWAS) in individuals of European ancestry...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-4-17

    authors: Charles BA,Shriner D,Doumatey A,Chen G,Zhou J,Huang H,Herbert A,Gerry NP,Christman MF,Adeyemo A,Rotimi CN

    更新日期:2011-02-04 00:00:00

  • Cell cycle and aging, morphogenesis, and response to stimuli genes are individualized biomarkers of glioblastoma progression and survival.

    abstract:BACKGROUND:Glioblastoma is a complex multifactorial disorder that has swift and devastating consequences. Few genes have been consistently identified as prognostic biomarkers of glioblastoma survival. The goal of this study was to identify general and clinical-dependent biomarker genes and biological processes of three...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-4-49

    authors: Serão NV,Delfino KR,Southey BR,Beever JE,Rodriguez-Zas SL

    更新日期:2011-06-07 00:00:00

  • DNA methylation changes in ovarian cancer are cumulative with disease progression and identify tumor stage.

    abstract:BACKGROUND:Hypermethylation of promoter CpG islands with associated loss of gene expression, and hypomethylation of CpG-rich repetitive elements that may destabilize the genome are common events in most, if not all, epithelial cancers. METHODS:The methylation of 6,502 CpG-rich sequences spanning the genome was analyze...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-1-47

    authors: Watts GS,Futscher BW,Holtan N,Degeest K,Domann FE,Rose SL

    更新日期:2008-09-30 00:00:00

  • Identification of exon skipping events associated with Alzheimer's disease in the human hippocampus.

    abstract:BACKGROUND:At least 90% of human genes are alternatively spliced. Alternative splicing has an important function regulating gene expression and miss-splicing can contribute to risk for human diseases, including Alzheimer's disease (AD). METHODS:We developed a splicing decision model as a molecular mechanism to identif...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-018-0453-8

    authors: Han S,Miller JE,Byun S,Kim D,Risacher SL,Saykin AJ,Lee Y,Nho K,for Alzheimer’s Disease Neuroimaging Initiative.

    更新日期:2019-01-31 00:00:00

  • Transcriptome sequencing of lncRNA, miRNA, mRNA and interaction network constructing in coronary heart disease.

    abstract:BACKGROUND:Non-coding RNA has been shown to participate in numerous biological and pathological processes and has attracted increasing attention in recent years. Recent studies have demonstrated that long non-coding RNA and micro RNA can interact through various mechanisms to regulate mRNA. Yet the gene-gene interactio...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-019-0570-z

    authors: Liao J,Wang J,Liu Y,Li J,Duan L

    更新日期:2019-08-23 00:00:00

  • Differential gene expression in disease: a comparison between high-throughput studies and the literature.

    abstract:BACKGROUND:Differential gene expression is important to understand the biological differences between healthy and diseased states. Two common sources of differential gene expression data are microarray studies and the biomedical literature. METHODS:With the aid of text mining and gene expression analysis we have exami...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-017-0293-y

    authors: Rodriguez-Esteban R,Jiang X

    更新日期:2017-10-11 00:00:00

  • Glucocorticoid-driven transcriptomes in human airway epithelial cells: commonalities, differences and functional insight from cell lines and primary cells.

    abstract:BACKGROUND:Glucocorticoids act on the glucocorticoid receptor (GR; NR3C1) to resolve inflammation and, as inhaled corticosteroids (ICS), are the cornerstone of treatment for asthma. However, reduced efficacy in severe disease or exacerbations indicates a need to improve ICS actions. METHODS:Glucocorticoid-driven trans...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-018-0467-2

    authors: Mostafa MM,Rider CF,Shah S,Traves SL,Gordon PMK,Miller-Larsson A,Leigh R,Newton R

    更新日期:2019-01-31 00:00:00

  • Combinations of newly confirmed Glioma-Associated loci link regions on chromosomes 1 and 9 to increased disease risk.

    abstract:BACKGROUND:Glioblastoma multiforme (GBM) tends to occur between the ages of 45 and 70. This relatively early onset and its poor prognosis make the impact of GBM on public health far greater than would be suggested by its relatively low frequency. Tissue and blood samples have now been collected for a number of populati...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-4-63

    authors: Yang TH,Kon M,Hung JH,Delisi C

    更新日期:2011-08-09 00:00:00

  • Transcriptional profiling of mycobacterial antigen-induced responses in infants vaccinated with BCG at birth.

    abstract:BACKGROUND:Novel tuberculosis (TB) vaccines recently tested in humans have been designed to boost immunity induced by the current vaccine, Mycobacterium bovis Bacille Calmette-Guérin (BCG). Because BCG vaccination is used extensively in infants, this population group is likely to be the first in which efficacy trials o...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-2-10

    authors: Fletcher HA,Keyser A,Bowmaker M,Sayles PC,Kaplan G,Hussey G,Hill AV,Hanekom WA

    更新日期:2009-02-24 00:00:00

  • Mosaic chromosome 18 anomaly delineated in a child with dysmorphism using a three-pronged cytogenetic techniques approach: a case report.

    abstract:BACKGROUND:A plethora of cases are reported in the literature with iso- and ring-chromosome 18. However, co-occurrence of these two abnormalities in an individual along with a third cell line and absence of numerical anomaly is extremely rare. CASE PRESENTATION:A 7-year-old female was referred for diagnosis due to gro...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-020-00796-9

    authors: Sheth H,Trivedi S,Liehr T,Patel K,Jain D,Sheth J,Sheth F

    更新日期:2020-09-24 00:00:00

  • The International Conference on Intelligent Biology and Medicine (ICIBM) 2018: genomics meets medicine.

    abstract::During June 10-12, 2018, the International Conference on Intelligent Biology and Medicine (ICIBM 2018) was held in Los Angeles, California, USA. The conference included 11 scientific sessions, four tutorials, one poster session, four keynote talks and four eminent scholar talks that covered a wide range of topics rang...

    journal_title:BMC medical genomics

    pub_type: 社论

    doi:10.1186/s12920-018-0448-5

    authors: Zhi D,Zhao Z,Li F,Wu Z,Liu X,Wang K

    更新日期:2019-01-31 00:00:00

  • Routine use of microarray-based gene expression profiling to identify patients with low cytogenetic risk acute myeloid leukemia: accurate results can be obtained even with suboptimal samples.

    abstract:BACKGROUND:Gene expression profiling has shown its ability to identify with high accuracy low cytogenetic risk acute myeloid leukemia such as acute promyelocytic leukemia and leukemias with t(8;21) or inv(16). The aim of this gene expression profiling study was to evaluate to what extent suboptimal samples with low leu...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-5-6

    authors: de la Blétière DR,Blanchet O,Cornillet-Lefèbvre P,Coutolleau A,Baranger L,Geneviève F,Luquet I,Hunault-Berger M,Beucher A,Schmidt-Tanguy A,Zandecki M,Delneste Y,Ifrah N,Guardiola P

    更新日期:2012-01-30 00:00:00

  • Molecular sampling of prostate cancer: a dilemma for predicting disease progression.

    abstract:BACKGROUND:Current prostate cancer prognostic models are based on pre-treatment prostate specific antigen (PSA) levels, biopsy Gleason score, and clinical staging but in practice are inadequate to accurately predict disease progression. Hence, we sought to develop a molecular panel for prostate cancer progression by re...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-3-8

    authors: Sboner A,Demichelis F,Calza S,Pawitan Y,Setlur SR,Hoshida Y,Perner S,Adami HO,Fall K,Mucci LA,Kantoff PW,Stampfer M,Andersson SO,Varenhorst E,Johansson JE,Gerstein MB,Golub TR,Rubin MA,Andrén O

    更新日期:2010-03-16 00:00:00

  • Gene profiling of the erythro- and megakaryoblastic leukaemias induced by the Graffi murine retrovirus.

    abstract:BACKGROUND:Acute erythro- and megakaryoblastic leukaemias are associated with very poor prognoses and the mechanism of blastic transformation is insufficiently elucidated. The murine Graffi leukaemia retrovirus induces erythro- and megakaryoblastic leukaemias when inoculated into NFS mice and represents a good model to...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-3-2

    authors: Voisin V,Legault P,Ospina DP,Ben-David Y,Rassart E

    更新日期:2010-01-26 00:00:00

  • Association of adipocyte genes with ASP expression: a microarray analysis of subcutaneous and omental adipose tissue in morbidly obese subjects.

    abstract:BACKGROUND:Prevalence of obesity is increasing to pandemic proportions. However, obese subjects differ in insulin resistance, adipokine production and co-morbidities. Based on fasting plasma analysis, obese subjects were grouped as Low Acylation Stimulating protein (ASP) and Triglyceride (TG) (LAT) vs High ASP and TG (...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-3-3

    authors: MacLaren RE,Cui W,Lu H,Simard S,Cianflone K

    更新日期:2010-01-27 00:00:00

  • Pharmacogenetic testing through the direct-to-consumer genetic testing company 23andMe.

    abstract:BACKGROUND:Rapid advances in scientific research have led to an increase in public awareness of genetic testing and pharmacogenetics. Direct-to-consumer (DTC) genetic testing companies, such as 23andMe, allow consumers to access their genetic information directly through an online service without the involvement of hea...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-017-0283-0

    authors: Lu M,Lewis CM,Traylor M

    更新日期:2017-06-19 00:00:00

  • Genomic approaches to identifying targets for treating β hemoglobinopathies.

    abstract::Sickle cell disease and β thalassemia are common severe diseases with little effective pathophysiologically-based treatment. Their phenotypic heterogeneity prompted genomic approaches to identify modifiers that ultimately might be exploited therapeutically. Fetal hemoglobin (HbF) is the major modulator of the phenotyp...

    journal_title:BMC medical genomics

    pub_type: 杂志文章,评审

    doi:10.1186/s12920-015-0120-2

    authors: Ngo DA,Steinberg MH

    更新日期:2015-07-29 00:00:00

  • 12q14 microduplication: a new clinical entity reciprocal to the microdeletion syndrome?

    abstract:BACKGROUND:12q14 microdeletion syndrome is characterized by low birth weight and failure to thrive, proportionate short stature and developmental delay. The opposite syndrome (microduplication) has not yet been characterized. Our main objective is the recognition of a new clinical entity - 12q14 microduplication syndro...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-019-0653-x

    authors: Dória S,Alves D,Pinho MJ,Pinto J,Leão M

    更新日期:2020-01-03 00:00:00

  • Genome-wide prediction and analysis of human tissue-selective genes using microarray expression data.

    abstract:BACKGROUND:Understanding how genes are expressed specifically in particular tissues is a fundamental question in developmental biology. Many tissue-specific genes are involved in the pathogenesis of complex human diseases. However, experimental identification of tissue-specific genes is time consuming and difficult. Th...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-6-S1-S10

    authors: Teng S,Yang JY,Wang L

    更新日期:2013-01-01 00:00:00