Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Towards Applying Text Mining and Natural Language Processing for Biomedical Ontology Acquisition
Inniss T., Light M., Thomas G., Lee J., Grassi M., Williams A.
TMBIO(2006)
John G
CMPUT 605
2006
Focus
Ontology for describing age-related macular degeneration (AMD) Comparison of the accuracy of three methods for Ontology
Natural Language Processing (NLP) Text Mining (SAS Text Miner) Human Expert
Manual and adhoc knowledge acquisition IDOCS (Intelligent Distributed Ontology Consensus System)
CMPUT 605
2006
Introduction
No existing common and standardized vocabulary for classification of disease types for certain eyediseases
Clinicians, dispersed geographically, may use different terms to describe the same condition Research aimed at extracting the feature and attribute descriptions for the vocabulary of AMD, and build an Ontology from that.
CMPUT 605
2006
Related Work
Lot of research done, since 1990s, for applying NLP techniques in medicine, bio-medicine etc.
NLP & Text Data Mining have been recognized to play an important role in this endeavor Research focused on online repositories such as Medline & PubMed
CMPUT 605
2006
IDOCS
CMPUT 605
2006
Methodology
Four clinical experts in retinal diseases enlisted to view 100 eye sample images of AMD
Experts in different geographic locations Described the observations using digital voice recorders no artificially imposed vocabulary constraints Another retinal expert for manual parsing of the transcribed text extracting key words, organization of key-words into categories etc.
CMPUT 605
2006
Methodology: NLP
NLP: Used for information extraction and automatic summarization.
Identify short sequences of words having meaning over and above a meaning composed directly from their parts extreme programming Ngram Statistics Package (NSP) used for collocation discovery in case of bi-grams
CMPUT 605
2006
Methodology: NLP
CMPUT 605
2006
CMPUT 605
2006
CMPUT 605
2006
Results: NLP
CMPUT 605
2006
CMPUT 605
2006
Comparison
sss
CMPUT 605
2006
Comparison
Thus text mining is a viable and effective method for determining vocabulary to describe a particular disease
Text Mining found a lot of terms that NLP found Human Expert is the best Ground Truth
CMPUT 605
2006
Ontology Generation
CMPUT 605
2006
2006
CMPUT 605
2006