Benvenuto in Scribd!

Salta carosello

Data Mining: Ontologies

Caricato da

JohnGagliano

Il 0% ha trovato utile questo documento (0 voti)

34 visualizzazioni17 pagine

Towards Applying Text Mining and Natural Language Processing for Biomedical Ontology Acquisition

Copyright

Formati disponibili

PPTX, PDF, TXT o leggi online da Scribd

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Segnala questo documento

Towards Applying Text Mining and Natural Language Processing for Biomedical Ontology Acquisition

Copyright:

Attribution Non-Commercial (BY-NC)

Formati disponibili

Scarica in formato PPTX, PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Il 0% ha trovato utile questo documento (0 voti)

34 visualizzazioni17 pagine

Data Mining: Ontologies

Caricato da

JohnGagliano

Towards Applying Text Mining and Natural Language Processing for Biomedical Ontology Acquisition

Copyright:

Attribution Non-Commercial (BY-NC)

Formati disponibili

Scarica in formato PPTX, PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Salta alla pagina

Sei sulla pagina 1di 17

Cerca all'interno del documento

Faculty of Computer Science

Towards Applying Text Mining and Natural Language Processing for Biomedical Ontology Acquisition
Inniss T., Light M., Thomas G., Lee J., Grassi M., Williams A.
TMBIO(2006)

John G

CMPUT 605

March 31, 2013

2006

Department of Computing Science

Focus
Ontology for describing age-related macular degeneration (AMD) Comparison of the accuracy of three methods for Ontology
Natural Language Processing (NLP) Text Mining (SAS Text Miner) Human Expert

Manual and adhoc knowledge acquisition IDOCS (Intelligent Distributed Ontology Consensus System)

CMPUT 605

2006

Department of Computing Science

Introduction
No existing common and standardized vocabulary for classification of disease types for certain eyediseases
Clinicians, dispersed geographically, may use different terms to describe the same condition Research aimed at extracting the feature and attribute descriptions for the vocabulary of AMD, and build an Ontology from that.

CMPUT 605

2006

Department of Computing Science

Related Work
Lot of research done, since 1990s, for applying NLP techniques in medicine, bio-medicine etc.
NLP & Text Data Mining have been recognized to play an important role in this endeavor Research focused on online repositories such as Medline & PubMed

NLP systems developed: MedLee, UMLS, GENIES etc.

CMPUT 605

2006

Department of Computing Science

IDOCS

CMPUT 605

2006

Department of Computing Science

Methodology
Four clinical experts in retinal diseases enlisted to view 100 eye sample images of AMD
Experts in different geographic locations Described the observations using digital voice recorders no artificially imposed vocabulary constraints Another retinal expert for manual parsing of the transcribed text extracting key words, organization of key-words into categories etc.

CMPUT 605

2006

Department of Computing Science

Methodology: NLP
NLP: Used for information extraction and automatic summarization.
Identify short sequences of words having meaning over and above a meaning composed directly from their parts extreme programming Ngram Statistics Package (NSP) used for collocation discovery in case of bi-grams

Word-pair associations measured by PMI

CMPUT 605

2006

Department of Computing Science

Methodology: NLP

Large PMI for larger degree of association between the words s

CMPUT 605

2006

Department of Computing Science

Methodology:Text Mining (SAS Text Miner)

Collection of documents (corpus) used as input to any text mining algorithm
Corpus broken into tokens or terms (tokens in a particular language) Term weighting Measures: Entropy, Inverse Document Frequency (IDF), Global Frequency (GF) IDF, None (Global weight of 1) & Normal term wt.

CMPUT 605

2006

Department of Computing Science

Results: Human Experts

CMPUT 605

2006

Department of Computing Science

Results: NLP

CMPUT 605

2006

Department of Computing Science

Results: Text Miner

Frequency wt. None
Term wt. Normal

CMPUT 605

2006

Department of Computing Science

Comparison
sss

CMPUT 605

2006

Department of Computing Science

Comparison

Thus text mining is a viable and effective method for determining vocabulary to describe a particular disease
Text Mining found a lot of terms that NLP found Human Expert is the best Ground Truth

CMPUT 605

2006

Department of Computing Science

Ontology Generation

CMPUT 605

2006

Department of Computing Science

Conclusion and Future Work

Human experts are the best, but they did miss some key descriptors
Text Mining and NLP can enhance the generation of feature generations, by preventing the above case As a consequence more robust vocabulary can be generated Extension evaluate the effectiveness of the automated tools, text mining & NLP Different weighting schemes will be tried in the future
CMPUT 605

2006

Department of Computing Science

Thank You For Your Attention!

CMPUT 605

2006

Potrebbero piacerti anche

Grit: The Power of Passion and Perseverance
Da Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Valutazione: 4 su 5 stelle
4/5 (588)
A Meta-Analysis of Vocabulary Learning Strategies of EFL Learners
Documento11 pagine
A Meta-Analysis of Vocabulary Learning Strategies of EFL Learners
NAJDCO
Nessuna valutazione finora
The Yellow House: A Memoir (2019 National Book Award Winner)
Da Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Valutazione: 4 su 5 stelle
4/5 (98)
Ethiopia Information Revolution Practice Spotlight FINAL 508 Compliant
Documento10 pagine
Ethiopia Information Revolution Practice Spotlight FINAL 508 Compliant
Geremew Tarekegne Tsegaye
Nessuna valutazione finora
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Da Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Valutazione: 4 su 5 stelle
4/5 (5795)
NMDC
Documento42 pagine
NMDC
Lost Humera
Nessuna valutazione finora
Never Split the Difference: Negotiating As If Your Life Depended On It
Da Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Valutazione: 4.5 su 5 stelle
4.5/5 (838)
Good Movies For Research Papers
Documento7 pagine
Good Movies For Research Papers
ngqcodbkf
100% (1)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Da Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Valutazione: 4 su 5 stelle
4/5 (895)
1st Quaretrly Exam in PR 1
Documento3 pagine
1st Quaretrly Exam in PR 1
Niño Jay C. Gastones
100% (1)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Da Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Valutazione: 4.5 su 5 stelle
4.5/5 (345)
00dissertation PDF
Documento161 pagine
00dissertation PDF
lady diane ancheta
Nessuna valutazione finora
Shoe Dog: A Memoir by the Creator of Nike
Da Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Valutazione: 4.5 su 5 stelle
4.5/5 (537)
Green Human Resource Management
Documento7 pagine
Green Human Resource Management
Sergey
Nessuna valutazione finora
Yes Please
Da Everand
Yes Please
Amy Poehler
Valutazione: 4 su 5 stelle
4/5 (1891)
Research Methodology of Sip
Documento2 pagine
Research Methodology of Sip
Paras Jain
0% (1)
The Little Book of Hygge: Danish Secrets to Happy Living
Da Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Valutazione: 3.5 su 5 stelle
3.5/5 (400)
ICTICT608 - Main Assessment
Documento10 pagine
ICTICT608 - Main Assessment
samwel
0% (1)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Da Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Valutazione: 4.5 su 5 stelle
4.5/5 (474)
Thesis Projects For Computer Science
Documento6 pagine
Thesis Projects For Computer Science
gbxfr1p1
100% (2)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Da Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Valutazione: 3.5 su 5 stelle
3.5/5 (231)
INFOSYS Placement Paper 2 - Freshers Choice
Documento20 pagine
INFOSYS Placement Paper 2 - Freshers Choice
fresherschoice
Nessuna valutazione finora
On Fire: The (Burning) Case for a Green New Deal
Da Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Valutazione: 4 su 5 stelle
4/5 (74)
How To Write Your First Research Paper: Focus: Education - Career Advice
Documento10 pagine
How To Write Your First Research Paper: Focus: Education - Career Advice
Ricardo Chavarria
Nessuna valutazione finora
The Emperor of All Maladies: A Biography of Cancer
Da Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Valutazione: 4.5 su 5 stelle
4.5/5 (271)
Titela VILCEANU - Introduction To Pragmatics - Portfolio Guidebook
Documento29 pagine
Titela VILCEANU - Introduction To Pragmatics - Portfolio Guidebook
Roxana Panache
Nessuna valutazione finora
Angela's Ashes: A Memoir
Da Everand
Angela's Ashes: A Memoir
Frank McCourt
Valutazione: 4.5 su 5 stelle
4.5/5 (440)
Zerihun Tsegayepdf
Documento20 pagine
Zerihun Tsegayepdf
Mohamud Hanad
Nessuna valutazione finora
Bad Feminist: Essays
Da Everand
Bad Feminist: Essays
Roxane Gay
Valutazione: 4 su 5 stelle
4/5 (1016)
DAC21801 - FYP Rubric CeDS 20222023 - EX - Final
Documento4 pagine
DAC21801 - FYP Rubric CeDS 20222023 - EX - Final
Kimaii Zaki
Nessuna valutazione finora
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Da Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Valutazione: 4.5 su 5 stelle
4.5/5 (266)
Claroty Company of The Year - Frost & Sullivan
Documento10 pagine
Claroty Company of The Year - Frost & Sullivan
Tuan MA
Nessuna valutazione finora
The Unwinding: An Inner History of the New America
Da Everand
The Unwinding: An Inner History of the New America
George Packer
Valutazione: 4 su 5 stelle
4/5 (45)
Part-Time Job Students' Difficulties in Studying and Working
Documento5 pagine
Part-Time Job Students' Difficulties in Studying and Working
Jennie KIm
Nessuna valutazione finora
Team of Rivals: The Political Genius of Abraham Lincoln
Da Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Valutazione: 4.5 su 5 stelle
4.5/5 (234)
DLL Ucsp 2017
Documento68 pagine
DLL Ucsp 2017
Ma. Luisa A. Angsinco
Nessuna valutazione finora
Principles: Life and Work
Da Everand
Principles: Life and Work
Ray Dalio
Valutazione: 4 su 5 stelle
4/5 (599)
LGBTQIA
Documento12 pagine
LGBTQIA
John Enmar Pantig Malonzo
Nessuna valutazione finora
Fear: Trump in the White House
Da Everand
Fear: Trump in the White House
Bob Woodward
Valutazione: 3.5 su 5 stelle
3.5/5 (738)
Constitutional Law
Documento19 pagine
Constitutional Law
May Marie Ann Aragon-Jimenez WESTERN MINDANAO STATE UNIVERSITY, COLLEGE OF LAW
Nessuna valutazione finora
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Da Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Valutazione: 3.5 su 5 stelle
3.5/5 (2259)
Resume Guide Princeton University
Documento12 pagine
Resume Guide Princeton University
Bảo Vy
Nessuna valutazione finora
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Da Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Valutazione: 4 su 5 stelle
4/5 (1091)
Digital Distraction Among College Students of The University of Batangas PDF
Documento32 pagine
Digital Distraction Among College Students of The University of Batangas PDF
Prince Miranda
100% (2)
Steve Jobs
Da Everand
Steve Jobs
Walter Isaacson
Valutazione: 4.5 su 5 stelle
4.5/5 (806)
Factors Affecting Potential Consumers To Variable Life Insurance: Based On Theory of Planned Behavior
Documento8 pagine
Factors Affecting Potential Consumers To Variable Life Insurance: Based On Theory of Planned Behavior
Motiram paudel
Nessuna valutazione finora
Rise of ISIS: A Threat We Can't Ignore
Da Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Valutazione: 3.5 su 5 stelle
3.5/5 (137)
PR1 Peta 2
Documento7 pagine
PR1 Peta 2
rycowyne06
Nessuna valutazione finora
John Adams
Da Everand
John Adams
David McCullough
Valutazione: 4.5 su 5 stelle
4.5/5 (2409)
Executive: Maulana Puji Kusumadewi
Documento4 pagine
Executive: Maulana Puji Kusumadewi
Barra Selabean Sasiwou
Nessuna valutazione finora
The Glass Castle: A Memoir
Da Everand
The Glass Castle: A Memoir
Jeannette Walls
Valutazione: 4.5 su 5 stelle
4.5/5 (1713)
+ Koe,-Costello-and-Taylor-2016-Luxury-Branding-Review-of-the-Literature
Documento9 pagine
+ Koe,-Costello-and-Taylor-2016-Luxury-Branding-Review-of-the-Literature
Danishev
Nessuna valutazione finora
The Outsider: A Novel
Da Everand
The Outsider: A Novel
Stephen King
Valutazione: 4 su 5 stelle
4/5 (1839)
Crimsocio Notes RKM 1
Documento159 pagine
Crimsocio Notes RKM 1
Carlo Jay Cajandab
Nessuna valutazione finora
Brooklyn: A Novel
Da Everand
Brooklyn: A Novel
Colm Toibin
Valutazione: 3.5 su 5 stelle
3.5/5 (1937)
Project Report On Training and Development
Documento50 pagine
Project Report On Training and Development
kaur_simran232
75% (144)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Da Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Valutazione: 4.5 su 5 stelle
4.5/5 (121)
Examining Market Behavior and Firm Risk Patterns: An Empirical Analysis On Hispanic Female-Owned Businesses Enterprises
Documento16 pagine
Examining Market Behavior and Firm Risk Patterns: An Empirical Analysis On Hispanic Female-Owned Businesses Enterprises
TI Journals Publishing
Nessuna valutazione finora
The Light Between Oceans: A Novel
Da Everand
The Light Between Oceans: A Novel
M.L. Stedman
Valutazione: 4.5 su 5 stelle
4.5/5 (789)
Artificial Intelligence Risks and Benefits PDF
Documento5 pagine
Artificial Intelligence Risks and Benefits PDF
Gopinath
Nessuna valutazione finora
The Art of Racing in the Rain: A Novel
Da Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Valutazione: 4 su 5 stelle
4/5 (4200)
Manhattan Beach: A Novel
Da Everand
Manhattan Beach: A Novel
Jennifer Egan
Valutazione: 3.5 su 5 stelle
3.5/5 (792)
The Woman in Cabin 10
Da Everand
The Woman in Cabin 10
Ruth Ware
Valutazione: 3.5 su 5 stelle
3.5/5 (2322)
The Perks of Being a Wallflower
Da Everand
The Perks of Being a Wallflower
Stephen Chbosky
Valutazione: 4.5 su 5 stelle
4.5/5 (2104)
Wolf Hall: A Novel
Da Everand
Wolf Hall: A Novel
Hilary Mantel
Valutazione: 4 su 5 stelle
4/5 (3811)
A Man Called Ove: A Novel
Da Everand
A Man Called Ove: A Novel
Fredrik Backman
Valutazione: 4.5 su 5 stelle
4.5/5 (4610)
Little Women
Da Everand
Little Women
Louisa May Alcott
Valutazione: 4 su 5 stelle
4/5 (104)
Sing, Unburied, Sing: A Novel
Da Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Valutazione: 4 su 5 stelle
4/5 (1103)
A Tree Grows in Brooklyn
Da Everand
A Tree Grows in Brooklyn
Betty Smith
Valutazione: 4.5 su 5 stelle
4.5/5 (1929)
Her Body and Other Parties: Stories
Da Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Valutazione: 4 su 5 stelle
4/5 (821)
The Constant Gardener: A Novel
Da Everand
The Constant Gardener: A Novel
John le Carré
Valutazione: 3.5 su 5 stelle
3.5/5 (104)