Performance Was Similar To The One Deemed

Caricato da

Srinivas

Il 0% ha trovato utile questo documento (0 voti)

11 visualizzazioni1 pagina

Performance Was Similar to the One Deemed

Titolo originale

Performance Was Similar to the One Deemed

Copyright

Formati disponibili

PDF, TXT o leggi online da Scribd

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Segnala questo documento

Performance Was Similar to the One Deemed

Copyright:

Formati disponibili

Scarica in formato PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Il 0% ha trovato utile questo documento (0 voti)

11 visualizzazioni1 pagina

Performance Was Similar To The One Deemed

Caricato da

Srinivas

Performance Was Similar to the One Deemed

Copyright:

Formati disponibili

Scarica in formato PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Salta alla pagina

Sei sulla pagina 1di 1

Cerca all'interno del documento

Automatic Genre-Specific Text Classification

its performance was similar to the one deemed best complex one, outperforms NB especially in text mining
such as Information Gain and Chi Square, and it tasks [Kim, Han, Rim, & Myaeng, 2006]. We describe
is simple and efficient. Therefore, we chose DF them below.
as our general feature selection method. In our
previous work [Yu et al., 2008], we concluded that 1. Naïve Bayes - Naïve Bayes classifier can be viewed
a DF threshold of 30 is a good setting to balance as a Bayesian network where feature attributes X1,
the computation complexity and classification X2, …, Xn are conditionally independent given
accuracy. With such a feature selection setting, the class attribute C [John & Langley, 1995].
we obtained 1754 features from 63963 unique Let C be a random variable and X be a vector of
words in the training corpus. random variables X1, X2, …, Xn. The probability
2. Genre Features - Each defined class has its own of a document x being in class c is calculated us-
characteristics other than general features. Many ing Bayes’ rule as below. The document will be
keywords such as ‘grading policy’ occur in a true classified into the most probable class.
syllabus probably along with a link to the content
page. On the other hand, a false syllabus might con- p ( X = x | C = c) p (C = c)
p (C = c | X = x) =
tain syllabus keyword without enough keywords p( X = x)
related to the syllabus components. In addition,
the position of a keyword within a page matters.
Since feature attributes (x1, x2, …, xn) represent the
For example, a keyword within the anchor text of
document x, and they are assumed to be conditionally
a link or around the link would suggest a syllabus
independent, we can obtain the equation below.
component outside the current page. A capitalized
keyword at the beginning of a page would sug-
p ( X = x | C = c) = ∏ p ( X i = xi | C = c)
gest a syllabus component with a heading in the i
page. Motivated by the above observations, we
manually selected 84 features to classify our data
An assumption to estimate the above probabilities
set into the four classes. We used both content
for numeric attributes is that the value of such an at-
and structure features for syllabus classification,
tribute follows a normal distribution within a class.
as they have been found useful in the detection
Therefore, we can estimate p(Xi = xi | C = c) by using
of other genres [Kennedy & Shepherd, 2005].
the mean and the standard deviation of such a normal
These features mainly concern the occurrence
distribution from the training data.
of keywords, the positions of keywords, and the
Such an assumption for the distribution may not
co-occurrence of keywords and links. Details of
hold for some domains. Therefore, we also applied
these features are in [Yu et al., 2008].
the kernel method from [John & Langley, 1995] to
estimate the distribution of each numeric attribute in
After extracting free text from these documents, our
our syllabus classification application.
training corpus consisted of 63963 unique terms, We
represented it by the three kinds of feature attributes:
2. Support Vector Machines - It is a two-class classi-
1754 unique general features, 84 unique genre features,
fier (Figure 1) that finds the hyperplane maximiz-
and 1838 unique features in total. Each of these feature
ing the minimum distance between the hyperplane
attributes has a numeric value between 0.0 and 1.0.
and training data points [Boser, Guyon, & Vapnik,
1992]. Specifically, the hyperplane ωTx + γ is
Classifiers found by minimizing the objective function:
NB and SVM are two well-known best performing 1
supervised learning models in text classification appli- || W || 2 such that D( AW − eG ) ≥ e
2
cations [Kim, Han, Rim, & Myaeng, 2006; Joachims,
1998]. NB, a simple and efficient approach, succeeds The margin is
in various data mining tasks, while SVM, a highly

Potrebbero piacerti anche

Databases and Ontologies
Documento1 pagina
Databases and Ontologies
Srinivas
Nessuna valutazione finora
Bibliomining For Library Decision-Making: Key Terms
Documento1 pagina
Bibliomining For Library Decision-Making: Key Terms
Srinivas
Nessuna valutazione finora
Machine Learning Tools: (Scherf Et. Al. 2005)
Documento1 pagina
Machine Learning Tools: (Scherf Et. Al. 2005)
Srinivas
Nessuna valutazione finora
Bioinformatics Programmers
Documento1 pagina
Bioinformatics Programmers
Srinivas
Nessuna valutazione finora
Bio in For Matics
Documento1 pagina
Bio in For Matics
Srinivas
Nessuna valutazione finora
Bibliomining For Library Decision-Making: Background
Documento1 pagina
Bibliomining For Library Decision-Making: Background
Srinivas
Nessuna valutazione finora
Discussed The Application
Documento1 pagina
Discussed The Application
Srinivas
Nessuna valutazione finora
Modified For This Purpose
Documento1 pagina
Modified For This Purpose
Srinivas
Nessuna valutazione finora
Provides More Accurate Recommendations
Documento1 pagina
Provides More Accurate Recommendations
Srinivas
Nessuna valutazione finora
American Standard Code For Informa
Documento1 pagina
American Standard Code For Informa
Srinivas
Nessuna valutazione finora
Historic Nature of Data
Documento1 pagina
Historic Nature of Data
Srinivas
Nessuna valutazione finora
Have Realized The Importance
Documento1 pagina
Have Realized The Importance
Srinivas
Nessuna valutazione finora
Familiar With The Browser
Documento1 pagina
Familiar With The Browser
Srinivas
Nessuna valutazione finora
Best Practices in Data Warehousing: Les Pang
Documento1 pagina
Best Practices in Data Warehousing: Les Pang
Srinivas
Nessuna valutazione finora
Business Areas Served
Documento1 pagina
Business Areas Served
Srinivas
Nessuna valutazione finora
The Framework For Behavioral Pattern-Based Clustering
Documento1 pagina
The Framework For Behavioral Pattern-Based Clustering
Srinivas
Nessuna valutazione finora
Categories of Customer Behavior
Documento1 pagina
Categories of Customer Behavior
Srinivas
Nessuna valutazione finora
A Bayesian Based Machine Learning Application To Task Analysis
Documento1 pagina
A Bayesian Based Machine Learning Application To Task Analysis
Srinivas
Nessuna valutazione finora
Task Analysis Compared
Documento1 pagina
Task Analysis Compared
Srinivas
Nessuna valutazione finora
Key Terms: A Bayesian Based Machine Learning Application To Task Analysis
Documento1 pagina
Key Terms: A Bayesian Based Machine Learning Application To Task Analysis
Srinivas
Nessuna valutazione finora
Similarly Presented and Having
Documento1 pagina
Similarly Presented and Having
Srinivas
Nessuna valutazione finora
Support Vector Machines
Documento1 pagina
Support Vector Machines
Srinivas
Nessuna valutazione finora
Bayesian Based Machine Learning
Documento1 pagina
Bayesian Based Machine Learning
Srinivas
Nessuna valutazione finora
Classic Task Analysis Methods
Documento1 pagina
Classic Task Analysis Methods
Srinivas
Nessuna valutazione finora
Recorded Phone Conversations Between
Documento1 pagina
Recorded Phone Conversations Between
Srinivas
Nessuna valutazione finora
Proceedings of International Symposium
Documento1 pagina
Proceedings of International Symposium
Srinivas
Nessuna valutazione finora
Automatic Music Timbre Indexing
Documento1 pagina
Automatic Music Timbre Indexing
Srinivas
Nessuna valutazione finora
Automatic Musical Instrument
Documento1 pagina
Automatic Musical Instrument
Srinivas
Nessuna valutazione finora
What Are Musical Pitch
Documento1 pagina
What Are Musical Pitch
Srinivas
Nessuna valutazione finora
A Small Set of Digital Library
Documento1 pagina
A Small Set of Digital Library
Srinivas
Nessuna valutazione finora
Shoe Dog: A Memoir by the Creator of Nike
Da Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Valutazione: 4.5 su 5 stelle
4.5/5 (537)
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Da Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Valutazione: 4 su 5 stelle
4/5 (5794)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Da Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Valutazione: 4 su 5 stelle
4/5 (895)
The Yellow House: A Memoir (2019 National Book Award Winner)
Da Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Valutazione: 4 su 5 stelle
4/5 (98)
Grit: The Power of Passion and Perseverance
Da Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Valutazione: 4 su 5 stelle
4/5 (588)
The Little Book of Hygge: Danish Secrets to Happy Living
Da Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Valutazione: 3.5 su 5 stelle
3.5/5 (400)
The Emperor of All Maladies: A Biography of Cancer
Da Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Valutazione: 4.5 su 5 stelle
4.5/5 (271)
Never Split the Difference: Negotiating As If Your Life Depended On It
Da Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Valutazione: 4.5 su 5 stelle
4.5/5 (838)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Da Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Valutazione: 3.5 su 5 stelle
3.5/5 (2259)
On Fire: The (Burning) Case for a Green New Deal
Da Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Valutazione: 4 su 5 stelle
4/5 (74)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Da Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Valutazione: 4.5 su 5 stelle
4.5/5 (474)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Da Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Valutazione: 3.5 su 5 stelle
3.5/5 (231)
Team of Rivals: The Political Genius of Abraham Lincoln
Da Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Valutazione: 4.5 su 5 stelle
4.5/5 (234)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Da Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Valutazione: 4.5 su 5 stelle
4.5/5 (266)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Da Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Valutazione: 4.5 su 5 stelle
4.5/5 (345)
Yes Please
Da Everand
Yes Please
Amy Poehler
Valutazione: 4 su 5 stelle
4/5 (1891)
The Unwinding: An Inner History of the New America
Da Everand
The Unwinding: An Inner History of the New America
George Packer
Valutazione: 4 su 5 stelle
4/5 (45)
Rise of ISIS: A Threat We Can't Ignore
Da Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Valutazione: 3.5 su 5 stelle
3.5/5 (137)
Principles: Life and Work
Da Everand
Principles: Life and Work
Ray Dalio
Valutazione: 4 su 5 stelle
4/5 (599)
Fear: Trump in the White House
Da Everand
Fear: Trump in the White House
Bob Woodward
Valutazione: 3.5 su 5 stelle
3.5/5 (738)
Angela's Ashes: A Memoir
Da Everand
Angela's Ashes: A Memoir
Frank McCourt
Valutazione: 4.5 su 5 stelle
4.5/5 (440)
Bad Feminist: Essays
Da Everand
Bad Feminist: Essays
Roxane Gay
Valutazione: 4 su 5 stelle
4/5 (1016)
Steve Jobs
Da Everand
Steve Jobs
Walter Isaacson
Valutazione: 4.5 su 5 stelle
4.5/5 (806)
The Glass Castle: A Memoir
Da Everand
The Glass Castle: A Memoir
Jeannette Walls
Valutazione: 4.5 su 5 stelle
4.5/5 (1713)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Da Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Valutazione: 4 su 5 stelle
4/5 (1090)
John Adams
Da Everand
John Adams
David McCullough
Valutazione: 4.5 su 5 stelle
4.5/5 (2409)
The Outsider: A Novel
Da Everand
The Outsider: A Novel
Stephen King
Valutazione: 4 su 5 stelle
4/5 (1839)
The Light Between Oceans: A Novel
Da Everand
The Light Between Oceans: A Novel
M.L. Stedman
Valutazione: 4.5 su 5 stelle
4.5/5 (789)
Manhattan Beach: A Novel
Da Everand
Manhattan Beach: A Novel
Jennifer Egan
Valutazione: 3.5 su 5 stelle
3.5/5 (792)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Da Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Valutazione: 4.5 su 5 stelle
4.5/5 (121)
The Woman in Cabin 10
Da Everand
The Woman in Cabin 10
Ruth Ware
Valutazione: 3.5 su 5 stelle
3.5/5 (2322)
Brooklyn: A Novel
Da Everand
Brooklyn: A Novel
Colm Tóibín
Valutazione: 3.5 su 5 stelle
3.5/5 (1937)
A Man Called Ove: A Novel
Da Everand
A Man Called Ove: A Novel
Fredrik Backman
Valutazione: 4.5 su 5 stelle
4.5/5 (4610)
The Perks of Being a Wallflower
Da Everand
The Perks of Being a Wallflower
Stephen Chbosky
Valutazione: 4.5 su 5 stelle
4.5/5 (2104)
Wolf Hall: A Novel
Da Everand
Wolf Hall: A Novel
Hilary Mantel
Valutazione: 4 su 5 stelle
4/5 (3811)
Little Women
Da Everand
Little Women
Louisa May Alcott
Valutazione: 4 su 5 stelle
4/5 (104)
Her Body and Other Parties: Stories
Da Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Valutazione: 4 su 5 stelle
4/5 (821)
The Art of Racing in the Rain: A Novel
Da Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Valutazione: 4 su 5 stelle
4/5 (4200)
Sing, Unburied, Sing: A Novel
Da Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Valutazione: 4 su 5 stelle
4/5 (1103)
The Constant Gardener: A Novel
Da Everand
The Constant Gardener: A Novel
John le Carré
Valutazione: 3.5 su 5 stelle
3.5/5 (104)
A Tree Grows in Brooklyn
Da Everand
A Tree Grows in Brooklyn
Betty Smith
Valutazione: 4.5 su 5 stelle
4.5/5 (1929)
White Noise Windows Data Augmentation For Time Series
Documento5 pagine
White Noise Windows Data Augmentation For Time Series
andrew manuel
Nessuna valutazione finora
Artificial Intelligence and Machine Learning
Documento12 pagine
Artificial Intelligence and Machine Learning
Swati Hans
Nessuna valutazione finora
2020 02. DNNRec A Novel Deep Learning Based Hybrid Recommender System
Documento14 pagine
2020 02. DNNRec A Novel Deep Learning Based Hybrid Recommender System
imran
Nessuna valutazione finora
Artificial Intelligence - Presentation
Documento11 pagine
Artificial Intelligence - Presentation
Vale
Nessuna valutazione finora
CGS Ebook Transformative AI
Documento101 pagine
CGS Ebook Transformative AI
Joao Rangel
Nessuna valutazione finora
Activation Functions - Ipynb - Colaboratory
Documento10 pagine
Activation Functions - Ipynb - Colaboratory
GOURAV SAHOO
Nessuna valutazione finora
@ Car Evaluation
Documento10 pagine
@ Car Evaluation
MuhammadRidhoIryananda
Nessuna valutazione finora
18CSC305J - UNIT-4.pptx - 18CSC305J - UNIT-4
Documento77 pagine
18CSC305J - UNIT-4.pptx - 18CSC305J - UNIT-4
Jefferson Aaron
Nessuna valutazione finora
Computrised Paper Evaluation Using Neural Network
Documento22 pagine
Computrised Paper Evaluation Using Neural Network
arjun c chandrathil
50% (4)
Cheat Sheet
Documento15 pagine
Cheat Sheet
dylantan.yhao
Nessuna valutazione finora
08 Natural Language Processing in Tensorflow
Documento29 pagine
08 Natural Language Processing in Tensorflow
Akbar Shakoor
Nessuna valutazione finora
Int234 Oer
Documento2 pagine
Int234 Oer
gunuruphanindra
Nessuna valutazione finora
AI202 - Spring 2024 - Lecture 1 - Introduction
Documento27 pagine
AI202 - Spring 2024 - Lecture 1 - Introduction
Asad
Nessuna valutazione finora
CE F417-Applications of AI in Civil Engineering-Jagadeesh
Documento3 pagine
CE F417-Applications of AI in Civil Engineering-Jagadeesh
Masina Sai Satish
Nessuna valutazione finora
JNTUK R20 B.Tech CSE 3-2 Machine Learning Unit 3 Notes
Documento21 pagine
JNTUK R20 B.Tech CSE 3-2 Machine Learning Unit 3 Notes
Harini
Nessuna valutazione finora
LRM: L R M S I 3D: Arge Econstruction Odel For Ingle Mage To
Documento23 pagine
LRM: L R M S I 3D: Arge Econstruction Odel For Ingle Mage To
caxidas953
Nessuna valutazione finora
Ensemble Learning
Documento22 pagine
Ensemble Learning
Aleli Pamplona
Nessuna valutazione finora
Traffic Sign Recognition Using Yolov3 Based Detector: Bachelor of Technology
Documento40 pagine
Traffic Sign Recognition Using Yolov3 Based Detector: Bachelor of Technology
Naresh Dama
Nessuna valutazione finora
Python Major Project Titles List
Documento3 pagine
Python Major Project Titles List
vikkinikki
100% (1)
FPGA Implementation of A Convolutional Neural Network For Wake Up Word Detection - Project Assignment - Ole Martin Skafsa - NTNU
Documento120 pagine
FPGA Implementation of A Convolutional Neural Network For Wake Up Word Detection - Project Assignment - Ole Martin Skafsa - NTNU
Technical Novice
Nessuna valutazione finora
Object Detectionwith Convolutional Neural Networks
Documento12 pagine
Object Detectionwith Convolutional Neural Networks
Sejal Kale
Nessuna valutazione finora
Introduction To Neural Networks - Chapter1
Documento43 pagine
Introduction To Neural Networks - Chapter1
wondi BET
Nessuna valutazione finora
A Comprehensive Survey of The R-CNN Family For Object Detection
Documento6 pagine
A Comprehensive Survey of The R-CNN Family For Object Detection
Suleiman Shamasneh
Nessuna valutazione finora
Kunal Rai: ML/AI Professional
Documento2 pagine
Kunal Rai: ML/AI Professional
kunal rai
Nessuna valutazione finora
Introduction To ML P1
Documento21 pagine
Introduction To ML P1
tester123_mail
Nessuna valutazione finora
LABSHEET 3 - SN PID Controller PDF
Documento6 pagine
LABSHEET 3 - SN PID Controller PDF
yenni fatimah
Nessuna valutazione finora
(Athena Scientific Series in Optimization and Neural Computation, 6) Dimitris Bertsimas, John N. Tsitsiklis - Introduction To Linear Optimization-Athena Scientific (1997) PDF
Documento186 pagine
(Athena Scientific Series in Optimization and Neural Computation, 6) Dimitris Bertsimas, John N. Tsitsiklis - Introduction To Linear Optimization-Athena Scientific (1997) PDF
Manh Nguyen
Nessuna valutazione finora
Suryadeepti Singhal: Education About Me
Documento1 pagina
Suryadeepti Singhal: Education About Me
gorv
Nessuna valutazione finora
Sobel Erosion Dilation Examples
Documento4 pagine
Sobel Erosion Dilation Examples
ha has
Nessuna valutazione finora
Age and Gender Prediction For Better Marketing Strategies
Documento23 pagine
Age and Gender Prediction For Better Marketing Strategies
Manoj Surnam
Nessuna valutazione finora