BDA

Caricato da

Gaurav Kulat

Il 0% ha trovato utile questo documento (0 voti)

31 visualizzazioni2 pagine

Big Data

Copyright

Formati disponibili

DOCX, PDF, TXT o leggi online da Scribd

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Segnala questo documento

Big Data

Copyright:

Formati disponibili

Scarica in formato DOCX, PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Il 0% ha trovato utile questo documento (0 voti)

31 visualizzazioni2 pagine

BDA

Caricato da

Gaurav Kulat

Big Data

Copyright:

Formati disponibili

Scarica in formato DOCX, PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Salta alla pagina

Sei sulla pagina 1di 2

Cerca all'interno del documento

Classification

Data classification is the process of organizing data into categories for its usage to be done in efficient and
effective way. It makes essential data easy to find and retrieve. Once a data-classification scheme has
been created, security standards that specify appropriate handling practices for each category and storage
standards that define the data's lifecycle requirements should be addressed.
It is a systematic process for obtaining important and relevant information about data and metadata data
about data. The classification analysis helps identifying to which a set of categories different types of data
belong. Classification analysis is closely linked to cluster analysis as the classification can be used to
cluster data. For example, the email provider performs a well-known example of classification analysis:
they use algorithms that are capable of classifying your email as legitimate or mark it as spam.
To handle large storage data is a tedious job for users to identify accurate data from huge unstructured
data. So, there should be some mechanism which classifies unstructured data into organized form which
helps user to easily access required data. Classification techniques over big transactional database provide
required data to the users from large datasets more simple way. There are two main classification
techniques, supervised and unsupervised and thus big data concept comes into existence. The objective of
classification is to analyze huge data and to develop an accurate description or model for each organized
class using the feature present in the data. We use that training data to build a model of what a typical data
set looks like when it has one of the various target values. We then apply that model to data for which that
target value is currently unknown. The algorithm identifies new data points that match the model of each
target value. This model is used to classify test data for which the class descriptions are unknown.

Importance

Risk management

Legal Discovery

Compliances

predicts categorical class labels

Classifies data (constructs a model) based on the training set and the values (class labels) in a
classifying attribute and uses it in classifying new data

Application
Highly sensitive corporate and customer data that if disclosed could put the organization at
financial or legal risk.
Example: Employee social security numbers, customer credit card numbers
Sensitive internal data that if disclosed could negatively affect operations.

Example: Contracts with third-party suppliers, employee reviews

Internal data that is not meant for public disclosure.
Example: Sales contest rules, organizational charts
Data that may be freely disclosed with the public.
credit approval
target marketing
medical diagnosis
treatment effectiveness analysis

Classification technique is used to solve the challenges which classify the big data according to the format
of the data that must be processed, the type of analysis to be applied, the processing techniques at work,
and the data sources for the data that the target system is required to acquire, load, analyze, store and
process. Supervised classification techniques (Decision Tree and support vector machine) are also known
as directed or predictive classification. In this method, set of possible class is known in advanced.
Unsupervised classification techniques are also known as descriptive or undirected. In this method, set of
possible class is unknown, after classification we can assign name to that class.

Potrebbero piacerti anche

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Da Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Valutazione: 4 su 5 stelle
4/5 (5794)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Da Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Valutazione: 4 su 5 stelle
4/5 (1090)
Never Split the Difference: Negotiating As If Your Life Depended On It
Da Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Valutazione: 4.5 su 5 stelle
4.5/5 (838)
Principles: Life and Work
Da Everand
Principles: Life and Work
Ray Dalio
Valutazione: 4 su 5 stelle
4/5 (599)
The Glass Castle: A Memoir
Da Everand
The Glass Castle: A Memoir
Jeannette Walls
Valutazione: 4.5 su 5 stelle
4.5/5 (1712)
Sing, Unburied, Sing: A Novel
Da Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Valutazione: 4 su 5 stelle
4/5 (1103)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Da Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Valutazione: 4 su 5 stelle
4/5 (894)
Grit: The Power of Passion and Perseverance
Da Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Valutazione: 4 su 5 stelle
4/5 (587)
Shoe Dog: A Memoir by the Creator of Nike
Da Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Valutazione: 4.5 su 5 stelle
4.5/5 (537)
The Perks of Being a Wallflower
Da Everand
The Perks of Being a Wallflower
Stephen Chbosky
Valutazione: 4.5 su 5 stelle
4.5/5 (2099)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Da Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Valutazione: 4.5 su 5 stelle
4.5/5 (474)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Da Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Valutazione: 4.5 su 5 stelle
4.5/5 (344)
Bad Feminist: Essays
Da Everand
Bad Feminist: Essays
Roxane Gay
Valutazione: 4 su 5 stelle
4/5 (1015)
The Outsider: A Novel
Da Everand
The Outsider: A Novel
Stephen King
Valutazione: 4 su 5 stelle
4/5 (1839)
Her Body and Other Parties: Stories
Da Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Valutazione: 4 su 5 stelle
4/5 (821)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Da Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Valutazione: 4.5 su 5 stelle
4.5/5 (119)
The Emperor of All Maladies: A Biography of Cancer
Da Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Valutazione: 4.5 su 5 stelle
4.5/5 (271)
Angela's Ashes: A Memoir
Da Everand
Angela's Ashes: A Memoir
Frank McCourt
Valutazione: 4.5 su 5 stelle
4.5/5 (440)
The Little Book of Hygge: Danish Secrets to Happy Living
Da Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Valutazione: 3.5 su 5 stelle
3.5/5 (399)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Da Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Valutazione: 3.5 su 5 stelle
3.5/5 (2219)
A Man Called Ove: A Novel
Da Everand
A Man Called Ove: A Novel
Fredrik Backman
Valutazione: 4.5 su 5 stelle
4.5/5 (4609)
Brooklyn: A Novel
Da Everand
Brooklyn: A Novel
Colm Toibin
Valutazione: 3.5 su 5 stelle
3.5/5 (1937)
The Art of Racing in the Rain: A Novel
Da Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Valutazione: 4 su 5 stelle
4/5 (4200)
A Tree Grows in Brooklyn
Da Everand
A Tree Grows in Brooklyn
Betty Smith
Valutazione: 4.5 su 5 stelle
4.5/5 (1929)
The Yellow House: A Memoir (2019 National Book Award Winner)
Da Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Valutazione: 4 su 5 stelle
4/5 (98)
Steve Jobs
Da Everand
Steve Jobs
Walter Isaacson
Valutazione: 4.5 su 5 stelle
4.5/5 (806)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Da Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Valutazione: 4.5 su 5 stelle
4.5/5 (265)
The Woman in Cabin 10
Da Everand
The Woman in Cabin 10
Ruth Ware
Valutazione: 3.5 su 5 stelle
3.5/5 (2322)
Yes Please
Da Everand
Yes Please
Amy Poehler
Valutazione: 4 su 5 stelle
4/5 (1891)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Da Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Valutazione: 3.5 su 5 stelle
3.5/5 (231)
Team of Rivals: The Political Genius of Abraham Lincoln
Da Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Valutazione: 4.5 su 5 stelle
4.5/5 (234)
Fear: Trump in the White House
Da Everand
Fear: Trump in the White House
Bob Woodward
Valutazione: 3.5 su 5 stelle
3.5/5 (738)
John Adams
Da Everand
John Adams
David McCullough
Valutazione: 4.5 su 5 stelle
4.5/5 (2409)
Wolf Hall: A Novel
Da Everand
Wolf Hall: A Novel
Hilary Mantel
Valutazione: 4 su 5 stelle
4/5 (3811)
On Fire: The (Burning) Case for a Green New Deal
Da Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Valutazione: 4 su 5 stelle
4/5 (73)
The Light Between Oceans: A Novel
Da Everand
The Light Between Oceans: A Novel
M.L. Stedman
Valutazione: 4.5 su 5 stelle
4.5/5 (789)
The Unwinding: An Inner History of the New America
Da Everand
The Unwinding: An Inner History of the New America
George Packer
Valutazione: 4 su 5 stelle
4/5 (45)
Manhattan Beach: A Novel
Da Everand
Manhattan Beach: A Novel
Jennifer Egan
Valutazione: 3.5 su 5 stelle
3.5/5 (792)
The Constant Gardener: A Novel
Da Everand
The Constant Gardener: A Novel
John le Carré
Valutazione: 3.5 su 5 stelle
3.5/5 (104)
Rise of ISIS: A Threat We Can't Ignore
Da Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Valutazione: 3.5 su 5 stelle
3.5/5 (137)
Little Women
Da Everand
Little Women
Louisa May Alcott
Valutazione: 4 su 5 stelle
4/5 (104)
FB $ Ig Link Dating Ansd Hookup Link
Documento3 pagine
FB $ Ig Link Dating Ansd Hookup Link
itz kingsley
75% (4)
New Gateway To Computer Science 10
Documento289 pagine
New Gateway To Computer Science 10
Labu Rai
100% (1)
GSN PWS 9 9 Install Instructions Non Xerox
Documento3 pagine
GSN PWS 9 9 Install Instructions Non Xerox
Kostas Gus
Nessuna valutazione finora
Unit V:: Design and Analysis of Algorithms
Documento7 pagine
Unit V:: Design and Analysis of Algorithms
Sairam N
100% (1)
Opencv 2 Refman
Documento929 pagine
Opencv 2 Refman
Shobith Narayanan
Nessuna valutazione finora
Oracle® Warehouse Builder API and Scripting Reference
Documento206 pagine
Oracle® Warehouse Builder API and Scripting Reference
ranusofi
Nessuna valutazione finora
Software Requirements Specification
Documento17 pagine
Software Requirements Specification
Aditya Sajja
Nessuna valutazione finora
1 Oilwell Cementing Test Equipment
Documento42 pagine
1 Oilwell Cementing Test Equipment
Flixxforchill2
Nessuna valutazione finora
Assignment Managing A Successful Business Project Assignment
Documento32 pagine
Assignment Managing A Successful Business Project Assignment
Tooba Tanvir
Nessuna valutazione finora
Fanuc LR Mate 200ib 200ib 3l
Documento4 pagine
Fanuc LR Mate 200ib 200ib 3l
Hector Calvillo Gtz
Nessuna valutazione finora
Cambridge International AS & A Level: Computer Science 9618/04
Documento10 pagine
Cambridge International AS & A Level: Computer Science 9618/04
Shakila Shaki
Nessuna valutazione finora
Tugas 2 Bahasa Inggris
Documento8 pagine
Tugas 2 Bahasa Inggris
rizky
Nessuna valutazione finora
XPT System - Application Notes - R6.0 PDF
Documento118 pagine
XPT System - Application Notes - R6.0 PDF
Camilo T
Nessuna valutazione finora
6 Data Sheet - Credential Manager
Documento4 pagine
6 Data Sheet - Credential Manager
John Andrade
Nessuna valutazione finora
14.6.2021-Maths-Periodic Test-1 PDF
Documento4 pagine
14.6.2021-Maths-Periodic Test-1 PDF
Ismail S
Nessuna valutazione finora
PHD Thesis On Elliptic Curve Cryptography
Documento6 pagine
PHD Thesis On Elliptic Curve Cryptography
lorigilbertgilbert
100% (2)
Auditing-Data-Privacy Joa Eng 0518
Documento5 pagine
Auditing-Data-Privacy Joa Eng 0518
Spit Fire
Nessuna valutazione finora
Olympus D 705 Instruction Manual 777253 PDF
Documento76 pagine
Olympus D 705 Instruction Manual 777253 PDF
juanpablo
Nessuna valutazione finora
BW XspCLIAdminGuide R210
Documento533 pagine
BW XspCLIAdminGuide R210
Francoj DA
Nessuna valutazione finora
Defibrillators, External, Automated Semiautomated - 030223112503
Documento105 pagine
Defibrillators, External, Automated Semiautomated - 030223112503
CLAUDIA PAOLA HOLGUIN VELEZ
Nessuna valutazione finora
IR AL INICIO EQUIPOS CON GARANTIA DE 2 AÑOS
Documento7 pagine
IR AL INICIO EQUIPOS CON GARANTIA DE 2 AÑOS
NachoAlmeidaRuiz
Nessuna valutazione finora
Manual MXK-194-198
Documento586 pagine
Manual MXK-194-198
Vinicius Zanicheli
Nessuna valutazione finora
JSIKA Vol 2 No 2 (2013)/ISSN 2338-137X
Documento10 pagine
JSIKA Vol 2 No 2 (2013)/ISSN 2338-137X
17.Fernando Dwi septian
Nessuna valutazione finora
Steps For Setting Up Environment For Developing Custom Forms in E-Business Suite
Documento3 pagine
Steps For Setting Up Environment For Developing Custom Forms in E-Business Suite
Pramod
Nessuna valutazione finora
HU10293GYB
Documento8 pagine
HU10293GYB
daniela vega
Nessuna valutazione finora
MECHANICAL ENGINEERING ANALYSIS USING HYPERMESH
Documento25 pagine
MECHANICAL ENGINEERING ANALYSIS USING HYPERMESH
fatin
Nessuna valutazione finora
Aomei Backupper: User Manual
Documento80 pagine
Aomei Backupper: User Manual
Pedro Vera
Nessuna valutazione finora
ECS401: Cryptography and Network Security: Module 5: Authentication Protocols
Documento19 pagine
ECS401: Cryptography and Network Security: Module 5: Authentication Protocols
Shabnam Smile
Nessuna valutazione finora
Investigating Afan Oromo Language Structure and de
Documento9 pagine
Investigating Afan Oromo Language Structure and de
getachew
Nessuna valutazione finora
Kafka Remanere
Documento3 pagine
Kafka Remanere
Ricardo Benavides
Nessuna valutazione finora