4 V of Big Data

Caricato da

Aniruddha Dutta

Il 0% ha trovato utile questo documento (0 voti)

82 visualizzazioni3 pagine

The four big V described in Big Data

Copyright

Formati disponibili

DOCX, PDF, TXT o leggi online da Scribd

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Segnala questo documento

The four big V described in Big Data

Copyright:

Formati disponibili

Scarica in formato DOCX, PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Il 0% ha trovato utile questo documento (0 voti)

82 visualizzazioni3 pagine

4 V of Big Data

Caricato da

Aniruddha Dutta

The four big V described in Big Data

Copyright:

Formati disponibili

Scarica in formato DOCX, PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Salta alla pagina

Sei sulla pagina 1di 3

Cerca all'interno del documento

4 V's of big data

Volume of big data

IBM researchers determined that by 2020, the volume of data in our planet earth
would be approximately 40 zettabytes (43 trillion gigabytes, which is about 4
1021 bytes. See more details on this in this Wikipedia article.

This enormous amount of data is the result of Internetification of devices (Internet of

Things) in the present world. Also in the modern world the conversion from analog to
digital media and coupled with the increase in storage in digital devices has led to
such an increase in data.

Velocity of big data

Velocity refers to the throughput of big data, or how fast the data is coming in and is
being processed. Real Time Analytics has helped to keep up the pace of data
processing to some extent. In finance, risk management and trading systems
demand almost instantaneous analysis in order to make effective decisions based
on data-driven insights. The use of high-speed data and communication networks
coupled with increased usage of cell phone, tablets and other connected devices,
are primary factors driving information velocity. We can get a measure of this
velocity by knowing the tweets per second, emails per second or facebook
comments where new data are created every second.

Variety of big data

Data coming from digital cameras, cell phones, sensors, web, having different
formats makes it difficult to process the data for which new technologies are always
in demand. With the rise of NoSQL databases to handle what is known
as unstructured data or rather data whose format is fungible or constantly changing.

Veracity of big data

Veracity refers to the need to confirm the correctness of data, which exists. The
source of data, which is not always authentic, costs much if its processed.
According to an estimate by IBM, poor data quality costs the US economy about
$3.1 trillion dollars a year.
What is Pandas?

The pandas is a high-performance open source library for data analysis in Python
developed by Wes McKinney in 2008. Over the years, it has become the de-facto
standard library for data analysis using Python. There's been great adoption of the
tool, a large community behind it, (220+ contributors and 9000+ commits by
03/2014), rapid iteration, features, and enhancements continuously made.

Some key features of pandas include the following:

It can process a variety of data sets in different formats: time series, tabular
heterogeneous, and matrix data.
It facilitates loading/importing data from varied sources such as CSV and
DB/SQL.
It can handle a myriad of operations on data sets: subsetting, slicing, filtering,
merging, groupBy, re-ordering, and re-shaping.
It can deal with missing data according to rules defined by the
user/developer: ignore, convert to 0, and so on.
It can be used for parsing and munging (conversion) of data as well as
modeling and statistical analysis.
It integrates well with other Python libraries such as statsmodels, SciPy, and
scikit-learn.
It delivers fast performance and can be speeded up even more by
making use of Cython (C extensions to Python).
For more information go through the official pandas documentation available at
http://pandas.pydata.org/pandas-docs/stable/.

Potrebbero piacerti anche

Shoe Dog: A Memoir by the Creator of Nike
Da Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Valutazione: 4.5 su 5 stelle
4.5/5 (537)
Grit: The Power of Passion and Perseverance
Da Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Valutazione: 4 su 5 stelle
4/5 (587)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Da Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Valutazione: 4 su 5 stelle
4/5 (894)
The Yellow House: A Memoir (2019 National Book Award Winner)
Da Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Valutazione: 4 su 5 stelle
4/5 (98)
The Little Book of Hygge: Danish Secrets to Happy Living
Da Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Valutazione: 3.5 su 5 stelle
3.5/5 (399)
On Fire: The (Burning) Case for a Green New Deal
Da Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Valutazione: 4 su 5 stelle
4/5 (73)
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Da Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Valutazione: 4 su 5 stelle
4/5 (5794)
Never Split the Difference: Negotiating As If Your Life Depended On It
Da Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Valutazione: 4.5 su 5 stelle
4.5/5 (838)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Da Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Valutazione: 4.5 su 5 stelle
4.5/5 (474)
Yes Please
Da Everand
Yes Please
Amy Poehler
Valutazione: 4 su 5 stelle
4/5 (1891)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Da Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Valutazione: 3.5 su 5 stelle
3.5/5 (231)
Principles: Life and Work
Da Everand
Principles: Life and Work
Ray Dalio
Valutazione: 4 su 5 stelle
4/5 (599)
The Emperor of All Maladies: A Biography of Cancer
Da Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Valutazione: 4.5 su 5 stelle
4.5/5 (271)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Da Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Valutazione: 4 su 5 stelle
4/5 (1090)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Da Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Valutazione: 3.5 su 5 stelle
3.5/5 (2219)
Team of Rivals: The Political Genius of Abraham Lincoln
Da Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Valutazione: 4.5 su 5 stelle
4.5/5 (234)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Da Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Valutazione: 4.5 su 5 stelle
4.5/5 (344)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Da Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Valutazione: 4.5 su 5 stelle
4.5/5 (265)
Fear: Trump in the White House
Da Everand
Fear: Trump in the White House
Bob Woodward
Valutazione: 3.5 su 5 stelle
3.5/5 (738)
Angela's Ashes: A Memoir
Da Everand
Angela's Ashes: A Memoir
Frank McCourt
Valutazione: 4.5 su 5 stelle
4.5/5 (440)
Rise of ISIS: A Threat We Can't Ignore
Da Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Valutazione: 3.5 su 5 stelle
3.5/5 (137)
Steve Jobs
Da Everand
Steve Jobs
Walter Isaacson
Valutazione: 4.5 su 5 stelle
4.5/5 (806)
John Adams
Da Everand
John Adams
David McCullough
Valutazione: 4.5 su 5 stelle
4.5/5 (2409)
The Unwinding: An Inner History of the New America
Da Everand
The Unwinding: An Inner History of the New America
George Packer
Valutazione: 4 su 5 stelle
4/5 (45)
Bad Feminist: Essays
Da Everand
Bad Feminist: Essays
Roxane Gay
Valutazione: 4 su 5 stelle
4/5 (1015)
The Glass Castle: A Memoir
Da Everand
The Glass Castle: A Memoir
Jeannette Walls
Valutazione: 4.5 su 5 stelle
4.5/5 (1711)
Wolf Hall: A Novel
Da Everand
Wolf Hall: A Novel
Hilary Mantel
Valutazione: 4 su 5 stelle
4/5 (3811)
The Outsider: A Novel
Da Everand
The Outsider: A Novel
Stephen King
Valutazione: 4 su 5 stelle
4/5 (1839)
The Perks of Being a Wallflower
Da Everand
The Perks of Being a Wallflower
Stephen Chbosky
Valutazione: 4.5 su 5 stelle
4.5/5 (2099)
The Woman in Cabin 10
Da Everand
The Woman in Cabin 10
Ruth Ware
Valutazione: 3.5 su 5 stelle
3.5/5 (2322)
The Light Between Oceans: A Novel
Da Everand
The Light Between Oceans: A Novel
M.L. Stedman
Valutazione: 4.5 su 5 stelle
4.5/5 (789)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Da Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Valutazione: 4.5 su 5 stelle
4.5/5 (119)
Little Women
Da Everand
Little Women
Louisa May Alcott
Valutazione: 4 su 5 stelle
4/5 (104)
Brooklyn: A Novel
Da Everand
Brooklyn: A Novel
Colm Toibin
Valutazione: 3.5 su 5 stelle
3.5/5 (1937)
A Man Called Ove: A Novel
Da Everand
A Man Called Ove: A Novel
Fredrik Backman
Valutazione: 4.5 su 5 stelle
4.5/5 (4609)
The Art of Racing in the Rain: A Novel
Da Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Valutazione: 4 su 5 stelle
4/5 (4200)
Manhattan Beach: A Novel
Da Everand
Manhattan Beach: A Novel
Jennifer Egan
Valutazione: 3.5 su 5 stelle
3.5/5 (792)
A Tree Grows in Brooklyn
Da Everand
A Tree Grows in Brooklyn
Betty Smith
Valutazione: 4.5 su 5 stelle
4.5/5 (1929)
Sing, Unburied, Sing: A Novel
Da Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Valutazione: 4 su 5 stelle
4/5 (1103)
The Constant Gardener: A Novel
Da Everand
The Constant Gardener: A Novel
John le Carre
Valutazione: 3.5 su 5 stelle
3.5/5 (104)
Her Body and Other Parties: Stories
Da Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Valutazione: 4 su 5 stelle
4/5 (821)
Fi 6110 Datasheet
Documento2 pagine
Fi 6110 Datasheet
Jesus Roldan
Nessuna valutazione finora
Car Audio DEH X2850UI
Documento2 pagine
Car Audio DEH X2850UI
Daniel Mrz
Nessuna valutazione finora
Onshape College 7-1 Lesson Plan
Documento43 pagine
Onshape College 7-1 Lesson Plan
hezron
100% (1)
Nxbook PDF
Documento256 pagine
Nxbook PDF
zhenshis
Nessuna valutazione finora
9.1.1.6 Lab - Encrypting and Decrypting Data Using OpenSSL
Documento3 pagine
9.1.1.6 Lab - Encrypting and Decrypting Data Using OpenSSL
Brian Harefa
Nessuna valutazione finora
Firmware/OS
Documento2 pagine
Firmware/OS
Frankuu Palapag
Nessuna valutazione finora
Noskievic 2020
Documento5 pagine
Noskievic 2020
Vahap Eyüp DOĞAN
Nessuna valutazione finora
Fault Code 111
Documento2 pagine
Fault Code 111
Enrrique Lara
Nessuna valutazione finora
Single Site Verification Overview
Documento2 pagine
Single Site Verification Overview
Mohamyz EDðiÊ
Nessuna valutazione finora
JFo 2 1 SG
Documento41 pagine
JFo 2 1 SG
saumya patel
Nessuna valutazione finora
Scott Gilbert NH State Police Email Header
Documento7 pagine
Scott Gilbert NH State Police Email Header
Audra Toop
Nessuna valutazione finora
Laporan Praktikum Keamanan Siber - Tugas 6 - Kelas C - Kelompok 3
Documento5 pagine
Laporan Praktikum Keamanan Siber - Tugas 6 - Kelas C - Kelompok 3
I Gede Arie Yogantara Subrata
Nessuna valutazione finora
Paper0406 Ducted Fan
Documento8 pagine
Paper0406 Ducted Fan
SCALPEL DRAWING
Nessuna valutazione finora
BIG IP Traffic
Documento328 pagine
BIG IP Traffic
anh00
Nessuna valutazione finora
Human Gait Indicators of Carrying A Concealed Firearm: A Skeletal Tracking and Data Mining Approach
Documento17 pagine
Human Gait Indicators of Carrying A Concealed Firearm: A Skeletal Tracking and Data Mining Approach
Stoie Gabriel
Nessuna valutazione finora
Cpu
Documento1 pagina
Cpu
Aimee Hernandez
Nessuna valutazione finora
Schneider Electric ATV312HU40N4 Datasheet
Documento3 pagine
Schneider Electric ATV312HU40N4 Datasheet
Magda Diaz
Nessuna valutazione finora
Ttu 720
Documento2 pagine
Ttu 720
Eduardo Mtz
Nessuna valutazione finora
eSRC HANDS ON ACTIVITY
Documento66 pagine
eSRC HANDS ON ACTIVITY
RaulJunioRamos
100% (2)
Os ND06
Documento3 pagine
Os ND06
kevinbtech
Nessuna valutazione finora
Python GUI Programming Cookbook - Sample Chapter
Documento27 pagine
Python GUI Programming Cookbook - Sample Chapter
Packt Publishing
86% (7)
SLA - OnlineSvcsConsolidatedSLA (WW) (English) (August2020) (CR)
Documento74 pagine
SLA - OnlineSvcsConsolidatedSLA (WW) (English) (August2020) (CR)
Fish Finger
Nessuna valutazione finora
WhitePaper AVEVA TheBenefitsofMovingtoCloudUnifiedEngineering 22-02
Documento7 pagine
WhitePaper AVEVA TheBenefitsofMovingtoCloudUnifiedEngineering 22-02
mohsen12345678
Nessuna valutazione finora
Rest Assured
Documento25 pagine
Rest Assured
raviinter
Nessuna valutazione finora
【SIEMENS】Cios Select-with-FD-Flyer
Documento6 pagine
【SIEMENS】Cios Select-with-FD-Flyer
Ashley
Nessuna valutazione finora
Exploiting Memory Hierarchy Through Caches
Documento87 pagine
Exploiting Memory Hierarchy Through Caches
Eric Corona
Nessuna valutazione finora
Validus Fintech - Service Order Form
Documento4 pagine
Validus Fintech - Service Order Form
Anurag
Nessuna valutazione finora
Final Project Report V4 - 1
Documento42 pagine
Final Project Report V4 - 1
Preeti Shrestha
Nessuna valutazione finora
Manual de Instalación APD5 Epson
Documento8 pagine
Manual de Instalación APD5 Epson
Lily Droplet
Nessuna valutazione finora
15
Documento2 pagine
15
ashar565
Nessuna valutazione finora