Benvenuto in Scribd!

Salta carosello

An Introduction To Apache Spark

Caricato da

Mike Frampton

Il 0% ha trovato utile questo documento (0 voti)

351 visualizzazioni7 pagine

A introduction to Apache Spark, what is it and how does it work ? Why use it and some examples of use.

Titolo originale

An introduction to Apache Spark

Copyright

Formati disponibili

PDF, TXT o leggi online da Scribd

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Segnala questo documento

A introduction to Apache Spark, what is it and how does it work ? Why use it and some examples of use.

Copyright:

Attribution Non-Commercial (BY-NC)

Formati disponibili

Scarica in formato PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Il 0% ha trovato utile questo documento (0 voti)

351 visualizzazioni7 pagine

An Introduction To Apache Spark

Caricato da

Mike Frampton

A introduction to Apache Spark, what is it and how does it work ? Why use it and some examples of use.

Copyright:

Attribution Non-Commercial (BY-NC)

Formati disponibili

Scarica in formato PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Salta alla pagina

Sei sulla pagina 1di 7

Cerca all'interno del documento

Apache Spark

What is it ? How does it work ? Benefits Tuning Examples

www.semtech-solutions.co.nz

info@semtech-solutions.co.nz

Spark What is it ?

Open Source Alternative to Map Reduce for certain applications A low latency cluster computing system For very large data sets May be 100 times faster than Map Reduce for

Iterative algorithms Interactive data mining

Used with Hadoop / HDFS Released under BSD License

www.semtech-solutions.co.nz

info@semtech-solutions.co.nz

Spark How does it work ?

Uses in memory cluster computing Memory access faster than disk access Has API's written in

Scala Java Python

Can be accessed from Scala and Python shells Currently an Apache incubator project

www.semtech-solutions.co.nz

info@semtech-solutions.co.nz

Spark Benefits

Scales to very large clusters Uses in memory processing for increased speed High Level API's

Java, Scala, Python

Low latency shell access

www.semtech-solutions.co.nz

info@semtech-solutions.co.nz

Spark Tuning

Bottlenecks can occur in the cluster via

CPU, memory or network bandwidth Java ObjectOutputStream vs Kryo Use primitive types Set JVM Flags Store objects in serialized form i.e.

Tune data serialization method i.e.

Memory Tuning

RDD Persistence MEMORY_ONLY_SER

www.semtech-solutions.co.nz

info@semtech-solutions.co.nz

Spark Examples
Example from spark-project.org, Spark job in Scala. Showing a simple text count from a system log.
/*** SimpleJob.scala ***/ import spark.SparkContext import SparkContext._ object SimpleJob { def main(args: Array[String]) { val logFile = "/var/log/syslog" // Should be some file on your system val sc = new SparkContext("local", "Simple Job", "$YOUR_SPARK_HOME", List("target/scala-2.9.3/simple-project_2.9.3-1.0.jar")) val logData = sc.textFile(logFile, 2).cache() val numAs = logData.filter(line => line.contains("a")).count() val numBs = logData.filter(line => line.contains("b")).count() println("Lines with a: %s, Lines with b: %s".format(numAs, numBs))

www.semtech-solutions.co.nz

info@semtech-solutions.co.nz

Feel free to contact us at

www.semtech-solutions.co.nz info@semtech-solutions.co.nz

We offer IT project consultancy We are happy to hear about your problems You can just pay for those hours that you need To solve your problems

Potrebbero piacerti anche

What Is Apache Ranger ?
Documento17 pagine
What Is Apache Ranger ?
Mike Frampton
Nessuna valutazione finora
Apache SystemML AI/ML
Documento11 pagine
Apache SystemML AI/ML
Mike Frampton
Nessuna valutazione finora
What Is Apache Airavata ?
Documento12 pagine
What Is Apache Airavata ?
Mike Frampton
Nessuna valutazione finora
Apache Gobblin
Documento14 pagine
Apache Gobblin
Mike Frampton
Nessuna valutazione finora
What Is Apache Phoenix ?
Documento11 pagine
What Is Apache Phoenix ?
Mike Frampton
Nessuna valutazione finora
What Is Apache Edgent ?
Documento12 pagine
What Is Apache Edgent ?
Mike Frampton
Nessuna valutazione finora
Apache Kudu
Documento12 pagine
Apache Kudu
Mike Frampton
Nessuna valutazione finora
An Introduction To Titan
Documento8 pagine
An Introduction To Titan
Mike Frampton
Nessuna valutazione finora
An Introduction To Pentaho
Documento13 pagine
An Introduction To Pentaho
Mike Frampton
Nessuna valutazione finora
What Is Apache Couchdb ?
Documento12 pagine
What Is Apache Couchdb ?
Mike Frampton
Nessuna valutazione finora
Apache ActiveMQ
Documento11 pagine
Apache ActiveMQ
Mike Frampton
Nessuna valutazione finora
Apache Tez
Documento10 pagine
Apache Tez
Mike Frampton
Nessuna valutazione finora
Apache Beam
Documento13 pagine
Apache Beam
Mike Frampton
Nessuna valutazione finora
Kubernetes
Documento14 pagine
Kubernetes
Mike Frampton
100% (1)
Ni Fi
Documento15 pagine
Ni Fi
Mike Frampton
Nessuna valutazione finora
Apache Tinkerpop - Odp
Documento11 pagine
Apache Tinkerpop - Odp
Mike Frampton
Nessuna valutazione finora
Apache Tinkerpop - Odp
Documento11 pagine
Apache Tinkerpop - Odp
Mike Frampton
Nessuna valutazione finora
An Introduction To Apache Spark MLlib
Documento8 pagine
An Introduction To Apache Spark MLlib
Mike Frampton
Nessuna valutazione finora
An Introduction To 0xdata H2O
Documento10 pagine
An Introduction To 0xdata H2O
Mike Frampton
Nessuna valutazione finora
An Introduction To Apache Storm
Documento10 pagine
An Introduction To Apache Storm
Mike Frampton
Nessuna valutazione finora
An Introduction To Apache Mesos
Documento9 pagine
An Introduction To Apache Mesos
Mike Frampton
Nessuna valutazione finora
An Introduction To Databricks
Documento10 pagine
An Introduction To Databricks
Mike Frampton
Nessuna valutazione finora
An Introduction To Apache Cordova
Documento9 pagine
An Introduction To Apache Cordova
Mike Frampton
Nessuna valutazione finora
An Introduction To Apache Bigtop
Documento9 pagine
An Introduction To Apache Bigtop
Mike Frampton
Nessuna valutazione finora
An Introduction To Apache Maven
Documento9 pagine
An Introduction To Apache Maven
Mike Frampton
Nessuna valutazione finora
An Introduction To Apache Thrift
Documento12 pagine
An Introduction To Apache Thrift
Mike Frampton
Nessuna valutazione finora
An Introduction To Apache Gora
Documento11 pagine
An Introduction To Apache Gora
Mike Frampton
Nessuna valutazione finora
An Introduction To Apache Falcon
Documento8 pagine
An Introduction To Apache Falcon
Mike Frampton
Nessuna valutazione finora
An Introduction To Apache S4
Documento8 pagine
An Introduction To Apache S4
Mike Frampton
Nessuna valutazione finora
An Introduction To Apache Crunch
Documento8 pagine
An Introduction To Apache Crunch
Mike Frampton
Nessuna valutazione finora
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Da Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Valutazione: 4 su 5 stelle
4/5 (5794)
The Yellow House: A Memoir (2019 National Book Award Winner)
Da Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Valutazione: 4 su 5 stelle
4/5 (98)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Da Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Valutazione: 3.5 su 5 stelle
3.5/5 (231)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Da Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Valutazione: 4 su 5 stelle
4/5 (895)
The Little Book of Hygge: Danish Secrets to Happy Living
Da Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Valutazione: 3.5 su 5 stelle
3.5/5 (400)
Shoe Dog: A Memoir by the Creator of Nike
Da Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Valutazione: 4.5 su 5 stelle
4.5/5 (537)
Never Split the Difference: Negotiating As If Your Life Depended On It
Da Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Valutazione: 4.5 su 5 stelle
4.5/5 (838)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Da Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Valutazione: 4.5 su 5 stelle
4.5/5 (474)
Grit: The Power of Passion and Perseverance
Da Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Valutazione: 4 su 5 stelle
4/5 (588)
Yes Please
Da Everand
Yes Please
Amy Poehler
Valutazione: 4 su 5 stelle
4/5 (1891)
The Emperor of All Maladies: A Biography of Cancer
Da Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Valutazione: 4.5 su 5 stelle
4.5/5 (271)
On Fire: The (Burning) Case for a Green New Deal
Da Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Valutazione: 4 su 5 stelle
4/5 (74)
Team of Rivals: The Political Genius of Abraham Lincoln
Da Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Valutazione: 4.5 su 5 stelle
4.5/5 (234)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Da Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Valutazione: 4.5 su 5 stelle
4.5/5 (266)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Da Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Valutazione: 4.5 su 5 stelle
4.5/5 (344)
Rise of ISIS: A Threat We Can't Ignore
Da Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Valutazione: 3.5 su 5 stelle
3.5/5 (137)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Da Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Valutazione: 3.5 su 5 stelle
3.5/5 (2259)
Fear: Trump in the White House
Da Everand
Fear: Trump in the White House
Bob Woodward
Valutazione: 3.5 su 5 stelle
3.5/5 (738)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Da Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Valutazione: 4 su 5 stelle
4/5 (1090)
Principles: Life and Work
Da Everand
Principles: Life and Work
Ray Dalio
Valutazione: 4 su 5 stelle
4/5 (599)
John Adams
Da Everand
John Adams
David McCullough
Valutazione: 4.5 su 5 stelle
4.5/5 (2409)
The Unwinding: An Inner History of the New America
Da Everand
The Unwinding: An Inner History of the New America
George Packer
Valutazione: 4 su 5 stelle
4/5 (45)
The Glass Castle: A Memoir
Da Everand
The Glass Castle: A Memoir
Jeannette Walls
Valutazione: 4.5 su 5 stelle
4.5/5 (1713)
Angela's Ashes: A Memoir
Da Everand
Angela's Ashes: A Memoir
Frank McCourt
Valutazione: 4.5 su 5 stelle
4.5/5 (440)
Steve Jobs
Da Everand
Steve Jobs
Walter Isaacson
Valutazione: 4.5 su 5 stelle
4.5/5 (806)
Bad Feminist: Essays
Da Everand
Bad Feminist: Essays
Roxane Gay
Valutazione: 4 su 5 stelle
4/5 (1016)
The Outsider: A Novel
Da Everand
The Outsider: A Novel
Stephen King
Valutazione: 4 su 5 stelle
4/5 (1839)
The Light Between Oceans: A Novel
Da Everand
The Light Between Oceans: A Novel
M.L. Stedman
Valutazione: 4.5 su 5 stelle
4.5/5 (789)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Da Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Valutazione: 4.5 su 5 stelle
4.5/5 (121)
Brooklyn: A Novel
Da Everand
Brooklyn: A Novel
Colm Tóibín
Valutazione: 3.5 su 5 stelle
3.5/5 (1937)
The Woman in Cabin 10
Da Everand
The Woman in Cabin 10
Ruth Ware
Valutazione: 3.5 su 5 stelle
3.5/5 (2322)
A Man Called Ove: A Novel
Da Everand
A Man Called Ove: A Novel
Fredrik Backman
Valutazione: 4.5 su 5 stelle
4.5/5 (4609)
The Perks of Being a Wallflower
Da Everand
The Perks of Being a Wallflower
Stephen Chbosky
Valutazione: 4.5 su 5 stelle
4.5/5 (2104)
Wolf Hall: A Novel
Da Everand
Wolf Hall: A Novel
Hilary Mantel
Valutazione: 4 su 5 stelle
4/5 (3811)
Little Women
Da Everand
Little Women
Louisa May Alcott
Valutazione: 4 su 5 stelle
4/5 (104)
Manhattan Beach: A Novel
Da Everand
Manhattan Beach: A Novel
Jennifer Egan
Valutazione: 3.5 su 5 stelle
3.5/5 (792)
The Art of Racing in the Rain: A Novel
Da Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Valutazione: 4 su 5 stelle
4/5 (4200)
The Constant Gardener: A Novel
Da Everand
The Constant Gardener: A Novel
John le Carré
Valutazione: 3.5 su 5 stelle
3.5/5 (104)
A Tree Grows in Brooklyn
Da Everand
A Tree Grows in Brooklyn
Betty Smith
Valutazione: 4.5 su 5 stelle
4.5/5 (1929)
Her Body and Other Parties: Stories
Da Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Valutazione: 4 su 5 stelle
4/5 (821)
Sing, Unburied, Sing: A Novel
Da Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Valutazione: 4 su 5 stelle
4/5 (1103)
PLK-B Technical Manual USA
Documento65 pagine
PLK-B Technical Manual USA
lapogunevas
Nessuna valutazione finora
CourceMeterials DEL34
Documento206 pagine
CourceMeterials DEL34
Jeramaine Torres
100% (1)
Ldom Troubleshooting
Documento8 pagine
Ldom Troubleshooting
Kaushal727
Nessuna valutazione finora
Unit One Introduction To Information Technology
Documento13 pagine
Unit One Introduction To Information Technology
Bradley Musonza
Nessuna valutazione finora
How To Setup Distribution Point in SCCM 2012 R2
Documento11 pagine
How To Setup Distribution Point in SCCM 2012 R2
Fernando Mossi García
Nessuna valutazione finora
Primergy Rx200 s5 Config
Documento4 pagine
Primergy Rx200 s5 Config
Angel Cabrera
Nessuna valutazione finora
Storage Design and Implementation in Vsphere 6 A Technology Deep Dive (Mostafa Khalil) (Z-Library)
Documento1.757 pagine
Storage Design and Implementation in Vsphere 6 A Technology Deep Dive (Mostafa Khalil) (Z-Library)
Juan Alvarez Meneses
Nessuna valutazione finora
LNL TS 2220
Documento2 pagine
LNL TS 2220
David YH
Nessuna valutazione finora
Chapter 2 PDF
Documento18 pagine
Chapter 2 PDF
Amandeep Singh
Nessuna valutazione finora
Universal Serial Bus Mass Storage Class Specification Overview
Documento14 pagine
Universal Serial Bus Mass Storage Class Specification Overview
borland6538
Nessuna valutazione finora
Audio Information and Media
Documento12 pagine
Audio Information and Media
Hannah Charis L. Barabat
Nessuna valutazione finora
Lesson 5 Understanding Computer (Autosaved)
Documento27 pagine
Lesson 5 Understanding Computer (Autosaved)
alma agnas
Nessuna valutazione finora
Silo - Tips - Clearcase Vob Database Troubleshooting Carem Bennett Ccna Mcse NT Cip
Documento7 pagine
Silo - Tips - Clearcase Vob Database Troubleshooting Carem Bennett Ccna Mcse NT Cip
pradeepya
Nessuna valutazione finora
Manual HP Laptop 15-Bw0xx
Documento116 pagine
Manual HP Laptop 15-Bw0xx
guili guili
Nessuna valutazione finora
Replace Internal FibreChannel (FC) Disks Controlled by VXVM
Documento7 pagine
Replace Internal FibreChannel (FC) Disks Controlled by VXVM
res0nat0r
Nessuna valutazione finora
OXUF934DSB 514902 Datasheet
Documento3 pagine
OXUF934DSB 514902 Datasheet
karkera
Nessuna valutazione finora
CSC 302 Computer Hardware and Software Concepts
Documento4 pagine
CSC 302 Computer Hardware and Software Concepts
Don Mlambo
Nessuna valutazione finora
SoftPerfect-RAM Disk User Manual
Documento8 pagine
SoftPerfect-RAM Disk User Manual
Salvatore Bonaffino
Nessuna valutazione finora
Komku Blog Install Windows XP Using USB Flash Disk Flash Drive - Step by Step Guide
Documento45 pagine
Komku Blog Install Windows XP Using USB Flash Disk Flash Drive - Step by Step Guide
ahmadhilmi89
0% (1)
ACPI Embedded SATAIII mSATA SSD MSS4Q-L 3K PE Datasheet 20190611
Documento16 pagine
ACPI Embedded SATAIII mSATA SSD MSS4Q-L 3K PE Datasheet 20190611
Daniel Crespo
Nessuna valutazione finora
Red Hat Enterprise Linux-6-Global File System 2-ko-KR
Documento64 pagine
Red Hat Enterprise Linux-6-Global File System 2-ko-KR
오명훈
Nessuna valutazione finora
Dell PowerEdge T630 Spec Sheet
Documento2 pagine
Dell PowerEdge T630 Spec Sheet
PatrickChanCF
Nessuna valutazione finora
A Critical Review of 7 Years of Mobile Device Forensics
Documento27 pagine
A Critical Review of 7 Years of Mobile Device Forensics
Liam Lam
Nessuna valutazione finora
Ozone Architecture v1
Documento11 pagine
Ozone Architecture v1
Himanshu Rathore
Nessuna valutazione finora
5-Practicas+BigData Trabajar Hdfs
Documento10 pagine
5-Practicas+BigData Trabajar Hdfs
Christiam Niño
Nessuna valutazione finora
Design and Implementation of A Computerized Educational Administrative Information System
Documento83 pagine
Design and Implementation of A Computerized Educational Administrative Information System
Yekeen Abiodun
100% (2)
Helix Version 3
Documento14 pagine
Helix Version 3
Divik Shrivastava
Nessuna valutazione finora
Virtualizing SQL Server 2008 r2 On Hitachi Compute Rack 220 and Hus 150
Documento33 pagine
Virtualizing SQL Server 2008 r2 On Hitachi Compute Rack 220 and Hus 150
random123vn
Nessuna valutazione finora
LZQJ Manual
Documento54 pagine
LZQJ Manual
thomaswangkoro
100% (3)
Ashtech Mobilemapper Field 3 Software - 371 PDF
Documento72 pagine
Ashtech Mobilemapper Field 3 Software - 371 PDF
Damon Feitosa Gomes Sobrinho
Nessuna valutazione finora