Sei sulla pagina 1di 13

REVIEW QUESTION 1

(1 point possible)
What gives Spark its speed advantage for complex applications?

Spark extends the MapReduce model Various libraries provide Spark with
additional functionality Spark can cover a wide range of workloads under
one system Spark can cover a wide range of workloads under one system -
incorrect Spark makes extensive use of in-memory computations All of
the above

REVIEW QUESTION 2
(1 point possible)
For what purpose would an Engineer use Spark? Select all that apply.

Analyzing data to obtain insights Programming with Spark’s API


Transforming data into a useable form for analysis Developing a data
processing system Tuning an application for a business use case
Analyzing data to obtain insights, Transforming data into a useable form for
analysis, Developing a data processing system, - incorrect

REVIEW QUESTION 3
(1 point possible)
Which of the following statements are true of the Resilient Distributed Dataset
(RDD)? Select all that apply.

There are three types of RDD operations. RDDs allow Spark to


reconstruct transformations RDDs only add a small amount of code due to
tight integration RDD action operations do not return a value RDD is a
distributed collection of elements parallelized across the cluster.

REVIEW QUESTION 1
(1 point possible)
Which of the following methods can be used to create a Resilient Distributed
Dataset (RDD)? Select all that apply.

Creating a directed acyclic graph (DAG) Parallelizing an existing Spark


collection Referencing a Hadoop-supported dataset Using data that
resides in Spark Transforming an existing RDD to form a new one
REVIEW QUESTION 2
(1/1 point)
What happens when an action is executed?

Executors prepare the data for operation in parallel The driver sends
code to be executed on each block A cache is created for storing partial
results in memory Data is partitioned into different blocks across the cluster
All of the above All of the above - correct

REVIEW QUESTION 3
(1 point possible)
Which of the following statements is true of RDD persistence? Select all that
apply.

Persistence through caching provides fault tolerance Future actions can


be performed significantly faster Each partition is replicated on two cluster
nodes RDD persistence always improves space efficiency By default,
objects that are too big for memory are stored on the disk

REVIEW QUESTION 1
(1 point possible)
What is SparkContext?

An object that represents the connection to a Spark cluster A tool for


linking to nodes A tool for linking to nodes - incorrect A tool that provides
fault tolerance The built-in shell for the Spark engine A programming
language for applications

REVIEW QUESTION 2
(1 point possible)
Which of the following methods can be used to pass functions to Spark? Select
all that apply.

Transformations and actions Passing by reference Static methods in a


global singleton Import statements Anonymous function syntax
Transformations and actions, Anonymous function syntax, - incorrect
REVIEW QUESTION 3
(1 point possible)
Which of the following is a main component of a Spark application’s source
code?

SparkContext object Transformations and actions Business Logic


Import statements Import statements - incorrect All of the above
REVIEW QUESTION 1
(1 point possible)
Which of the following is NOT an example of a Spark library?

Hive MLlib Spark Streaming Spark SQL Spark SQL - incorrect


GraphX

REVIEW QUESTION 2
(1 point possible)
From which of the following sources can Spark Streaming receive data? Select
all that apply.

Kafka JSON Parquet HDFS Hive


REVIEW QUESTION 3
(1/1 point)
In Spark Streaming, processing begins immediately when an element of the
application is executed. True or false?

True False False - correct


You have used 1 of 1 submissions

REVIEW QUESTION 1
(1 point possible)
Which of the following is a main component of a Spark cluster? Select all that
apply.

Driver Program SparkContext Cluster Manager Worker node


Cache
Driver Program, Cluster Manager, Cache, - incorrect
REVIEW QUESTION 2
(1 point possible)
What are the main locations for Spark configuration? Select all that apply.

The SparkConf object The Spark Shell Executor Processes


Environment variables Logging properties
Executor Processes, Logging properties, - incorrect
REVIEW QUESTION 3
(1/1 point)
Which of the following techniques can improve Spark performance? Select all
that apply.

Scheduler Configuration Memory Tuning Data Serialization Using


Broadcast variables Using nested structures
Memory Tuning, Data Serialization, Using Broadcast variables, - correct
Examen

QUESTION 1
(1 point possible)
1. Which of the following is a type of Spark RDD operation? Select all that
apply.

Parallelization Action Persistence Transformation Evaluation


Action, Persistence, Transformation, - incorrect

QUESTION 2
(1/1 point)
2. Spark must be installed and run on top of a Hadoop cluster. True or false

True False False - correct


QUESTION 3
(1/1 point)
Which of the following operations will work improperly when using a
Combiner?

Count Maximum Minimum Average Average - correct All of the


above operations will work properly
QUESTION 4
(1/1 point)
Spark supports which of the following libraries?

GraphX Spark Streaming MLlib Spark SQL All of the above All of
the above - correct
QUESTION 5
(1/1 point)
Spark supports which of the following programming languages?
C++ and Python Scala, Java, C++, Python, Perl Scala, Perl, Java
Scala, Python, Java, R Scala, Python, Java, R - correct Java and Scala
You have used 2 of 2 submissions

QUESTION 6
(1/1 point)
A transformation is evaluated immediately. True or false?

True False False - correct


Final Check your answer Save your answer

QUESTION 7
(1 point possible)
Which storage level does the cache() function use?

MEMORY_AND_DISK_SER MEMORY_AND_DISK MEMORY_ONLY_SER


MEMORY_ONLY_SER - incorrect MEMORY_ONLY

QUESTION 8
(1 point possible)
Which of the following statements does NOT describe accumulators?

They can only be read by the driver Programmers can extend them
beyond numeric types Programmers can extend them beyond numeric types -
incorrect They implement counters and sums They can only be added
through an associative operation They are read-only

QUESTION 9
(1/1 point)
You must explicitly initialize the SparkContext when creating a Spark
application. True or false?

True True - correct False


You have used 2 of 2 submissions

QUESTION 10
(1/1 point)
The "local" parameter can be used to specify the number of cores to use for
the application. True or false?

True True - correct False


QUESTION 11
(1/1 point)
Spark applications can ONLY be packaged using one, specific build tool. True
or false?

True False False - correct


QUESTION 12
(1 point possible)
Which of the following parameters of the “spark-submit” script determine
where the application will run?

--class --master --deploy-mode --deploy-mode - incorrect --conf


None of the above
QUESTION 13
(1 point possible)
Which of the following is NOT supported as a cluster manager?

YARN YARN - incorrect Helix Mesos Spark All of the above are
supported
QUESTION 14
(1 point possible)
Spark SQL allows relational queries to be expressed in which of the following?

HiveQL only HiveQL only - incorrect Scala, SQL, and HiveQL Scala and
SQL Scala and HiveQL SQL only

QUESTION 15
(1/1 point)
15. Spark Streaming processes live streaming data in real-time. True or false?

True False False - correct


QUESTION 16
(1/1 point)
The MLlib library contains which of the following algorithms?

Dimensionality Reduction Regression Classification Clustering All


of the above All of the above - correct
QUESTION 17
(1/1 point)
What is the purpose of the GraphX library?
To create a visual representation of the data To generate data-parallel
models To create a visual representation of a directed acyclic graph (DAG)
To perform graph-parallel computations To perform graph-parallel
computations - correct To convert from data-parallel to graph-parallel
algorithms
QUESTION 18
(1 point possible)
Which list describes the correct order of precedence for Spark configuration,
from highest to lowest?

Properties set on SparkConf, values in spark-defaults.conf, flags passed to


spark-submit Flags passed to spark-submit, values in spark-defaults.conf,
properties set on SparkConf Values in spark-defaults.conf, properties set on
SparkConf, flags passed to spark-submit Values in spark-defaults.conf, flags
passed to spark-submit, properties set on SparkConf Values in spark-
defaults.conf, flags passed to spark-submit, properties set on SparkConf -
incorrect Properties set on SparkConf, flags passed to spark-submit, values
in spark-defaults.conf

QUESTION 19
(1/1 point)
Spark monitoring can be performed with external tools. True or false?

True True - correct False


Which serialization libraries are supported in Spark? Select all that apply.

Apache Avro Java Serialization Protocol Buffers Kyro Serialization


TPL
Java Serialization, Protocol Buffers, - incorrect

Examen 2

Question 1
(1 point possible)

1. Which of the following is a type of Spark RDD operation? Select all that apply.

Parallelization Action Persistence Transformation Evaluation


Parallelization, Persistence, Evaluation, - incorrect

Question 1

You have used 2 of 2 submissions


Question 2
(1/1 point)

2. Spark must be installed and run on top of a Hadoop cluster. True or false

True False False - correct

Question 2

You have used 2 of 2 submissions

Question 3
(1/1 point)

Which of the following operations will work improperly when using a Combiner?

Count Maximum Minimum Average Average - correct All of the above


operations will work properly

Question 3

You have used 2 of 2 submissions

Question 4
(1/1 point)

Spark supports which of the following libraries?

GraphX Spark Streaming MLlib Spark SQL All of the above All of the
above - correct

Question 4

You have used 2 of 2 submissions

Question 5
(1/1 point)

Spark supports which of the following programming languages?

C++ and Python Scala, Java, C++, Python, Perl Scala, Perl, Java Scala,
Python, Java, R Scala, Python, Java, R - correct Java and Scala

Question 5

You have used 2 of 2 submissions


Question 6
(1/1 point)

A transformation is evaluated immediately. True or false?

True False False - correct

Question 6

You have used 2 of 2 submissions

Question 7
(1/1 point)

Which storage level does the cache() function use?

MEMORY_AND_DISK_SER MEMORY_AND_DISK MEMORY_ONLY_SER


MEMORY_ONLY MEMORY_ONLY - correct

Question 7

You have used 2 of 2 submissions

Question 8
(1 point possible)

Which of the following statements does NOT describe accumulators?

They can only be read by the driver Programmers can extend them beyond
numeric types They implement counters and sums They implement counters and
sums - incorrect They can only be added through an associative operation They
are read-only

Question 8

You have used 2 of 2 submissions

Question 9
(1/1 point)

You must explicitly initialize the SparkContext when creating a Spark application. True or false?

True True - correct False

Question 9
You have used 2 of 2 submissions

Question 10
(1/1 point)

The "local" parameter can be used to specify the number of cores to use for the application. True
or false?

True True - correct False

Question 10

You have used 2 of 2 submissions

Question 11
(1/1 point)

Spark applications can ONLY be packaged using one, specific build tool. True or false?

True False False - correct

Question 11

You have used 2 of 2 submissions

Question 12
(1/1 point)

Which of the following parameters of the “spark-submit” script determine where the application
will run?

--class --master --master - correct --deploy-mode --conf None of the


above

Question 12

You have used 2 of 2 submissions

Question 13
(1/1 point)

Which of the following is NOT supported as a cluster manager?

YARN Helix Helix - correct Mesos Spark All of the above are supported

Question 13
You have used 2 of 2 submissions

Question 14
(1/1 point)

Spark SQL allows relational queries to be expressed in which of the following?

HiveQL only Scala, SQL, and HiveQL Scala, SQL, and HiveQL - correct Scala
and SQL Scala and HiveQL SQL only

Question 14

You have used 2 of 2 submissions

Question 15
(1/1 point)

15. Spark Streaming processes live streaming data in real-time. True or false?

True False False - correct

Question 15

You have used 2 of 2 submissions

Question 16
(1/1 point)

The MLlib library contains which of the following algorithms?

Dimensionality Reduction Regression Classification Clustering All of


the above All of the above - correct

Question 16

You have used 2 of 2 submissions

Question 17
(1/1 point)

What is the purpose of the GraphX library?

To create a visual representation of the data To generate data-parallel models


To create a visual representation of a directed acyclic graph (DAG) To perform
graph-parallel computations To perform graph-parallel computations - correct To
convert from data-parallel to graph-parallel algorithms

Question 17

You have used 2 of 2 submissions

Question 18
(1 point possible)

Which list describes the correct order of precedence for Spark configuration, from highest to
lowest?

Properties set on SparkConf, values in spark-defaults.conf, flags passed to spark-


submit Flags passed to spark-submit, values in spark-defaults.conf, properties set
on SparkConf Flags passed to spark-submit, values in spark-defaults.conf, properties
set on SparkConf - incorrect Values in spark-defaults.conf, properties set on
SparkConf, flags passed to spark-submit Values in spark-defaults.conf, flags
passed to spark-submit, properties set on SparkConf Properties set on SparkConf,
flags passed to spark-submit, values in spark-defaults.conf

Question 18

You have used 2 of 2 submissions

Question 19
(1/1 point)

Spark monitoring can be performed with external tools. True or false?

True True - correct False

Question 19

You have used 2 of 2 submissions

Question 20
(1 point possible)

Which serialization libraries are supported in Spark? Select all that apply.

Apache Avro Java Serialization Protocol Buffers Kyro Serialization TPL


Apache Avro, Java Serialization, Kyro Serialization, - incorrect

Question 20
You have used 2 of 2 submissions

Potrebbero piacerti anche