Benvenuto in Scribd!

CSE511 CourseBrief

Caricato da

Il 0% ha trovato utile questo documento (0 voti)

173 visualizzazioni2 pagine

This course covers data processing at scale using parallel and distributed algorithms. It focuses on designing, deploying, and using state-of-the-art data processing systems that provide scalable access to large datasets. Specific topics include efficient query processing, indexing structures, distributed database design, parallel query execution, concurrency control, data management in cloud and MapReduce environments, and NoSQL database systems. The course aims to help learners differentiate data models, perform queries and analytics in modern databases, and utilize cluster computing systems like Hadoop/Spark for scalable data operations in cloud environments. It is estimated to require 15-20 hours per week and may require using technologies like Amazon AWS, Hadoop/Spark, GitHub, PostgreSQL and

Descrizione originale:

Titolo originale

CSE511_CourseBrief

Copyright

Formati disponibili

PDF, TXT o leggi online da Scribd

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Segnala questo documento

Copyright:

Formati disponibili

Scarica in formato PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Il 0% ha trovato utile questo documento (0 voti)

173 visualizzazioni2 pagine

CSE511 CourseBrief

Caricato da

Maddy

Copyright:

Formati disponibili

Scarica in formato PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Salta alla pagina

Sei sulla pagina 1di 2

Cerca all'interno del documento

CSE 511: Data Processing at Scale

About this course

Database systems are used to provide convenient access to disk-resident data through efficient query
processing, indexing structures, concurrency control, and recovery. This course delves into new frameworks
for processing and generating large-scale datasets with parallel and distributed algorithms, covering the
design, deployment and use of state-of-the-art data processing systems, which provide scalable access to
data.

Specific topics covered include:

● Efficient query processing
● Indexing structures
● Distributed database design
● Parallel query execution
● Concurrency control in distributed parallel database systems
● Data management in cloud computing environments
● Data management in Map/Reduce-based
● NoSQL database systems

Required prior knowledge and skills

● Basic statistics and computer science knowledge including computer organization and architecture,
discrete mathematics, data structures, and algorithms
● Knowledge of high-level programming languages (e.g., C++, Java) and scripting language (e.g., Python)

Learning Outcomes
Learners completing this course will be able to:
● Differentiate among major data models such as relational, spatial, and NoSQL
● Perform queries (e.g., SQL) and analytics tasks in state-of-the-art database systems
● Apply leading-edge techniques to design/tune distributed and parallel database systems
● Utilize existing NoSQL database systems as appropriate for specified cases
● Perform database operations (e.g., selection, projection, join, and groupby) in state-of-the-art cluster
computing systems such as Hadoop/Spark
● Perform scalable data processing operations (e.g., selection, projection, join, and groupby) in cloud
computing environments, including Amazon AWS

Estimated Workload/Time Commitment Per Week

15 - 20 hours per week

Technology Requirements
Hardware - Standard hardware with major OSSoftware and Other (programs, platforms, services, etc.) - To
complete course projects, some of the following may be required: Amazon AWS, Cloud, Hadoop/Spark,
GitHub, PostgreSQL, MongoDB, Neo4j.

Page 1

Creators

Dr. Mohamed Sarwat

Mohamed Sarwat is an Assistant Professor of Computer Science and the director of the Data Systems
(DataSys) lab at Arizona State University (ASU). He is also an affiliate member of the Center for Assured and
Scalable Data Engineering (CASCADE). Before joining ASU, Mohamed obtained his MSc and PhD degrees in
computer science from the University of Minnesota. His research interest lies in the broad area of data
management systems.

Dr. Ming Zhao
Ming Zhao is an associate professor of the ASU School of Computing, Informatics, and Decision Systems
Engineering. Before joining ASU, he was an associate professor of the School of Computing and Information
Sciences (SCIS) at Florida International University. He directs the Research Laboratory for Virtualized
Infrastructure, Systems, and Applications (VISA). His research interests are in distributed/cloud computing, big
data, high-performance computing, autonomic computing, virtualization, storage systems and operating
systems.

Page 2

Potrebbero piacerti anche

Business Analyst Resume Sample
Documento1 pagina
Business Analyst Resume Sample
Atheeth Bhat
Nessuna valutazione finora
Course 572
Documento8 pagine
Course 572
Maddy
Nessuna valutazione finora
Algorithms and Data Structures
Documento37 pagine
Algorithms and Data Structures
jimmy
Nessuna valutazione finora
Sudarshan Book PDF
Documento1 pagina
Sudarshan Book PDF
Ploy Mahakanta
Nessuna valutazione finora
The Efficient Way To Run Your Property: Integrations Support About Us
Documento27 pagine
The Efficient Way To Run Your Property: Integrations Support About Us
sayrah
Nessuna valutazione finora
CISSP - Domain 4 - Communication and Network
Documento129 pagine
CISSP - Domain 4 - Communication and Network
Jerry Shen
Nessuna valutazione finora
Database Management System Thesis PDF
Documento7 pagine
Database Management System Thesis PDF
rqopqlvcf
100% (2)
Applied Computation 263 Data and Computation On The Internet
Documento5 pagine
Applied Computation 263 Data and Computation On The Internet
Martita Uriarte
Nessuna valutazione finora
Research Papers On Distributed Database Management System
Documento4 pagine
Research Papers On Distributed Database Management System
pwvgqccnd
Nessuna valutazione finora
Statement of Purpose
Documento2 pagine
Statement of Purpose
謝亦翔
Nessuna valutazione finora
Thesis On Distributed Database System
Documento8 pagine
Thesis On Distributed Database System
vetepuwej1z3
100% (2)
Advanced Applications of Python Data Structures and Algorithms
Documento318 pagine
Advanced Applications of Python Data Structures and Algorithms
kanshian
Nessuna valutazione finora
Lecture1 Introduction Jan7 2018
Documento29 pagine
Lecture1 Introduction Jan7 2018
Ramakrishna Yellapragada
Nessuna valutazione finora
Acm Sigmod Jim Gray Doctoral Dissertation Award
Documento4 pagine
Acm Sigmod Jim Gray Doctoral Dissertation Award
PaperWritingServicesReviewsReno
Nessuna valutazione finora
JD - Graph Database Architect
Documento4 pagine
JD - Graph Database Architect
N Sai Avinash
Nessuna valutazione finora
Thesis in System Analysis and Design
Documento7 pagine
Thesis in System Analysis and Design
allysonthompsonboston
100% (2)
Term Paper On Distributed Database
Documento5 pagine
Term Paper On Distributed Database
ea8m12sm
100% (1)
PO Master Final III3 2009
Documento21 pagine
PO Master Final III3 2009
vaibhav.1991
Nessuna valutazione finora
Thesis Database Management System
Documento7 pagine
Thesis Database Management System
afkodkedr
100% (2)
Data Science - Fundamentals and Components
Documento21 pagine
Data Science - Fundamentals and Components
Banuroopa Velkumar
Nessuna valutazione finora
Veritabanı Yönetim Sistemleri YZM508: Dr. Osman GÖKALP
Documento49 pagine
Veritabanı Yönetim Sistemleri YZM508: Dr. Osman GÖKALP
olcayto krocan
Nessuna valutazione finora
Architecting Data Intensive Systems
Documento51 pagine
Architecting Data Intensive Systems
VALANARR COMPUTERS
Nessuna valutazione finora
VTU Exam Question Paper With Solution of 18CS72 Big Data and Analytics Feb-2022-Dr. v. Vijayalakshmi
Documento25 pagine
VTU Exam Question Paper With Solution of 18CS72 Big Data and Analytics Feb-2022-Dr. v. Vijayalakshmi
WWE ROCKERS
Nessuna valutazione finora
Data Structures and Algorithms in Java: Third Edition
Documento14 pagine
Data Structures and Algorithms in Java: Third Edition
Amit Singh
Nessuna valutazione finora
Research Paper On Distributed Database
Documento7 pagine
Research Paper On Distributed Database
vguneqrhf
100% (1)
A Survey On Software Suites For Data Mining, Analytics and Knowledge Discovery
Documento6 pagine
A Survey On Software Suites For Data Mining, Analytics and Knowledge Discovery
International Journal of computational Engineering research (IJCER)
Nessuna valutazione finora
Data Engineer - JD MG
Documento1 pagina
Data Engineer - JD MG
Shubham Agarwal
Nessuna valutazione finora
Research Paper On Distributed Operating System
Documento4 pagine
Research Paper On Distributed Operating System
aflbtcxfc
100% (1)
Hostel Management System Literature Review PDF
Documento5 pagine
Hostel Management System Literature Review PDF
c5qm9s32
Nessuna valutazione finora
Database App
Documento24 pagine
Database App
Mr Moo
Nessuna valutazione finora
Reading - Data Warehousing Specialist
Documento4 pagine
Reading - Data Warehousing Specialist
Qaisar Shakoor
Nessuna valutazione finora
Da Unit-1
Documento23 pagine
Da Unit-1
jaganbecs
Nessuna valutazione finora
Research Papers Distributed Database Management System
Documento6 pagine
Research Papers Distributed Database Management System
gz46ktxr
100% (1)
Manj Data 1
Documento30 pagine
Manj Data 1
Aris Haryanto
Nessuna valutazione finora
Team 7 - Survey of Cs Curriculum
Documento9 pagine
Team 7 - Survey of Cs Curriculum
api-653628343
Nessuna valutazione finora
Management Information System Thesis Sample
Documento7 pagine
Management Information System Thesis Sample
ballnimasal1972
100% (2)
Bigdata Hadoop PPTs
Documento96 pagine
Bigdata Hadoop PPTs
Rahul Sharma
Nessuna valutazione finora
Dataverse and Research Data For ICRAF
Documento24 pagine
Dataverse and Research Data For ICRAF
Ochieng Wilberforce
Nessuna valutazione finora
Background of The Study
Documento20 pagine
Background of The Study
ailynvalentintabefra
Nessuna valutazione finora
Assuming The Roles Of: Systems Analyst
Documento40 pagine
Assuming The Roles Of: Systems Analyst
pandq
Nessuna valutazione finora
Latest Research Papers On Distributed Database
Documento5 pagine
Latest Research Papers On Distributed Database
ttdqgsbnd
100% (1)
DS Lecture 01
Documento24 pagine
DS Lecture 01
sidra zahra
Nessuna valutazione finora
Processing Extremely Large Scale Datasets
Documento51 pagine
Processing Extremely Large Scale Datasets
VALANARR COMPUTERS
Nessuna valutazione finora
Lecture 01.1
Documento21 pagine
Lecture 01.1
Quazi Hasnat Irfan
Nessuna valutazione finora
A Perusal of Big Data Classification and
Documento13 pagine
A Perusal of Big Data Classification and
manitenkasi
Nessuna valutazione finora
Chapter Two: Data Science
Documento19 pagine
Chapter Two: Data Science
mahammed
Nessuna valutazione finora
A Data Mining Architecture For Distributed Environments: Lecture Notes in Computer Science June 2002
Documento13 pagine
A Data Mining Architecture For Distributed Environments: Lecture Notes in Computer Science June 2002
james russell west brook
Nessuna valutazione finora
DBMS 1
Documento24 pagine
DBMS 1
samwillison36
Nessuna valutazione finora
Thesis Management Information System
Documento8 pagine
Thesis Management Information System
lizbundrenwestminster
100% (2)
CSE 3621 Lec0323
Documento18 pagine
CSE 3621 Lec0323
M A Rob
Nessuna valutazione finora
Lecture1 Introduction
Documento27 pagine
Lecture1 Introduction
Elon Dusk
Nessuna valutazione finora
(Data-Centric Systems and Applications) David Taniar, Wenny Rahayu - Data Warehousing and Analytics - Fueling The Data Engine-Springer (2022)
Documento642 pagine
(Data-Centric Systems and Applications) David Taniar, Wenny Rahayu - Data Warehousing and Analytics - Fueling The Data Engine-Springer (2022)
Abdullah
Nessuna valutazione finora
What Is Systems Analysis?
Documento2 pagine
What Is Systems Analysis?
Marcel Kenji Matsunaga Tangaro
Nessuna valutazione finora
Advanced Applications of Python Data Structures and Algorith
Documento410 pagine
Advanced Applications of Python Data Structures and Algorith
أحمد عاشور
Nessuna valutazione finora
System Analysis Design Thesis Format
Documento4 pagine
System Analysis Design Thesis Format
HelpPaperRochester
100% (2)
Database MNGT System
Documento73 pagine
Database MNGT System
Delina Tedros
Nessuna valutazione finora
A Data Mining Architecture For Clustered Environments: Lecture Notes in Computer Science June 2002
Documento11 pagine
A Data Mining Architecture For Clustered Environments: Lecture Notes in Computer Science June 2002
james russell west brook
Nessuna valutazione finora
Cse 551 Mcs
Documento6 pagine
Cse 551 Mcs
Ioana Raluca Tiriac
Nessuna valutazione finora
CIS 319-70 Spring 2010 TMarasco
Documento7 pagine
CIS 319-70 Spring 2010 TMarasco
Rudy Ariyanto
Nessuna valutazione finora
Assignment On Overview of Microstrategy & DSS: Submitted To
Documento5 pagine
Assignment On Overview of Microstrategy & DSS: Submitted To
Tanjum Tisha
Nessuna valutazione finora
Msit 110 DBMS
Documento274 pagine
Msit 110 DBMS
kamalgaihre
Nessuna valutazione finora
Bda Mod 1
Documento17 pagine
Bda Mod 1
Ison Pereira
Nessuna valutazione finora
Research Paper On Advanced Database Management System
Documento5 pagine
Research Paper On Advanced Database Management System
klbndecnd
Nessuna valutazione finora
Big Data Modeling and Management Systems
Da Everand
Big Data Modeling and Management Systems
Alexander Afriyie
Nessuna valutazione finora
CSE598 Advancedsoftwareanalysisdesign Coursebrief PDF
Documento2 pagine
CSE598 Advancedsoftwareanalysisdesign Coursebrief PDF
Maddy
Nessuna valutazione finora
Terms and Conditions For Jio - Double Data Offer Definitions
Documento3 pagine
Terms and Conditions For Jio - Double Data Offer Definitions
Maddy
Nessuna valutazione finora
Modala Maleyanthe
Documento1 pagina
Modala Maleyanthe
Maddy
Nessuna valutazione finora
Department of Computing: CS-344: Web Engineering
Documento4 pagine
Department of Computing: CS-344: Web Engineering
Waseem Abbas
Nessuna valutazione finora
Pro Mysql: Michael Kruckenberg and Jay Pipes
Documento3 pagine
Pro Mysql: Michael Kruckenberg and Jay Pipes
siddhesh_kerkar
Nessuna valutazione finora
SSL Over WebSphere
Documento16 pagine
SSL Over WebSphere
Ankit Poddar
Nessuna valutazione finora
COMP 1630 - April 2018 - Wednesday - CRN 57600 PDF
Documento7 pagine
COMP 1630 - April 2018 - Wednesday - CRN 57600 PDF
lincoln_faria
Nessuna valutazione finora
Gas For Energy - GIS - Juretic - Skopljak Stulic
Documento7 pagine
Gas For Energy - GIS - Juretic - Skopljak Stulic
KHALED KHALED
Nessuna valutazione finora
Sketchfab White Paper 3D Configurators
Documento18 pagine
Sketchfab White Paper 3D Configurators
Alexandre Equoy
Nessuna valutazione finora
ERP Stands For Enterprise Resource and Planning
Documento6 pagine
ERP Stands For Enterprise Resource and Planning
Bhupendra Mish
Nessuna valutazione finora
Foundation of Relational Implimentation
Documento23 pagine
Foundation of Relational Implimentation
Muhammad Aqeel
Nessuna valutazione finora
Data Warehousing Glossary
Documento11 pagine
Data Warehousing Glossary
Islam Way
Nessuna valutazione finora
SFDC 001
Documento5 pagine
SFDC 001
CLINICAL LAB SOFT
Nessuna valutazione finora
Sample Project Plan Viewing
Documento3 pagine
Sample Project Plan Viewing
SanthoshAnvekar
Nessuna valutazione finora
Extended Star Schema
Documento22 pagine
Extended Star Schema
vinner786
Nessuna valutazione finora
Project Report of Contact Management System - PDF - Databases - Software Testing
Documento118 pagine
Project Report of Contact Management System - PDF - Databases - Software Testing
Vinit Rahangdale
Nessuna valutazione finora
Installment Payment Program For Globe Postpaid
Documento3 pagine
Installment Payment Program For Globe Postpaid
Trixia
Nessuna valutazione finora
Odf
Documento7 pagine
Odf
api-3736000
Nessuna valutazione finora
02 - SGU-UT - Data Architecture For Digital Transformation Ver 0.1-r
Documento31 pagine
02 - SGU-UT - Data Architecture For Digital Transformation Ver 0.1-r
Stanislaus Aditya
Nessuna valutazione finora
IC Project Task Template 8624
Documento3 pagine
IC Project Task Template 8624
Kim Villegas
Nessuna valutazione finora
SAP S4 Logistics
Documento46 pagine
SAP S4 Logistics
Soumen
100% (2)
B. A, B, and C Only
Documento13 pagine
B. A, B, and C Only
SUDHARSANA S
Nessuna valutazione finora
Redwood City - Public Works Memo - Sedaru Subscription Agreement 010813
Documento2 pagine
Redwood City - Public Works Memo - Sedaru Subscription Agreement 010813
api-281755212
Nessuna valutazione finora
Midterm Quiz 1 - Attempt Review 5
Documento2 pagine
Midterm Quiz 1 - Attempt Review 5
Ton Lazaro
Nessuna valutazione finora
Noise Cancelling Software For Windows
Documento2 pagine
Noise Cancelling Software For Windows
Info India
Nessuna valutazione finora
vEPC Solutions Guide 10.0 PDF
Documento16 pagine
vEPC Solutions Guide 10.0 PDF
vallala venkatesh
Nessuna valutazione finora
Security Incident Management Process
Documento8 pagine
Security Incident Management Process
Hiren Dhaduk
Nessuna valutazione finora
Computer Networks - AQA Computer Science Cheat Sheet: by Via
Documento2 pagine
Computer Networks - AQA Computer Science Cheat Sheet: by Via
indira
Nessuna valutazione finora
File To File Scenario Without IR (ESR) Objects - SAP Integration Hub
Documento8 pagine
File To File Scenario Without IR (ESR) Objects - SAP Integration Hub
fziwen
Nessuna valutazione finora
Category - Data Anomalies - Database Management
Documento3 pagine
Category - Data Anomalies - Database Management
gaurav chauhan
Nessuna valutazione finora