Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Program Curriculum
Learning outcomes:
• Learn to implement Machine Learning techniques using Python
• Learn data visualization techniques
• Learn to analyze raw data
• Learn Big Data and Spark
Python
1. Introduction to Python
2. Python Environment
3. Executing Python
• Interactive Mode
• Scripting Mode
• Integrated Development Environment
• Python Identifiers
• Reserved Words
• Lines and Indentation
www.edyoda.com hello@edyoda.com
5. Python Variable Types
• Arithmetic Operators
• Comparison Operators
• Assignment Operators
• Bitwise Operators
• Logical Operators
• Membership Operators
• Identity Operators
• Operators Precedence
• IF statements
• IF...ELIF...ELSE Statements
• Nested IF statements
8. Python Loops
• While loop
• For loop
• Nested loop
• Break control statement
• Continue statement
• Pass statement
9. Python Numbers
www.edyoda.com hello@edyoda.com
10. Python Strings
www.edyoda.com hello@edyoda.com
15. Python Modules
• Importing Modules
• Namespaces and scoping
• Packages
• Creating Classes
• Creating instance objects
• Destroying Objects (Garbage Collection)
• Custom Classes
• Attributes and Methods
• Inheritance and Polymorphism
• Using Properties to Control Attribute Access
• Lambda
• Filter
• Map
• Functools
www.edyoda.com hello@edyoda.com
20. Iterators and Generators
• Itertools
• Generators
• Decorators
21. Collections
• Deque
• Counter
• OrderedDict
• ChainMap
• Pdb
• Breakpoints
• Pip
• Virtualenv
• The init.py files
• The setup.py file
• Installing the package
• Software deployment in Python
www.edyoda.com hello@edyoda.com
Data Wrangling
2. Essential NumPy
• Introduction to NumPy
• Creation
• Access
• Stacking and Splitting
• Methods
• Broadcasting
• Introduction to Pandas
• Understanding Series & DataFrames
• Loading CSV,JSON
• Connecting databases
• Descriptive Statistics
• Accessing subsets of data - Rows, Columns, Filters
• Handling Missing Data
• Dropping rows & columns
• Handling Duplicates
• Function Application - map, apply, groupby, rolling, str
• Merge, Join & Concatenate
• Stacking, Unstacking & Melting
• Pivot-tables
• Normalizing JSON
• Application - EDA on Employee data, sales data
www.edyoda.com hello@edyoda.com
4. Understanding Visualization:
Mathematics Fundamentals
Machine Learning
www.edyoda.com hello@edyoda.com
2. Preprocessing for Machine Learning
• Introduction to Preprocessing
• StandardScaler
• MinMaxScaler
• RobustScaler
• Normalization
• Binarization
• Encoding Categorical (Ordinal & Nominal) Features
• Imputation
• Polynomial Features
• Custom Transformer
• Text Processing
• CountVectorizer
• TfIdf
• HashingVectorizer
• Image using skimage
3. Decision Trees
4. Naive Bayes
www.edyoda.com hello@edyoda.com
5. Composite Estimators using Pipelines & FeatureUnions
• Cross Validation
• Hyperparameter Tuning
• Model Evaluation
• Model Persistence
• Validation Curves
• Learning Curves
8. Nearest Neighbors
www.edyoda.com hello@edyoda.com
• Nearest Neighbors for Regression
• Nearest Centroid Classifier
• Application - Nearest neighbour for face inpainting
9. Clustering Techniques
www.edyoda.com hello@edyoda.com
12. Dealing with Imbalanced Classes
www.edyoda.com hello@edyoda.com
Big Data Ecosystem
• Big Data
• Understanding distributed computing
• Introduction to Hadoop
• HDFS, YARN, MapReduce
• Limitations of Hadoop
• Introduction to Spark
• Introduction to Kafka
• Hive
• Cassandra
• Driver
• Executors
• Partitions
• Jobs
• Stages
• Tasks
• Resilient Distributed Datastructure
• DataFrames as a High Level Datastructure
www.edyoda.com hello@edyoda.com
• DataFrame Column
• Creating tables from dataframe
• SQL query
• DataFrame Grouping
• DataFrame Functions
• User Defined Functions (UDF)
www.edyoda.com hello@edyoda.com