Sei sulla pagina 1di 2

09/03/16- 01/14/17

Class Hours: 4 hours/week


Work Load: 10-15 hours/week

Data Application Lab Data Application Lab

Data Scientist Training Program 925 S Atlantic Blvd, Monterey Park, CA


info@datalaus.com; 1-800-485-7918
OOver

Syllabus

Week 1: Week 2:
Python Eco-system for Data Analytics Data Exploration & Visualization
- 1. What is data scientist - 1. Python basic visualization
- 2. Numpy, scipy, pandas, ipython book - 2. OOP concept and plotting principle
- 3. Basic analytics using pandas - 3. Advanced visualization
- 4. Case study using pandas - 4. Exploratory data analysis
Machine Learning Algorithm-1 Machine Learning Algorithm-2
– Brief Introduction to Machine Learning Algorithm – SVM Classifiers

Week 3: Week 4:
Scikit Learn Eco-system for Machine Learning Regression & Classification
- 1. Machine learning introduction - 1. Regularization (Lasso, Ridge, Elastic-Net)
- 2. Scikit Learn package - 2. Basic classification model (Logistic, Tree, SVM)
- 3. Basic regression model - 3. Model measurement (classification)
- 4. Cross validation & model measurement - 4. Bias variance trade off
(regression)
Machine Learning Algorithm-3 Machine Learning Algorithm-4
– ANN – Decision Tree

Week 5: Week 6:
Dimension Reduction & Unsupervised Learning Data Analysis using Hadoop Hive
- 1. Model feature selection - 1. Hadoop ecosystem introduction
- 2. Principle component analysis - 2. HDFS
- 3. Clustering analysis (K-mean, KBSCAN, etc) - 3. Data analysis with Hive
- 4. Other ML methodology (ensemble method,
random forest, etc)
Data Analysis using Python-1 Data Analysis using Python-2
– Machine Learning Algorithms Implementation – Web Crawler

Week 7: Week 8:
Data Analysis using Apache Pig Data Processing using Spark SQL and data frame
– 1. Pig Introduction – 1. Spark SQL
– 2. Pig Latin language (continue at the next page) – 2. Spark data frame (continue at the next page)

Data Scientist Training Program 1


– 3. Pig works as ETL Data Analysis using Python-4
Data Analysis using Python-3 – Basic CS Algorithm 2
– Basic CS Algorithm
Week 10:
Week 9: Data Science in different industries including finance,
Machine Learning using Spark MLLib marketing, online advertising
– 1. MLLib (Kaggle Case Study)
– 2. GraphX Data Analysis using Python-6
Data Analysis using Python-5 – Web Front-end
– MapReduce

Project Internship (10 weeks) Career Assist (4 hours)


5 Weeks - Live Kaggle Competition – Mock Interview
– Career consulting
5 Weeks – End-to-End Data App Development – Resume polish
Choose 1 of the 2:
1. Steam Gaming Platform Recommendation System

2. Fintech: Lending Club Investment Consulting Service

Data Scientist Training Program 2

Potrebbero piacerti anche