Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Engineering Big Data with R and Hadoop Ecosystem Essential of Applied Predictive Analytics
http://www.insofe.edu.in
Session# 1 2 3 4 5 6 7 8 9 10 11 12 13
Lecture Session Introduction to Big Data & Applications The Hadoop Eco-system Parallel architectures and concurrent algorithms Distributed File Systems, GFS & HDFS HDFS (continued), CDH4 HDFS Map Reduce Map Reduce (continued) Map Reduce (continued), YARN, Hadoop Streaming Sqoop, Hive R-Hadoop NoSQL databases including HBase PIG, Oozie Machine Learning on Hadoop Mahout
Lab Session (15-30min) Live demo of an Internet-based big data application (10-15min) Different Hadoop installations (20min) Linux shell, Java basics demo (5+20min)
Using HDFS from shell & from programs, HDFS Configuration & Log files (30min) MR configuration and log files (15min) Word Count with MR (20min) Hadoop streaming (in some language popular with this batch); CDH4 features demo? (20+ 5-10min) Sqoop, Hive demo (5+20min) Demonstration of Word Count in RHadoop, contrast with MR version (30min) More examples on Hive and R-Hadoop. Small demo of H-Base (20-25min) PIG, Oozie demo (20+5min) Demonstrate Mahout. Run on movie reco data (30min). MR Demo of Text index building. Assign Text Search homeworks (Homeworks can be done in any one of R-Hadoop / PIG / Hive / Java MR / Hadoop Streaming, as per individual preference) - 25+15min
14
Yes
15 16
Other ecosystem components Text Classification, text clustering Graph processing & Applications including SSSP PageRank, BSP, Hama Pregel, Giraph, Social Network Mining Certification & Wrap up Mahout for text classification. Text search student submissions discussion (15+20min). MR demo of SSSP on a non-trivial graph (20min). Assign graph processing homework. PageRank demo on MR and Hama (10+10min). Graph homework student submissions discussion (20min) Interaction session with certified professionals (20min)
Yes Yes
17 18 19 20
http://www.insofe.edu.in
http://www.insofe.edu.in
Day 10: Generalizing Decision Trees; Information Content and Gain Ratio; Dealing with numerical variables; other measures of randomness Day 11: Inductive learning from a 500-ft view; Issues in inductive learning like curse of dimensionality; Overfitting; Bias-Variance tradeoff Day 12: Pruning a Decision Tree, Cost as a consideration; Unwrapping Trees as rules Day 13: A mathematical model for association analysis Day 14: Large itemsets and Association Rules; Apriori: Constructs large itemsets with minisup by iterations Day 15: Interestingness of discovered association rules; Application examples; Association analysis vs. Classification Day 16: Using Association Rules to compare stores; Dissociation Rules; Sequential Analysis Using Association Rules Day 17: Data visualization and Story-telling: Anatomy of a graph Day 18: Animated graphs, BI dashboards and the latest trends in data visualization Days 19 and 20: An end-to-end case study in R involving understanding the data, filling the missing values, applying and assessing models and reporting the results.
http://www.insofe.edu.in
Mentors Profiles
Dr. SREERAMA K MURTHY
Co-founder and CEO, Teqnium Consultancy Services PhD in Data Mining, Johns Hopkins University Classes Taught Engineering Big Data with R and Hadoop Ecosystem Brief Profile Ph.D. - Johns Hopkins University M.Tech. IIT, Chennai (Madras) B.E. - NIT, Allahabad 17 years of work experience after Ph.D. (USA: 5 years, India: 11 years) 21 US Patent applications (8 issued), 2 Indian patent applications Many invention disclosures, numerous journal and conference papers. Designed, managed, built and deployed large software systems. Technocrat, combining love for technology with entrepreneurship and business
management. Helped conceptualize business plans of three ventures. Obtained millions of dollars in funding.
Chairman & CEO - Teqnium Consultancy Services Director, Technology - Globarena ITeknowledge Pvt Ltd Managing Director - Globarena Web Technologies Senior Manager and Head, E-Commerce Research group - IBM India Research Lab Researcher - Siemens Corporate Research Areas of Expertise: Technology Enabled Education and Training, e-Skilling, Outsourced R&D, Data Mining, Digital Security, Healthcare Informatics Specialties: Education Strategy, Role of Technology in Skills Development, Instructional Design, Research, Intellectual Property, Novel Product Design
http://www.insofe.edu.in
Classes Taught
Essentials of Applied Predictive Analytics
Brief Profile
Ph.D. Carnegie Mellon University (CMU) M.S. Carnegie Mellon University (CMU) B.E. NIT, Tiruchirapalli 15 years of work experience after Ph.D. in diverse organizations ranging from Defense Research to Web startup and mid-size IT services companies. President - International School of Engineering, Chief Research Officer - Prithvi Information Solutions Ltd., Hyderabad, Founder and Managing Director - Axaya Cybertech Pvt Ltd, Co-founder and Managing Director - Globarena ITeknowledge Pvt. Ltd Scientist - Defence Metallurgical Research Laboratory, Hyderabad, During his years of experience as a scientist and entrepreneur, Murthy has applied his strengths in logical thinking, math and science to solving industrial and societal problems, designing solutions from fundamentals, identifying, training and motivating high quality individuals, and to articulating the findings in a lucid manner to all the stakeholders. Over the past few years, Dr. Murthy has been actively teaching Data Analytics to working professionals with wide range of experience and from diverse industries. He has also been consulting on Data Science projects with Fortune 25 to IT Services to Startup companies. During his years of experience as a scientist and entrepreneur, Dr. Murthy has applied his strengths in logical thinking, math and science to solving industrial and societal problems, designing solutions from fundamentals, identifying, training and motivating high quality individuals, and to articulating the findings in a lucid manner to all the stakeholders. He built the Business Analytics and Optimization division of a mid-tier IT services company from scratch and filed for 5 patents in Retail and Telecom Analytics, during which time he also acquired Fortune 500 clients and turned the division into a profitable delivery center.
http://www.insofe.edu.in
Fee Structure
Program Fee for Each Individual Module: For International Students: $9 for Application Fees and $640 for Program Fees For Indian Students: Rs. 500 for Application Fees and Rs. 35,000 for Program Fees Program Fee for Two Modules: For International Students: $9 for Application Fees and $960 for Program Fees For Indian Students: Rs. 500 for Application Fees and Rs. 54,000 for Program Fees For more details, please visit: http://insofe.edu.in/init/default/elearning_engineering_big_data
http://www.insofe.edu.in
http://www.insofe.edu.in