Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
7/11/14
Bill Howe, UW
7/11/14
Bill Howe, UW
Plan
context (8 min)
panelists (5 x (5min + 2min))
discussion
7/11/14
Bill Howe, UW
Bill Howe, UW
7/11/14
Bill Howe, UW
7/11/14
Bill Howe, UW
7/11/14
Bill Howe, UW
7/11/14
Bill Howe, UW
7/11/14
Bill Howe, UW
7/11/14
Bill Howe, UW
10
Statistics
traditional analysis
Data Munging
parsing, scraping, and formatting data
Visualization
graphs, tools, etc.
7/11/14
Bill Howe, UW
11
Bill Howe, UW
7/11/14
Bill Howe, UW
13
7/11/14
Bill Howe, UW
14
Huge number of
relevant courses,
new and existing.
7/11/14
Bill Howe, UW
15
Tools
tools
abstr.
structs
stats
desk
cloud
Math
Scale
Audience
hackers
7/11/14
analysts
Bill Howe, UW
16
tools
abstr.
structs
stats
desk
cloud
hackers
7/11/14
Bill Howe, UW
analysts
17
William W. Cohen
Machine
Learning
tools
abstr.
structs
stats
desk
cloud
hackers
7/11/14
Bill Howe, UW
tools
abstr.
structs
stats
desk
cloud
hackers
analysts
analysts
18
Dan
Suciu
Magda
Balazinska
tools
abstr.
structs
stats
desk
cloud
hackers
7/11/14
Bill Howe, UW
analysts
19
tools
abstr.
structs
stats
desk
cloud
hackers
7/11/14
Bill Howe, UW
analysts
20
tools
abstr.
structs
stats
desk
cloud
hackers
7/11/14
Bill Howe, UW
analysts
21
tools
abstr.
structs
stats
desk
cloud
hackers
7/11/14
Bill Howe, UW
analysts
22
7/11/14
Bill Howe, UW
tools
abstr.
structs
stats
desk
cloud
hackers
analysts
23
7/11/14
Bill Howe, UW
24
Bill Howe
Session 1,
Spring 2013
tools
abstr.
structs
stats
desk
cloud
Session 2
(starts Monday!)
hackers
7/11/14
Bill Howe, UW
analysts
25
Participation numbers
Registered:
Clicked play in first 2 weeks:
Turned in 1st homework:
Completed all assignments:
Passed:
Forum threads:
Forum posts:
26
7/11/14
Bill Howe, UW
27
7/11/14
Bill Howe, UW
28
Syllabus
Data Science Landscape (~1 week)
Data Manipulation at Scale
Relational Databases (~1 week)
MapReduce (~1 week)
NoSQL (~1 week)
Analytics
Statistics Pearls (~1 week)
multiple hypothesis testing, effect size, bayesian, bootstrap
7/11/14
Bill Howe, UW
30
Pandas (Python)
merge(left, right, on=key)
dplyr (R)
filter(x), select(x), arrange(x), groupby(x),
inner_join(x, y), left_join(x, y), .
Bill Howe, UW
31
7/11/14
Bill Howe, UW
32
Possible Responses
Data science is just a buzzword; theres
no substance to it.
Im already teaching all this stuff;
theres nothing new here.
This is a job for statistics departments /
B-schools / I-schools / applied math /
anyone else.
7/11/14
Bill Howe, UW
33