IEOR E4725 Spring 2016 Syllabus

Big Data in Finance - Syllabus
The vast proliferation of data and increasing technological complexities continue to transform the way
industries operate and compete. Over the last two years, 90 percent of the data in the world has been
created as a result of the creation of 2.5 quintillion bytes of data on a daily basis. Commonly referred to
as big data, this rapid growth and storage creates opportunities for collection, processing and analysis of
structured and unstructured data.
Financial services, in particular, have widely adopted big data analytics to inform better investment
decisions with consistent returns. In conjunction with big data, algorithmic trading uses vast historical
data with complex mathematical models to maximize portfolio returns. The continued adoption of big
data will inevitably transform the landscape of financial services. However, along with its apparent
benefits, significant challenges remain in regards to big data’s ability to capture the mounting volume of
data.
The increasing volume of market data poses a big challenge for financial institutions. Along with vast
historical data, banking and capital markets need to actively manage ticker data. Likewise, investment
banks and asset management firms use voluminous data to make sound investment decisions. Insurance
and retirement firms can access past policy and claims information for active risk management.
The course will be a mix of Theory and practice with real big data cases in finance. We will invite guest
lecturers mostly for real Big Data Finance Applications.
Theory
1. Introduction: What Is Data Science?

2. Structured and Unstructured Data. Providing Structure to Unstructured Data
3. Analyzing data. Statistical Inference, Exploratory Data Analysis, and the Data Science Process.
Clustering, classifying, recommending, and modeling.
4. Modern Applied Statistics: Learning and Data Mining. Machine Learning. Supervised,
Unsupervised, Reinforced and Deep Learning.
1. Classification: k-nearest neighbors, the optimal Bayes classifier, naive Bayes, LDA and QDA,
reduced rank LDA, logistic regression. Support Vector Machines.
2. Regression: linear regression, bias-variance decomposition, subset selection, shrinkage

methods including ridge regression and the Lasso, regression in high dimensions. Support Vector
Machines.
3. Cluster analysis: BIRCH, Hierarchical, k-means, Expectation-maximization (EM), DBSCAN,

OPTICS and Mean-shift.
4. Dimension Reduction : principal components analysis (PCA), kernel PCA, non-negative matrix
decomposition, PageRank, Factor Modeling and Copulas.
5. Bayesian Models and Inference using Markov Chain Monte-Carlo (MCMC) including Gibbs
sampling and Metropolis-Hastings.
6. Introduction to Graphical Models: Bayesian networks, Markov networks, inference in

graphical models.
7. Deep learning
8. Estimation and Robustness
9. Optimization Techniques
5. Spam Filters, Naive Bayes, and Wrangling

6. Extracting Meaning from Data
7. Recommendation Engines: Building a User-Facing Data Product at Scale
8. Data Visualization and Fraud Detection
9. Social Networks and Data
10. Data Engineering: MapReduce, Pregel, and Hadoop
11. Big Data in Physics – The CERN case
Practice - Big data in Finance applications
1. Predictive Analytics / Trading

2. Sentiment Analysis
3. Financial Fraud
4. Credit Ratings
5. Pricing
6. Customer Segmentation
7. Know Your Customer
Course Grading
 Homework : 25%
 Final Exam : 35%
 Big data Finance project : 40%
Instructor : Miquel Noguer i Alonso PhD
Textbook
No textbook but recommended reading:
 Doing Data Science, O’Reilly Media

 Principles of Big Data, Elsevier
 Python for Data Analysis, O’Reilly Media
 Machine Learning for Hackers, O’Reilly Media.
 A translation of the R examples in Machine Learning for Hackers to Python can be found here:
http://slendrmeans.wordpress.com/will-it-python/
 Probabilistic Programming and Bayesian Methods for Hackers

IEOR E4725 Spring 2016 Syllabus

Caricato da

Informazioni sul documento

Copyright

Formati disponibili

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Copyright:

Formati disponibili

IEOR E4725 Spring 2016 Syllabus

Caricato da

Copyright:

Formati disponibili

Big Data in Finance - Syllabus

1. Introduction: What Is Data Science?

2. Regression: linear regression, bias-variance decomposition, subset selection, shrinkage

3. Cluster analysis: BIRCH, Hierarchical, k-means, Expectation-maximization (EM), DBSCAN,

6. Introduction to Graphical Models: Bayesian networks, Markov networks, inference in

8. Estimation and Robustness

5. Spam Filters, Naive Bayes, and Wrangling

Practice - Big data in Finance applications

1. Predictive Analytics / Trading

Instructor : Miquel Noguer i Alonso PhD

No textbook but recommended reading:

 Doing Data Science, O’Reilly Media

Potrebbero piacerti anche