Mobility Patterns, Big Data and Transport Analytics: Tools and Applications for Modeling

Ebook768 pages12 hours

Mobility Patterns, Big Data and Transport Analytics: Tools and Applications for Modeling

Name: Mobility Patterns, Big Data and Transport Analytics: Tools and Applications for Modeling
ISBN: 9780128129715

Rating: 0 out of 5 stars

()

Read preview

About this ebook

Mobility Patterns, Big Data and Transport Analytics provides a guide to the new analytical framework and its relation to big data, focusing on capturing, predicting, visualizing and controlling mobility patterns - a key aspect of transportation modeling. The book features prominent international experts who provide overviews on new analytical frameworks, applications and concepts in mobility analysis and transportation systems. Users will find a detailed, mobility ‘structural’ analysis and a look at the extensive behavioral characteristics of transport, observability requirements and limitations for realistic transportation applications and transportation systems analysis that are related to complex processes and phenomena.

This book bridges the gap between big data, data science, and transportation systems analysis with a study of big data’s impact on mobility and an introduction to the tools necessary to apply new techniques.

The book covers in detail, mobility ‘structural’ analysis (and its dynamics), the extensive behavioral characteristics of transport, observability requirements and limitations for realistic transportation applications, and transportation systems analysis related to complex processes and phenomena. The book bridges the gap between big data, data science, and Transportation Systems Analysis with a study of big data’s impact on mobility, and an introduction to the tools necessary to apply new techniques.

Guides readers through the paradigm-shifting opportunities and challenges of handling Big Data in transportation modeling and analytics
Covers current analytical innovations focused on capturing, predicting, visualizing, and controlling mobility patterns, while discussing future trends
Delivers an introduction to transportation-related information advances, providing a benchmark reference by world-leading experts in the field
Captures and manages mobility patterns, covering multiple purposes and alternative transport modes, in a multi-disciplinary approach
Companion website features videos showing the analyses performed, as well as test codes and data-sets, allowing readers to recreate the presented analyses and apply the highlighted techniques to their own data

Skip carousel

Social Science

LanguageEnglish

PublisherElsevier Science

Release dateNov 27, 2018

ISBN9780128129715

Related to Mobility Patterns, Big Data and Transport Analytics

Related ebooks

Skip carousel

Road Traffic Modeling and Management: Using Statistical Monitoring and Deep Learning
Ebook
Road Traffic Modeling and Management: Using Statistical Monitoring and Deep Learning
byFouzi Harrou
Rating: 0 out of 5 stars
0 ratings
Data Science Applied to Sustainability Analysis
Ebook
Data Science Applied to Sustainability Analysis
byJennifer Dunn
Rating: 0 out of 5 stars
0 ratings
Data Analytics for Intelligent Transportation Systems
Ebook
Data Analytics for Intelligent Transportation Systems
byMashrur Chowdhury
Rating: 0 out of 5 stars
0 ratings
Cognitive Computing for Human-Robot Interaction: Principles and Practices
Ebook
Cognitive Computing for Human-Robot Interaction: Principles and Practices
byMamta Mittal
Rating: 0 out of 5 stars
0 ratings
Geographic Knowledge Infrastructure: Applications to Territorial Intelligence and Smart Cities
Ebook
Geographic Knowledge Infrastructure: Applications to Territorial Intelligence and Smart Cities
byRobert Laurini
Rating: 0 out of 5 stars
0 ratings
Big Data and Smart Service Systems
Ebook
Big Data and Smart Service Systems
byXiwei Liu
Rating: 0 out of 5 stars
0 ratings
The Future of Intelligent Transport Systems
Ebook
The Future of Intelligent Transport Systems
byGeorge J. Dimitrakopoulos
Rating: 0 out of 5 stars
0 ratings
Nature-Inspired Optimization Algorithms
Ebook
Nature-Inspired Optimization Algorithms
byXin-She Yang
Rating: 0 out of 5 stars
0 ratings
Mastering Machine Learning Algorithms - Second Edition: Expert techniques for implementing popular machine learning algorithms, fine-tuning your models, and understanding how they work, 2nd Edition
Ebook
Mastering Machine Learning Algorithms - Second Edition: Expert techniques for implementing popular machine learning algorithms, fine-tuning your models, and understanding how they work, 2nd Edition
byGiuseppe Bonaccorso
Rating: 0 out of 5 stars
0 ratings
Introduction to Dynamic Programming: International Series in Modern Applied Mathematics and Computer Science, Volume 1
Ebook
Introduction to Dynamic Programming: International Series in Modern Applied Mathematics and Computer Science, Volume 1
byLeon Cooper
Rating: 0 out of 5 stars
0 ratings
R and Data Mining: Examples and Case Studies
Ebook
R and Data Mining: Examples and Case Studies
byYanchang Zhao
Rating: 3 out of 5 stars
3/5
Big Data, Big Analytics: Emerging Business Intelligence and Analytic Trends for Today's Businesses
Ebook
Big Data, Big Analytics: Emerging Business Intelligence and Analytic Trends for Today's Businesses
byMichele Chambers
Rating: 0 out of 5 stars
0 ratings
Practical Machine Learning for Data Analysis Using Python
Ebook
Practical Machine Learning for Data Analysis Using Python
byAbdulhamit Subasi
Rating: 0 out of 5 stars
0 ratings
Informed Urban Transport Systems: Classic and Emerging Mobility Methods toward Smart Cities
Ebook
Informed Urban Transport Systems: Classic and Emerging Mobility Methods toward Smart Cities
byJoseph Chow
Rating: 0 out of 5 stars
0 ratings
Machine Learning and Data Mining
Ebook
Machine Learning and Data Mining
byIgor Kononenko
Rating: 3 out of 5 stars
3/5
Data Mining Applications with R
Ebook
Data Mining Applications with R
byYanchang Zhao
Rating: 4 out of 5 stars
4/5
Mastering SciPy
Ebook
Mastering SciPy
byBlanco-Silva Francisco J.
Rating: 0 out of 5 stars
0 ratings
MLOps A Complete Guide - 2021 Edition
Ebook
MLOps A Complete Guide - 2021 Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Machine Learning for Time Series Forecasting with Python
Ebook
Machine Learning for Time Series Forecasting with Python
byFrancesca Lazzeri
Rating: 4 out of 5 stars
4/5
Python Geospatial Development
Ebook
Python Geospatial Development
byErik Westra
Rating: 4 out of 5 stars
4/5
Spatial Regression Analysis Using Eigenvector Spatial Filtering
Ebook
Spatial Regression Analysis Using Eigenvector Spatial Filtering
byDaniel Griffith
Rating: 0 out of 5 stars
0 ratings
Handbook of Statistical Analysis and Data Mining Applications
Ebook
Handbook of Statistical Analysis and Data Mining Applications
byRobert Nisbet
Rating: 4 out of 5 stars
4/5
Learning R for Geospatial Analysis
Ebook
Learning R for Geospatial Analysis
byMichael Dorman
Rating: 0 out of 5 stars
0 ratings
Machine Learning in the AWS Cloud: Add Intelligence to Applications with Amazon SageMaker and Amazon Rekognition
Ebook
Machine Learning in the AWS Cloud: Add Intelligence to Applications with Amazon SageMaker and Amazon Rekognition
byAbhishek Mishra
Rating: 0 out of 5 stars
0 ratings
Big Data Analytics with R
Ebook
Big Data Analytics with R
bySimon Walkowiak
Rating: 0 out of 5 stars
0 ratings
R Machine Learning By Example
Ebook
R Machine Learning By Example
byDipanjan Sarkar
Rating: 0 out of 5 stars
0 ratings
Urban Mobility and the Smartphone: Transportation, Travel Behavior and Public Policy
Ebook
Urban Mobility and the Smartphone: Transportation, Travel Behavior and Public Policy
byAnne Aguilera
Rating: 0 out of 5 stars
0 ratings
Essentials of Deep Learning and AI: Experience Unsupervised Learning, Autoencoders, Feature Engineering, and Time Series Analysis with TensorFlow, Keras, and scikit-learn
Ebook
Essentials of Deep Learning and AI: Experience Unsupervised Learning, Autoencoders, Feature Engineering, and Time Series Analysis with TensorFlow, Keras, and scikit-learn
byShashidhar Soppin
Rating: 0 out of 5 stars
0 ratings
Exploratory and Multivariate Data Analysis
Ebook
Exploratory and Multivariate Data Analysis
byMichel Jambu
Rating: 0 out of 5 stars
0 ratings
Big Data Analytics
Ebook
Big Data Analytics
byVenkat Ankam
Rating: 0 out of 5 stars
0 ratings

Social Science For You

Skip carousel

Read People Like a Book: How to Analyze, Understand, and Predict People’s Emotions, Thoughts, Intentions, and Behaviors
Ebook
Read People Like a Book: How to Analyze, Understand, and Predict People’s Emotions, Thoughts, Intentions, and Behaviors
byPatrick King
Rating: 4 out of 5 stars
4/5
Greek Mythology: The Gods, Goddesses, and Heroes Handbook: From Aphrodite to Zeus, a Profile of Who's Who in Greek Mythology
Ebook
Greek Mythology: The Gods, Goddesses, and Heroes Handbook: From Aphrodite to Zeus, a Profile of Who's Who in Greek Mythology
byLiv Albert
Rating: 4 out of 5 stars
4/5
My Secret Garden: Women's Sexual Fantasies
Ebook
My Secret Garden: Women's Sexual Fantasies
byNancy Friday
Rating: 4 out of 5 stars
4/5
The Encyclopedia of Misinformation: A Compendium of Imitations, Spoofs, Delusions, Simulations, Counterfeits, Impostors, Illusions, Confabulations, Skullduggery, ... Conspiracies & Miscellaneous Fakery
Ebook
The Encyclopedia of Misinformation: A Compendium of Imitations, Spoofs, Delusions, Simulations, Counterfeits, Impostors, Illusions, Confabulations, Skullduggery, ... Conspiracies & Miscellaneous Fakery
byRex Sorgatz
Rating: 4 out of 5 stars
4/5
The Like Switch: An Ex-FBI Agent's Guide to Influencing, Attracting, and Winning People Over
Ebook
The Like Switch: An Ex-FBI Agent's Guide to Influencing, Attracting, and Winning People Over
byJack Schafer
Rating: 4 out of 5 stars
4/5
Come As You Are: Revised and Updated: The Surprising New Science That Will Transform Your Sex Life
Ebook
Come As You Are: Revised and Updated: The Surprising New Science That Will Transform Your Sex Life
byEmily Nagoski
Rating: 4 out of 5 stars
4/5
The Art of Witty Banter: Be Clever, Quick, & Magnetic
Ebook
The Art of Witty Banter: Be Clever, Quick, & Magnetic
byPatrick King
Rating: 4 out of 5 stars
4/5
All About Love: New Visions
Ebook
All About Love: New Visions
bybell hooks
Rating: 4 out of 5 stars
4/5
Nickel and Dimed: On (Not) Getting By in America
Ebook
Nickel and Dimed: On (Not) Getting By in America
byBarbara Ehrenreich
Rating: 4 out of 5 stars
4/5
The Great Reset: And the War for the World
Ebook
The Great Reset: And the War for the World
byAlex Jones
Rating: 4 out of 5 stars
4/5
Just Mercy: a story of justice and redemption
Ebook
Just Mercy: a story of justice and redemption
byBryan Stevenson
Rating: 5 out of 5 stars
5/5
King, Warrior, Magician, Lover: Rediscovering the Archetypes of the Mature Masculine
Ebook
King, Warrior, Magician, Lover: Rediscovering the Archetypes of the Mature Masculine
byRobert Moore
Rating: 4 out of 5 stars
4/5
Women Don't Owe You Pretty
Ebook
Women Don't Owe You Pretty
byFlorence Given
Rating: 4 out of 5 stars
4/5
A People's History of the United States
Ebook
A People's History of the United States
byHoward Zinn
Rating: 4 out of 5 stars
4/5
Men Who Hate Women: From Incels to Pickup Artists: The Truth about Extreme Misogyny and How it Affects Us All
Ebook
Men Who Hate Women: From Incels to Pickup Artists: The Truth about Extreme Misogyny and How it Affects Us All
byLaura Bates
Rating: 4 out of 5 stars
4/5
One Nation Under Blackmail - Vol. 1: The Sordid Union Between Intelligence and Crime that Gave Rise to Jeffrey Epstein, VOL.1
Ebook
One Nation Under Blackmail - Vol. 1: The Sordid Union Between Intelligence and Crime that Gave Rise to Jeffrey Epstein, VOL.1
byWhitney Alyse Webb
Rating: 5 out of 5 stars
5/5
Freedom Is a Constant Struggle: Ferguson, Palestine, and the Foundations of a Movement
Ebook
Freedom Is a Constant Struggle: Ferguson, Palestine, and the Foundations of a Movement
byAngela Y. Davis
Rating: 4 out of 5 stars
4/5
The Fourth Turning Is Here: What the Seasons of History Tell Us about How and When This Crisis Will End
Ebook
The Fourth Turning Is Here: What the Seasons of History Tell Us about How and When This Crisis Will End
byNeil Howe
Rating: 4 out of 5 stars
4/5
The Denial of Death
Ebook
The Denial of Death
byErnest Becker
Rating: 4 out of 5 stars
4/5
You're Not Listening: What You're Missing and Why It Matters
Ebook
You're Not Listening: What You're Missing and Why It Matters
byKate Murphy
Rating: 4 out of 5 stars
4/5
Dumbing Us Down - 25th Anniversary Edition: The Hidden Curriculum of Compulsory Schooling
Ebook
Dumbing Us Down - 25th Anniversary Edition: The Hidden Curriculum of Compulsory Schooling
byJohn Taylor Gatto
Rating: 4 out of 5 stars
4/5
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Ebook
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
byMargot Lee Shetterly
Rating: 4 out of 5 stars
4/5
The Sun Does Shine: How I Found Life and Freedom on Death Row (Oprah's Book Club Selection)
Ebook
The Sun Does Shine: How I Found Life and Freedom on Death Row (Oprah's Book Club Selection)
byAnthony Ray Hinton
Rating: 4 out of 5 stars
4/5
Prisoners of Geography: Ten Maps That Explain Everything About the World
Ebook
Prisoners of Geography: Ten Maps That Explain Everything About the World
byTim Marshall
Rating: 4 out of 5 stars
4/5
Hunt, Gather, Parent: What Ancient Cultures Can Teach Us About the Lost Art of Raising Happy, Helpful Little Humans
Ebook
Hunt, Gather, Parent: What Ancient Cultures Can Teach Us About the Lost Art of Raising Happy, Helpful Little Humans
byMichaeleen Doucleff
Rating: 4 out of 5 stars
4/5
The Song of the Cell: An Exploration of Medicine and the New Human
Ebook
The Song of the Cell: An Exploration of Medicine and the New Human
bySiddhartha Mukherjee
Rating: 4 out of 5 stars
4/5
The January 6th Report
Ebook
The January 6th Report
byElizabeth Holtzman
Rating: 4 out of 5 stars
4/5
One Nation Under Blackmail – Vol. 2: The Sordid Union Between Intelligence and Organized Crime that Gave Rise to Jeffrey Epstein Vol. 2
Ebook
One Nation Under Blackmail – Vol. 2: The Sordid Union Between Intelligence and Organized Crime that Gave Rise to Jeffrey Epstein Vol. 2
byWhitney Alyse Webb
Rating: 5 out of 5 stars
5/5
The Human Condition
Ebook
The Human Condition
byHannah Arendt
Rating: 4 out of 5 stars
4/5
I Don't Want to Talk About It: Overcoming the Secret Legacy of Male Depression
Ebook
I Don't Want to Talk About It: Overcoming the Secret Legacy of Male Depression
byTerrence Real
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

#70 Beyond the Language Wars: R & Python for the Modern Data Scientist
Podcast episode
#70 Beyond the Language Wars: R & Python for the Modern Data Scientist
byDataFramed
0 ratings
0% found this document useful
[DataFramed Careers Series #2] What Makes a Great Data Science Portfolio
Podcast episode
[DataFramed Careers Series #2] What Makes a Great Data Science Portfolio
byDataFramed
0 ratings
0% found this document useful
433: Falling for FastAPI: Mike's falling in love with FastAPI and gives us a hint at the next project he's building.
Podcast episode
433: Falling for FastAPI: Mike's falling in love with FastAPI and gives us a hint at the next project he's building.
byCoder Radio
0 ratings
0% found this document useful
37. Sean Knapp - The brave new world of data engineering
Podcast episode
37. Sean Knapp - The brave new world of data engineering
byTowards Data Science
0 ratings
0% found this document useful
This alternative way to measure research impact made judges cry with joy: Research managers, citizen scientists, librarians and technicians rarely make it onto author lists. But an initiative to assess their hidden contributions to team science moved some judging panel members to tears.
Podcast episode
This alternative way to measure research impact made judges cry with joy: Research managers, citizen scientists, librarians and technicians rarely make it onto author lists. But an initiative to assess their hidden contributions to team science moved some judging panel members to tears.
byWorking Scientist
0 ratings
0% found this document useful
Episode 272: It Came from TRB! Poster Session Part 1: Various poster presenters from the 99th TRB Annual Meeting
Podcast episode
Episode 272: It Came from TRB! Poster Session Part 1: Various poster presenters from the 99th TRB Annual Meeting
byTalking Headways: A Streetsblog Podcast
0 ratings
0% found this document useful
Amazon Last-Mile Routing Research Challenge Uncovers New Research Methods on a Global Scale: The MIT Center for Transportation & Logistics (MIT CTL) and Amazon engaged with a global community of researchers across a range of disciplines, from computer science to business operations, to supply chain management, challenging them to build data-driv...
Podcast episode
Amazon Last-Mile Routing Research Challenge Uncovers New Research Methods on a Global Scale: The MIT Center for Transportation & Logistics (MIT CTL) and Amazon engaged with a global community of researchers across a range of disciplines, from computer science to business operations, to supply chain management, challenging them to build data-driv...
byMIT Supply Chain Frontiers
0 ratings
0% found this document useful
REPOST: MIT Center for Transportation and Logistics with Chris Caplice: and discuss MIT Center for Transportation and Logistics and a wide range of logistics and supply chain topics. Chris is a Senior Research Scientist at MIT and serves as the Executive Director of the MIT Center for Transportation &...
Podcast episode
REPOST: MIT Center for Transportation and Logistics with Chris Caplice: and discuss MIT Center for Transportation and Logistics and a wide range of logistics and supply chain topics. Chris is a Senior Research Scientist at MIT and serves as the Executive Director of the MIT Center for Transportation &...
byThe Logistics of Logistics
0 ratings
0% found this document useful
The Anthrozoology Podcast - Problematizing the Ethics Process: An Anthrozoological Perspective Part 2 #10
Podcast episode
The Anthrozoology Podcast - Problematizing the Ethics Process: An Anthrozoological Perspective Part 2 #10
byThe Anthrozoology Podcast
0 ratings
0% found this document useful
[Bite] Data Science and the Scientific Method
Podcast episode
[Bite] Data Science and the Scientific Method
byDataCafé
0 ratings
0% found this document useful
The Anthrozoology Podcast - Problematizing the Ethics Process: An Anthrozoological Perspective Part 1 #9
Podcast episode
The Anthrozoology Podcast - Problematizing the Ethics Process: An Anthrozoological Perspective Part 1 #9
byThe Anthrozoology Podcast
0 ratings
0% found this document useful
Podcast Ep. #18 – Prof. Wenbin Yu on the Structure Genome: On this episode I am speaking to Wenbin Yu, who is a professor at the School of Aeronautics and Astronautics of Purdue University and CTO of AnalySwift, a provider of simulation software for composites. Wenbin has achieved many accolades in both the ac...
Podcast episode
Podcast Ep. #18 – Prof. Wenbin Yu on the Structure Genome: On this episode I am speaking to Wenbin Yu, who is a professor at the School of Aeronautics and Astronautics of Purdue University and CTO of AnalySwift, a provider of simulation software for composites. Wenbin has achieved many accolades in both the ac...
byAerospace Engineering Podcast
0 ratings
0% found this document useful
058R_An adaptive learning process for developing and applying sustainability indicators with local communities (research summary)
Podcast episode
058R_An adaptive learning process for developing and applying sustainability indicators with local communities (research summary)
byWhat is The Future for Cities?
0 ratings
0% found this document useful
48. Big Data Wrangling for Core Sensing Technology
Podcast episode
48. Big Data Wrangling for Core Sensing Technology
byDiscovery to Recovery
0 ratings
0% found this document useful
007 Prof. Kristin Persson of the Materials Project – Building a Global Materials Informatics Platform: Summary: This episode focuses on Prof. Kristin Persson’s work directing the Materials Project, where she had her group have built an open-source materials informatics platform that reaches over 75,000 users worldwide. In this episode,...
Podcast episode
007 Prof. Kristin Persson of the Materials Project – Building a Global Materials Informatics Platform: Summary: This episode focuses on Prof. Kristin Persson’s work directing the Materials Project, where she had her group have built an open-source materials informatics platform that reaches over 75,000 users worldwide. In this episode,...
byDataLab: The Materials Informatics Podcast
0 ratings
0% found this document useful
070R_Citizen-centred big data analysis-driven governance intelligence framework for smart cities (research summary)
Podcast episode
070R_Citizen-centred big data analysis-driven governance intelligence framework for smart cities (research summary)
byWhat is The Future for Cities?
0 ratings
0% found this document useful
Optimising the Future
Podcast episode
Optimising the Future
byDataCafé
0 ratings
0% found this document useful
Why Microservices Are Better Than Cloud Computing: This episode on Systems—one of the four Domains of Data Science UVA uses to define the field—explores the challenges of cloud computing within the framework of biomedical research. Phil Bourne, Dean of the UVA School of Data Science, speaks with computational biologist and associate professor Nathan Sheffield about a paper they co-wrote on systemic issues from cloud platforms that do not support FAIRness, including platform lock-in, poor integration across platforms, and duplicated efforts for users and developers. They suggest instead prioritizing microservices and access to modular data in smaller chunks or summarized form. Emphasizing modularity and interoperability would lead to a more powerful Unix-like ecosystem of web services for biomedical analysis and data retrieval. The two discuss how funders, developers, and researchers can support microservices as the next generation of cloud-based bioinformatics. From Cloud Computing to
Podcast episode
Why Microservices Are Better Than Cloud Computing: This episode on Systems—one of the four Domains of Data Science UVA uses to define the field—explores the challenges of cloud computing within the framework of biomedical research. Phil Bourne, Dean of the UVA School of Data Science, speaks with computational biologist and associate professor Nathan Sheffield about a paper they co-wrote on systemic issues from cloud platforms that do not support FAIRness, including platform lock-in, poor integration across platforms, and duplicated efforts for users and developers. They suggest instead prioritizing microservices and access to modular data in smaller chunks or summarized form. Emphasizing modularity and interoperability would lead to a more powerful Unix-like ecosystem of web services for biomedical analysis and data retrieval. The two discuss how funders, developers, and researchers can support microservices as the next generation of cloud-based bioinformatics. From Cloud Computing to
byUVA Data Points
0 ratings
0% found this document useful
Setting the Standard: Impact of Method Standardization in Chromatography
Podcast episode
Setting the Standard: Impact of Method Standardization in Chromatography
byThe Analytical Wavelength
0 ratings
0% found this document useful
Ocean Visions Launchpad Teams
Podcast episode
Ocean Visions Launchpad Teams
byCarbon Removal Newsroom
0 ratings
0% found this document useful
191R_Decision-making approach to urban energy retrofit – A comprehensive review (research summary)
Podcast episode
191R_Decision-making approach to urban energy retrofit – A comprehensive review (research summary)
byWhat is The Future for Cities?
0 ratings
0% found this document useful
130R_A new framework for very large-scale urban modelling (research summary)
Podcast episode
130R_A new framework for very large-scale urban modelling (research summary)
byWhat is The Future for Cities?
0 ratings
0% found this document useful
Seeing Green with Drs. Sandland and Chazot: What happens when you see yellow and I see blue? Together we see green. In this episode, diversifying materials science and improving the field.
Podcast episode
Seeing Green with Drs. Sandland and Chazot: What happens when you see yellow and I see blue? Together we see green. In this episode, diversifying materials science and improving the field.
byChalk Radio
0 ratings
0% found this document useful
Collaborating with Academia to Develop Future Practice and Practitioners
Podcast episode
Collaborating with Academia to Develop Future Practice and Practitioners
byEWN - Engineering With Nature
0 ratings
0% found this document useful
A Quant Pioneer Reflects on Machine Learning, Big Data, ESG, and Value Investing: Episode #449 A Quant Pioneer Reflects on Machine Learning, Big Data, ESG, and Value Investing John Chisholm, CFA, co-CEO and co-founder of Acadian Asset Management, a global, quantitative investment manager, takes the audience deep into...
Podcast episode
A Quant Pioneer Reflects on Machine Learning, Big Data, ESG, and Value Investing: Episode #449 A Quant Pioneer Reflects on Machine Learning, Big Data, ESG, and Value Investing John Chisholm, CFA, co-CEO and co-founder of Acadian Asset Management, a global, quantitative investment manager, takes the audience deep into...
byEnterprising Investor
0 ratings
0% found this document useful
#56 - Dr. Walid Saba, Gadi Singer, Prof. J. Mark Bishop (Panel discussion)
Podcast episode
#56 - Dr. Walid Saba, Gadi Singer, Prof. J. Mark Bishop (Panel discussion)
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
The Future of Chemistry Education
Podcast episode
The Future of Chemistry Education
byThe Analytical Wavelength
0 ratings
0% found this document useful
The Optimal Way To Browse The Internet: Thanks to the University of Minnesota for sponsoring this video! http://twin-cities.umn.edu/ The decisions we make while we browse the internet are suprisingly similar to the ones animals make as they forage for food...here's why. Thanks...
Podcast episode
The Optimal Way To Browse The Internet: Thanks to the University of Minnesota for sponsoring this video! http://twin-cities.umn.edu/ The decisions we make while we browse the internet are suprisingly similar to the ones animals make as they forage for food...here's why. Thanks...
byMinuteEarth
0 ratings
0% found this document useful
Episode Sixteen - Empowering Service Members in Tech with Red Hat
Podcast episode
Episode Sixteen - Empowering Service Members in Tech with Red Hat
byThe Veterans Career Compass: An ACP Podcast
0 ratings
0% found this document useful
149R_Urban futures: systemic or system changing interventions? A literature review using Meadows’ leverage points as analytical framework (research summary)
Podcast episode
149R_Urban futures: systemic or system changing interventions? A literature review using Meadows’ leverage points as analytical framework (research summary)
byWhat is The Future for Cities?
0 ratings
0% found this document useful

Skip carousel

Scikit-Learn: The Ultimate Python Library
APC
Article
Scikit-Learn: The Ultimate Python Library
Jul 15, 2019
4 min read
Future Cities
BBC Science Focus Magazine
Article
Future Cities
Jul 6, 2022
8 min read
A Breakthrough For Mobile Air Quality Data – And 3 Deceptively Tough Challenges That Paved The Way
Environmental Defense Fund (Blog)
Article
A Breakthrough For Mobile Air Quality Data – And 3 Deceptively Tough Challenges That Paved The Way
Sep 22, 2017
With these groundbreaking insights, we can scale up mobile sensing technology to track and map pollutants in cities everywhere.
3 min read
Data Centers Aren’t The Energy Hogs We Thought
Futurity
Article
Data Centers Aren’t The Energy Hogs We Thought
Feb 28, 2020
2 min read
Green Cities A Pathway to Sustainable Future
Techfastly
Article
Green Cities A Pathway to Sustainable Future
Jan 3, 2022
5 min read
For Designing A Superior Solution
Fast Company
Article
For Designing A Superior Solution
Aug 4, 2020
5 min read
Smartphone Sensors Keep An Eye On Crumbling Bridges
Futurity
Article
Smartphone Sensors Keep An Eye On Crumbling Bridges
Feb 11, 2019
1 min read
Innovating In Education
Landscape Architecture Australia
Article
Innovating In Education
Jan 29, 2024
4 min read
Connected Cities: Driving Digital Transformation In Complex Ecosystems
The European Business Review
Article
Connected Cities: Driving Digital Transformation In Complex Ecosystems
Nov 22, 2018
8 min read
Is The Future sustainable?
PC Pro Magazine
Article
Is The Future sustainable?
Jun 8, 2023
8 min read
This Simple Trick Keeps Pollution Out of Your Car
Futurity
Article
This Simple Trick Keeps Pollution Out of Your Car
Aug 11, 2017
After conducting a new research approach using actual commutes, engineers discovered a simple shift can reduce the health risks of air pollution on the road. Research has linked an increased amount of air pollutants to a whole host of medical maladie
3 min read
First Ever Doctorate Into Study Of Heritage Railways
Heritage Railway
Article
First Ever Doctorate Into Study Of Heritage Railways
Oct 26, 2021
2 min read
The Design Bible That Changed How Americans Bike in Cities
The Atlantic
Article
The Design Bible That Changed How Americans Bike in Cities
Mar 1, 2018
7 min read
Building The Future
India Today
Article
Building The Future
Jun 26, 2021
3 min read
Fighting For Science And Democracy: Lessons In Advocacy From The Classroom To The Street
Union of Concerned Scientists
Article
Fighting For Science And Democracy: Lessons In Advocacy From The Classroom To The Street
Dec 19, 2022
The fight for science is far from over, so what does it take to bring science advocacy into the classroom and beyond?
3 min read
Fastest GPS Routes May Have Higher Crash Risk
Futurity
Article
Fastest GPS Routes May Have Higher Crash Risk
Mar 1, 2022
2 min read
Shared E-scooters May Not Be So Climate Friendly After All
Futurity
Article
Shared E-scooters May Not Be So Climate Friendly After All
Jan 3, 2022
3 min read
One Simple Trick to Reduce Your Carbon Footprint
Union of Concerned Scientists
Article
One Simple Trick to Reduce Your Carbon Footprint
Nov 14, 2017
3 min read
Innovation – Now And For Generations To Come
The European Business Review
Article
Innovation – Now And For Generations To Come
Aug 1, 2022
6 min read
Reimagining Thepost-pandemic City
Landscape Architecture Australia
Article
Reimagining Thepost-pandemic City
Jul 27, 2020
5 min read
Agency And Change: Disruptive Construction Technologies
Architectural Review Asia Pacific
Article
Agency And Change: Disruptive Construction Technologies
Nov 11, 2019
4 min read
Building the Right Project: An Engineer’s Perspective on Infrastructure Adaptation to Extreme Weather Events
Union of Concerned Scientists
Article
Building the Right Project: An Engineer’s Perspective on Infrastructure Adaptation to Extreme Weather Events
May 16, 2018
6 min read
Zero Bias: A Cq Editorial
CQ Amateur Radio
Article
Zero Bias: A Cq Editorial
Aug 1, 2023
I’m writing this just after the 4th of July, when it’s even more common than usual to hear people say “thank you for your service” to just about anybody in a uniform. This is great, of course, as long as it’s sincere and not just a cliché, but that’s
3 min read
Data and the Future of Mobility: An Interview with Dr. Regina Clewlow
Union of Concerned Scientists
Article
Data and the Future of Mobility: An Interview with Dr. Regina Clewlow
Nov 29, 2018
6 min read
Where Are The Trees? Not Paris, New ‘Green View Index’ Finds
TechLife News
Article
Where Are The Trees? Not Paris, New ‘Green View Index’ Finds
Feb 3, 2017
2 min read
Where Are the Trees? Not Paris, New ‘Green View Index’ Finds
AppleMagazine
Article
Where Are the Trees? Not Paris, New ‘Green View Index’ Finds
Feb 3, 2017
2 min read
Why You Should Turn Off Your Camera During Zoom Meetings
Futurity
Article
Why You Should Turn Off Your Camera During Zoom Meetings
Jan 15, 2021
3 min read
New Issue Brief: Strengthening Public Input on Solar Geoengineering Research
Union of Concerned Scientists
Article
New Issue Brief: Strengthening Public Input on Solar Geoengineering Research
Jun 25, 2020
3 min read
How to Convince Transit Riders Their Wait Wasn’t So Bad
The Atlantic
Article
How to Convince Transit Riders Their Wait Wasn’t So Bad
Aug 18, 2015
2 min read
Serving the Globe
Beijing Review
Article
Serving the Globe
Jul 9, 2020
On June 23, when the last satellite in the homegrown BeiDou Navigation Satellite System (BDS) was successfully launched, completing the third-generation BDS network six months ahead of schedule, it was a milestone for many reasons. It means the world
1 min read

Related categories

Skip carousel

Reviews for Mobility Patterns, Big Data and Transport Analytics

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Mobility Patterns, Big Data and Transport Analytics - Constantinos Antoniou

research).

Chapter 1

Big Data and Transport Analytics: An Introduction

Constantinos Antoniou⁎; Loukas Dimitriou†; Francisco Câmara Pereira‡ ⁎ Department of Civil, Geo and Environmental Engineering, Technical University of Munich, Munich, Germany

† Laboratory for Transport Engineering, Department of Civil and Environmental Engineering, University of Cyprus, Nicosia, Cyprus

‡ Department of Management Engineering, Technical University of Denmark (DTU), Lyngby, Denmark

Abstract

The aim of this book is to contribute to the question of how the transportation profession and research community can benefit from the new era of Big Data and Data Science, the opportunities that arise, the new threats that are emerging and how old and new challenges can be addressed by the enormous quantities of information that are foreseen to be available.

Keywords

Moore's law; Data Science; Big Data; Transport analytics

Chapter Outline

1Introduction

2Book Structure

Special Acknowledgments

References

Further Reading

1 Introduction

The current era can be characterized by three main components:

1.an unprecedented availability of (structured and unstructured) information, collected through traditional sources/sensors, but also by the extensive wealth of nontraditional sources, like internet-of-things and crowdsourcing;

2.a vast expansion of computational means (hardware and—most significantly—paradigms) exceeding Moore's law (Moore, 1965); and

3.the development of new powerful computational methods able to treat the challenges of extensive information, able to be executed only by powerful computational means (interconnected and cloud integrated).

These three elements triggered a tremendous boost in inspiration and incentives for new developments for business and industrial applications, in the associated research community, as well as in social and governmental organizations overall. The stage has changed.

This constitutes the new vibrant scientific area of Data Science, adding a new data-driven analytical paradigm that combines the existing traditional three, viz., the empirical, the theoretical, and the computational. As any newcomer, Data Science has been received by many with some reluctance (Pigliucci, 2009; Milne and Watling, 2017), but by others as a path to new (and easy) revelations.

Famously, Chris Anderson declared that this is the End of Theory,¹ following the long tradition of human ambition for conquering knowledge and future, starting from the biblical Tree of Knowledge, to statements of prolific figures of science like (purportedly) Charles Holland Duell's Everything that can be invented has been invented, Lord Kelvin's There is nothing new to be discovered in physics now; All that remains is more and more precise measurement and David Hilbert's We must know; We will know!, until all of them to be defeated (e.g., Gödel, 1931).

However, Anderson's (2008) statement that Correlation supersedes causation, and science can advance even without coherent models, unified theories, or really any mechanistic explanation at all, naively taken, may lead to the misconception that the fundamental use of models (hypothesis testing and explanatory analysis) is obsolete. Of course, this is not the essence of data analysis, though data-centric analysis has an impact on the experimental design, the type of information that is used for a particular research purpose, and the type of confirmatory criteria used for evaluating results. These differences in data use (in data availability and volume) and model building (in models’ typology and fundamental assumptions) signify a turning point in scientific reasoning, requiring new theoretical and practical developments for treating the new scientific threats, as well as a preparation of the new generation of scientists, able to appropriately handle the new tool(s) that will increasingly become available.

Focusing in the field of transportation systems analysis, Data Science endeavors suit well, in their characteristics, which—succinctly—include:

–Complex and Large scale, composed by multiple distinctive units, arranged in multiple sequences, layers or parallel operations;

–Spatially distributed, establishing connectivity and service among remote locations by a synthesis of supply means (transport infrastructure and transport modes);

–Multiple-agents engagement, involved in cooperative, noncooperative, and competitive relationships among them and the transport infrastructure;

–Dynamic/Transient, since transport is by definition a dynamic phenomenon of movement in space and time; and

–Stochastic, since the transport operations stand for the manifestation of the decision-making process of agents (travelers, shippers, carriers, etc.) with different characteristics, properties, opportunities, flavors and criteria, while decisions are made in a fluctuating environment in terms of the physical, economic, and other elements.

The above fundamental characteristics of the transportation systems comprise sources of complexity, reflected as inaccuracies (or failures) of the typical/traditional analytical paradigms, especially when applied in real-world circumstances. The use of Big Data, treated within the new analytical field of Data Science, in our view stands for a promising new era for understanding and managing existing and future transportation phenomena.

The effective exploitation of the Big Data and Data Science promises depends on the rate of endorsement of emerging methods and applications by the relevant scientific and industrial community. It should be highlighted that the general public (such as the end-users and the markets) anticipate new developments, with the community of "early-adopters" growing rapidly.

But what are the characteristics of the relevant contemporary (and future) transportation scientist? How should the new generation of transportation scientists be equipped? Is it all about data handling/processing/analysis? The view that can be identified throughout this book reflects the idea that the strong scientific background on the field, topic, or system is a compulsory prerequisite for testing or adopting data-centric applications. This is far from the so-called Black-Box approaches and the jeopardies involved in such cases or applications. Advanced data analysis and Data Science concepts stand for an additional tool of the transportation professional, who should be formally prepared (possibly by dedicated programs) for embracing them. However, this should not be viewed as a shortcut to avoid the fundamentals.

Finally, the idea for this book was conceived during a Summer School that was organized by the Editors and held on these topics in June 2016 in the premises of the University of Cyprus, Lab for Transport Engineering, with the participation of most of the (co)authors. During the Summer School, the multidisciplinary combination of both the instructors, and the attendees, became immediately evident. This pluralism of ideas, approaches, and concepts is reflected here. We are confident that the readers will benefit from the contents of this book and will enjoy this guided trip through the different topics, models, and applications aiming to cover some of the most important fields of the transportation profession, where Big Data applications have matured enough.

2 Book Structure

The book is structured, instead of the form of a textbook, to provide a guided trip of the reader from introductory concepts about the use of Big Data and Data Science methods in transportation, to the presentation of indicative (though mature) applications. In this way, the reader may gain a helpful insight on how concepts may be treated with new methods, such as to develop innovative applications. We think that this innovation-oriented process may be more inspiring and has timely importance rather than the presentation of a strict closed-form/self-contained textbook of specific applications on some topics of mobility. For achieving this, the book is divided into four Sections; the introductory, the methodological, the applications, and outlook (Fig. 1).

Fig. 1 Flowchart of the book structure.

In the current introductory part (this chapter), the general view of the book on the use and value of data-centric analysis is provided. Then, at the second part of the book, a review and presentation of fundamentals on machine-learning methods is provided (Chapter 2), while a discussion about the combination of theory-driven and data-driven methods is offered in Chapter 3. Then, the stage of mobility analysis, human activities, and the living structure within the geographical space is offered in Chapter 4. Issues regarding data preparation (Chapter 5) and data visualization (Chapter 6) are providing important preparation on prospective transportation data-analytics professionals and researchers. The preparatory part of the book comprises the theoretical underpinnings on the integration of model-based machine-learning approaches in transportation (Chapter 7) and the use of nontraditional textual data for analyzing mobility (Chapter 8). After having read the methodological part, the reader will be equipped with essential methodological tools in order to be able to better follow the applications-oriented part. In detail, indicative applications of nontraditional examples using data-analytic approaches are selected, facilitating the understanding of how new methods have been adopted in order to analyze transportation demand and systems (Chapters 9, 10 and 15; Chapter 11), road safety (Chapter 12), mobility patterns (Chapter 13), and transport infrastructure (Chapter 14). By the end of the book's third part the reader will have gained knowledge on applications of currently state-of-the-art methods in various elements of the transportation field and hopefully inspirations for extending or improving the use of Big Data and Data Science methods in the field. The fourth and final part of book provides an outlook on the use of advanced data analytics methods in transport, aiming to offer a useful foresight to potential transport professionals, developers, and researchers.

Special Acknowledgments

The Editors of this book would like to gratefully acknowledge the patient and helpful support of the Editorial Managers (especially Mr. Tom Stover, Acquisitions Editor, Transport and Ms. Naomi Robertson) throughout the development process, as they provided their valuable experience and organization facilities for putting together and concluding this work.

Furthermore, the Editors would like to express their sincere and deep gratitude to all distinguished authors that contributed to this effort and agreed to share their academic and professional experience with the scientific community through this volume. Without their invaluable contribution, this book would not be possible.

References

C. Anderson, 2008. The End of Theory: The Data Deluge Makes the Scientific Method Obsolete.

Gödel K. Über formal unentscheidbare Sätze der Principia Mathematica und verwandter Systeme, I. Monatsh. Math. Phys. 1931;38:173–198.

Milne D., Watling D. Big data and understanding change in the context of planning transport systems. J. Transp. Geogr. 2017;doi:10.1016/j.jtrangeo.2017.11.004.

Moore G. Cramming more components onto integrated circuits. Electronics. 1965;38(8).

Pigliucci M. The end of theory in science?. EMBO Rep. 2009;10(6):534 Science and Society, Opinions.

Abstract

This chapter aims to be a smooth introduction to the basic concepts of machine learning, and, building on them, explain some to the latest advanced techniques. After a brief historical perspective, we overview the two currently most popular machine learning frameworks—deep learning and probabilistic graphical models. We conclude the chapter with practical advices about machine learning experiments which are necessary to know for a beginner. A good understanding of these fundamentals opens up a wide portfolio of opportunities for predictive models in transportation, and is hopefully a good basis for the remainder of this book.

Keywords

Machine learning; Deep learning; Probabilistic graphical models

Chapter Outline

1Introduction

2A Little Bit of History

3Deep Neural Networks and Optimization

4Bayesian Models

5Basics of Machine Learning Experiments

6Concluding Remarks

References

Further Reading

1 Introduction

At the time of writing this chapter, with so much buzz around concepts such as artificial intelligence, big data, deep learning, and probabilistic graphical models (PGMs), machine learning has gained a bit of a mystical fog around it. This excitement has more recently led to a series of dystopian visions that, among others, literally suggest the end of the world (e.g., Kurzweil, 2005; Bostrom, 2014)! We will delve into some of these aspects, below on the historical perspective, but first we want to demystify machine learning as a general discipline.

Machine learning combines statistics, optimization, and computer science. The statistics that we mention here is exactly the same discipline that has been taught for several centuries. No more, no less. In its vast majority, machine learning models also consist of functions that directly or indirectly end up as

In other words, we want to estimate (or predict) the value of a response variable, y, through a function f(x, β), where x is a vector with our observed (input) variables, and β is a vector with the parameters of our model. Since, in practice, our data (contained in x) is not perfect, there is always a bit of noise and/or unobserved data, which is usually represented by the error term, ϵ. Because we cannot really know the true value of ϵ, it is itself a random variable. Of course, as a consequence, y is a random variable as well.

It is almost sure that you are familiar with the most basic form of f, the linear model:

So, if you have a dataset with pairs of (x, y), the task is to estimate the values of β that best reproduce the linear relationship. You do this using optimization, sometimes called the training process, which is minimization of the difference between the true values of y and model's predictions, also known as a loss function. By now, you are bored? Great, let us now imagine that f(x) is instead defined as

where each function fk transforms its input and gives its result to the following one. Or, before doing that, it processes its data in a distributed fashion, for example, fK1(x1, …, xj), fK2(xj + 1, …, xk), …, fKL(xk + 1, …, xr), where we have L subfunctions, each processing a sub-part of the data. Imagine you have really a lot (thousands!) of such functions. Congrats, you just discovered a deep neural network (DNN) (or, in fact, many other types of models, including PGMs, another popular one these days). Of course, it can become much more complicated, but the principle is precisely the same. You have a sequence of functions, each one with its own set of parameters (sometimes shared among many different functions) where the output of one is the input of the other. Due to the large number, the estimation of these parameters may require a large amount of data and computation. And that is where it becomes distinct from classical statistics.

In fact, there are several ways to categorize Machine Learning tasks. The most common way depends on whether the target variable (our y above) is present in the problem itself, leading to supervised and unsupervised learning paradigms. Supervised learning implies presence of the target variable we would like to predict so the model receives a feedback from the teacher how its prediction is close to the ground truth. It also includes semi-supervised learning (when the target values are partially missing), active learning (number of possible target values is limited so the model should decide which data samples to use first), and reinforcement learning (y is given in a form of reward for a set of actions performed by the algorithm).

Sometimes, the target variable is not present, you only have data points x, and thus the problem formulation becomes slightly different. In such a case, the objective typically becomes to find patterns in the data or clusters of data points that seem to fit together. This is an example of unsupervised learning. From a statistical point of view, it is an example of generative learning of the data's joint distribution p(x), while most well-known supervised learning algorithms are regarded as discriminative, that is, we estimate the conditional distribution p(y | x). Some supervised learning algorithms exist, though, that are also generative (i.e., we estimate p(y, x)), which is the case of PGMs.

Depending on their output, machine learning models can be classified as regression (the target variable is continuous), classification (the target variable is categorical), and others like clustering (often unsupervised), probability density estimation, and dimensionality reduction (e.g., the Principal Component Analysis algorithm).

Although it would be almost impossible to provide a complete overview of this highly dynamic field, this chapter will present the several major forms of what is known today as machine learning and provide some tips about where to go next, if the reader wants to be part of this revolution, or simply understand more what it is about. But first, it is worthwhile to give a historical perspective.

2 A Little Bit of History

Machine learning was born as one branch within the major field of artificial Intelligence, which also includes others such as Knowledge Representation, Perception, Creativity (Russel and Norvig, 2009; Boden, 2006). The term machine learning was coined by Arthur Samuel, as early as 1952, who created the first program that could play and learn the checkers game (Samuel, 1959). The learning process here corresponded to incrementally updating a database with moves (board positions) and their score, according to probability for later success in winning or losing the game. As the computer played more, it improved its ability to win the game. This is probably the earliest version of reinforcement learning,¹ today an established sub-field for scenarios where the whole data set is only incrementally available and/or the label of each data point is not directly observable (e.g., the value of a game move may only be observable later in the game or even at its very end in the binary form of win or lose).

During the 1960s and 1970s, many researchers were enchanted by the concept of a machine that is pure logic, and the memory and computer processing limitations were extremely tight, comparing with nowadays. More than that, the belief that human intelligence could all be represented through logic (a computationalist point of view) was widespread and exciting. This naturally led to an emphasis on rule-based systems, representing knowledge through logic (e.g., logical rules, facts, symbols) and natural language processing.

In parallel, other researchers believed that we should study more the neurobiology of our brain and replicate it (a connectionist point of view), in what became known as artificial neural networks (ANN). The first well-known example is the perceptron (Rosenblatt, 1957), which applies a threshold rule to a linear function to discriminate a binary output. It is depicted in Fig. 1.

Fig. 1 Perceptron.

The perceptron was quite popular for a time as it can represent some logical gates (AND and OR), but not all (X-OR). Due to the latter limitation, such a simple ANN is limited to linearly separable problems (Fig. 2, left), whereas real problems quite often are inherently nonlinear in their nature (Fig. 2, right). This strong criticism pointed out by Minsky and Papert (1969) led to a decade with virtually no research on Neural Networks (NN), also known as the first AI winter.

Fig. 2 Linear separation. There are two classes (+ and o), and we want a rule (a classifier ) that discriminates between them.

The third relevant research thread from the 1960s and 1970s relates to the nearest neighbor concept. Cover and Hart (1967) published the Nearest neighbor pattern classification, which effectively marks the creation of the pattern recognition field and the birth of the well-known K-nearest neighbor algorithm (K-NN). Its underlying principle is simple: If we have a problem to solve (e.g., given an input vector x, classify it into a class, so the target variable y becomes categorical), we can look for K most similar situations in our database. Of course, the key research question is how to define similarity, which boils down to the comparison of x vectors. The typical option is the Euclidean distance, but there are a lot of other metrics which are much more suitable given the nature of the data at hand (e.g., categorical, textual, temporal). Another challenge in K-NN is to define K. What would be the best number of similar examples from the data to use in the algorithm? This introduces a concept of a hyperparameter, which should be defined a priori, in contrast to the parameters β, which are to be determined during the training. Of course, in those days, the complexity of data and computational power and memory were so low, that only much later we find good answers.

The most basic unsupervised clustering algorithm, known as k-Means, was first proposed by Lloyd (1957) at Bell Labs (but published only in 1982) and independently by Forgy (1965). Its main steps to find clusters in the data is, having k cluster centroids randomly initialized (k is another hyperparameter!), to assign each data point to a cluster with the nearest mean (step 1), and update these centroids using the data points assigned to them (step 2). Repeat these steps iteratively until convergence and the task is solved. Later, this procedure was rethought and generalized as the expectation-maximization algorithm for a Gaussian mixture model (Dempster et al., 1977).

The 1980s saw the rebirth of ANNs with enough computing power to allow for multilayer networks and nonlinear functions. Notice that, if we replace the function f(x) in Fig. 1 for a logistic sigmoid function (as opposed to the 0/1 step function), we essentially obtain a binary logit model (or logistic regression), and this is the basis for the multilayer perceptron algorithm (MLP) summarized in Fig. 3.

Fig. 3 Multilayer perceptron.

Notice that now there are two sets of weights (w(1) and w(2)). The first one associates a vector of weights (wj(1)) to each of the fj functions in the hidden layer. As mentioned, the typical form of these functions is either the logistic sigmoid (Fig. 4A) or the hyperbolic tangent. Each one of these then provides the input to the final (output) layer, weighted by the vector w(2) = [w1(2), w2(2), …, wm(2)]T. The function σ is typically another sigmoid (if we are doing classification) or the identity function (if we are doing regression).

Fig. 4 (A) Sigmoid function and (B) rectified linear unit (ReLU).

From a statistical modelling perspective, an MLP is simply a model with m latent variables that are combined in the output layer. The development of the backpropagation algorithm (Rumelhart et al., 1986), which allows the MLP to be efficiently estimated from data, was fundamental for its popularity. Furthermore, Hornik et al. (1989) demonstrated that one layer of suitable nonlinear neurons followed by a linear layer can approximate any nonlinear function with arbitrary accuracy, given enough nonlinear neurons. This means that an MLP network is a universal function approximator, which motivated plenty of excitement. However, it was also verified during the 1990s that the MLP often runs into overfitting problems, and its opaqueness (the weights have no clear interpretation, neither does the hidden layer) often limits its application.

Thus, after the MLP, we saw a relatively low investment on NN research, marking what was known (guess what…) as the second AI winter, until the boom of DNN models since 2010. The key ingredients for its success were new sophisticated architectures, computational resources and amounts of data available. Differently to an MLP, which always has one layer (Hastie et al., 2009), a DNN may have multiple layers, sometimes dozens or hundreds. While the MLP is always fully connected (all elements, or neurons, in one layer are connected with all elements of the subsequent layer), a DNN is often very selective, with different sets of neurons connected to different parts of the subsequent layer. The most popular activation function is not anymore the sigmoid; instead it is the rectified linear unit (ReLU), shown in Fig. 4B. The ReLU function became popular as it helps to mitigate the vanishing gradient problem (exponential decrease of a feedback signal when the errors propagate to the bottom layers), which hindered training of NNs with many layers.

Back to the 1980s, it was the golden age of the Expert Systems. These usually relied on explicit representation of domain knowledge in the form of logical rules and facts, obtained by interviewing experts. The new profession of Knowledge Engineer would typically combine the ability to do such interviews and code the knowledge into systems such as FRED (Freeway Realtime Expert System Demonstration) or FASTBRID (Fatigue Assessment of Steel Bridges), just to mention in the transportation domain. For a notion of expectations and opportunities of such systems for transportation, from the eyes of the early 1990s, the paper from Wentworth (1993) is an interesting read.

The major problem with Expert Systems, and related rule-based paradigms, has always been the costly process of collecting and translating such knowledge into a machine. To make things more complicated, such knowledge is often not deterministic or objective, which eventually led to the development of fuzzy logic, as a way to improve the Expert Systems potential, which were still huge collections of hand-coded rules. On the other hand, one should ask is the machine learning at all? Aren’t the Expert Systems instead an attempt to mechanize human knowledge, explicitly extracted and hand-crafted by humans (the coders)?

A different approach to this problem comes from probability theory. Between 1982 and 1985, Judea Pearl created the concept of Bayesian Network, where domain knowledge is also important to structure relationships between variables, but not anymore as long complex rules. Instead, he proposed to decompose knowledge into individual relationships, inspired by the concepts of causality and evidence. More importantly, he developed the belief propagation (BP) method for learning (or making inference) from data (Pearl, 1982, 1985). Such networks were successful, particularly in classification problems, and later, the challenge of the automatically generation of their structure from data (as opposed to domain expertise) was proposed by Spirtes and Glymour (1991), and remains a relevant topic until today.

But perhaps the hottest trend in the 1980s and 1990s were in fact the kernel methods, and the support vector machines (SVM) in particular. We get back to the problem of linear separation (Fig. 2), where reality tends to be nonlinearly separable. If we can find a function mapping that, when applied to a data set, makes it linearly separable, then we solve the problem. Typically, this implies mapping data to a higher dimension. Fig. 5 shows an example. On the left, we have the original data set (class o has points {− 7, − 5, − 4, 2, 3, 4}, while class + is {− 2, − 1, 0, 1}). This is one-dimensional, so let us add a new dimension by simply squaring the values. Each point is now (x, x²). On the right, we plot those points. Here it goes, we can now draw a straight line to separate the classes!

Fig. 5 Mapping 1-D data points, which are nonseparable linearly, to a higher dimensional (2-D) space makes them linearly separable.

Vapnik and others discovered that there is always one (higher) dimension where a data set is linearly separable. More importantly, they found a process, called the kernel trick, where such mapping is done without the need to explicitly project to the higher dimension (Boser et al., 1992) as we did above. Using convex optimization to train prediction models on data, the SVM obtains global optimality guarantees for the model/parameters learned. The SVM algorithm learns the separating line (a straight line in our example, or a hyperplane in higher dimensions) that has the largest distance to the nearest training-data point of any class. In other words, it finds the dividing line where each side has the maximum margin to the closest data points. The larger this margin, the lower the error. Fig. 6 illustrates the concept. Notice that the maximum margin always coincides with vectors from each class. These are the closest data points, the support vectors. Of course, all this happens at a potentially very high dimensionality and the 2-D example is only provided for illustration purposes.

Fig. 6 SVM finding the best hyperplane. There are many possible choices of linear separation (left). Support vectors are found that maximize the margin

Enjoying the preview?

Page 1 of 1

Mobility Patterns, Big Data and Transport Analytics: Tools and Applications for Modeling

About this ebook

Related to Mobility Patterns, Big Data and Transport Analytics

Related ebooks

Social Science For You

Related podcast episodes

Related articles

Related categories

Reviews for Mobility Patterns, Big Data and Transport Analytics

What did you think?

Book preview

Mobility Patterns, Big Data and Transport Analytics - Constantinos Antoniou

Big Data and Transport Analytics: An Introduction

Abstract

Keywords

Moore's law; Data Science; Big Data; Transport analytics

Chapter Outline

1 Introduction

2 Book Structure

Special Acknowledgments

References

Further Reading

Abstract

Keywords

Machine learning; Deep learning; Probabilistic graphical models

Chapter Outline

1 Introduction

2 A Little Bit of History