Learning Bayesian Statistics

Alexandre Andorra

A podcast on Bayesian inference - the methods, the projects and the people who make it possible!

  • 58 minutes 15 seconds
    #125 Bayesian Sports Analytics & The Future of PyMC, with Chris Fonnesbeck

    Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!


    Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!

    Visit our Patreon page to unlock exclusive Bayesian swag ;)

    Thank you to my Patrons for making this episode possible!

    Yusuke Saito, Avi Bryant, Ero Carrera, Giuliano Cruz, Tim Gasser, James Wade, Tradd Salvo, William Benton, James Ahloy, Robin Taylor,, Chad Scherrer, Zwelithini Tunyiswa, Bertrand Wilden, James Thompson, Stephen Oates, Gian Luca Di Tanna, Jack Wells, Matthew Maldonado, Ian Costley, Ally Salim, Larry Gill, Ian Moran, Paul Oreto, Colin Caprani, Colin Carroll, Nathaniel Burbank, Michael Osthege, Rémi Louf, Clive Edelsten, Henri Wallen, Hugo Botha, Vinh Nguyen, Marcin Elantkowski, Adam C. Smith, Will Kurt, Andrew Moskowitz, Hector Munoz, Marco Gorelli, Simon Kessell, Bradley Rode, Patrick Kelley, Rick Anderson, Casper de Bruin, Philippe Labonde, Michael Hankin, Cameron Smith, Tomáš Frýda, Ryan Wesslen, Andreas Netti, Riley King, Yoshiyuki Hamajima, Sven De Maeyer, Michael DeCrescenzo, Fergal M, Mason Yahr, Naoya Kanai, Steven Rowland, Aubrey Clayton, Jeannine Sue, Omri Har Shemesh, Scott Anthony Robson, Robert Yolken, Or Duek, Pavel Dusek, Paul Cox, Andreas Kröpelin, Raphaël R, Nicolas Rode, Gabriel Stechschulte, Arkady, Kurt TeKolste, Gergely Juhasz, Marcus Nölke, Maggi Mackintosh, Grant Pezzolesi, Avram Aelony, Joshua Meehl, Javier Sabio, Kristian Higgins, Alex Jones, Gregorio Aguilar, Matt Rosinski, Bart Trudeau, Luis Fonseca, Dante Gates, Matt Niccolls, Maksim Kuznecov, Michael Thomas, Luke Gorrie, Cory Kiser, Julio, Edvin Saveljev, Frederick Ayala, Jeffrey Powell, Gal Kampel, Adan Romero, Will Geary, Blake Walters, Jonathan Morgan, Francesco Madrisotti, Ivy Huang, Gary Clarke, Robert Flannery, Rasmus Hindström, Stefan, Corey Abshire and Mike Loncaric.

    Takeaways:

    • The evolution of sports modeling is tied to the availability of high-frequency data.
    • Bayesian methods are valuable in handling messy, hierarchical data.
    • Communication between data scientists and decision-makers is crucial for effective model use.
    • Models are often wrong, and learning from mistakes is part of the process.
    • Simplicity in models can sometimes yield better results than complexity.
    • The integration of analytics in sports is still developing, with opportunities in various sports.
    • Transparency in research and development teams enhances decision-making.
    • Understanding uncertainty in models is essential for informed decisions.
    • The balance between point estimates and full distributions is a challenge.
    • Iterative model development is key to improving analytics in sports.
    • It's important to avoid falling in love with a single model.
    • Data simulation can validate model structures before real data is used.
    • Gaussian processes offer flexibility in modeling without strict functional forms.
    • Structural time series help separate projection from observation noise.
    • Transitioning from sports analytics to consulting opens new opportunities.
    • Continuous learning is essential in the field of statistics.
    • The demand for Bayesian methods is growing across various industries.
    • Community-driven projects can lead to innovative solutions.

    Chapters:

    03:07 The Evolution of Modeling in Sports Analytics

    06:03 Transitioning from Academia to Sports Modeling

    08:56 The Role of Bayesian Methods in Sports Analytics

    11:49 Communicating Models and Insights to Decision Makers

    15:12 Learning from Mistakes in Model Development

    18:06 The Importance of Model Flexibility and Iteration

    21:02 Utilizing Simulation for Model Validation

    23:50 Choosing the Right Model Structure for Data

    27:04 Starting with Simple Models and Building Complexity

    29:29 Advancements in Gaussian Processes and PyMC

    31:54 Exploring Structural Time Series and GPs

    37:34 Transitioning to PyMC Labs and New Opportunities

    42:40 Innovations in Variational Inference Methods

    48:50 Future Vision for PyMC and Community Engagement

    50:43 Surprises in Bayesian Methods Adoption

    54:08 Reflections on Problem Solving and Influential Figures

    Links from the show:


    Transcript

    This is an automatic transcript and may therefore contain errors. Please get in touch if you're willing to correct them.

    5 February 2025, 11:00 am
  • 1 hour 35 minutes
    #124 State Space Models & Structural Time Series, with Jesse Grabowski

    Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!


    Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!

    Visit our Patreon page to unlock exclusive Bayesian swag ;)

    Takeaways:

    • Bayesian statistics offers a robust framework for econometric modeling.
    • State space models provide a comprehensive way to understand time series data.
    • Gaussian random walks serve as a foundational model in time series analysis.
    • Innovations represent external shocks that can significantly impact forecasts.
    • Understanding the assumptions behind models is key to effective forecasting.
    • Complex models are not always better; simplicity can be powerful.
    • Forecasting requires careful consideration of potential disruptions. Understanding observed and hidden states is crucial in modeling.
    • Latent abilities can be modeled as Gaussian random walks.
    • State space models can be highly flexible and diverse.
    • Composability allows for the integration of different model components.
    • Trends in time series should reflect real-world dynamics.
    • Seasonality can be captured through Fourier bases.
    • AR components help model residuals in time series data.
    • Exogenous regression components can enhance state space models.
    • Causal analysis in time series often involves interventions and counterfactuals.
    • Time-varying regression allows for dynamic relationships between variables.
    • Kalman filters were originally developed for tracking rockets in space.
    • The Kalman filter iteratively updates beliefs based on new data.
    • Missing data can be treated as hidden states in the Kalman filter framework.
    • The Kalman filter is a practical application of Bayes' theorem in a sequential context.
    • Understanding the dynamics of systems is crucial for effective modeling.
    • The state space module in PyMC simplifies complex time series modeling tasks.

    Chapters:

    00:00 Introduction to Jesse Krabowski and Time Series Analysis

    04:33 Jesse's Journey into Bayesian Statistics

    10:51 Exploring State Space Models

    18:28 Understanding State Space Models and Their Components

    40:39 Composability of State Space Models

    48:36 Understanding Trends and Derivatives

    52:35 The Importance of Seasonality in Time Series

    56:41 Components of Time Series Analysis

    01:00:46 Exogenous Regression in State Space Models

    01:06:41 Impulse Response Functions and Causality

    01:11:30 Why Kalman Filter Is So Powerful

    01:24:28 Future Directions and Applications

    Thank you to my Patrons for making this episode possible!

    Yusuke Saito, Avi Bryant, Ero Carrera, Giuliano Cruz, Tim Gasser, James Wade, Tradd Salvo, William Benton, James Ahloy, Robin Taylor,, Chad Scherrer, Zwelithini Tunyiswa, Bertrand Wilden, James Thompson, Stephen Oates, Gian Luca Di Tanna, Jack Wells, Matthew Maldonado, Ian Costley, Ally Salim, Larry Gill, Ian Moran, Paul Oreto, Colin Caprani, Colin Carroll, Nathaniel Burbank, Michael Osthege, Rémi Louf, Clive Edelsten, Henri Wallen, Hugo Botha, Vinh Nguyen, Marcin Elantkowski, Adam C. Smith, Will Kurt, Andrew Moskowitz, Hector Munoz, Marco Gorelli, Simon Kessell, Bradley Rode, Patrick Kelley, Rick Anderson, Casper de Bruin, Philippe Labonde, Michael Hankin, Cameron Smith, Tomáš Frýda, Ryan Wesslen, Andreas Netti, Riley King, Yoshiyuki Hamajima, Sven De Maeyer, Michael DeCrescenzo, Fergal M, Mason Yahr, Naoya Kanai, Steven Rowland, Aubrey Clayton, Jeannine Sue, Omri Har Shemesh, Scott Anthony Robson, Robert Yolken, Or Duek, Pavel Dusek, Paul Cox, Andreas Kröpelin, Raphaël R, Nicolas Rode, Gabriel Stechschulte, Arkady, Kurt TeKolste, Gergely Juhasz, Marcus Nölke, Maggi Mackintosh, Grant Pezzolesi, Avram Aelony, Joshua Meehl, Javier Sabio, Kristian Higgins, Alex Jones, Gregorio Aguilar, Matt Rosinski, Bart Trudeau, Luis Fonseca, Dante Gates, Matt Niccolls, Maksim Kuznecov, Michael Thomas, Luke Gorrie, Cory Kiser, Julio, Edvin Saveljev, Frederick Ayala, Jeffrey Powell, Gal Kampel, Adan Romero, Will Geary, Blake Walters, Jonathan Morgan, Francesco Madrisotti, Ivy Huang, Gary Clarke, Robert Flannery, Rasmus Hindström, Stefan, Corey Abshire and Mike Loncaric.

    Links from the show:


    Transcript

    This is an automatic transcript and may therefore contain errors. Please get in touch if you're willing to correct them.

    22 January 2025, 11:00 am
  • 1 hour 32 minutes
    #123 BART & The Future of Bayesian Tools, with Osvaldo Martin

    Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!


    Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!

    Visit our Patreon page to unlock exclusive Bayesian swag ;)

    Takeaways:

    • BART models are non-parametric Bayesian models that approximate functions by summing trees.
    • BART is recommended for quick modeling without extensive domain knowledge.
    • PyMC-BART allows mixing BART models with various likelihoods and other models.
    • Variable importance can be easily interpreted using BART models.
    • PreliZ aims to provide better tools for prior elicitation in Bayesian statistics.
    • The integration of BART with Bambi could enhance exploratory modeling.
    • Teaching Bayesian statistics involves practical problem-solving approaches.
    • Future developments in PyMC-BART include significant speed improvements.
    • Prior predictive distributions can aid in understanding model behavior.
    • Interactive learning tools can enhance understanding of statistical concepts.
    • Integrating PreliZ with PyMC improves workflow transparency.
    • Arviz 1.0 is being completely rewritten for better usability.
    • Prior elicitation is crucial in Bayesian modeling.
    • Point intervals and forest plots are effective for visualizing complex data.

    Chapters:

    00:00 Introduction to Osvaldo Martin and Bayesian Statistics

    08:12 Exploring Bayesian Additive Regression Trees (BART)

    18:45 Prior Elicitation and the PreliZ Package

    29:56 Teaching Bayesian Statistics and Future Directions

    45:59 Exploring Prior Predictive Distributions

    52:08 Interactive Modeling with PreliZ

    54:06 The Evolution of ArviZ

    01:01:23 Advancements in ArviZ 1.0

    01:06:20 Educational Initiatives in Bayesian Statistics

    01:12:33 The Future of Bayesian Methods

    Thank you to my Patrons for making this episode possible!

    Yusuke Saito, Avi Bryant, Ero Carrera, Giuliano Cruz, Tim Gasser, James Wade, Tradd Salvo, William Benton, James Ahloy, Robin Taylor,, Chad Scherrer, Zwelithini Tunyiswa, Bertrand Wilden, James Thompson, Stephen Oates, Gian Luca Di Tanna, Jack Wells, Matthew Maldonado, Ian Costley, Ally Salim, Larry Gill, Ian Moran, Paul Oreto, Colin Caprani, Colin Carroll, Nathaniel Burbank, Michael Osthege, Rémi Louf, Clive Edelsten, Henri Wallen, Hugo Botha, Vinh Nguyen, Marcin Elantkowski, Adam C. Smith, Will Kurt, Andrew Moskowitz, Hector Munoz, Marco Gorelli, Simon Kessell, Bradley Rode, Patrick Kelley, Rick Anderson, Casper de Bruin, Philippe Labonde, Michael Hankin, Cameron Smith, Tomáš Frýda, Ryan Wesslen, Andreas Netti, Riley King, Yoshiyuki Hamajima, Sven De Maeyer, Michael DeCrescenzo, Fergal M, Mason Yahr, Naoya Kanai, Steven Rowland, Aubrey Clayton, Jeannine Sue, Omri Har Shemesh, Scott Anthony Robson, Robert Yolken, Or Duek, Pavel Dusek, Paul Cox, Andreas Kröpelin, Raphaël R, Nicolas Rode, Gabriel Stechschulte, Arkady, Kurt TeKolste, Gergely Juhasz, Marcus Nölke, Maggi Mackintosh, Grant Pezzolesi, Avram Aelony, Joshua Meehl, Javier Sabio, Kristian Higgins, Alex Jones, Gregorio Aguilar, Matt Rosinski, Bart Trudeau, Luis Fonseca, Dante Gates, Matt Niccolls, Maksim Kuznecov, Michael Thomas, Luke Gorrie, Cory Kiser, Julio, Edvin Saveljev, Frederick Ayala, Jeffrey Powell, Gal Kampel, Adan Romero, Will Geary, Blake Walters, Jonathan Morgan, Francesco Madrisotti, Ivy Huang, Gary Clarke, Robert Flannery, Rasmus Hindström, Stefan, Corey Abshire and Mike Loncaric.

    Links from the show:


    Transcript

    This is an automatic transcript and may therefore contain errors. Please get in touch if you're willing to correct them.

    10 January 2025, 11:00 am
  • 1 hour 23 minutes
    #122 Learning and Teaching in the Age of AI, with Hugo Bowne-Anderson

    Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!


    Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!

    Visit our Patreon page to unlock exclusive Bayesian swag ;)

    Takeaways:

    • Effective data science education requires feedback and rapid iteration.
    • Building LLM applications presents unique challenges and opportunities.
    • The software development lifecycle for AI differs from traditional methods.
    • Collaboration between data scientists and software engineers is crucial.
    • Hugo's new course focuses on practical applications of LLMs.
    • Continuous learning is essential in the fast-evolving tech landscape.
    • Engaging learners through practical exercises enhances education.
    • POC purgatory refers to the challenges faced in deploying LLM-powered software.
    • Focusing on first principles can help overcome integration issues in AI.
    • Aspiring data scientists should prioritize problem-solving over specific tools.
    • Engagement with different parts of an organization is crucial for data scientists.
    • Quick paths to value generation can help gain buy-in for data projects.
    • Multimodal models are an exciting trend in AI development.
    • Probabilistic programming has potential for future growth in data science.
    • Continuous learning and curiosity are vital in the evolving field of data science.

    Chapters:

    09:13 Hugo's Journey in Data Science and Education

    14:57 The Appeal of Bayesian Statistics

    19:36 Learning and Teaching in Data Science

    24:53 Key Ingredients for Effective Data Science Education

    28:44 Podcasting Journey and Insights

    36:10 Building LLM Applications: Course Overview

    42:08 Navigating the Software Development Lifecycle

    48:06 Overcoming Proof of Concept Purgatory

    55:35 Guidance for Aspiring Data Scientists

    01:03:25 Exciting Trends in Data Science and AI

    01:10:51 Balancing Multiple Roles in Data Science

    01:15:23 Envisioning Accessible Data Science for All

    Thank you to my Patrons for making this episode possible!

    Yusuke Saito, Avi Bryant, Ero Carrera, Giuliano Cruz, Tim Gasser, James Wade, Tradd Salvo, William Benton, James Ahloy, Robin Taylor,, Chad Scherrer, Zwelithini Tunyiswa, Bertrand Wilden, James Thompson, Stephen Oates, Gian Luca Di Tanna, Jack Wells, Matthew Maldonado, Ian Costley, Ally Salim, Larry Gill, Ian Moran, Paul Oreto, Colin Caprani, Colin Carroll, Nathaniel Burbank, Michael Osthege, Rémi Louf, Clive Edelsten, Henri Wallen, Hugo Botha, Vinh Nguyen, Marcin Elantkowski, Adam C. Smith, Will Kurt, Andrew Moskowitz, Hector Munoz, Marco Gorelli, Simon Kessell, Bradley Rode, Patrick Kelley, Rick Anderson, Casper de Bruin, Philippe Labonde, Michael Hankin, Cameron Smith, Tomáš Frýda, Ryan Wesslen, Andreas Netti, Riley King, Yoshiyuki Hamajima, Sven De Maeyer, Michael DeCrescenzo, Fergal M, Mason Yahr, Naoya Kanai, Steven Rowland, Aubrey Clayton, Jeannine Sue, Omri Har Shemesh, Scott Anthony Robson, Robert Yolken, Or Duek, Pavel Dusek, Paul Cox, Andreas Kröpelin, Raphaël R, Nicolas Rode, Gabriel Stechschulte, Arkady, Kurt TeKolste, Gergely Juhasz, Marcus Nölke, Maggi Mackintosh, Grant Pezzolesi, Avram Aelony, Joshua Meehl, Javier Sabio, Kristian Higgins, Alex Jones, Gregorio Aguilar, Matt Rosinski, Bart Trudeau, Luis Fonseca, Dante Gates, Matt Niccolls, Maksim Kuznecov, Michael Thomas, Luke Gorrie, Cory Kiser, Julio, Edvin Saveljev, Frederick Ayala, Jeffrey Powell, Gal Kampel, Adan Romero, Will Geary, Blake Walters, Jonathan Morgan, Francesco Madrisotti, Ivy Huang, Gary Clarke, Robert Flannery, Rasmus Hindström, Stefan, Corey Abshire and Mike Loncaric.

    Links from the show:


    Transcript:

    This is an automatic transcript and may therefore contain errors. Please get in touch if you're willing to correct them.

    26 December 2024, 11:00 am
  • 1 hour 8 minutes
    #121 Exploring Bayesian Structural Equation Modeling, with Nathaniel Forde

    Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!


    Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!

    Visit our Patreon page to unlock exclusive Bayesian swag ;)

    Takeaways:

    • CFA is commonly used in psychometrics to validate theoretical constructs.
    • Theoretical structure is crucial in confirmatory factor analysis.
    • Bayesian approaches offer flexibility in modeling complex relationships.
    • Model validation involves both global and local fit measures.
    • Sensitivity analysis is vital in Bayesian modeling to avoid skewed results.
    • Complex models should be justified by their ability to answer specific questions.
    • The choice of model complexity should balance fit and theoretical relevance. Fitting models to real data builds confidence in their validity.
    • Divergences in model fitting indicate potential issues with model specification.
    • Factor analysis can help clarify causal relationships between variables.
    • Survey data is a valuable resource for understanding complex phenomena.
    • Philosophical training enhances logical reasoning in data science.
    • Causal inference is increasingly recognized in industry applications.
    • Effective communication is essential for data scientists.
    • Understanding confounding is crucial for accurate modeling.

    Chapters:

    10:11 Understanding Structural Equation Modeling (SEM) and Confirmatory Factor Analysis (CFA)

    20:11 Application of SEM and CFA in HR Analytics

    30:10 Challenges and Advantages of Bayesian Approaches in SEM and CFA

    33:58 Evaluating Bayesian Models

    39:50 Challenges in Model Building

    44:15 Causal Relationships in SEM and CFA

    49:01 Practical Applications of SEM and CFA

    51:47 Influence of Philosophy on Data Science

    54:51 Designing Models with Confounding in Mind

    57:39 Future Trends in Causal Inference

    01:00:03 Advice for Aspiring Data Scientists

    01:02:48 Future Research Directions

    Thank you to my Patrons for making this episode possible!

    Yusuke Saito, Avi Bryant, Ero Carrera, Giuliano Cruz, Tim Gasser, James Wade, Tradd Salvo, William Benton, James Ahloy, Robin Taylor,, Chad Scherrer, Zwelithini Tunyiswa, Bertrand Wilden, James Thompson, Stephen Oates, Gian Luca Di Tanna, Jack Wells, Matthew Maldonado, Ian Costley, Ally Salim, Larry Gill, Ian Moran, Paul Oreto, Colin Caprani, Colin Carroll, Nathaniel Burbank, Michael Osthege, Rémi Louf, Clive Edelsten, Henri Wallen, Hugo Botha, Vinh Nguyen, Marcin Elantkowski, Adam C. Smith, Will Kurt, Andrew Moskowitz, Hector Munoz, Marco Gorelli, Simon Kessell, Bradley Rode, Patrick Kelley, Rick Anderson, Casper de Bruin, Philippe Labonde, Michael Hankin, Cameron Smith, Tomáš Frýda, Ryan Wesslen, Andreas Netti, Riley King, Yoshiyuki Hamajima, Sven De Maeyer, Michael DeCrescenzo, Fergal M, Mason Yahr, Naoya Kanai, Steven Rowland, Aubrey Clayton, Jeannine Sue, Omri Har Shemesh, Scott Anthony Robson, Robert Yolken, Or Duek, Pavel Dusek, Paul Cox, Andreas Kröpelin, Raphaël R, Nicolas Rode, Gabriel Stechschulte, Arkady, Kurt TeKolste, Gergely Juhasz, Marcus Nölke, Maggi Mackintosh, Grant Pezzolesi, Avram Aelony, Joshua Meehl, Javier Sabio, Kristian Higgins, Alex Jones, Gregorio Aguilar, Matt Rosinski, Bart Trudeau, Luis Fonseca, Dante Gates, Matt Niccolls, Maksim Kuznecov, Michael Thomas, Luke Gorrie, Cory Kiser, Julio, Edvin Saveljev, Frederick Ayala, Jeffrey Powell, Gal Kampel, Adan Romero, Will Geary, Blake Walters, Jonathan Morgan, Francesco Madrisotti, Ivy Huang, Gary Clarke, Robert Flannery, Rasmus Hindström and Stefan.

    Links from the show:


    Transcript:

    This is an automatic transcript and may therefore contain errors. Please get in touch if you're willing to correct them.

    11 December 2024, 11:00 am
  • 1 hour 1 minute
    #120 Innovations in Infectious Disease Modeling, with Liza Semenova & Chris Wymant

    Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!


    Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!

    Visit our Patreon page to unlock exclusive Bayesian swag ;)

    -------------------------

    Love the insights from this episode? Make sure you never miss a beat with Chatpods! Whether you're commuting, working out, or just on the go, Chatpods lets you capture and summarize key takeaways effortlessly.

    Save time, stay organized, and keep your thoughts at your fingertips.

    Download Chatpods directly from App Store or Google Play and use it to listen to this podcast today!

    https://www.chatpods.com/?fr=LearningBayesianStatistics

    -------------------------

    Takeaways:

    • Epidemiology focuses on health at various scales, while biology often looks at micro-level details.
    • Bayesian statistics helps connect models to data and quantify uncertainty.
    • Recent advancements in data collection have improved the quality of epidemiological research.
    • Collaboration between domain experts and statisticians is essential for effective research.
    • The COVID-19 pandemic has led to increased data availability and international cooperation.
    • Modeling infectious diseases requires understanding complex dynamics and statistical methods.
    • Challenges in coding and communication between disciplines can hinder progress.
    • Innovations in machine learning and neural networks are shaping the future of epidemiology.
    • The importance of understanding the context and limitations of data in research. 

    Chapters:

    00:00 Introduction to Bayesian Statistics and Epidemiology

    03:35 Guest Backgrounds and Their Journey

    10:04 Understanding Computational Biology vs. Epidemiology

    16:11 The Role of Bayesian Statistics in Epidemiology

    21:40 Recent Projects and Applications in Epidemiology

    31:30 Sampling Challenges in Health Surveys

    34:22 Model Development and Computational Challenges

    36:43 Navigating Different Jargons in Survey Design

    39:35 Post-COVID Trends in Epidemiology

    42:49 Funding and Data Availability in Epidemiology

    45:05 Collaboration Across Disciplines

    48:21 Using Neural Networks in Bayesian Modeling

    51:42 Model Diagnostics in Epidemiology

    55:38 Parameter Estimation in Compartmental Models

    Thank you to my Patrons for making this episode possible!

    Yusuke Saito, Avi Bryant, Ero Carrera, Giuliano Cruz, Tim Gasser, James Wade, Tradd Salvo, William Benton, James Ahloy, Robin Taylor,, Chad Scherrer, Zwelithini Tunyiswa, Bertrand Wilden, James Thompson, Stephen Oates, Gian Luca Di Tanna, Jack Wells, Matthew Maldonado, Ian Costley, Ally Salim, Larry Gill, Ian Moran, Paul Oreto, Colin Caprani, Colin Carroll, Nathaniel Burbank, Michael Osthege, Rémi Louf, Clive Edelsten, Henri Wallen, Hugo Botha, Vinh Nguyen, Marcin Elantkowski, Adam C. Smith, Will Kurt, Andrew Moskowitz, Hector Munoz, Marco Gorelli, Simon Kessell, Bradley Rode, Patrick Kelley, Rick Anderson, Casper de Bruin, Philippe Labonde, Michael Hankin, Cameron Smith, Tomáš Frýda, Ryan Wesslen, Andreas Netti, Riley King, Yoshiyuki Hamajima, Sven De Maeyer, Michael DeCrescenzo, Fergal M, Mason Yahr, Naoya Kanai, Steven Rowland, Aubrey Clayton, Jeannine Sue, Omri Har Shemesh, Scott Anthony Robson, Robert Yolken, Or Duek, Pavel Dusek, Paul Cox, Andreas Kröpelin, Raphaël R, Nicolas Rode, Gabriel Stechschulte, Arkady, Kurt TeKolste, Gergely Juhasz, Marcus Nölke, Maggi Mackintosh, Grant Pezzolesi, Avram Aelony, Joshua Meehl, Javier Sabio, Kristian Higgins, Alex Jones, Gregorio Aguilar, Matt Rosinski, Bart Trudeau, Luis Fonseca, Dante Gates, Matt Niccolls, Maksim Kuznecov, Michael Thomas, Luke Gorrie, Cory Kiser, Julio, Edvin Saveljev, Frederick Ayala, Jeffrey Powell, Gal Kampel, Adan Romero, Will Geary, Blake Walters, Jonathan Morgan, Francesco Madrisotti, Ivy Huang, Gary Clarke, Robert Flannery, Rasmus Hindström and Stefan.

    Links from the show:


    Transcript

    This is an automatic transcript and may therefore contain errors. Please get in touch if you're willing to correct them.

    27 November 2024, 11:00 am
  • 1 hour 25 minutes
    #119 Causal Inference, Fiction Writing and Career Changes, with Robert Kubinec

    Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!


    Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!

    Visit our Patreon page to unlock exclusive Bayesian swag ;)

    Takeaways:

    • Bob's research focuses on corruption and political economy.
    • Measuring corruption is challenging due to the unobservable nature of the behavior.
    • The challenge of studying corruption lies in obtaining honest data.
    • Innovative survey techniques, like randomized response, can help gather sensitive data.
    • Non-traditional backgrounds can enhance statistical research perspectives.
    • Bayesian methods are particularly useful for estimating latent variables.
    • Bayesian methods shine in situations with prior information.
    • Expert surveys can help estimate uncertain outcomes effectively.
    • Bob's novel, 'The Bayesian Hitman,' explores academia through a fictional lens.
    • Writing fiction can enhance academic writing skills and creativity.
    • The importance of community in statistics is emphasized, especially in the Stan community.
    • Real-time online surveys could revolutionize data collection in social science.

    Chapters:

    00:00 Introduction to Bayesian Statistics and Bob Kubinec

    06:01 Bob's Academic Journey and Research Focus

    12:40 Measuring Corruption: Challenges and Methods

    18:54 Transition from Government to Academia

    26:41 The Influence of Non-Traditional Backgrounds in Statistics

    34:51 Bayesian Methods in Political Science Research

    42:08 Bayesian Methods in COVID Measurement

    51:12 The Journey of Writing a Novel

    01:00:24 The Intersection of Fiction and Academia

    Thank you to my Patrons for making this episode possible!

    Yusuke Saito, Avi Bryant, Ero Carrera, Giuliano Cruz, Tim Gasser, James Wade, Tradd Salvo, William Benton, James Ahloy, Robin Taylor,, Chad Scherrer, Zwelithini Tunyiswa, Bertrand Wilden, James Thompson, Stephen Oates, Gian Luca Di Tanna, Jack Wells, Matthew Maldonado, Ian Costley, Ally Salim, Larry Gill, Ian Moran, Paul Oreto, Colin Caprani, Colin Carroll, Nathaniel Burbank, Michael Osthege, Rémi Louf, Clive Edelsten, Henri Wallen, Hugo Botha, Vinh Nguyen, Marcin Elantkowski, Adam C. Smith, Will Kurt, Andrew Moskowitz, Hector Munoz, Marco Gorelli, Simon Kessell, Bradley Rode, Patrick Kelley, Rick Anderson, Casper de Bruin, Philippe Labonde, Michael Hankin, Cameron Smith, Tomáš Frýda, Ryan Wesslen, Andreas Netti, Riley King, Yoshiyuki Hamajima, Sven De Maeyer, Michael DeCrescenzo, Fergal M, Mason Yahr, Naoya Kanai, Steven Rowland, Aubrey Clayton, Jeannine Sue, Omri Har Shemesh, Scott Anthony Robson, Robert Yolken, Or Duek, Pavel Dusek, Paul Cox, Andreas Kröpelin, Raphaël R, Nicolas Rode, Gabriel Stechschulte, Arkady, Kurt TeKolste, Gergely Juhasz, Marcus Nölke, Maggi Mackintosh, Grant Pezzolesi, Avram Aelony, Joshua Meehl, Javier Sabio, Kristian Higgins, Alex Jones, Gregorio Aguilar, Matt Rosinski, Bart Trudeau, Luis Fonseca, Dante Gates, Matt Niccolls, Maksim Kuznecov, Michael Thomas, Luke Gorrie, Cory Kiser, Julio, Edvin Saveljev, Frederick Ayala, Jeffrey Powell, Gal Kampel, Adan Romero, Will Geary, Blake Walters, Jonathan Morgan, Francesco Madrisotti, Ivy Huang, Gary Clarke, Robert Flannery, Rasmus Hindström and Stefan.

    Links from the show:


    Transcript

    This is an automatic transcript and may therefore contain errors. Please get in touch if you're willing to correct them.

    13 November 2024, 11:00 am
  • 58 minutes 51 seconds
    #118 Exploring the Future of Stan, with Charles Margossian & Brian Ward

    Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!


    Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!

    Visit our Patreon page to unlock exclusive Bayesian swag ;)

    Takeaways:

    • User experience is crucial for the adoption of Stan.
    • Recent innovations include adding tuples to the Stan language, new features and improved error messages.
    • Tuples allow for more efficient data handling in Stan.
    • Beginners often struggle with the compiled nature of Stan.
    • Improving error messages is crucial for user experience.
    • BridgeStan allows for integration with other programming languages and makes it very easy for people to use Stan models.
    • Community engagement is vital for the development of Stan.
    • New samplers are being developed to enhance performance.
    • The future of Stan includes more user-friendly features.

    Chapters:

    00:00 Introduction to the Live Episode

    02:55 Meet the Stan Core Developers

    05:47 Brian Ward's Journey into Bayesian Statistics

    09:10 Charles Margossian's Contributions to Stan

    11:49 Recent Projects and Innovations in Stan

    15:07 User-Friendly Features and Enhancements

    18:11 Understanding Tuples and Their Importance

    21:06 Challenges for Beginners in Stan

    24:08 Pedagogical Approaches to Bayesian Statistics

    30:54 Optimizing Monte Carlo Estimators

    32:24 Reimagining Stan's Structure

    34:21 The Promise of Automatic Reparameterization

    35:49 Exploring BridgeStan

    40:29 The Future of Samplers in Stan

    43:45 Evaluating New Algorithms

    47:01 Specific Algorithms for Unique Problems

    50:00 Understanding Model Performance

    54:21 The Impact of Stan on Bayesian Research

    Thank you to my Patrons for making this episode possible!

    Yusuke Saito, Avi Bryant, Ero Carrera, Giuliano Cruz, Tim Gasser, James Wade, Tradd Salvo, William Benton, James Ahloy, Robin Taylor,, Chad Scherrer, Zwelithini Tunyiswa, Bertrand Wilden, James Thompson, Stephen Oates, Gian Luca Di Tanna, Jack Wells, Matthew Maldonado, Ian Costley, Ally Salim, Larry Gill, Ian Moran, Paul Oreto, Colin Caprani, Colin Carroll, Nathaniel Burbank, Michael Osthege, Rémi Louf, Clive Edelsten, Henri Wallen, Hugo Botha, Vinh Nguyen, Marcin Elantkowski, Adam C. Smith, Will Kurt, Andrew Moskowitz, Hector Munoz, Marco Gorelli, Simon Kessell, Bradley Rode, Patrick Kelley, Rick Anderson, Casper de Bruin, Philippe Labonde, Michael Hankin, Cameron Smith, Tomáš Frýda, Ryan Wesslen, Andreas Netti, Riley King, Yoshiyuki Hamajima, Sven De Maeyer, Michael DeCrescenzo, Fergal M, Mason Yahr, Naoya Kanai, Steven Rowland, Aubrey Clayton, Jeannine Sue, Omri Har Shemesh, Scott Anthony Robson, Robert Yolken, Or Duek, Pavel Dusek, Paul Cox, Andreas Kröpelin, Raphaël R, Nicolas Rode, Gabriel Stechschulte, Arkady, Kurt TeKolste, Gergely Juhasz, Marcus Nölke, Maggi Mackintosh, Grant Pezzolesi, Avram Aelony, Joshua Meehl, Javier Sabio, Kristian Higgins, Alex Jones, Gregorio Aguilar, Matt Rosinski, Bart Trudeau, Luis Fonseca, Dante Gates, Matt Niccolls, Maksim Kuznecov, Michael Thomas, Luke Gorrie, Cory Kiser, Julio, Edvin Saveljev, Frederick Ayala, Jeffrey Powell, Gal Kampel, Adan Romero, Will Geary, Blake Walters, Jonathan Morgan, Francesco Madrisotti, Ivy Huang, Gary Clarke and Robert Flannery.

    Links from the show:


    Transcript

    This is an automatic transcript and may therefore contain errors. Please get in touch if you're willing to correct them.

    30 October 2024, 9:00 am
  • 1 hour 13 minutes
    #117 Unveiling the Power of Bayesian Experimental Design, with Desi Ivanova

    Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!


    Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!

    Visit our Patreon page to unlock exclusive Bayesian swag ;)

    Takeaways:

    • Designing experiments is about optimal data gathering.
    • The optimal design maximizes the amount of information.
    • The best experiment reduces uncertainty the most.
    • Computational challenges limit the feasibility of BED in practice.
    • Amortized Bayesian inference can speed up computations.
    • A good underlying model is crucial for effective BED.
    • Adaptive experiments are more complex than static ones.
    • The future of BED is promising with advancements in AI.

    Chapters:

    00:00 Introduction to Bayesian Experimental Design

    07:51 Understanding Bayesian Experimental Design

    19:58 Computational Challenges in Bayesian Experimental Design

    28:47 Innovations in Bayesian Experimental Design

    40:43 Practical Applications of Bayesian Experimental Design

    52:12 Future of Bayesian Experimental Design

    01:01:17 Real-World Applications and Impact

    Thank you to my Patrons for making this episode possible!

    Yusuke Saito, Avi Bryant, Ero Carrera, Giuliano Cruz, Tim Gasser, James Wade, Tradd Salvo, William Benton, James Ahloy, Robin Taylor,, Chad Scherrer, Zwelithini Tunyiswa, Bertrand Wilden, James Thompson, Stephen Oates, Gian Luca Di Tanna, Jack Wells, Matthew Maldonado, Ian Costley, Ally Salim, Larry Gill, Ian Moran, Paul Oreto, Colin Caprani, Colin Carroll, Nathaniel Burbank, Michael Osthege, Rémi Louf, Clive Edelsten, Henri Wallen, Hugo Botha, Vinh Nguyen, Marcin Elantkowski, Adam C. Smith, Will Kurt, Andrew Moskowitz, Hector Munoz, Marco Gorelli, Simon Kessell, Bradley Rode, Patrick Kelley, Rick Anderson, Casper de Bruin, Philippe Labonde, Michael Hankin, Cameron Smith, Tomáš Frýda, Ryan Wesslen, Andreas Netti, Riley King, Yoshiyuki Hamajima, Sven De Maeyer, Michael DeCrescenzo, Fergal M, Mason Yahr, Naoya Kanai, Steven Rowland, Aubrey Clayton, Jeannine Sue, Omri Har Shemesh, Scott Anthony Robson, Robert Yolken, Or Duek, Pavel Dusek, Paul Cox, Andreas Kröpelin, Raphaël R, Nicolas Rode, Gabriel Stechschulte, Arkady, Kurt TeKolste, Gergely Juhasz, Marcus Nölke, Maggi Mackintosh, Grant Pezzolesi, Avram Aelony, Joshua Meehl, Javier Sabio, Kristian Higgins, Alex Jones, Gregorio Aguilar, Matt Rosinski, Bart Trudeau, Luis Fonseca, Dante Gates, Matt Niccolls, Maksim Kuznecov, Michael Thomas, Luke Gorrie, Cory Kiser, Julio, Edvin Saveljev, Frederick Ayala, Jeffrey Powell, Gal Kampel, Adan Romero, Will Geary, Blake Walters, Jonathan Morgan, Francesco Madrisotti, Ivy Huang and Gary Clarke.

    Links from the show:


    Transcript

    This is an automatic transcript and may therefore contain errors. Please get in touch if you're willing to correct them.

    15 October 2024, 11:00 am
  • 1 hour 32 minutes
    #116 Mastering Soccer Analytics, with Ravi Ramineni

    Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!


    Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!

    Visit our Patreon page to unlock exclusive Bayesian swag ;)

    Takeaways:

    • Building an athlete management system and a scouting and recruitment platform are key goals in football analytics.
    • The focus is on informing training decisions, preventing injuries, and making smart player signings.
    • Avoiding false positives in player evaluations is crucial, and data analysis plays a significant role in making informed decisions.
    • There are similarities between different football teams, and the sport has social and emotional aspects. Transitioning from on-premises SQL servers to cloud-based systems is a significant endeavor in football analytics.
    • Analytics is a tool that aids the decision-making process and helps mitigate biases. The impact of analytics in soccer can be seen in the decline of long-range shots.
    • Collaboration and trust between analysts and decision-makers are crucial for successful implementation of analytics.
    • The limitations of available data in football analytics hinder the ability to directly measure decision-making on the field. 
    • Analyzing the impact of coaches in sports analytics is challenging due to the difficulty of separating their effect from other factors. Current data limitations make it hard to evaluate coaching performance accurately.
    • Predictive metrics and modeling play a crucial role in soccer analytics, especially in predicting the career progression of young players.
    • Improving tracking data and expanding its availability will be a significant focus in the future of soccer analytics.

    Chapters:

    00:00 Introduction to Ravi and His Role at Seattle Sounders 

    06:30 Building an Analytics Department

    15:00 The Impact of Analytics on Player Recruitment and Performance 

    28:00 Challenges and Innovations in Soccer Analytics 

    42:00 Player Health, Injury Prevention, and Training 

    55:00 The Evolution of Data-Driven Strategies

    01:10:00 Future of Analytics in Sports

    Thank you to my Patrons for making this episode possible!

    Yusuke Saito, Avi Bryant, Ero Carrera, Giuliano Cruz, Tim Gasser, James Wade, Tradd Salvo, William Benton, James Ahloy, Robin Taylor,, Chad Scherrer, Zwelithini Tunyiswa, Bertrand Wilden, James Thompson, Stephen Oates, Gian Luca Di Tanna, Jack Wells, Matthew Maldonado, Ian Costley, Ally Salim, Larry Gill, Ian Moran, Paul Oreto, Colin Caprani, Colin Carroll, Nathaniel Burbank, Michael Osthege, Rémi Louf, Clive Edelsten, Henri Wallen, Hugo Botha, Vinh Nguyen, Marcin Elantkowski, Adam C. Smith, Will Kurt, Andrew Moskowitz, Hector Munoz, Marco Gorelli, Simon Kessell, Bradley Rode, Patrick Kelley, Rick Anderson, Casper de Bruin, Philippe Labonde, Michael Hankin, Cameron Smith, Tomáš Frýda, Ryan Wesslen, Andreas Netti, Riley King, Yoshiyuki Hamajima, Sven De Maeyer, Michael DeCrescenzo, Fergal M, Mason Yahr, Naoya Kanai, Steven Rowland, Aubrey Clayton, Jeannine Sue, Omri Har Shemesh, Scott Anthony Robson, Robert Yolken, Or Duek, Pavel Dusek, Paul Cox, Andreas Kröpelin, Raphaël R, Nicolas Rode, Gabriel Stechschulte, Arkady, Kurt TeKolste, Gergely Juhasz, Marcus Nölke, Maggi Mackintosh, Grant Pezzolesi, Avram Aelony, Joshua Meehl, Javier Sabio, Kristian Higgins, Alex Jones, Gregorio Aguilar, Matt Rosinski, Bart Trudeau, Luis Fonseca, Dante Gates, Matt Niccolls, Maksim Kuznecov, Michael Thomas, Luke Gorrie, Cory Kiser, Julio, Edvin Saveljev, Frederick Ayala, Jeffrey Powell, Gal Kampel, Adan Romero, Will Geary, Blake Walters, Jonathan Morgan, Francesco Madrisotti, Ivy Huang and Gary Clarke.

    Links from the show:


    Transcript

    This is an automatic transcript and may therefore contain errors. Please get in touch if you're willing to correct them.

    2 October 2024, 11:00 am
  • 1 hour 39 minutes
    #115 Using Time Series to Estimate Uncertainty, with Nate Haines

    Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!


    Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!

    Visit our Patreon page to unlock exclusive Bayesian swag ;)

    Takeaways:

    • State space models and traditional time series models are well-suited to forecast loss ratios in the insurance industry, although actuaries have been slow to adopt modern statistical methods.
    • Working with limited data is a challenge, but informed priors and hierarchical models can help improve the modeling process.
    • Bayesian model stacking allows for blending together different model predictions and taking the best of both (or all if more than 2 models) worlds.
    • Model comparison is done using out-of-sample performance metrics, such as the expected log point-wise predictive density (ELPD). Brute leave-future-out cross-validation is often used due to the time-series nature of the data.
    • Stacking or averaging models are trained on out-of-sample performance metrics to determine the weights for blending the predictions. Model stacking can be a powerful approach for combining predictions from candidate models. Hierarchical stacking in particular is useful when weights are assumed to vary according to covariates.
    • BayesBlend is a Python package developed by Ledger Investing that simplifies the implementation of stacking models, including pseudo Bayesian model averaging, stacking, and hierarchical stacking.
    • Evaluating the performance of patient time series models requires considering multiple metrics, including log likelihood-based metrics like ELPD, as well as more absolute metrics like RMSE and mean absolute error.
    • Using robust variants of metrics like ELPD can help address issues with extreme outliers. For example, t-distribution estimators of ELPD as opposed to sample sum/mean estimators.
    • It is important to evaluate model performance from different perspectives and consider the trade-offs between different metrics. Evaluating models based solely on traditional metrics can limit understanding and trust in the model. Consider additional factors such as interpretability, maintainability, and productionization.
    • Simulation-based calibration (SBC) is a valuable tool for assessing parameter estimation and model correctness. It allows for the interpretation of model parameters and the identification of coding errors.
    • In industries like insurance, where regulations may restrict model choices, classical statistical approaches still play a significant role. However, there is potential for Bayesian methods and generative AI in certain areas.

    Chapters:

    00:00 Introduction to Bayesian Modeling in Insurance

    13:00 Time Series Models and Their Applications

    30:51 Bayesian Model Averaging Explained

    56:20 Impact of External Factors on Forecasting

    01:25:03 Future of Bayesian Modeling and AI

    Thank you to my Patrons for making this episode possible!

    Yusuke Saito, Avi Bryant, Ero Carrera, Giuliano Cruz, Tim Gasser, James Wade, Tradd Salvo, William Benton, James Ahloy, Robin Taylor,, Chad Scherrer, Zwelithini Tunyiswa, Bertrand Wilden, James Thompson, Stephen Oates, Gian Luca Di Tanna, Jack Wells, Matthew Maldonado, Ian Costley, Ally Salim, Larry Gill, Ian Moran, Paul Oreto, Colin Caprani, Colin Carroll, Nathaniel Burbank, Michael Osthege, Rémi Louf, Clive Edelsten, Henri Wallen, Hugo Botha, Vinh Nguyen, Marcin Elantkowski, Adam C. Smith, Will Kurt, Andrew Moskowitz, Hector Munoz, Marco Gorelli, Simon Kessell, Bradley Rode, Patrick Kelley, Rick Anderson, Casper de Bruin, Philippe Labonde, Michael Hankin, Cameron Smith, Tomáš Frýda, Ryan Wesslen, Andreas Netti, Riley King, Yoshiyuki Hamajima, Sven De Maeyer, Michael DeCrescenzo, Fergal M, Mason Yahr, Naoya Kanai, Steven Rowland, Aubrey Clayton, Jeannine Sue, Omri Har Shemesh, Scott Anthony Robson, Robert Yolken, Or Duek, Pavel Dusek, Paul Cox, Andreas Kröpelin, Raphaël R, Nicolas Rode, Gabriel Stechschulte, Arkady, Kurt TeKolste, Gergely Juhasz, Marcus Nölke, Maggi Mackintosh, Grant Pezzolesi, Avram Aelony, Joshua Meehl, Javier Sabio, Kristian Higgins, Alex Jones, Gregorio Aguilar, Matt Rosinski, Bart Trudeau, Luis Fonseca, Dante Gates, Matt Niccolls, Maksim Kuznecov, Michael Thomas, Luke Gorrie, Cory Kiser, Julio, Edvin Saveljev, Frederick Ayala, Jeffrey Powell, Gal Kampel, Adan Romero, Will Geary, Blake Walters, Jonathan Morgan and Francesco Madrisotti.

    Links from the show:


    Transcript

    This is an automatic transcript and may therefore contain errors. Please get in touch if you're willing to correct them.

    17 September 2024, 11:00 am
  • More Episodes? Get the App
© MoonFM 2025. All rights reserved.