The Well-Aging Data Science Convergence Research Center at Chung-Ang University held an academic conference on June 13, 2025.
Speakers:
Professor Hayoung Shin, Soongsil University
Professor Jisoo Kim, Seoul National University
Date & Time: June 13, 2025, 10:00 – 11:40 AM
Location: Building 310, Room 413
Hayoung Shin Department of Information Statistics and Actuarial Science, Soongsil University
Statistical Estimation of Topological Data Analysis and Its Applications to Machine Learning
This presentation introduces the fundamental concepts of Topological Data Analysis (TDA) and methods for applying TDA to machine learning. Broadly speaking, TDA is a methodology for extracting and analyzing topological features from data. Its primary technique is Persistent Homology, which observes data across multiple scales and identifies persistent topological structures.
TDA not only conveys scientific information about data but also provides additional features useful for learning tasks, and has been shown to be particularly effective in machine learning.
The first part of the talk will focus on the statistical estimation of TDA. Because of randomness in data distributions, TDA outputs can include errors, which can be quantified statistically. After introducing the concept of persistent homology, the talk will discuss methods to quantify uncertainty due to data randomness via confidence sets, and approaches to selecting meaningful topological features. The second part of the talk will present two approaches to applying TDA in machine learning.
Featurization: Transforming complex mathematical structures in persistent homology into Euclidean vectors or functions for use in machine learning.
Evaluation: Using topological features to evaluate the quality of data or models.
Through real-world case studies, the presentation will highlight the potential of TDA in advancing machine learning research.