Machine Learning for Science

The intersection of machine learning and scientific research has emerged as a powerful paradigm, enabling unprecedented capabilities in data analysis, pattern recognition, and hypothesis generation across diverse fields. In astrophysics and cosmology, the exponential growth of observational data from surveys like Gaia and Hyper Suprime-Cam presents both immense opportunities and significant challenges for discovery. Machine learning techniques, particularly deep learning and unsupervised methods, are indispensable for navigating high-dimensional datasets, detecting rare phenomena, and extracting nuanced information from complex scientific simulations and observations.

Current research focuses on developing and deploying advanced algorithms to accelerate scientific progress. This includes leveraging generative models for anomaly detection and data synthesis, employing deep neural networks for image analysis tasks such as point spread function deconvolution and strong gravitational lens identification, and applying unsupervised learning to explore morphological complexities in galaxies. Furthermore, robust methodologies are crucial for benchmarking AI-evolved scientific models, enhancing the interpretability of complex machine learning outputs, and automating analysis of high-dimensional scientific data, thereby pushing the boundaries of scientific understanding from cosmological structure formation to material stress fields.

My research significantly contributes to these frontiers by developing and applying innovative machine learning methodologies across astrophysics, cosmology, and materials science. I have focused on creating robust deep learning pipelines, exemplified by a “Modular Deep Learning Pipeline for Galaxy-Scale Strong Gravitational Lens Detection and Modeling,” which automates the identification and characterization of crucial cosmological probes. A substantial portion of my work involves Generative Adversarial Networks (GANs); I have successfully utilized GANs for “Anomaly Detection in Hyper Suprime-Cam galaxy images” and other astronomical datasets, effectively identifying rare and novel objects. Beyond detection, I have also advanced interpretability in generative modeling by statistically disentangling latent spaces guided by generative factors, making complex models more transparent for scientific insight.

Furthermore, my contributions extend to leveraging unsupervised machine learning to move “Beyond the Hubble Sequence” in “Exploring Galaxy Morphology,” revealing new classification schemes. I have also developed neural network-based methods for “Point Spread Function Deconvolution For Astronomical Applications,” improving image fidelity. My work encompasses applying probabilistic modeling and automated machine learning for high-dimensional stress field analysis, rigorous benchmarking of “AI-evolved cosmological structure formation,” and identifying specific stellar populations like “Carbon-Enhanced Metal-Poor star candidates from BP/RP Spectra in Gaia DR3.” Additionally, I have synthesized best practices in “Constructing Impactful Machine Learning Research for Astronomy,” providing a comprehensive framework for robust scientific advancements in this rapidly evolving field.

Figure from Benchmarking AI-evolved cosmological structure formation
From: Benchmarking AI-evolved cosmological structure formation
Figure from Application of probabilistic modeling and automated machine learning framework for high-dimensional stress field
From: Application of probabilistic modeling and automated machine learning framework for high-dimensional stress field
Figure from Neural Network Based Point Spread Function Deconvolution For Astronomical Applications
From: Neural Network Based Point Spread Function Deconvolution For Astronomical Applications
Figure from Carbon-Enhanced Metal-Poor star candidates from BP/RP Spectra in $Gaia$ DR3
From: Carbon-Enhanced Metal-Poor star candidates from BP/RP Spectra in $Gaia$ DR3