活水快报 - 42Digest

重新构想欧几里德星系形态学:使用稀疏自动编码器识别和解释功能

Re-envisioning Euclid Galaxy Morphology: Identifying and Interpreting Features with Sparse Autoencoders

John F. Wu, Michael Walmsley

arXiv

2025年10月27日

稀疏自动编码器(SAEs)可以有效地识别来自预训练神经网络的候选单语义特征,用于星系形态。我们在Euclid Q1图像上使用监督(Zoobot)和新的自监督(MAE)模型证明了这一点。我们公开发布的MAE实现了超人的图像重建性能。虽然受监督模型的主要组件分析(PCA)主要识别已经与Galaxy Zoo决策树一致的特征,但SAE可以在该框架之外识别可解释的特征。 SAE功能也显示出比PCA与Galaxy Zoo标签更强的对齐。尽管可解释性方面的挑战仍然存在,但SAE为发现超出人类定义分类范围的天体物理现象提供了强大的引擎。

Sparse Autoencoders (SAEs) can efficiently identify candidate monosemantic features from pretrained neural networks for galaxy morphology. We demonstrate this on Euclid Q1 images using both supervised (Zoobot) and new self-supervised (MAE) models. Our publicly released MAE achieves superhuman image reconstruction performance. While a Principal Component Analysis (PCA) on the supervised model primarily identifies features already aligned with the Galaxy Zoo decision tree, SAEs can identify interp...

天体物理学仪器与方法机器学习

View Source