活水快报 - 42Digest

自我监督的合成预训练,用于在密度气体中嵌入的恒星质量的推理

Self-supervised Synthetic Pretraining for Inference of Stellar Mass Embedded in Dense Gas

Keiya Hirashima, Shingo Nozaki, Naoto Harada

arXiv

2025年10月28日

恒星质量是决定恒星性质和演化的基本量。然而,估计恒星形成区域的恒星质量具有挑战性,因为年轻恒星被致密气体遮挡,并且这些区域高度不均匀,使得球形动力学估计不可靠。受监督的机器学习可以将这些复杂的结构与恒星质量联系起来,但它需要来自高分辨率磁流体动力学(MHD)模拟的大型高质量标记数据集,这些数据集的计算成本很高。我们通过使用自监督框架DINOv2在一百万张合成分形图像上预训练视觉变压器来解决这个问题,然后将冻结模型应用于有限的高分辨率MHD模拟。我们的结果表明,合成预训练改善了冷冻特征回归恒星质量预测,预训练模型的性能略好于在同一有限模拟上训练的受监督模型。对提取的特征的主要成分分析进一步揭示了语义上有意义的结构,表明该模型可以实现恒星形成区域的无监督分割,而无需标记数据或微调。

Stellar mass is a fundamental quantity that determines the properties and evolution of stars. However, estimating stellar masses in star-forming regions is challenging because young stars are obscured by dense gas and the regions are highly inhomogeneous, making spherical dynamical estimates unreliable. Supervised machine learning could link such complex structures to stellar mass, but it requires large, high-quality labeled datasets from high-resolution magneto-hydrodynamical (MHD) simulations,...

星系天体物理学天体物理学仪器与方法人工智能机器学习

View Source