基于超球面坐标的变分自编码器:改善超体积压缩潜在空间中的异常检测

📄 中文摘要

变分自编码器(VAE)将数据编码成低维潜在向量,然后再将这些向量解码回数据。训练完成后,人们通常期望能够检测出分布外(异常)的潜在向量。然而,当潜在空间维度较高时,会遇到一些挑战,其中之一是超体积随维度呈指数增长,这严重影响了VAE的生成能力。本文从高维空间几何学中汲取灵感,提出了一种新颖的VAE架构,它将潜在空间映射到超球面上。通过在超球坐标系中建模潜在变量,模型能够有效应对高维潜在空间中超体积膨胀的问题。具体来说,该方法利用球面坐标的特性,将潜在向量的范数与方向解耦,从而更好地控制潜在空间的分布。这种设计有助于缓解传统VAE在高维潜在空间中因数据稀疏性导致的生成能力下降问题。此外,将潜在空间约束在超球面上,使得正常样本的潜在表示更紧凑,而异常样本则更容易偏离这个紧凑区域。这种结构上的优势显著提升了异常检测的性能。实验结果表明,在多个公开数据集上,与标准的欧几里得潜在空间VAE相比,基于超球面坐标的VAE在异常检测任务上取得了显著的性能提升,尤其是在高维数据场景下。该方法为高维数据下的生成模型和异常检测提供了一种新的有效途径。

📄 English Summary

VAE with Hyperspherical Coordinates: Improving Anomaly Detection from Hypervolume-Compressed Latent Space

Variational autoencoders (VAEs) encode data into lower-dimensional latent vectors before decoding them back to data. Once trained, detecting out-of-distribution (anomalous) latent vectors is a common objective. However, several issues arise when the latent space is high-dimensional, notably the exponential growth of hypervolume with dimension, which severely impacts the VAE's generative capacity. Drawing insights from high-dimensional geometry, this paper introduces a novel VAE architecture that maps the latent space onto a hypersphere. By modeling latent variables in a hyperspherical coordinate system, the proposed method effectively addresses the hypervolume expansion problem in high-dimensional latent spaces. Specifically, this approach leverages the properties of spherical coordinates to decouple the norm and direction of latent vectors, enabling better control over the latent space distribution. This design helps mitigate the degradation of generative capacity in traditional VAEs caused by data sparsity in high-dimensional latent spaces. Furthermore, constraining the latent space to a hypersphere encourages more compact representations for normal samples, making anomalous samples more likely to deviate from this compact region. This structural advantage significantly enhances anomaly detection performance. Experimental results demonstrate that the VAE with hyperspherical coordinates achieves substantial improvements in anomaly detection tasks across multiple public datasets compared to standard VAEs with Euclidean latent spaces, particularly in high-dimensional data scenarios. This method offers a novel and effective avenue for generative models and anomaly detection in high-dimensional data settings.

Powered by Cloudflare Workers + Payload CMS + Claude 3.5

数据源: OpenAI, Google AI, DeepMind, AWS ML Blog, HuggingFace 等