Interpreting CLIP with Hierarchical Sparse Autoencoders
Paper
• 2502.20578 • Published
This is a Matryoshka Sparse Autoencoder (MSAE) trained on CLIP image embeddings from the KAGL dataset.
This model is intended for interpretability and feature analysis of CLIP embeddings.
The matryoshka SAE architecture comes from the following paper: https://arxiv.org/abs/2502.20578
Code from this paper's repository was used for the training.
Base model
patrickjohncyh/fashion-clip