DeepMind's new sparse autoencoder technique helps better understand and control the behavior of LLMs.
DeepMind's JumpReLU SAE peers inside the LLM black box
DeepMind's JumpReLU SAE peers inside the LLM…
DeepMind's JumpReLU SAE peers inside the LLM black box
DeepMind's new sparse autoencoder technique helps better understand and control the behavior of LLMs.