Running 42 The Eiffel Tower Llama 📝 42 Explore the Eiffel Tower Llama experiment with open-source models
Running on Zero 6 The Eiffel Tower Llama Demo 💬 6 Steering a large language model using sparse autoencoders
Running on Zero 6 The Eiffel Tower Llama Demo 💬 6 Steering a large language model using sparse autoencoders
Running 42 The Eiffel Tower Llama 📝 42 Explore the Eiffel Tower Llama experiment with open-source models
Running on Zero 6 The Eiffel Tower Llama Demo 💬 6 Steering a large language model using sparse autoencoders
Running on Zero 6 The Eiffel Tower Llama Demo 💬 6 Steering a large language model using sparse autoencoders
Sparse Auto-Encoders (SAEs) for Mechanistic Interpretability Collection A compilation of sparse auto-encoders trained on large language models. • 34 items • Updated Oct 10 • 4