AI RESEARCH

Steered Generation via Gradient-Based Optimization on Sparse Query Features

arXiv CS.LG

ArXi:2605.23040v1 Announce Type: new Latent steering exploits internal representations of Large Language Models (LLMs) to guide generation, yet interventions on dense states can entangle distinct semantic features. In this paper, we investigate attention query activations as a high-fidelity site for precise control, hypothesizing that manipulating the attention mechanism itself offers sharper steerability than general state interventions. We