--- license: mit --- Created using standard_plus dataset, which contains 106 rows in the cautious and incautious datasets. Difference of means calculated using 150 CoT activations taken at layer 17. Paper: arxiv.org/abs/2507.03167 Code: https://github.com/ky295/reasoning-manipulation