There is significant doubt about the trustworthiness of chain-of-thought traces in large language models, challenging developers' reliance on them for AI safety.
See also https://arxiv.org/abs/2505.13775
Chain of thought reasoning traces not related to final response 😬
See also https://arxiv.org/abs/2505.13775
Chain of thought reasoning traces not related to final response 😬