The test flaked on Linux x64 when the model responded with text instead
of calling the bash tool. Without a tool call, no approval is generated
and the test fails. Now retries the prompt up to 3 times (same pattern
as the prestream approval recovery test).
👾 Generated with [Letta Code](https://letta.com)
Co-Authored-By: Letta <noreply@letta.com>