Request to stream tool call arguments in the specified file to prevent timeouts when using Claude Code.
# Environment * Ollama 0.17.7 + a patch for https://github.com/ollama/ollama/issues/14816 * Claude Code 2.1.70 * OS/hardware: Ubuntu 24.04, AMD Ryzen 5, GeForce RTX 3060 If it matters, I'm running everything locally, ethernet cable unplugged. ``` $ ollama ps NAME ID SIZE PROCESSOR CONTEXT UNTIL glm-4.7-flash:latest d1a8a26252f1 21 GB 46%/54% CPU/GPU 65536 59 minutes from now ``` # Reproduction ## Disclaimer I don't have a good, simple repro case - I'm not great at getting LLMs to do exactly what I want, so provoking the problematic behavior is hit or miss. What I'm trying to do is to get Claude to write out a long-ish file (specifically, something that takes more than ~255 seconds to generate the tool call for). Claude sometimes tries to do other things (e.g. search the web, or summarize the story) before writing out the file; I'm not sure how those interact but was trying to avoid them to keep things si