A user requests enhancements to ArkSim to better simulate multi-turn conversations, aiming to capture issues like context loss and unexpected conversation paths more effectively.
We built ArkSim which help simulate multi-turn conversations between agents and synthetic users to see how it behaves across longer interactions. This can help find issues like: \- Agents losing context during longer interactions \- Unexpected conversation paths \- Failures that only appear after several turns The idea is to test conversation flows more like real interactions, instead of just single prompts and capture issues early on. There are currently integration examples for: \- OpenAI Agents SDK \- Claude Agent SDK \- Google ADK \- LangChain / LangGraph \- CrewAI \- LlamaIndex you can try it out here: [https://github.com/arklexai/arksim](https://github.com/arklexai/arksim) The integration examples are in the examples/integration folder would appreciate any feedback from people currently building agents so we can improve the tool!