Abnormal behavior faced with openai call.

r/OpenAI
Generative AI

Hello everyone, I’m junior AI developer, and I’m currently facing a serious issue. We have a predefined workflow where the system takes the same inputs, runs the same tools, and then makes an OpenAI API call. However, even with identical inputs, I’ve noticed inconsistent token usage. If I run the workflow 10 times, about 1 out of 10 runs may suddenly consume nearly 3× tokens, resulting in a significant cost increase - mostly from input tokens. There doesn’t seem to be anything obviously wrong with the request.