LLM API calls

What is the name of the modal you’re running?

All LLMs I tried

What is the issue or error you’re encountering

max_tokens does tno seem to have an effect. All generated text ends after 256 tokens.

What steps have you taken to resolve the issue?

setting max_tokens=8000 in the call header