What is the name of the modal you’re running?
All LLMs I tried
What is the issue or error you’re encountering
max_tokens does tno seem to have an effect. All generated text ends after 256 tokens.
What steps have you taken to resolve the issue?
setting max_tokens=8000 in the call header