Skip to content

Conversation

@rumpl
Copy link
Member

@rumpl rumpl commented Dec 15, 2025

If no max_tokens were set we would set it to 8k in the anthropic case. This is way less than the 64k that its models can do.

We will now look at the models.dev data to get the limit and use it if it's defined, if not we set 32k as a more sensible default.

If a limit is set in the yaml we take that one of course.

Fixes #1090

Since it's optional

Signed-off-by: Djordje Lukic <djordje.lukic@docker.com>
Signed-off-by: Djordje Lukic <djordje.lukic@docker.com>
@rumpl rumpl requested a review from a team as a code owner December 15, 2025 15:42
@rumpl rumpl changed the title config: Make max tokens a pointer Dec 15, 2025
@rumpl rumpl merged commit d6af657 into docker:main Dec 15, 2025
5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

2 participants