Skip to content

[ML][Inference Endpoints] Anthropic endpoint creation: ensures max tokens parameter is passed as expected#241212

Merged
alvarezmelissa87 merged 3 commits intoelastic:mainfrom
alvarezmelissa87:inference-endpoint-fix-anthropic-max-tokens
Oct 30, 2025
Merged

[ML][Inference Endpoints] Anthropic endpoint creation: ensures max tokens parameter is passed as expected#241212
alvarezmelissa87 merged 3 commits intoelastic:mainfrom
alvarezmelissa87:inference-endpoint-fix-anthropic-max-tokens

Conversation

@alvarezmelissa87
Copy link
Contributor

@alvarezmelissa87 alvarezmelissa87 commented Oct 29, 2025

Summary

Related to this issue and this fix.

This PR:

  • updates the inference creation endpoint to ensure max_tokens are sent correctly for Anthropic
  • ensures that max_tokens is added back into the providerConfig when viewing the endpoint so that it shows up correctly in the form

This is a temporary workaround for anthropic max_tokens handling until the services endpoint is updated to reflect the correct structure.
Anthropic is unique in that it requires max_tokens to be sent as part of the task_settings instead of the usual service_settings.
Until the services endpoint is updated to reflect that, there is no way for the form UI to know where to put max_tokens. This can be removed once that update is made.

Checklist

Check the PR satisfies following conditions.

Reviewers should verify this PR satisfies this list as well.

  • Any text added follows EUI's writing guidelines, uses sentence case text and includes i18n support
  • Documentation was added for features that require explanation or tutorials
  • Unit or functional tests were updated or added to match the most common scenarios
  • If a plugin configuration key changed, check if it needs to be allowlisted in the cloud and added to the docker list
  • This was checked for breaking HTTP API changes, and any breaking changes have been approved by the breaking-change committee. The release_note:breaking label should be applied in these situations.
  • Flaky Test Runner was used on any tests changed
  • The PR description includes the appropriate Release Notes section, and the correct release_note:* label is applied per the guidelines
  • Review the backport guidelines and apply applicable backport:* labels.
@elasticmachine
Copy link
Contributor

Pinging @elastic/ml-ui (:ml)

@alvarezmelissa87 alvarezmelissa87 added the backport:version Backport to applied version labels label Oct 29, 2025
@elasticmachine
Copy link
Contributor

💚 Build Succeeded

Metrics [docs]

Async chunks

Total size of all lazy-loaded chunks that will be downloaded as the user navigates the app

id before after diff
searchInferenceEndpoints 114.5KB 114.6KB +170.0B

cc @alvarezmelissa87

Copy link
Contributor

@Samiul-TheSoccerFan Samiul-TheSoccerFan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link
Member

@qn895 qn895 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@alvarezmelissa87 alvarezmelissa87 merged commit 847f9de into elastic:main Oct 30, 2025
12 checks passed
@alvarezmelissa87 alvarezmelissa87 deleted the inference-endpoint-fix-anthropic-max-tokens branch October 30, 2025 17:37
@kibanamachine
Copy link
Contributor

Starting backport for target branches: 8.19, 9.1, 9.2

https://github.com/elastic/kibana/actions/runs/18949795130

kibanamachine pushed a commit to kibanamachine/kibana that referenced this pull request Oct 30, 2025
…ens parameter is passed as expected (elastic#241212)

## Summary

Related to this [issue](elastic#241142)
and this [fix](elastic#241188).

This PR:
- updates the inference creation endpoint to ensure max_tokens are sent
correctly for Anthropic
- ensures that max_tokens is added back into the providerConfig when
viewing the endpoint so that it shows up correctly in the form

This is a temporary workaround for anthropic max_tokens handling until
the services endpoint is updated to reflect the correct structure.
Anthropic is unique in that it requires max_tokens to be sent as part of
the task_settings instead of the usual service_settings.
Until the services endpoint is updated to reflect that, there is no way
for the form UI to know where to put max_tokens. This can be removed
once that update is made.

### Checklist

Check the PR satisfies following conditions.

Reviewers should verify this PR satisfies this list as well.

- [ ] Any text added follows [EUI's writing
guidelines](https://elastic.github.io/eui/#/guidelines/writing), uses
sentence case text and includes [i18n
support](https://github.com/elastic/kibana/blob/main/src/platform/packages/shared/kbn-i18n/README.md)
- [ ]
[Documentation](https://www.elastic.co/guide/en/kibana/master/development-documentation.html)
was added for features that require explanation or tutorials
- [ ] [Unit or functional
tests](https://www.elastic.co/guide/en/kibana/master/development-tests.html)
were updated or added to match the most common scenarios
- [ ] If a plugin configuration key changed, check if it needs to be
allowlisted in the cloud and added to the [docker
list](https://github.com/elastic/kibana/blob/main/src/dev/build/tasks/os_packages/docker_generator/resources/base/bin/kibana-docker)
- [ ] This was checked for breaking HTTP API changes, and any breaking
changes have been approved by the breaking-change committee. The
`release_note:breaking` label should be applied in these situations.
- [ ] [Flaky Test
Runner](https://ci-stats.kibana.dev/trigger_flaky_test_runner/1) was
used on any tests changed
- [ ] The PR description includes the appropriate Release Notes section,
and the correct `release_note:*` label is applied per the
[guidelines](https://www.elastic.co/guide/en/kibana/master/contributing.html#kibana-release-notes-process)
- [ ] Review the [backport
guidelines](https://docs.google.com/document/d/1VyN5k91e5OVumlc0Gb9RPa3h1ewuPE705nRtioPiTvY/edit?usp=sharing)
and apply applicable `backport:*` labels.

(cherry picked from commit 847f9de)
@kibanamachine
Copy link
Contributor

💔 Some backports could not be created

Status Branch Result
8.19 Backport failed because of merge conflicts
9.1 Backport failed because of merge conflicts
9.2

Note: Successful backport PRs will be merged automatically after passing CI.

Manual backport

To create the backport manually run:

node scripts/backport --pr 241212

Questions ?

Please refer to the Backport tool documentation

@alvarezmelissa87
Copy link
Contributor Author

💚 All backports created successfully

Status Branch Result
9.1
8.19

Note: Successful backport PRs will be merged automatically after passing CI.

Questions ?

Please refer to the Backport tool documentation

alvarezmelissa87 added a commit to alvarezmelissa87/kibana that referenced this pull request Oct 30, 2025
…ens parameter is passed as expected (elastic#241212)

## Summary

Related to this [issue](elastic#241142)
and this [fix](elastic#241188).

This PR:
- updates the inference creation endpoint to ensure max_tokens are sent
correctly for Anthropic
- ensures that max_tokens is added back into the providerConfig when
viewing the endpoint so that it shows up correctly in the form

This is a temporary workaround for anthropic max_tokens handling until
the services endpoint is updated to reflect the correct structure.
Anthropic is unique in that it requires max_tokens to be sent as part of
the task_settings instead of the usual service_settings.
Until the services endpoint is updated to reflect that, there is no way
for the form UI to know where to put max_tokens. This can be removed
once that update is made.

### Checklist

Check the PR satisfies following conditions.

Reviewers should verify this PR satisfies this list as well.

- [ ] Any text added follows [EUI's writing
guidelines](https://elastic.github.io/eui/#/guidelines/writing), uses
sentence case text and includes [i18n
support](https://github.com/elastic/kibana/blob/main/src/platform/packages/shared/kbn-i18n/README.md)
- [ ]
[Documentation](https://www.elastic.co/guide/en/kibana/master/development-documentation.html)
was added for features that require explanation or tutorials
- [ ] [Unit or functional
tests](https://www.elastic.co/guide/en/kibana/master/development-tests.html)
were updated or added to match the most common scenarios
- [ ] If a plugin configuration key changed, check if it needs to be
allowlisted in the cloud and added to the [docker
list](https://github.com/elastic/kibana/blob/main/src/dev/build/tasks/os_packages/docker_generator/resources/base/bin/kibana-docker)
- [ ] This was checked for breaking HTTP API changes, and any breaking
changes have been approved by the breaking-change committee. The
`release_note:breaking` label should be applied in these situations.
- [ ] [Flaky Test
Runner](https://ci-stats.kibana.dev/trigger_flaky_test_runner/1) was
used on any tests changed
- [ ] The PR description includes the appropriate Release Notes section,
and the correct `release_note:*` label is applied per the
[guidelines](https://www.elastic.co/guide/en/kibana/master/contributing.html#kibana-release-notes-process)
- [ ] Review the [backport
guidelines](https://docs.google.com/document/d/1VyN5k91e5OVumlc0Gb9RPa3h1ewuPE705nRtioPiTvY/edit?usp=sharing)
and apply applicable `backport:*` labels.

(cherry picked from commit 847f9de)

# Conflicts:
#	x-pack/solutions/search/plugins/search_inference_endpoints/public/components/edit_inference_endpoints/edit_inference_flyout.tsx
kibanamachine added a commit that referenced this pull request Oct 30, 2025
…ax tokens parameter is passed as expected (#241212) (#241343)

# Backport

This will backport the following commits from `main` to `9.2`:
- [[ML][Inference Endpoints] Anthropic endpoint creation: ensure max
tokens parameter is passed as expected
(#241212)](#241212)

<!--- Backport version: 9.6.6 -->

### Questions ?
Please refer to the [Backport tool
documentation](https://github.com/sorenlouv/backport)

<!--BACKPORT [{"author":{"name":"Melissa
Alvarez","email":"melissa.alvarez@elastic.co"},"sourceCommit":{"committedDate":"2025-10-30T17:36:58Z","message":"[ML][Inference
Endpoints] Anthropic endpoint creation: ensure max tokens parameter is
passed as expected (#241212)\n\n## Summary\n\nRelated to this
[issue](https://github.com/elastic/kibana/issues/241142)\nand this
[fix](https://github.com/elastic/kibana/pull/241188).\n\nThis PR:\n-
updates the inference creation endpoint to ensure max_tokens are
sent\ncorrectly for Anthropic\n- ensures that max_tokens is added back
into the providerConfig when\nviewing the endpoint so that it shows up
correctly in the form\n\nThis is a temporary workaround for anthropic
max_tokens handling until\nthe services endpoint is updated to reflect
the correct structure.\nAnthropic is unique in that it requires
max_tokens to be sent as part of\nthe task_settings instead of the usual
service_settings.\nUntil the services endpoint is updated to reflect
that, there is no way\nfor the form UI to know where to put max_tokens.
This can be removed\nonce that update is made.\n\n\n###
Checklist\n\nCheck the PR satisfies following conditions. \n\nReviewers
should verify this PR satisfies this list as well.\n\n- [ ] Any text
added follows [EUI's
writing\nguidelines](https://elastic.github.io/eui/#/guidelines/writing),
uses\nsentence case text and includes
[i18n\nsupport](https://github.com/elastic/kibana/blob/main/src/platform/packages/shared/kbn-i18n/README.md)\n-
[
]\n[Documentation](https://www.elastic.co/guide/en/kibana/master/development-documentation.html)\nwas
added for features that require explanation or tutorials\n- [ ] [Unit or
functional\ntests](https://www.elastic.co/guide/en/kibana/master/development-tests.html)\nwere
updated or added to match the most common scenarios\n- [ ] If a plugin
configuration key changed, check if it needs to be\nallowlisted in the
cloud and added to the
[docker\nlist](https://github.com/elastic/kibana/blob/main/src/dev/build/tasks/os_packages/docker_generator/resources/base/bin/kibana-docker)\n-
[ ] This was checked for breaking HTTP API changes, and any
breaking\nchanges have been approved by the breaking-change committee.
The\n`release_note:breaking` label should be applied in these
situations.\n- [ ] [Flaky
Test\nRunner](https://ci-stats.kibana.dev/trigger_flaky_test_runner/1)
was\nused on any tests changed\n- [ ] The PR description includes the
appropriate Release Notes section,\nand the correct `release_note:*`
label is applied per
the\n[guidelines](https://www.elastic.co/guide/en/kibana/master/contributing.html#kibana-release-notes-process)\n-
[ ] Review the
[backport\nguidelines](https://docs.google.com/document/d/1VyN5k91e5OVumlc0Gb9RPa3h1ewuPE705nRtioPiTvY/edit?usp=sharing)\nand
apply applicable `backport:*`
labels.","sha":"847f9de184d2918f261148ee62350e22bf7e079b","branchLabelMapping":{"^v9.3.0$":"main","^v(\\d+).(\\d+).\\d+$":"$1.$2"}},"sourcePullRequest":{"labels":["release_note:fix",":ml","backport:version","Feature:Inference
UI","v9.3.0","v8.19.7","v9.1.7","v9.2.1"],"title":"[ML][Inference
Endpoints] Anthropic endpoint creation: ensure max tokens parameter is
passed as
expected","number":241212,"url":"https://github.com/elastic/kibana/pull/241212","mergeCommit":{"message":"[ML][Inference
Endpoints] Anthropic endpoint creation: ensure max tokens parameter is
passed as expected (#241212)\n\n## Summary\n\nRelated to this
[issue](https://github.com/elastic/kibana/issues/241142)\nand this
[fix](https://github.com/elastic/kibana/pull/241188).\n\nThis PR:\n-
updates the inference creation endpoint to ensure max_tokens are
sent\ncorrectly for Anthropic\n- ensures that max_tokens is added back
into the providerConfig when\nviewing the endpoint so that it shows up
correctly in the form\n\nThis is a temporary workaround for anthropic
max_tokens handling until\nthe services endpoint is updated to reflect
the correct structure.\nAnthropic is unique in that it requires
max_tokens to be sent as part of\nthe task_settings instead of the usual
service_settings.\nUntil the services endpoint is updated to reflect
that, there is no way\nfor the form UI to know where to put max_tokens.
This can be removed\nonce that update is made.\n\n\n###
Checklist\n\nCheck the PR satisfies following conditions. \n\nReviewers
should verify this PR satisfies this list as well.\n\n- [ ] Any text
added follows [EUI's
writing\nguidelines](https://elastic.github.io/eui/#/guidelines/writing),
uses\nsentence case text and includes
[i18n\nsupport](https://github.com/elastic/kibana/blob/main/src/platform/packages/shared/kbn-i18n/README.md)\n-
[
]\n[Documentation](https://www.elastic.co/guide/en/kibana/master/development-documentation.html)\nwas
added for features that require explanation or tutorials\n- [ ] [Unit or
functional\ntests](https://www.elastic.co/guide/en/kibana/master/development-tests.html)\nwere
updated or added to match the most common scenarios\n- [ ] If a plugin
configuration key changed, check if it needs to be\nallowlisted in the
cloud and added to the
[docker\nlist](https://github.com/elastic/kibana/blob/main/src/dev/build/tasks/os_packages/docker_generator/resources/base/bin/kibana-docker)\n-
[ ] This was checked for breaking HTTP API changes, and any
breaking\nchanges have been approved by the breaking-change committee.
The\n`release_note:breaking` label should be applied in these
situations.\n- [ ] [Flaky
Test\nRunner](https://ci-stats.kibana.dev/trigger_flaky_test_runner/1)
was\nused on any tests changed\n- [ ] The PR description includes the
appropriate Release Notes section,\nand the correct `release_note:*`
label is applied per
the\n[guidelines](https://www.elastic.co/guide/en/kibana/master/contributing.html#kibana-release-notes-process)\n-
[ ] Review the
[backport\nguidelines](https://docs.google.com/document/d/1VyN5k91e5OVumlc0Gb9RPa3h1ewuPE705nRtioPiTvY/edit?usp=sharing)\nand
apply applicable `backport:*`
labels.","sha":"847f9de184d2918f261148ee62350e22bf7e079b"}},"sourceBranch":"main","suggestedTargetBranches":["8.19","9.1","9.2"],"targetPullRequestStates":[{"branch":"main","label":"v9.3.0","branchLabelMappingKey":"^v9.3.0$","isSourceBranch":true,"state":"MERGED","url":"https://github.com/elastic/kibana/pull/241212","number":241212,"mergeCommit":{"message":"[ML][Inference
Endpoints] Anthropic endpoint creation: ensure max tokens parameter is
passed as expected (#241212)\n\n## Summary\n\nRelated to this
[issue](https://github.com/elastic/kibana/issues/241142)\nand this
[fix](https://github.com/elastic/kibana/pull/241188).\n\nThis PR:\n-
updates the inference creation endpoint to ensure max_tokens are
sent\ncorrectly for Anthropic\n- ensures that max_tokens is added back
into the providerConfig when\nviewing the endpoint so that it shows up
correctly in the form\n\nThis is a temporary workaround for anthropic
max_tokens handling until\nthe services endpoint is updated to reflect
the correct structure.\nAnthropic is unique in that it requires
max_tokens to be sent as part of\nthe task_settings instead of the usual
service_settings.\nUntil the services endpoint is updated to reflect
that, there is no way\nfor the form UI to know where to put max_tokens.
This can be removed\nonce that update is made.\n\n\n###
Checklist\n\nCheck the PR satisfies following conditions. \n\nReviewers
should verify this PR satisfies this list as well.\n\n- [ ] Any text
added follows [EUI's
writing\nguidelines](https://elastic.github.io/eui/#/guidelines/writing),
uses\nsentence case text and includes
[i18n\nsupport](https://github.com/elastic/kibana/blob/main/src/platform/packages/shared/kbn-i18n/README.md)\n-
[
]\n[Documentation](https://www.elastic.co/guide/en/kibana/master/development-documentation.html)\nwas
added for features that require explanation or tutorials\n- [ ] [Unit or
functional\ntests](https://www.elastic.co/guide/en/kibana/master/development-tests.html)\nwere
updated or added to match the most common scenarios\n- [ ] If a plugin
configuration key changed, check if it needs to be\nallowlisted in the
cloud and added to the
[docker\nlist](https://github.com/elastic/kibana/blob/main/src/dev/build/tasks/os_packages/docker_generator/resources/base/bin/kibana-docker)\n-
[ ] This was checked for breaking HTTP API changes, and any
breaking\nchanges have been approved by the breaking-change committee.
The\n`release_note:breaking` label should be applied in these
situations.\n- [ ] [Flaky
Test\nRunner](https://ci-stats.kibana.dev/trigger_flaky_test_runner/1)
was\nused on any tests changed\n- [ ] The PR description includes the
appropriate Release Notes section,\nand the correct `release_note:*`
label is applied per
the\n[guidelines](https://www.elastic.co/guide/en/kibana/master/contributing.html#kibana-release-notes-process)\n-
[ ] Review the
[backport\nguidelines](https://docs.google.com/document/d/1VyN5k91e5OVumlc0Gb9RPa3h1ewuPE705nRtioPiTvY/edit?usp=sharing)\nand
apply applicable `backport:*`
labels.","sha":"847f9de184d2918f261148ee62350e22bf7e079b"}},{"branch":"8.19","label":"v8.19.7","branchLabelMappingKey":"^v(\\d+).(\\d+).\\d+$","isSourceBranch":false,"state":"NOT_CREATED"},{"branch":"9.1","label":"v9.1.7","branchLabelMappingKey":"^v(\\d+).(\\d+).\\d+$","isSourceBranch":false,"state":"NOT_CREATED"},{"branch":"9.2","label":"v9.2.1","branchLabelMappingKey":"^v(\\d+).(\\d+).\\d+$","isSourceBranch":false,"state":"NOT_CREATED"}]}]
BACKPORT-->

Co-authored-by: Melissa Alvarez <melissa.alvarez@elastic.co>
alvarezmelissa87 added a commit that referenced this pull request Oct 30, 2025
…ax tokens parameter is passed as expected (#241212) (#241352)

# Backport

This will backport the following commits from `main` to `9.1`:
- [[ML][Inference Endpoints] Anthropic endpoint creation: ensure max
tokens parameter is passed as expected
(#241212)](#241212)

<!--- Backport version: 10.1.0 -->

### Questions ?
Please refer to the [Backport tool
documentation](https://github.com/sorenlouv/backport)

<!--BACKPORT [{"author":{"name":"Melissa
Alvarez","email":"melissa.alvarez@elastic.co"},"sourceCommit":{"committedDate":"2025-10-30T17:36:58Z","message":"[ML][Inference
Endpoints] Anthropic endpoint creation: ensure max tokens parameter is
passed as expected (#241212)\n\n## Summary\n\nRelated to this
[issue](https://github.com/elastic/kibana/issues/241142)\nand this
[fix](https://github.com/elastic/kibana/pull/241188).\n\nThis PR:\n-
updates the inference creation endpoint to ensure max_tokens are
sent\ncorrectly for Anthropic\n- ensures that max_tokens is added back
into the providerConfig when\nviewing the endpoint so that it shows up
correctly in the form\n\nThis is a temporary workaround for anthropic
max_tokens handling until\nthe services endpoint is updated to reflect
the correct structure.\nAnthropic is unique in that it requires
max_tokens to be sent as part of\nthe task_settings instead of the usual
service_settings.\nUntil the services endpoint is updated to reflect
that, there is no way\nfor the form UI to know where to put max_tokens.
This can be removed\nonce that update is made.\n\n\n###
Checklist\n\nCheck the PR satisfies following conditions. \n\nReviewers
should verify this PR satisfies this list as well.\n\n- [ ] Any text
added follows [EUI's
writing\nguidelines](https://elastic.github.io/eui/#/guidelines/writing),
uses\nsentence case text and includes
[i18n\nsupport](https://github.com/elastic/kibana/blob/main/src/platform/packages/shared/kbn-i18n/README.md)\n-
[
]\n[Documentation](https://www.elastic.co/guide/en/kibana/master/development-documentation.html)\nwas
added for features that require explanation or tutorials\n- [ ] [Unit or
functional\ntests](https://www.elastic.co/guide/en/kibana/master/development-tests.html)\nwere
updated or added to match the most common scenarios\n- [ ] If a plugin
configuration key changed, check if it needs to be\nallowlisted in the
cloud and added to the
[docker\nlist](https://github.com/elastic/kibana/blob/main/src/dev/build/tasks/os_packages/docker_generator/resources/base/bin/kibana-docker)\n-
[ ] This was checked for breaking HTTP API changes, and any
breaking\nchanges have been approved by the breaking-change committee.
The\n`release_note:breaking` label should be applied in these
situations.\n- [ ] [Flaky
Test\nRunner](https://ci-stats.kibana.dev/trigger_flaky_test_runner/1)
was\nused on any tests changed\n- [ ] The PR description includes the
appropriate Release Notes section,\nand the correct `release_note:*`
label is applied per
the\n[guidelines](https://www.elastic.co/guide/en/kibana/master/contributing.html#kibana-release-notes-process)\n-
[ ] Review the
[backport\nguidelines](https://docs.google.com/document/d/1VyN5k91e5OVumlc0Gb9RPa3h1ewuPE705nRtioPiTvY/edit?usp=sharing)\nand
apply applicable `backport:*`
labels.","sha":"847f9de184d2918f261148ee62350e22bf7e079b","branchLabelMapping":{"^v9.3.0$":"main","^v(\\d+).(\\d+).\\d+$":"$1.$2"}},"sourcePullRequest":{"labels":["release_note:fix",":ml","backport:version","Feature:Inference
UI","v9.3.0","v8.19.7","v9.1.7","v9.2.1"],"title":"[ML][Inference
Endpoints] Anthropic endpoint creation: ensure max tokens parameter is
passed as
expected","number":241212,"url":"https://github.com/elastic/kibana/pull/241212","mergeCommit":{"message":"[ML][Inference
Endpoints] Anthropic endpoint creation: ensure max tokens parameter is
passed as expected (#241212)\n\n## Summary\n\nRelated to this
[issue](https://github.com/elastic/kibana/issues/241142)\nand this
[fix](https://github.com/elastic/kibana/pull/241188).\n\nThis PR:\n-
updates the inference creation endpoint to ensure max_tokens are
sent\ncorrectly for Anthropic\n- ensures that max_tokens is added back
into the providerConfig when\nviewing the endpoint so that it shows up
correctly in the form\n\nThis is a temporary workaround for anthropic
max_tokens handling until\nthe services endpoint is updated to reflect
the correct structure.\nAnthropic is unique in that it requires
max_tokens to be sent as part of\nthe task_settings instead of the usual
service_settings.\nUntil the services endpoint is updated to reflect
that, there is no way\nfor the form UI to know where to put max_tokens.
This can be removed\nonce that update is made.\n\n\n###
Checklist\n\nCheck the PR satisfies following conditions. \n\nReviewers
should verify this PR satisfies this list as well.\n\n- [ ] Any text
added follows [EUI's
writing\nguidelines](https://elastic.github.io/eui/#/guidelines/writing),
uses\nsentence case text and includes
[i18n\nsupport](https://github.com/elastic/kibana/blob/main/src/platform/packages/shared/kbn-i18n/README.md)\n-
[
]\n[Documentation](https://www.elastic.co/guide/en/kibana/master/development-documentation.html)\nwas
added for features that require explanation or tutorials\n- [ ] [Unit or
functional\ntests](https://www.elastic.co/guide/en/kibana/master/development-tests.html)\nwere
updated or added to match the most common scenarios\n- [ ] If a plugin
configuration key changed, check if it needs to be\nallowlisted in the
cloud and added to the
[docker\nlist](https://github.com/elastic/kibana/blob/main/src/dev/build/tasks/os_packages/docker_generator/resources/base/bin/kibana-docker)\n-
[ ] This was checked for breaking HTTP API changes, and any
breaking\nchanges have been approved by the breaking-change committee.
The\n`release_note:breaking` label should be applied in these
situations.\n- [ ] [Flaky
Test\nRunner](https://ci-stats.kibana.dev/trigger_flaky_test_runner/1)
was\nused on any tests changed\n- [ ] The PR description includes the
appropriate Release Notes section,\nand the correct `release_note:*`
label is applied per
the\n[guidelines](https://www.elastic.co/guide/en/kibana/master/contributing.html#kibana-release-notes-process)\n-
[ ] Review the
[backport\nguidelines](https://docs.google.com/document/d/1VyN5k91e5OVumlc0Gb9RPa3h1ewuPE705nRtioPiTvY/edit?usp=sharing)\nand
apply applicable `backport:*`
labels.","sha":"847f9de184d2918f261148ee62350e22bf7e079b"}},"sourceBranch":"main","suggestedTargetBranches":["8.19","9.1"],"targetPullRequestStates":[{"branch":"main","label":"v9.3.0","branchLabelMappingKey":"^v9.3.0$","isSourceBranch":true,"state":"MERGED","url":"https://github.com/elastic/kibana/pull/241212","number":241212,"mergeCommit":{"message":"[ML][Inference
Endpoints] Anthropic endpoint creation: ensure max tokens parameter is
passed as expected (#241212)\n\n## Summary\n\nRelated to this
[issue](https://github.com/elastic/kibana/issues/241142)\nand this
[fix](https://github.com/elastic/kibana/pull/241188).\n\nThis PR:\n-
updates the inference creation endpoint to ensure max_tokens are
sent\ncorrectly for Anthropic\n- ensures that max_tokens is added back
into the providerConfig when\nviewing the endpoint so that it shows up
correctly in the form\n\nThis is a temporary workaround for anthropic
max_tokens handling until\nthe services endpoint is updated to reflect
the correct structure.\nAnthropic is unique in that it requires
max_tokens to be sent as part of\nthe task_settings instead of the usual
service_settings.\nUntil the services endpoint is updated to reflect
that, there is no way\nfor the form UI to know where to put max_tokens.
This can be removed\nonce that update is made.\n\n\n###
Checklist\n\nCheck the PR satisfies following conditions. \n\nReviewers
should verify this PR satisfies this list as well.\n\n- [ ] Any text
added follows [EUI's
writing\nguidelines](https://elastic.github.io/eui/#/guidelines/writing),
uses\nsentence case text and includes
[i18n\nsupport](https://github.com/elastic/kibana/blob/main/src/platform/packages/shared/kbn-i18n/README.md)\n-
[
]\n[Documentation](https://www.elastic.co/guide/en/kibana/master/development-documentation.html)\nwas
added for features that require explanation or tutorials\n- [ ] [Unit or
functional\ntests](https://www.elastic.co/guide/en/kibana/master/development-tests.html)\nwere
updated or added to match the most common scenarios\n- [ ] If a plugin
configuration key changed, check if it needs to be\nallowlisted in the
cloud and added to the
[docker\nlist](https://github.com/elastic/kibana/blob/main/src/dev/build/tasks/os_packages/docker_generator/resources/base/bin/kibana-docker)\n-
[ ] This was checked for breaking HTTP API changes, and any
breaking\nchanges have been approved by the breaking-change committee.
The\n`release_note:breaking` label should be applied in these
situations.\n- [ ] [Flaky
Test\nRunner](https://ci-stats.kibana.dev/trigger_flaky_test_runner/1)
was\nused on any tests changed\n- [ ] The PR description includes the
appropriate Release Notes section,\nand the correct `release_note:*`
label is applied per
the\n[guidelines](https://www.elastic.co/guide/en/kibana/master/contributing.html#kibana-release-notes-process)\n-
[ ] Review the
[backport\nguidelines](https://docs.google.com/document/d/1VyN5k91e5OVumlc0Gb9RPa3h1ewuPE705nRtioPiTvY/edit?usp=sharing)\nand
apply applicable `backport:*`
labels.","sha":"847f9de184d2918f261148ee62350e22bf7e079b"}},{"branch":"8.19","label":"v8.19.7","branchLabelMappingKey":"^v(\\d+).(\\d+).\\d+$","isSourceBranch":false,"state":"NOT_CREATED"},{"branch":"9.1","label":"v9.1.7","branchLabelMappingKey":"^v(\\d+).(\\d+).\\d+$","isSourceBranch":false,"state":"NOT_CREATED"},{"branch":"9.2","label":"v9.2.1","branchLabelMappingKey":"^v(\\d+).(\\d+).\\d+$","isSourceBranch":false,"url":"https://github.com/elastic/kibana/pull/241343","number":241343,"state":"OPEN"}]}]
BACKPORT-->
alvarezmelissa87 added a commit that referenced this pull request Oct 30, 2025
…max tokens parameter is passed as expected (#241212) (#241353)

# Backport

This will backport the following commits from `main` to `8.19`:
- [[ML][Inference Endpoints] Anthropic endpoint creation: ensure max
tokens parameter is passed as expected
(#241212)](#241212)

<!--- Backport version: 10.1.0 -->

### Questions ?
Please refer to the [Backport tool
documentation](https://github.com/sorenlouv/backport)

<!--BACKPORT [{"author":{"name":"Melissa
Alvarez","email":"melissa.alvarez@elastic.co"},"sourceCommit":{"committedDate":"2025-10-30T17:36:58Z","message":"[ML][Inference
Endpoints] Anthropic endpoint creation: ensure max tokens parameter is
passed as expected (#241212)\n\n## Summary\n\nRelated to this
[issue](https://github.com/elastic/kibana/issues/241142)\nand this
[fix](https://github.com/elastic/kibana/pull/241188).\n\nThis PR:\n-
updates the inference creation endpoint to ensure max_tokens are
sent\ncorrectly for Anthropic\n- ensures that max_tokens is added back
into the providerConfig when\nviewing the endpoint so that it shows up
correctly in the form\n\nThis is a temporary workaround for anthropic
max_tokens handling until\nthe services endpoint is updated to reflect
the correct structure.\nAnthropic is unique in that it requires
max_tokens to be sent as part of\nthe task_settings instead of the usual
service_settings.\nUntil the services endpoint is updated to reflect
that, there is no way\nfor the form UI to know where to put max_tokens.
This can be removed\nonce that update is made.\n\n\n###
Checklist\n\nCheck the PR satisfies following conditions. \n\nReviewers
should verify this PR satisfies this list as well.\n\n- [ ] Any text
added follows [EUI's
writing\nguidelines](https://elastic.github.io/eui/#/guidelines/writing),
uses\nsentence case text and includes
[i18n\nsupport](https://github.com/elastic/kibana/blob/main/src/platform/packages/shared/kbn-i18n/README.md)\n-
[
]\n[Documentation](https://www.elastic.co/guide/en/kibana/master/development-documentation.html)\nwas
added for features that require explanation or tutorials\n- [ ] [Unit or
functional\ntests](https://www.elastic.co/guide/en/kibana/master/development-tests.html)\nwere
updated or added to match the most common scenarios\n- [ ] If a plugin
configuration key changed, check if it needs to be\nallowlisted in the
cloud and added to the
[docker\nlist](https://github.com/elastic/kibana/blob/main/src/dev/build/tasks/os_packages/docker_generator/resources/base/bin/kibana-docker)\n-
[ ] This was checked for breaking HTTP API changes, and any
breaking\nchanges have been approved by the breaking-change committee.
The\n`release_note:breaking` label should be applied in these
situations.\n- [ ] [Flaky
Test\nRunner](https://ci-stats.kibana.dev/trigger_flaky_test_runner/1)
was\nused on any tests changed\n- [ ] The PR description includes the
appropriate Release Notes section,\nand the correct `release_note:*`
label is applied per
the\n[guidelines](https://www.elastic.co/guide/en/kibana/master/contributing.html#kibana-release-notes-process)\n-
[ ] Review the
[backport\nguidelines](https://docs.google.com/document/d/1VyN5k91e5OVumlc0Gb9RPa3h1ewuPE705nRtioPiTvY/edit?usp=sharing)\nand
apply applicable `backport:*`
labels.","sha":"847f9de184d2918f261148ee62350e22bf7e079b","branchLabelMapping":{"^v9.3.0$":"main","^v(\\d+).(\\d+).\\d+$":"$1.$2"}},"sourcePullRequest":{"labels":["release_note:fix",":ml","backport:version","Feature:Inference
UI","v9.3.0","v8.19.7","v9.1.7","v9.2.1"],"title":"[ML][Inference
Endpoints] Anthropic endpoint creation: ensure max tokens parameter is
passed as
expected","number":241212,"url":"https://github.com/elastic/kibana/pull/241212","mergeCommit":{"message":"[ML][Inference
Endpoints] Anthropic endpoint creation: ensure max tokens parameter is
passed as expected (#241212)\n\n## Summary\n\nRelated to this
[issue](https://github.com/elastic/kibana/issues/241142)\nand this
[fix](https://github.com/elastic/kibana/pull/241188).\n\nThis PR:\n-
updates the inference creation endpoint to ensure max_tokens are
sent\ncorrectly for Anthropic\n- ensures that max_tokens is added back
into the providerConfig when\nviewing the endpoint so that it shows up
correctly in the form\n\nThis is a temporary workaround for anthropic
max_tokens handling until\nthe services endpoint is updated to reflect
the correct structure.\nAnthropic is unique in that it requires
max_tokens to be sent as part of\nthe task_settings instead of the usual
service_settings.\nUntil the services endpoint is updated to reflect
that, there is no way\nfor the form UI to know where to put max_tokens.
This can be removed\nonce that update is made.\n\n\n###
Checklist\n\nCheck the PR satisfies following conditions. \n\nReviewers
should verify this PR satisfies this list as well.\n\n- [ ] Any text
added follows [EUI's
writing\nguidelines](https://elastic.github.io/eui/#/guidelines/writing),
uses\nsentence case text and includes
[i18n\nsupport](https://github.com/elastic/kibana/blob/main/src/platform/packages/shared/kbn-i18n/README.md)\n-
[
]\n[Documentation](https://www.elastic.co/guide/en/kibana/master/development-documentation.html)\nwas
added for features that require explanation or tutorials\n- [ ] [Unit or
functional\ntests](https://www.elastic.co/guide/en/kibana/master/development-tests.html)\nwere
updated or added to match the most common scenarios\n- [ ] If a plugin
configuration key changed, check if it needs to be\nallowlisted in the
cloud and added to the
[docker\nlist](https://github.com/elastic/kibana/blob/main/src/dev/build/tasks/os_packages/docker_generator/resources/base/bin/kibana-docker)\n-
[ ] This was checked for breaking HTTP API changes, and any
breaking\nchanges have been approved by the breaking-change committee.
The\n`release_note:breaking` label should be applied in these
situations.\n- [ ] [Flaky
Test\nRunner](https://ci-stats.kibana.dev/trigger_flaky_test_runner/1)
was\nused on any tests changed\n- [ ] The PR description includes the
appropriate Release Notes section,\nand the correct `release_note:*`
label is applied per
the\n[guidelines](https://www.elastic.co/guide/en/kibana/master/contributing.html#kibana-release-notes-process)\n-
[ ] Review the
[backport\nguidelines](https://docs.google.com/document/d/1VyN5k91e5OVumlc0Gb9RPa3h1ewuPE705nRtioPiTvY/edit?usp=sharing)\nand
apply applicable `backport:*`
labels.","sha":"847f9de184d2918f261148ee62350e22bf7e079b"}},"sourceBranch":"main","suggestedTargetBranches":["8.19","9.1"],"targetPullRequestStates":[{"branch":"main","label":"v9.3.0","branchLabelMappingKey":"^v9.3.0$","isSourceBranch":true,"state":"MERGED","url":"https://github.com/elastic/kibana/pull/241212","number":241212,"mergeCommit":{"message":"[ML][Inference
Endpoints] Anthropic endpoint creation: ensure max tokens parameter is
passed as expected (#241212)\n\n## Summary\n\nRelated to this
[issue](https://github.com/elastic/kibana/issues/241142)\nand this
[fix](https://github.com/elastic/kibana/pull/241188).\n\nThis PR:\n-
updates the inference creation endpoint to ensure max_tokens are
sent\ncorrectly for Anthropic\n- ensures that max_tokens is added back
into the providerConfig when\nviewing the endpoint so that it shows up
correctly in the form\n\nThis is a temporary workaround for anthropic
max_tokens handling until\nthe services endpoint is updated to reflect
the correct structure.\nAnthropic is unique in that it requires
max_tokens to be sent as part of\nthe task_settings instead of the usual
service_settings.\nUntil the services endpoint is updated to reflect
that, there is no way\nfor the form UI to know where to put max_tokens.
This can be removed\nonce that update is made.\n\n\n###
Checklist\n\nCheck the PR satisfies following conditions. \n\nReviewers
should verify this PR satisfies this list as well.\n\n- [ ] Any text
added follows [EUI's
writing\nguidelines](https://elastic.github.io/eui/#/guidelines/writing),
uses\nsentence case text and includes
[i18n\nsupport](https://github.com/elastic/kibana/blob/main/src/platform/packages/shared/kbn-i18n/README.md)\n-
[
]\n[Documentation](https://www.elastic.co/guide/en/kibana/master/development-documentation.html)\nwas
added for features that require explanation or tutorials\n- [ ] [Unit or
functional\ntests](https://www.elastic.co/guide/en/kibana/master/development-tests.html)\nwere
updated or added to match the most common scenarios\n- [ ] If a plugin
configuration key changed, check if it needs to be\nallowlisted in the
cloud and added to the
[docker\nlist](https://github.com/elastic/kibana/blob/main/src/dev/build/tasks/os_packages/docker_generator/resources/base/bin/kibana-docker)\n-
[ ] This was checked for breaking HTTP API changes, and any
breaking\nchanges have been approved by the breaking-change committee.
The\n`release_note:breaking` label should be applied in these
situations.\n- [ ] [Flaky
Test\nRunner](https://ci-stats.kibana.dev/trigger_flaky_test_runner/1)
was\nused on any tests changed\n- [ ] The PR description includes the
appropriate Release Notes section,\nand the correct `release_note:*`
label is applied per
the\n[guidelines](https://www.elastic.co/guide/en/kibana/master/contributing.html#kibana-release-notes-process)\n-
[ ] Review the
[backport\nguidelines](https://docs.google.com/document/d/1VyN5k91e5OVumlc0Gb9RPa3h1ewuPE705nRtioPiTvY/edit?usp=sharing)\nand
apply applicable `backport:*`
labels.","sha":"847f9de184d2918f261148ee62350e22bf7e079b"}},{"branch":"8.19","label":"v8.19.7","branchLabelMappingKey":"^v(\\d+).(\\d+).\\d+$","isSourceBranch":false,"state":"NOT_CREATED"},{"branch":"9.1","label":"v9.1.7","branchLabelMappingKey":"^v(\\d+).(\\d+).\\d+$","isSourceBranch":false,"state":"NOT_CREATED"},{"branch":"9.2","label":"v9.2.1","branchLabelMappingKey":"^v(\\d+).(\\d+).\\d+$","isSourceBranch":false,"url":"https://github.com/elastic/kibana/pull/241343","number":241343,"state":"OPEN"}]}]
BACKPORT-->
ana-davydova pushed a commit to ana-davydova/kibana that referenced this pull request Nov 3, 2025
…ens parameter is passed as expected (elastic#241212)

## Summary

Related to this [issue](elastic#241142)
and this [fix](elastic#241188).

This PR:
- updates the inference creation endpoint to ensure max_tokens are sent
correctly for Anthropic
- ensures that max_tokens is added back into the providerConfig when
viewing the endpoint so that it shows up correctly in the form

This is a temporary workaround for anthropic max_tokens handling until
the services endpoint is updated to reflect the correct structure.
Anthropic is unique in that it requires max_tokens to be sent as part of
the task_settings instead of the usual service_settings.
Until the services endpoint is updated to reflect that, there is no way
for the form UI to know where to put max_tokens. This can be removed
once that update is made.


### Checklist

Check the PR satisfies following conditions. 

Reviewers should verify this PR satisfies this list as well.

- [ ] Any text added follows [EUI's writing
guidelines](https://elastic.github.io/eui/#/guidelines/writing), uses
sentence case text and includes [i18n
support](https://github.com/elastic/kibana/blob/main/src/platform/packages/shared/kbn-i18n/README.md)
- [ ]
[Documentation](https://www.elastic.co/guide/en/kibana/master/development-documentation.html)
was added for features that require explanation or tutorials
- [ ] [Unit or functional
tests](https://www.elastic.co/guide/en/kibana/master/development-tests.html)
were updated or added to match the most common scenarios
- [ ] If a plugin configuration key changed, check if it needs to be
allowlisted in the cloud and added to the [docker
list](https://github.com/elastic/kibana/blob/main/src/dev/build/tasks/os_packages/docker_generator/resources/base/bin/kibana-docker)
- [ ] This was checked for breaking HTTP API changes, and any breaking
changes have been approved by the breaking-change committee. The
`release_note:breaking` label should be applied in these situations.
- [ ] [Flaky Test
Runner](https://ci-stats.kibana.dev/trigger_flaky_test_runner/1) was
used on any tests changed
- [ ] The PR description includes the appropriate Release Notes section,
and the correct `release_note:*` label is applied per the
[guidelines](https://www.elastic.co/guide/en/kibana/master/contributing.html#kibana-release-notes-process)
- [ ] Review the [backport
guidelines](https://docs.google.com/document/d/1VyN5k91e5OVumlc0Gb9RPa3h1ewuPE705nRtioPiTvY/edit?usp=sharing)
and apply applicable `backport:*` labels.
albertoblaz pushed a commit to albertoblaz/kibana that referenced this pull request Nov 4, 2025
…ens parameter is passed as expected (elastic#241212)

## Summary

Related to this [issue](elastic#241142)
and this [fix](elastic#241188).

This PR:
- updates the inference creation endpoint to ensure max_tokens are sent
correctly for Anthropic
- ensures that max_tokens is added back into the providerConfig when
viewing the endpoint so that it shows up correctly in the form

This is a temporary workaround for anthropic max_tokens handling until
the services endpoint is updated to reflect the correct structure.
Anthropic is unique in that it requires max_tokens to be sent as part of
the task_settings instead of the usual service_settings.
Until the services endpoint is updated to reflect that, there is no way
for the form UI to know where to put max_tokens. This can be removed
once that update is made.


### Checklist

Check the PR satisfies following conditions. 

Reviewers should verify this PR satisfies this list as well.

- [ ] Any text added follows [EUI's writing
guidelines](https://elastic.github.io/eui/#/guidelines/writing), uses
sentence case text and includes [i18n
support](https://github.com/elastic/kibana/blob/main/src/platform/packages/shared/kbn-i18n/README.md)
- [ ]
[Documentation](https://www.elastic.co/guide/en/kibana/master/development-documentation.html)
was added for features that require explanation or tutorials
- [ ] [Unit or functional
tests](https://www.elastic.co/guide/en/kibana/master/development-tests.html)
were updated or added to match the most common scenarios
- [ ] If a plugin configuration key changed, check if it needs to be
allowlisted in the cloud and added to the [docker
list](https://github.com/elastic/kibana/blob/main/src/dev/build/tasks/os_packages/docker_generator/resources/base/bin/kibana-docker)
- [ ] This was checked for breaking HTTP API changes, and any breaking
changes have been approved by the breaking-change committee. The
`release_note:breaking` label should be applied in these situations.
- [ ] [Flaky Test
Runner](https://ci-stats.kibana.dev/trigger_flaky_test_runner/1) was
used on any tests changed
- [ ] The PR description includes the appropriate Release Notes section,
and the correct `release_note:*` label is applied per the
[guidelines](https://www.elastic.co/guide/en/kibana/master/contributing.html#kibana-release-notes-process)
- [ ] Review the [backport
guidelines](https://docs.google.com/document/d/1VyN5k91e5OVumlc0Gb9RPa3h1ewuPE705nRtioPiTvY/edit?usp=sharing)
and apply applicable `backport:*` labels.
@peteharverson peteharverson changed the title [ML][Inference Endpoints] Anthropic endpoint creation: ensure max tokens parameter is passed as expected Dec 16, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

backport:version Backport to applied version labels Feature:Inference UI ML Inference endpoints UI and AI connector :ml release_note:fix v8.19.7 v9.1.7 v9.2.1 v9.3.0

7 participants