[ML] Integrate SageMaker with OpenAI Embeddings (#126856) by prwhelan · Pull Request #127610 · elastic/elasticsearch

prwhelan · 2025-05-01T17:28:32Z

Integrating with SageMaker.

Current design:

SageMaker accepts any byte payload, which can be text, csv, or json. api represents the structure of the payload that we will send, for example openai, elastic, common, probably cohere or huggingface as well.
api implementations are extensions of SageMakerSchemaPayload, which supports:
- "extra" service and task settings specific to the payload structure, so cohere would require embedding_type and openai would require dimensions in the service_settings
- conversion logic from model, service settings, task settings, and input to SdkBytes
- conversion logic from responding SdkBytes to InferenceServiceResults
Everything else is tunneling, there are a number of base service_settings and task_settings that are independent of the api format that we will store and set
We let the SDK do the bulk of the work in terms of connection details, rate limiting, retries, etc.

Integrating with SageMaker. Current design: - SageMaker accepts any byte payload, which can be text, csv, or json. `api` represents the structure of the payload that we will send, for example `openai`, `elastic`, `common`, probably `cohere` or `huggingface` as well. - `api` implementations are extensions of `SageMakerSchemaPayload`, which supports: - "extra" service and task settings specific to the payload structure, so `cohere` would require `embedding_type` and `openai` would require `dimensions` in the `service_settings` - conversion logic from model, service settings, task settings, and input to `SdkBytes` - conversion logic from responding `SdkBytes` to `InferenceServiceResults` - Everything else is tunneling, there are a number of base `service_settings` and `task_settings` that are independent of the api format that we will store and set - We let the SDK do the bulk of the work in terms of connection details, rate limiting, retries, etc.

prwhelan added >enhancement :ml Machine learning backport Team:ML Meta label for the ML team auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) v8.19.0 labels May 1, 2025

elasticsearchmachine merged commit 577a6f8 into elastic:8.19 May 1, 2025
15 checks passed

prwhelan deleted the backport/8.19/126856 branch May 1, 2025 18:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Integrate SageMaker with OpenAI Embeddings (#126856)#127610

[ML] Integrate SageMaker with OpenAI Embeddings (#126856)#127610
elasticsearchmachine merged 1 commit intoelastic:8.19from
prwhelan:backport/8.19/126856

prwhelan commented May 1, 2025

Uh oh!

Labels

2 participants

Conversation

prwhelan commented May 1, 2025

Uh oh!

Labels

2 participants