CEL input: Add OTel tracing by chrisberkhout · Pull Request #48440 · elastic/beats

chrisberkhout · 2026-01-16T08:50:53Z

Proposed commit message

CEL input: Add OTel tracing (#)

Instruments the CEL Input with OpenTelemetry tracing. Sampling is 100% -
all operation is covered. By default no exporter is set up and traces
will not be exported. Export can be configured to go to the console or
to an OTLP endpoint using the `grpc` (default) or `http/protobuf`
protocols.

Typically OTel tracing considers the whole process to be the "resource".
However, in this case the resource is the input instance. For that
reason a trace provider is created specifically for the input instance
and it is not explicitly set as the global tracer provider.

There is an extra environment variable to override any other
configuration and disable export for a specific input:
`BEATS_OTEL_TRACES_DISABLE=cel`.

Spans covering HTTP requests are enriched with attributes for request
and response headers, with values automatically (but configurably)
redacted to protect sensitive data.

Normal request logging and Filebeat logs will include span and trace IDs
that allow correlation with the OTel data. This is done in any location
to which we can pass a logger from the trace creation site. Other
Filebeat logging will lack the IDs. Because logger attributes are
append-only we pass around a logger with modified attributes rather than
modify attributes in a global logger.

Normal request logging had unused functionality for including a
`trace.id` field. That has been removed in favor of an OTel-specific
implementation that adds `trace.id` and `span.id` if there is a current,
valid span.

Requests initiated by CEL will have spans added by `otelhttp` and will
identify the correct parent span using trace data from the request
context. Since the relevant eval-time context is not propagated to those
requests by mito, cel-go[1] or oauth2[2], `ContextInjector` is used to
rewrite each request to include the current context as it is processed.

[1]: https://github.com/google/cel-go/issues/557
[2]: https://github.com/golang/oauth2/issues/262

There were a couple of things for which the initial approach changed:

Use of https://pkg.go.dev/go.opentelemetry.io/contrib/exporters/autoexport to interpret OTel environment variables and set up the exporter was removed in favor of manual handling, which seems to be standard when using the Go SDK (unlike implementations in some other languages).
The context with OTel tracing data needs to be propagated the HTTP client used by CEL so that HTTP spans are attached to the correct parent span. That was initially done with a change in Mito: Add HTTPWithContextFnOpts so requests can have eval-time context mito#118. That has been closed to avoid changing Mito. Now it is done in the CEL Input by having ContextInjector rewrite requests in the client used by CEL, which also solves the problem for OAuth2 requests.

There are some differences from the attribute and other names given in the planning document:

cel.periodic.program_count
→ Changed to cel.periodic.execution_count to match cel.program.execution.
cel.program.batch_count
→ Removed. It would only indicate whether an execution returned any events or not. Any other batching is internal to the CEL evaluation.
cel.{periodic,program}.success
→ Removed, in favor of span status.
cel.program.error_message
→ Not set. Uses SetStatus and RecordError instead.
BEATS_OTEL_TRACING_DISABLE
→ Changed to BEATS_OTEL_TRACES_DISABLE to match OTEL_TRACES_EXPORTER and OTEL_EXPORTER_OTLP_TRACES_*.

Handling of span-specific context and loggers is somewhat cumbersome. Refactoring to extract separate functions from run for separate stages of processing will help to tidy this up and is planned as follow-up work: #48464.

Checklist

My code follows the style guidelines of this project
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I have made corresponding change to the default configuration files
I have added tests that prove my fix is effective or that my feature works. Where relevant, I have used the stresstest.sh script to run them under stress conditions and race detector to verify their stability.
I have added an entry in ./changelog/fragments using the changelog tool.

How to test this PR locally

You can use otel-desktop-viewer as simple receiver and viewer of OTel traces:

# Install it
go install github.com/CtrlSpice/otel-desktop-viewer@latest

# Run it. It will open its web UI
otel-desktop-viewer

# In another terminal, set it as the destination for OTel traces
export OTEL_TRACES_EXPORTER=otlp
export OTEL_EXPORTER_OTLP_TRACES_PROTOCOL=grpc
export OTEL_EXPORTER_OTLP_TRACES_ENDPOINT=http://localhost:4317

In the terminal with those environment variables set, you can run the input with an example that includes OAuth2 and multiple requests per period, like this:

(cd x-pack/filebeat && go build) && ./x-pack/filebeat/filebeat run -c <(echo '
filebeat.inputs:
- type: cel
  enabled: true
  id: cel-1
  interval: 5s
  resource.url: https://api.ipify.org/?format=json&passwd=mysecretword
  program: |
    get(state.url).Body.as(body, state.with({
        "events": [body.decode_json()],
        "want_more": int(state.?runcount.orValue(1)) % 3 != 0,
        "runcount": int(state.?runcount.orValue(1)) + 1,
    }))
  resource.tracer.enable: true
  resource.tracer.filename: "x-pack/filebeat/logs/cel/http-request-trace-cel-*.ndjson"
  auth.oauth2.enabled: true
  auth.oauth2.client.id: someclientid
  auth.oauth2.client.secret: someclientsecret
  auth.oauth2.scopes: scope.me
  auth.oauth2.token_url: https://oauth-mock.mock.beeceptor.com/oauth/token/github
  auth.oauth2.endpoint_params:
    grant_type: client_credentials
  otel.trace.redacted:
    - User-Agent
  otel.trace.unredacted:
    - Authorization
output.elasticsearch:
  hosts: ["https://elasticsearch:9200"]
  username: "elastic"
  password: "changeme"
  protocol: "https"
  ssl.verification_mode: "none"
  preset: balanced
logging.level: debug
logging.to_stderr: true
')

You can also use Elastic Observability to receive and view OTel traces, but it involves a bit more setup.

Bring up the Elastic Stack:

elastic-package stack up -v

In Kibana, go to "Management > Integrations" and go to the "APM" integration page. Click "Manage APM integration in Fleet", then "Add Elastic APM". Under "Configure integration > Integration settings > General > Server configuration", change the Host and URL settings to use '0.0.0.0' instead of 'localhost'. Under "Where to add this integration?", choose "Existing hosts > Elastic Agent (elastic-package)". Then click "Save and continue".

Now, back in the terminal, find the IP address of the agent container.

docker ps # confirm the agent container name is elastic-package-stack-elastic-agent-1
AGENT="elastic-package-stack-elastic-agent-1"
AGENT_IP=$(docker inspect "$AGENT" \
  --format '{{ (index .NetworkSettings.Networks "elastic-package-stack_default").IPAddress }}')
echo "$AGENT_IP" # confirm the IP was found

Use that as the destination for OTel traces:

export OTEL_TRACES_EXPORTER=otlp
export OTEL_EXPORTER_OTLP_TRACES_PROTOCOL=grpc
export OTEL_EXPORTER_OTLP_TRACES_ENDPOINT="http://$AGENT_IP:8200"

Then from the terminal with those settings you can run the input using example Filebeat configuration as above.

To view the exported traces in Kibana, go to "Observability > Applications > Traces".

Use cases

This tracing is to be used for troubleshooting, particularly for Agentless.

Screenshots

OTel traces for the CEL Input in Elastic Observability:

github-actions · 2026-01-16T08:51:02Z

🤖 GitHub comments

Just comment with:

run docs-build : Re-trigger the docs validation. (use unformatted text in the comment!)

mergify · 2026-01-16T08:51:51Z

This pull request is now in conflicts. Could you fix it? 🙏
To fixup this pull request, you can check out it locally. See documentation: https://help.github.com/articles/checking-out-pull-requests-locally/

git fetch upstream
git checkout -b cel-otel-tracing upstream/cel-otel-tracing
git merge upstream/main
git push upstream cel-otel-tracing

mergify · 2026-01-16T08:51:51Z

This pull request does not have a backport label.
If this is a bug or security fix, could you label this PR @chrisberkhout? 🙏.
For such, you'll need to label your PR with:

The upcoming major version of the Elastic Stack
The upcoming minor version of the Elastic Stack (if you're not pushing a breaking change)

To fixup this pull request, you need to add the backport labels for the needed
branches, such as:

backport-8./d is the label to automatically backport to the 8./d branch. /d is the digit
backport-active-all is the label that automatically backports to all active branches.
backport-active-8 is the label that automatically backports to all active minor branches for the 8 major.
backport-active-9 is the label that automatically backports to all active minor branches for the 9 major.

mergify · 2026-01-23T19:39:49Z

This pull request is now in conflicts. Could you fix it? 🙏
To fixup this pull request, you can check out it locally. See documentation: https://help.github.com/articles/checking-out-pull-requests-locally/

git fetch upstream
git checkout -b cel-otel-tracing upstream/cel-otel-tracing
git merge upstream/main
git push upstream cel-otel-tracing

github-actions · 2026-01-30T16:14:41Z

🔍 Preview links for changed docs

docs/reference/filebeat/filebeat-input-cel.md

elasticmachine · 2026-02-02T16:38:58Z

Pinging @elastic/security-service-integrations (Team:Security-Service Integrations)

x-pack/filebeat/input/cel/input.go

x-pack/filebeat/input/internal/httplog/roundtripper.go

…ing up the global one (which is set automatically), which may not be correct (because there may be several).

…through CEL's HTTP client.

…cessing of env vars.

…ption{}.

Co-authored-by: Janeen Mikell Roberts <57149392+jmikell821@users.noreply.github.com>

Co-authored-by: Dan Kortschak <dan.kortschak@elastic.co>

orestisfl

Use of https://pkg.go.dev/go.opentelemetry.io/contrib/exporters/autoexport to interpret OTel environment variables and set up the exporter was removed in favor of manual handling, which seems to be standard when using the Go SDK (unlike implementations in some other languages).

I was not aware that this is the accepted standard. What is that based on?

Typically OTel tracing considers the whole process to be the "resource".
However, in this case the resource is the input instance. For that
reason a trace provider is created specifically for the input instance
and it is not explicitly set as the global tracer provider.

I would perhaps expect "filebeat" perhaps to be the resource, otherwise I would be concerned we would spam the service inventory with every single input name.

OTel traces for the CEL Input in Elastic Observability

Any idea why it's listed as "unknown" on the path above?

orestisfl · 2026-02-09T15:10:10Z

x-pack/filebeat/input/cel/input.go

 	metrics, reg := newInputMetrics(env.MetricsRegistry, env.Logger)

 	ctx := ctxtool.FromCanceller(env.Cancelation)
+	otelTracerProvider, err := otel.NewTracerProvider(ctx, getResourceAttributes(env, cfg), i.Name())


The Shutdown method is never called on the provider. Could that lead to unexpected data loss?

orestisfl · 2026-02-09T15:18:55Z

x-pack/filebeat/input/cel/input.go

 	metrics, reg := newInputMetrics(env.MetricsRegistry, env.Logger)

 	ctx := ctxtool.FromCanceller(env.Cancelation)
+	otelTracerProvider, err := otel.NewTracerProvider(ctx, getResourceAttributes(env, cfg), i.Name())


Q: Could this lead to significant overhead for multiple inputs? Could we instead make this call once and set any run-specific attributes in the span level?

efd6 · 2026-02-09T19:51:04Z

I do see one span per trace without a parent ID, and other parts of the UI identify the root as a root span.

Yes, that's what I'm seeing.

andrewkroh

I ran the PR locally. Works as I expected, minus the few things I commented on. 👍

URL query parameter redaction works
Header redaction works
Default sensitive-word detection works
File tracer (resource.tracer) includes trace.id and span.id
Filebeat debug logs include trace.id and span.id fields
Resource attributes are populated
BEATS_OTEL_TRACES_DISABLE=cel disables trace export as expected

trace.json

andrewkroh · 2026-02-10T01:05:35Z

x-pack/filebeat/input/cel/input.go

+				case <-waitCtx.Done():
+					runSpan.SetStatus(codes.Unset, waitCtx.Err().Error())
+					return waitCtx.Err()


When <-waitCtx.Done() fires, the function returns without calling waitSpan.End(). Please add waitSpan.End() before the return.

andrewkroh · 2026-02-10T01:12:30Z

x-pack/filebeat/input/cel/input.go

 			if !ok {
-				metricsRecorder.AddProgramRunDuration(ctx, time.Since(start))
+				metricsRecorder.AddProgramRunDuration(execCtx, time.Since(start))
 				continue


The loop continues without ending execSpan. It looks like we are leaking the span?

andrewkroh · 2026-02-10T01:19:54Z

x-pack/filebeat/input/cel/input.go

+				errorSpans(err, end{execSpan}, runSpan)
 				return errors.New("unexpected missing events array from evaluation")


IIUC, at this point, err may be nil here. A fresh error should be used instead:

err := errors.New("unexpected missing events array from evaluation") errorSpans(err, end{execSpan}, runSpan) return err

andrewkroh · 2026-02-10T01:20:45Z

x-pack/filebeat/input/cel/input.go

+								err := fmt.Errorf("unexpected type returned for evaluation cursor element: %T", cursors[0])
+								metricsRecorder.AddProgramRunDuration(pubCtx, time.Since(start))
+								errorSpans(err, end{pubSpan}, end{execSpan}, runSpan)
 								return fmt.Errorf("unexpected type returned for evaluation cursor element: %T", cursors[0])


Duplicated error.

Suggested change

return fmt.Errorf("unexpected type returned for evaluation cursor element: %T", cursors[0])

return err

andrewkroh · 2026-02-10T01:22:00Z

x-pack/filebeat/otel/trace.go

+}
+
+func (rt *ExtraSpanAttribsRoundTripper) RoundTrip(r *http.Request) (*http.Response, error) {
+


I think this file wants to be gofumpt -w -extraed. 😄

andrewkroh · 2026-02-10T01:24:30Z

x-pack/filebeat/otel/trace.go

+		span.SetAttributes(attribute.StringSlice(
+			"url.full",
+			[]string{sanitizedURLString(r.URL, rt.shouldRedact)},


Per OTel semantic conventions, url.full is a string type, not an array. This should be

attribute.String("url.full", sanitizedURLString(r.URL, rt.shouldRedact)).

https://opentelemetry.io/docs/specs/semconv/registry/attributes/url/#url-full

andrewkroh · 2026-02-10T01:29:17Z

x-pack/filebeat/otel/trace.go

+func (rt *ExtraSpanAttribsRoundTripper) RoundTrip(r *http.Request) (*http.Response, error) {
+
+	span := trace.SpanFromContext(r.Context())
+	if span != nil && span.SpanContext().IsValid() {


trace.SpanFromContext never returns nil. It may return a noop, but never nil.

Suggested change

if span != nil && span.SpanContext().IsValid() {

if span.SpanContext().IsValid() {

andrewkroh · 2026-02-10T01:29:28Z

x-pack/filebeat/otel/trace.go

+		return resp, err
+	}
+
+	if span != nil && span.SpanContext().IsValid() {


Suggested change

if span != nil && span.SpanContext().IsValid() {

if span.SpanContext().IsValid() {

andrewkroh · 2026-02-10T01:36:33Z

x-pack/filebeat/input/internal/httplog/roundtripper.go

 // TraceIDKey is key used to add a trace.id value to the context of HTTP
 // requests. The value will be logged by LoggingRoundTripper.
 const TraceIDKey = contextKey("trace.id")


andrewkroh · 2026-02-10T02:20:14Z

x-pack/filebeat/otel/trace.go

+	return false
+}
+
+var sensitiveWords = map[string]struct{}{


The word "credentials" in Access-Control-Allow-Credentials is a CORS concept, not a secret. This is a common header and we should avoid redacting all the time. Users / developers would need to add this to otel.trace.unredacted to see the value, which is unlikely to occur to them.

Maybe we need a set of known safe headers...?

var knownSafeNames = map[string]struct{}{ "access-control-allow-credentials": {}, // etc. }

We will have to be vigilant on our code reviews for packages to make sure that we are setting the unredacted for things like sort_key, country_code, etc. We can probably put something about this into our code review wiki page for developing packages, and hopefully AI tools can help keep us straight.

chrisberkhout self-assigned this Jan 16, 2026

chrisberkhout added enhancement Filebeat Filebeat Team:Security-Service Integrations Security Service Integrations Team labels Jan 16, 2026

botelastic bot added needs_team Indicates that the issue/PR needs a Team:* label and removed needs_team Indicates that the issue/PR needs a Team:* label labels Jan 16, 2026

chrisberkhout mentioned this pull request Jan 19, 2026

x-pack/filebeat/input/cel: misc refactoring #48464

Open

efd6 mentioned this pull request Jan 19, 2026

Add HTTPWithContextFnOpts so requests can have eval-time context elastic/mito#118

Closed

chrisberkhout force-pushed the cel-otel-tracing branch 2 times, most recently from 0adb194 to 41ebd88 Compare January 22, 2026 15:26

chrisberkhout force-pushed the cel-otel-tracing branch from 41ebd88 to 04a16c2 Compare January 30, 2026 16:13

github-actions bot deployed to docs-preview January 30, 2026 16:13 View deployment

chrisberkhout force-pushed the cel-otel-tracing branch from 04a16c2 to 14673f7 Compare February 2, 2026 15:39

github-actions bot deployed to docs-preview February 2, 2026 15:39 View deployment

github-actions bot deployed to docs-preview February 2, 2026 16:37 View deployment

chrisberkhout marked this pull request as ready for review February 2, 2026 16:38

chrisberkhout requested review from a team as code owners February 2, 2026 16:38

chrisberkhout requested review from faec and leehinman February 2, 2026 16:38

efd6 reviewed Feb 2, 2026

View reviewed changes

pierrehilbert added the Team:Elastic-Agent-Data-Plane Label for the Agent Data Plane team label Feb 3, 2026

chrisberkhout and others added 23 commits February 9, 2026 11:41

Add documentation of otel.trace.* settings.

ac6624e

Refactor for better testability of env var config handling.

ca07bc8

Test trace.go.

e5302df

Make otelhttp.NewTransport use a given TracerProvider instead of pick…

480f878

…ing up the global one (which is set automatically), which may not be correct (because there may be several).

Tidy up ExtraSpanAttribsRoundTripper.

2028c13

Add ContextInjector to get the eval-time context into requests going …

276ffe2

…through CEL's HTTP client.

Add changelog entry.

419bb27

Fix typo.

1cb346a

Fix more typos.

38284e1

Extract helpers for span status and ending and for adding IDs to logs.

5e43413

Updated request trace logs godoc to cover additional fields.

11b2680

Tidy up GetExporterTypeFromEnv() for the case of metrics off, traces on.

2d7d3be

Fix handling of OLTP endpoint URLs so the SDK doesn't do it's own pro…

6aad790

…cessing of env vars.

Remove unnecessary temp var.

7afd9c7

Call it ctx instead of spanCtx because there's only one.

55782ee

Rename extraOpts to otelhttpOptions, use nil rather than []otelhttp.O…

8384d13

…ption{}.

Comment clarification - OTel span only in context for CEL so far.

c74d6ac

Update docs/reference/filebeat/filebeat-input-cel.md

d219a4a

Co-authored-by: Janeen Mikell Roberts <57149392+jmikell821@users.noreply.github.com>

Update docs/reference/filebeat/filebeat-input-cel.md

c41cd34

Co-authored-by: Janeen Mikell Roberts <57149392+jmikell821@users.noreply.github.com>

Generate import path constant.

bc3fd6d

Have errorSpans and okSpans end spans as well when sentinel present.

0c45d6f

Update x-pack/filebeat/input/internal/httplog/roundtripper.go

02a5255

Co-authored-by: Dan Kortschak <dan.kortschak@elastic.co>

Godoc for NewTracerProvider.

d096d78

chrisberkhout force-pushed the cel-otel-tracing branch from 117e012 to d096d78 Compare February 9, 2026 10:43

chrisberkhout requested a review from efd6 February 9, 2026 10:43

github-actions bot deployed to docs-preview February 9, 2026 10:43 View deployment

orestisfl reviewed Feb 9, 2026

View reviewed changes

andrewkroh reviewed Feb 10, 2026

View reviewed changes

leehinman removed their request for review February 13, 2026 19:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CEL input: Add OTel tracing#48440

CEL input: Add OTel tracing#48440
chrisberkhout wants to merge 50 commits intoelastic:mainfrom
chrisberkhout:cel-otel-tracing

chrisberkhout commented Jan 16, 2026 •

edited

Loading

github-actions bot commented Jan 16, 2026

mergify bot commented Jan 16, 2026

mergify bot commented Jan 16, 2026

mergify bot commented Jan 23, 2026

github-actions bot commented Jan 30, 2026 •

edited

Loading

elasticmachine commented Feb 2, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

orestisfl left a comment

orestisfl Feb 9, 2026

orestisfl Feb 9, 2026

efd6 commented Feb 9, 2026

andrewkroh left a comment

andrewkroh Feb 10, 2026

andrewkroh Feb 10, 2026

andrewkroh Feb 10, 2026

andrewkroh Feb 10, 2026

andrewkroh Feb 10, 2026

andrewkroh Feb 10, 2026

andrewkroh Feb 10, 2026

andrewkroh Feb 10, 2026

andrewkroh Feb 10, 2026

andrewkroh Feb 10, 2026

Labels

8 participants

		errorSpans(err, end{execSpan}, runSpan)
		return errors.New("unexpected missing events array from evaluation")

	return fmt.Errorf("unexpected type returned for evaluation cursor element: %T", cursors[0])
	return err

		}

		func (rt ExtraSpanAttribsRoundTripper) RoundTrip(r http.Request) (*http.Response, error) {

	if span != nil && span.SpanContext().IsValid() {
	if span.SpanContext().IsValid() {

Conversation

chrisberkhout commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed commit message

Checklist

How to test this PR locally

Related

Use cases

Screenshots

github-actions bot commented Jan 16, 2026

🤖 GitHub comments

mergify bot commented Jan 16, 2026

mergify bot commented Jan 16, 2026

mergify bot commented Jan 23, 2026

github-actions bot commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔍 Preview links for changed docs

elasticmachine commented Feb 2, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

orestisfl left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

efd6 commented Feb 9, 2026

andrewkroh left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Labels

8 participants

chrisberkhout commented Jan 16, 2026 •

edited

Loading

github-actions bot commented Jan 30, 2026 •

edited

Loading