Parallel GPU buffer writes #22314

aevyrie · 2025-12-30T05:22:13Z

Objective

After a series of optimizations making render and postupdate more parallel, write_batched_instance_buffers was regularly one of the largest spans with very low thread use, sitting at 4ms in 1 4ms frame. This makes it an ideal target to improve throughput. Note this screenshot doesn't include some visibility system optimizations:

Solution

Spawn tasks for writing buffers to the GPU. This is especially helpful for current_input_buffer and previous_input_buffer, which take about the same time and are the longest buffer writes - moving these to tasks effectively halves the time spent in the system.

In the 250k bevymark_3d stress test, this saves 1.7ms in the system, and 2.8ms in frame time

frametime

system

Testing

cargo rer bevymark_3d --features=debug,trace_tracy -- --benchmark --waves 250 --per-wave 1000

Write buffers in parallel

0940eca

alice-i-cecile added A-Rendering Drawing game state to the screen C-Performance A change motivated by improving speed, memory usage or compile times S-Needs-Review Needs reviewer attention (from anyone!) to move forward labels Dec 30, 2025

github-project-automation bot added this to Rendering Dec 30, 2025

james7132 self-requested a review December 30, 2025 19:37

james7132 approved these changes Dec 30, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Parallel GPU buffer writes #22314

Parallel GPU buffer writes #22314

aevyrie commented Dec 30, 2025 •

edited

Loading

Labels

3 participants

Uh oh!

Parallel GPU buffer writes #22314

Are you sure you want to change the base?

Parallel GPU buffer writes #22314

Conversation

aevyrie commented Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Objective

Solution

Testing

Labels

3 participants

aevyrie commented Dec 30, 2025 •

edited

Loading