fix: use local token_hop_len in streaming loop to avoid concurrent st… by Caxson · Pull Request #1849 · FunAudioLLM/CosyVoice

Caxson · 2026-03-12T06:34:23Z

…ate mutation

The streaming loop in CosyVoice2Model.tts() mutates self.token_hop_len each iteration (via stream_scale_factor). When multiple requests share the same model instance, this shared state is corrupted across concurrent inferences.

Use a local variable token_hop_len initialized from self.token_hop_len and update only the local copy inside the loop, so each streaming session has its own hop length progression. Behavior is unchanged for single-request usage.

…ate mutation The streaming loop in CosyVoice2Model.tts() mutates self.token_hop_len each iteration (via stream_scale_factor). When multiple requests share the same model instance, this shared state is corrupted across concurrent inferences. Use a local variable token_hop_len initialized from self.token_hop_len and update only the local copy inside the loop, so each streaming session has its own hop length progression. Behavior is unchanged for single-request usage.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: use local token_hop_len in streaming loop to avoid concurrent st…#1849

fix: use local token_hop_len in streaming loop to avoid concurrent st…#1849
Caxson wants to merge 1 commit intoFunAudioLLM:mainfrom
Caxson:fix_streaming_bug

Caxson commented Mar 12, 2026

Labels

1 participant

Conversation

Caxson commented Mar 12, 2026

Labels

1 participant