-
Notifications
You must be signed in to change notification settings - Fork 182
Pull requests: lightseekorg/tokenspeed
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(attention): treat window_left as exclusive
#578
opened Jul 1, 2026 by
Yu-Zhewen
Contributor
Loading…
Security: Potential race condition in global backend initialization
#575
opened Jul 1, 2026 by
tomaioo
Loading…
perf(deepseek-v4): accelerate indexer Q and packed FP8 quantization
#563
opened Jun 30, 2026 by
dongjiyingdjy
Contributor
Loading…
perf(multimodal): Prewarm Kimi vision encoder before readiness
#556
opened Jun 30, 2026 by
qimcis
Contributor
Loading…
perf(scheduler): Split mixed prefill/decode forwards to preserve decode graphs
#555
opened Jun 30, 2026 by
qimcis
Contributor
Loading…
feat(serve): export Prometheus metrics via /metrics endpoint
#553
opened Jun 29, 2026 by
ilyaters
Loading…
feat(platform): model the Blackwell family — datacenter / Thor / consumer
#551
opened Jun 28, 2026 by
jasl
Contributor
Loading…
ci(disaggregation): add Qwen3.5 EPD encode coverage
#549
opened Jun 28, 2026 by
chenht2022
Contributor
•
Draft
feat(disaggregation): add EPD encode pipeline
#548
opened Jun 28, 2026 by
chenht2022
Contributor
Loading…
perf(deepseek-v4): integrate TRT-LLM mHC kernels
#547
opened Jun 28, 2026 by
dongjiyingdjy
Contributor
•
Draft
feat(rl): weight-transfer control plane for RL online weight sync
#546
opened Jun 28, 2026 by
qywu
Collaborator
Loading…
refactor(spec-decode): simplify deepseekV3/GLM attention path for #217 (3/3)
#544
opened Jun 28, 2026 by
rjzhb
Collaborator
Loading…
test(agentic): add EvalScope trie benchmark protocol
#466
opened Jun 17, 2026 by
Xiangyi1996
Collaborator
•
Draft
test(ci): add DeepSeek-V4-Flash MTP AIME25 eval
#461
opened Jun 16, 2026 by
dongjiyingdjy
Contributor
Loading…
fix(scheduler): release paged-cache snapshots in ~HybridPrefixCache to avoid teardown use-after-free
inactive
#455
opened Jun 15, 2026 by
Sunt-ing
Loading…
Fix EP8 DP/TP RSAG init and empty LM head
#416
opened Jun 11, 2026 by
yubofredwang
Contributor
Loading…
perf(gdn): fuse causal_conv1d and QKV split for GDN prefill
#382
opened Jun 8, 2026 by
elwhyjay
Contributor
Loading…
fix(cache): Coarsely fence the compute stream behind the host loadback stream on.
#370
opened Jun 6, 2026 by
LorrinWWW
Contributor
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-06-01.