Skip to content
View BobQC's full-sized avatar

Highlights

  • Pro

Block or report BobQC

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. SageAttention SageAttention Public

    Forked from thu-ml/SageAttention

    Quantized Attention achieves speedup of 2-5x and 3-11x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models.

    Cuda 1