Skip to content
View amd-hhashemi's full-sized avatar

Block or report amd-hhashemi

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. hipBLASLt hipBLASLt Public

    Forked from ROCm/hipBLASLt

    hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library

    Assembly

  2. composable_kernel composable_kernel Public

    Forked from ROCm/composable_kernel

    Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

    C++

  3. stndln_prfll stndln_prfll Public

    A standalone prefill based on flashinfer

    Cuda

  4. aiter aiter Public

    Forked from ROCm/aiter

    AI Tensor Engine for ROCm

    Python

  5. pytorch pytorch Public

    Forked from pytorch/pytorch

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    Python

  6. clr clr Public

    Forked from ROCm/clr

    C++