Skip to content
View lienz's full-sized avatar

Block or report lienz

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. google-research/leaf-audio google-research/leaf-audio Public

    LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks, and then be trained for the task at hand, while using a ve…

    Python 515 53

  2. google-research/diffstride google-research/diffstride Public

    TF/Keras code for DiffStride, a pooling layer with learnable strides.

    Python 124 7

  3. facebookresearch/tdfbanks facebookresearch/tdfbanks Public archive

    Pytorch implementation of time-domain filterbanks

    Python 112 20

  4. kyutai-labs/moshi kyutai-labs/moshi Public

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

    Python 9.1k 827

  5. kyutai-labs/hibiki kyutai-labs/hibiki Public

    Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- H…

    Rust 1.3k 105

  6. kyutai-labs/dactory kyutai-labs/dactory Public

    Python 43 5