- Seattle
- www.cmwilhelm.com/
Stars
⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…
This repo is for demonstration purposes only.



