Skip to content
View pcuenca's full-sized avatar

Sponsoring

@jart
@Blaizzy

Organizations

@huggingface

Block or report pcuenca

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Every Eval Ever is a shared schema and crowdsourced eval database. It defines a standardized metadata format for storing AI evaluation results — from leaderboard scrapes and research papers to loca…

Python 87 42 Updated Jul 1, 2026

Un-0: an image generator powered by a simulated system of coupled oscillators, an example of an emerging physical computing substrate.

Python 276 40 Updated Jun 26, 2026

a minimalist agent that teaches you to create coding agents

Python 451 52 Updated Jul 1, 2026

WebAssembly binding for llama.cpp - Enabling on-browser LLM inference

TypeScript 1,130 103 Updated Jun 17, 2026
Swift 8 Updated Jun 22, 2026

Local-first healthcare AI: clinical NER & HIPAA PII de-identification that runs 100% on-device. 1,000+ medical models, 12 languages, Apple MLX + Python, no cloud, no patient data leaving your netwo…

Python 4,010 449 Updated Jul 1, 2026

Model export recipes, Python primitives, and Swift runtime utilities for on-device AI

Swift 1,275 101 Updated Jul 1, 2026

How much experts do we need to serve a model?

Python 152 15 Updated Mar 18, 2026

llama.cpp's official website

Svelte 15 7 Updated Jun 30, 2026

An interface library for RL post training with environments.

Python 2,385 404 Updated Jul 1, 2026

Build and install script for llama.app

Python 15 3 Updated Jun 29, 2026

GEMMs with metal

Python 21 5 Updated Jun 13, 2026

One hub for all LLMs. Lives on your macOS menu bar.

Swift 117 12 Updated Jun 1, 2026

Secure the tools you `brew install`

Rust 36 2 Updated Jun 23, 2026

Pi coding agent extension: llama.cpp provider with dynamic model + context window discovery

TypeScript 40 8 Updated Jun 14, 2026

Generate, format and mask agent traces with ease.

Python 94 11 Updated Jun 23, 2026

AssetOpsBench - Industry 4.0: A unified benchmark and framework for building, orchestrating, and evaluating domain-specific AI agents for Industry 4.0 asset operations and maintenance, with 460+ sc…

Python 1,954 287 Updated Jul 1, 2026
Shell 1 Updated May 28, 2026

LLM inference in C/C++

C++ 118,920 20,142 Updated Jul 1, 2026

Python bindings for llama.cpp

Python 10,454 1,421 Updated Jun 29, 2026

A utility script to upload pytorch traces to a Hugging Face Bucket, and then build sharable trace URL

Python 10 Updated Jun 23, 2026

Objective-C port of the tokenizer in HuggingFace's swift-transformers

Objective-C 2 Updated Jun 24, 2026
Python 1 2 Updated May 7, 2026

How Fast can you pull from Hugging Face?

Python 25 Updated May 12, 2026

DeepSeek 4 Flash and PRO local inference engine for Metal, CUDA and ROCm

C 17,189 1,455 Updated Jun 17, 2026

A course on context engineering with code agents.

Python 57 15 Updated May 26, 2026

Opinionated Configuration Files

Shell 5 Updated Jun 22, 2026

Backup and restore OpenClaw agent workspaces to HF Buckets

Shell 2 Updated Mar 17, 2026

LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar

Python 17,361 1,472 Updated Jul 1, 2026
Next