Skip to content
View jiweibo's full-sized avatar
:octocat:
I may be slow to respond.
:octocat:
I may be slow to respond.

Block or report jiweibo

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Lightweight coding agent that runs in your terminal

Rust 79,380 11,373 Updated May 1, 2026

Zstandard - Fast real-time compression algorithm

C 27,046 2,467 Updated May 1, 2026

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 5,978 540 Updated Apr 30, 2026

Lightweight Kubernetes

Go 32,895 2,648 Updated Apr 30, 2026

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 3,109 447 Updated May 1, 2026

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Python 941 76 Updated Mar 4, 2026

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python 42,112 5,088 Updated Apr 29, 2026

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 32,596 4,095 Updated May 1, 2026

A list of free LLM inference resources accessible via API.

Python 19,576 1,974 Updated May 1, 2026

Write scalable load tests in plain Python 🚗💨

Python 27,748 3,208 Updated Apr 28, 2026

The open source coding agent.

TypeScript 153,019 17,667 Updated May 1, 2026

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 91,542 10,420 Updated Apr 26, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,091 607 Updated Mar 13, 2026

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 42,393 7,521 Updated May 1, 2026

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 2,036 134 Updated Apr 28, 2026

A book for Learning the Foundations of LLMs

16,104 1,537 Updated Dec 12, 2025

LLM inference in C/C++

C++ 107,789 17,659 Updated May 1, 2026

🌸 A command-line fuzzy finder

Go 79,931 2,796 Updated Apr 27, 2026

微舆:人人可用的多Agent舆情分析助手,打破信息��房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。

Python 40,700 7,521 Updated Mar 13, 2026

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

Python 2,312 272 Updated Feb 20, 2026

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,710 1,076 Updated May 1, 2026

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 19,099 1,748 Updated Jan 30, 2026

Move and resize windows on macOS with keyboard shortcuts and snap areas

Swift 28,969 921 Updated Apr 29, 2026
323 29 Updated Apr 6, 2026

The 500 AI Agents Projects is a curated collection of AI agent use cases across various industries. It showcases practical applications and provides links to open-source projects for implementation…

29,286 5,148 Updated Jan 13, 2026

Infisical is the open-source platform for secrets, certificates, and privileged access management.

TypeScript 26,423 1,851 Updated May 1, 2026

Sync notes between local and cloud with smart conflict: S3 (Amazon S3/Cloudflare R2/Backblaze B2/...), Dropbox, webdav (NextCloud/InfiniCLOUD/Synology/...), OneDrive, Google Drive (GDrive), Box, pC…

TypeScript 7,351 377 Updated Nov 10, 2024

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫

Python 48,733 10,398 Updated Apr 30, 2026

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 27,187 1,983 Updated Jan 9, 2026
Next