kebe7jun

Kebe kebe7jun

Focus on Istio, Kubernetes, eBPF, WASM, Envoy.

190 followers · 97 following

DaoCloud
Shanghai
14:11 (UTC +08:00)

Achievements

x3 x3 x4

Achievements

x3 x3 x4

Organizations

Stars

matrixhub-ai / matrixhub

An Open-source, self-hosted AI model hub with Hugging Face compatibility, accelerating vLLM/SGLang performance.

Go 56 12 Updated Mar 2, 2026

tw93 / Mole

🐹 Deep clean and optimize your Mac.

Shell 37,578 1,031 Updated Mar 2, 2026

samzong / moltbot-channel-feishu

A production-grade Feishu/Lark channel plugin for Moltbot(Clawdbot).

TypeScript 7 2 Updated Feb 3, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 244,324 47,238 Updated Mar 2, 2026

anomalyco / opencode

The open source coding agent.

TypeScript 113,795 11,455 Updated Mar 2, 2026

infinigence / FUSCO

High-performance distributed data shuffling (all-to-all) library for MoE training and inference

Python 112 11 Updated Feb 28, 2026

Tongyi-MAI / MAI-UI

MAI-UI: Real-World Centric Foundation GUI Agents ranging from 2B to 235B

Jupyter Notebook 1,703 170 Updated Feb 10, 2026

gpustack / gpustack

Performance-optimized AI inference on your GPUs. Unlock superior throughput by selecting and tuning engines like vLLM or SGLang.

Python 4,559 461 Updated Mar 2, 2026

NVIDIA / NVSentinel

NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments

Go 192 49 Updated Mar 2, 2026

hypertrons / hypertrons-crx

A browser extension for insights into GitHub, Gitee projects and developers.

TypeScript 399 103 Updated Feb 28, 2026

bobeff / open-source-games

A list of open source games.

Python 12,147 955 Updated Feb 25, 2026

photoprism / photoprism

AI-Powered Photos App for the Decentralized Web 🌈💎✨

Go 39,407 2,210 Updated Mar 1, 2026

snowflakedb / ArcticInference

ArcticInference: vLLM plugin for high-throughput, low-latency inference

Python 403 50 Updated Feb 24, 2026

vipshop / cache-dit

🤗 A PyTorch-native and Flexible Inference Engine with Hybrid Cache Acceleration and Parallelism for DiTs.

Python 1,059 63 Updated Mar 2, 2026

xlite-dev / LeetCUDA

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 9,772 977 Updated Feb 25, 2026

vllm-project / vllm-omni

A framework for efficient model inference with omni-modality models

Python 2,863 470 Updated Mar 2, 2026

jukofyork / transplant-vocab

Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.

Python 49 8 Updated Oct 29, 2025

samzong / modelfs

Go 2 Updated Nov 27, 2025

deepseek-ai / LPLB

An early research stage expert-parallel load balancer for MoE models based on linear programming.

Python 499 33 Updated Nov 19, 2025

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 4,508 585 Updated Mar 2, 2026

xlite-dev / Awesome-LLM-Inference

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 5,030 346 Updated Feb 27, 2026

langfuse / langfuse

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

TypeScript 22,482 2,245 Updated Feb 28, 2026

flagos-ai / FlagTree

FlagTree is a unified compiler supporting multiple AI chip backends for custom Deep Learning operations, which is forked from triton-lang/triton.

C++ 213 40 Updated Mar 1, 2026

ovg-project / kvcached

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 797 87 Updated Feb 27, 2026

ai-dynamo / aiconfigurator

Offline optimization of your disaggregated Dynamo graph

Python 195 67 Updated Mar 2, 2026

anthropics / skills

Public repository for Agent Skills

Python 80,585 8,446 Updated Feb 25, 2026

ImagineAILab / ai-by-hand-excel

5,957 746 Updated Jan 28, 2025

scraly / developers-conferences-agenda

developers.events is a community-driven platform listing developer/tech conferences and Calls for Papers (CFPs) worldwide with a list, a calendar and a map view. It helps organizers, speakers, spon…

JavaScript 1,939 493 Updated Mar 2, 2026

jinbooooom / ai-infra-hpc

hpc 教程，包含集合通信(mpi、nccl)、cuda 编程、向量化 SIMD、RDMA 通信等

Cuda 261 25 Updated Feb 14, 2026

QwenLM / Qwen3Guard

Qwen3Guard is a multilingual guardrail model series developed by the Qwen team at Alibaba Cloud.

Python 428 30 Updated Oct 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kebe kebe7jun

Sponsors

Achievements

Achievements

Organizations

Block or report kebe7jun

Stars

matrixhub-ai / matrixhub

tw93 / Mole

samzong / moltbot-channel-feishu

openclaw / openclaw

anomalyco / opencode

infinigence / FUSCO

Tongyi-MAI / MAI-UI

gpustack / gpustack

NVIDIA / NVSentinel

hypertrons / hypertrons-crx

bobeff / open-source-games

photoprism / photoprism

snowflakedb / ArcticInference

vipshop / cache-dit

xlite-dev / LeetCUDA

vllm-project / vllm-omni

jukofyork / transplant-vocab

samzong / modelfs

deepseek-ai / LPLB

THUDM / slime

xlite-dev / Awesome-LLM-Inference

langfuse / langfuse

flagos-ai / FlagTree

ovg-project / kvcached

ai-dynamo / aiconfigurator

anthropics / skills

ImagineAILab / ai-by-hand-excel

scraly / developers-conferences-agenda

jinbooooom / ai-infra-hpc

QwenLM / Qwen3Guard