Skip to content
View mateiz's full-sized avatar

Organizations

@mesos @radlab

Block or report mateiz

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Large World Model -- Modeling Text and Video with Millions Context

Python 7,409 556 Updated Oct 19, 2024

A Bulletproof Way to Generate Structured JSON from Language Models

Jupyter Notebook 4,923 184 Updated Feb 24, 2024

A hyper-fast local vector database for use with LLM Agents. Now accepting SAFEs at $135M cap.

Python 1,408 87 Updated Feb 7, 2025

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Python 10,793 1,146 Updated Jun 30, 2023
Python 1,563 229 Updated Mar 25, 2026

DSPy: The framework for programming—not prompting—language models

Python 34,137 2,863 Updated Apr 30, 2026

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 8,767 2,085 Updated Apr 30, 2026

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

Java 1,192 164 Updated Apr 30, 2026

Sample base images for Databricks Container Services

Jupyter Notebook 213 129 Updated Apr 27, 2026

An open protocol for secure data sharing

Scala 938 224 Updated May 1, 2026

Offload IoT computation to local hardware while justifying any network accesses.

Rust 7 2 Updated May 31, 2023

A native Rust library for Delta Lake, with bindings into Python

Rust 3,206 615 Updated Apr 30, 2026

API for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation

Jupyter Notebook 341 59 Updated Apr 1, 2026

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,854 469 Updated Oct 14, 2025

The library for web and native user interfaces.

JavaScript 244,788 51,036 Updated Apr 29, 2026

Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

Python 28,549 4,588 Updated May 1, 2026

Joblib Apache Spark Backend

Python 249 24 Updated Mar 24, 2026

The Tensor Algebra SuperOptimizer for Deep Learning

C++ 741 93 Updated Jan 26, 2023
Python 392 115 Updated Nov 4, 2022

An open-source toolkit for large-scale genomic analysis

Scala 299 118 Updated Apr 19, 2026

Puffer is a free live TV streaming website and a research study at Stanford using machine learning to improve video streaming

C++ 909 140 Updated Nov 7, 2025

Koalas: pandas API on Apache Spark

Python 3,373 368 Updated Mar 20, 2024

A Python-embedded modeling language for convex optimization problems.

C++ 6,197 1,174 Updated Apr 30, 2026

The Legion Parallel Programming System

C++ 757 151 Updated Mar 28, 2026

GoCD plugins to work with MLFlow as model repository in a CD flow

Java 32 4 Updated Nov 1, 2023

MLflow App Library

Python 79 35 Updated Dec 25, 2018

Intellij Jsonnet Plugin

Java 90 17 Updated Mar 9, 2024

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while control…

Python 25,672 5,671 Updated May 1, 2026

The "Command Line Interactive Controller for Kubernetes"

Rust 1,508 92 Updated Mar 27, 2026

Accelerating network inference over video

Python 437 121 Updated Mar 6, 2020
Next