Plugins Ecosystem#
Welcome to the FiftyOne Plugins ecosystem! 🚀
Here you’ll discover cutting-edge research, state-of-the-art models, and powerful add-ons that unlock new FiftyOne workflows.
FiftyOne plugins allow you to extend and customize the functionality of the core tool to suit your specific needs. From advanced computer vision models to integrations with other popular AI tools, this curated collection of plugins will transform FiftyOne into your bespoke visual AI development workbench.

by voxel51
Utilities for integrating FiftyOne with annotation tools

by voxel51
Utilities for working with the FiftyOne Brain

by voxel51
Create your own custom dashboards from within the App

by voxel51
A collection of import/export utilities

by voxel51
Utilities working with FiftyOne database indexes
by voxel51
Utilities for managing and building FiftyOne plugins

by voxel51
Utilities for managing your delegated operations

by voxel51
Utilities for managing your custom runs

by voxel51
Call your favorite SDK utilities from the App

by voxel51
Download datasets and run inference with models from the FiftyOne Zoo, all without leaving the App

by harpreetsahota
Implementing NVLabs C-RADIOv3 Embeddings Model as Remotely Sourced Zoo Model for FiftyOne

by harpreetsahota
Nomic Embed Multimodal is a family of vision-language models built on Qwen2.5-VL that generates high-dimensional embeddings for both images and text in a shared vector space.

by harpreetsahota
BiModernVBert is a vision-language model built on the ModernVBert architecture that generates embeddings for both images and text in a shared 768-dimensional vector space.

by harpreetsahota
ColModernVBert is a multi-vector vision-language model built on the ModernVBert architecture that generates ColBERT-style embeddings for both images and text.

by harpreetsahota
DeepSeek-OCR is a vision-language model designed for optical character recognition with a focus on "contextual optical compression."

by harpreetsahota
olmOCR-2 is a state-of-the-art OCR model built on Qwen2.5-VL architecture that extracts text from document images with high accuracy.

by harpreetsahota
Jina Embeddings v4 is a state-of-the-art Vision Language Model that generates embeddings for both images and text in a shared vector space.

by harpreetsahota
ColQwen2.5 is a Vision Language Model based on Qwen2.5-VL-3B-Instruct that generates ColBERT-style multi-vector representations for efficient document retrieval. This version takes dynamic image resolutions (up to 768 image patches) and doesn't resize them, preserving aspect ratios for better accuracy.

by voxel51
An AI assistant that can query visual datasets, search the FiftyOne docs, and answer general computer vision questions

by harpreetsahota
This plugin connects FiftyOne datasets with Weights & Biases to enable reproducible, data-centric ML workflows.
by AdonaiVera
Load and explore the BDDOIA Safe/Unsafe Action dataset via the FiftyOne Zoo

by harpreetsahota
A plugin that intelligently displays and formats VLM (Vision Language Model) outputs and text fields. Perfect for viewing OCR results, receipt analysis, document processing, and any text-heavy computer vision workflows.

by harpreetsahota
Nanonets-OCR2 transforms documents into structured markdown with intelligent content recognition and semantic tagging, making it ideal for downstream processing by Large Language Models (LLMs).

by vlm-run
Extract structured data from visual and audio sources including documents, images, and videos
by jacobmarks
Accelerate your data labeling with Active Learning!

by harpreetsahota
Implementing UI-TARS-1.5 as a Remote Zoo Model for FiftyOne
by AdonaiVera
A comprehensive FiftyOne plugin for testing and evaluating multiple Vision-Language Models (VLMs) with dynamic prompts and built-in evaluation capabilities

by danielgural
search through your video datasets using FiftyOne Brain and Twelve Labs!

by harpreetsahota
Implementing Microsoft's GUI Actor as a Remote Zoo Model for FiftyOne
by AdonaiVera
This plugin integrates Google Gemini's multimodal Vision models (e.g., gemini-2.5-flash) into your FiftyOne workflows. Prompt with text and one or more images; receive a text response grounded in visual inputs
by harpreetsahota
Isaac-0.1 is the first in Perceptron AI's family of models built to be the intelligence layer for the physical world. This integration supports various computer vision tasks including object detection, classification, OCR, visual question answering, and more.

by harpreetsahota
ColPali is a Vision Language Model based on PaliGemma-3B that generates ColBERT-style multi-vector representations for efficient document retrieval.

by harpreetsahota
Moondream 3 (Preview) is an vision language model with a mixture-of-experts architecture (9B total parameters, 2B active). This model makes no compromises, delivering state-of-the-art visual reasoning while still retaining our efficient and deployment-friendly ethos.

by harpreetsahota
Implementing PaliGemma-2-Mix as a Remote Zoo Model for FiftyOne

by harpreetsahota
Integrating MiniCPM-V 4.5 as a Remote Source Zoo Model in FiftyOne

by harpreetsahota
Kosmos-2.5 excels at two core tasks\: generating spatially-aware text blocks (OCR) and producing structured markdown output from images.

by jacobmarks
Create and test multimodal RAG pipelines with LlamaIndex, Milvus, and FiftyOne!

by jacobmarks
Find the images in your dataset most similar to an audio file!

by harpreetsahota
Integrating FastVLM as a Remote Source Zoo Model for FiftyOne

by harpreetsahota
Implementing NVIDIA NeMo Retriever Parse as a FiftyOne Plugin

by brimoor
Load your PDF documents into FiftyOne as per-page images
by jacobmarks
Cluster your images using embeddings with FiftyOne and scikit-learn!

by danielgural
Find the clusters in your data using some of the best algorithms available!

by harpreetsahota
Implemeting Meta AI's VGGT as a FiftyOne Remote Zoo Model

by harpreetsahota
Run ViTPose Models from Hugging Face on your FiftyOne Dataset

by harpreetsahota
Moondream2 implementation as a remotely sourced zoo model for FiftyOne

by harpreetsahota
Implementing Florence2 as a Remote Zoo Model for FiftyOne

by madave94
Tackle noisy annotation! Find and analyze annotation issues in datasets with multiple annotators per image.
by jacobmarks
Run zero-shot (open vocabulary) prediction on your data!

by harpreetsahota
A FiftyOne plugin for generating synthetic samples for datasets in COCO4GUI format

by harpreetsahota
FiftyOne Remotely Sourced Zoo Model integration for Moonshot AI's Kimi-VL-A3B models enabling object detection, keypoint localization, and image classification with strong GUI and document understanding capabilities.

by harpreetsahota
Integrating ShowUI into FiftyOne as a Remote Source Zoo Model

by harpreetsahota
Implementing MiMo-VL as a Remote Zoo Model for FiftyOne

by harpreetsahota
Integrating OS-Atlas Base into FiftyOne as a Remote Source Zoo Model

by harpreetsahota
A FiftyOne Remotely Sourced Zoo Model integration for Google's SigLIP2 model enabling natural language search across images in your FiftyOne Dataset
by jacobmarks
Ask (and answer) open-ended visual questions about your images!

by harpreetsahota
Import your LeRobot format dataset into FiftyOne format

by harpreetsahota
Implementing the COCO4GUI dataset type in FiftyOne with importers and exports

by harpreetsahota
Implementing MedGemma as a Remote Zoo Model for FiftyOne

by segmentsai
Integrate FiftyOne with the Segments.ai annotation tool!
by jacobmarks
Play YouTube videos in the FiftyOne App!

by voxel51
Track model training experiments on your FiftyOne datasets with MLflow!
by jacobmarks
Find common image quality issues in your datasets
by ehofesmann
Edit attributes of your labels directly in the FiftyOne App!

by mmoollllee
Tile your high resolution images to squares for training small object detection models
by harpreetsahota
Implementing MedSigLIP as a Remote Zoo Model for FiftyOne

by harpreetsahota
Compute embeddings for video using Facebook Hiera Models
by swheaton
Anonymize/blur images based on a FiftyOne Detections field.

by harpreetsahota
Implementing Qwen2.5-VL as a Remote Zoo Model for FiftyOne

by harpreetsahota
Implementing Llama-3.1-Nemotron-Nano-VL-8B-V1 as a Remote Zoo Model for FiftyOne

by jacobmarks
Run optical character recognition with PyTesseract!
by jacobmarks
Find the images in your dataset most similar to an image from filesystem or the internet!

by danielgural
Import your audio datasets as spectograms into FiftyOne!

by harpreetsahota
A FiftyOne Remotely Sourced Zoo Model integration for LlamaIndex's VDR model enabling natural language search across document images, screenshots, and charts in your datasets.

by AdonaiVera
Improve VLM training data quality with state-of-the-art dataset pruning and quality techniques
by jacobmarks
Caption all your images with state of the art vision-language models!
by jacobmarks
Test out any Albumentations data augmentation transform with FiftyOne!

by jacobmarks
Find exact and approximate duplicates in your dataset!
by jacobmarks
Semantically search emojis and copy to clipboard!

by harpreetsahota
Run the Janus Pro Models from Deepseek on your Fiftyone Dataset

by allenleetc
Compare two object detection models!

by harpreetsahota
Perfom zero-shot metric monocular depth estimation using the Apple Depth Pro model

by danielgural
Find the optimal confidence threshold for your detection models automatically!

by danielgural
Find those troublesome outliers in your dataset automatically!
by jacobmarks
Add synthetic data from prompts with text-to-image models and FiftyOne!
by jacobmarks
Perform semantic search on text in your documents!
by allenleetc
Plotly-based Map Panel with adjustable marker cosmetics!
by jacobmarks
Navigate concept space with CLIP, vector search, and FiftyOne!
by jacobmarks
Find images that best interpolate between two text-based extremes!
by jacobmarks
Chat with your images using GPT-4 Vision!

by voxel51
Push FiftyOne datasets to the Hugging Face Hub, and load datasets from the Hub into FiftyOne!

by voxel51
Run inference on your datasets using Hugging Face Transformers models!

by mmoollllee
Compute datetime-related fields (sunrise, dawn, evening, weekday, ...) from your samples' filenames or creation dates
by jacobmarks
Perform keyword search on a specified field!

by danielgural
Bring images to life with image to video!
by jacobmarks
on two numeric ranges simultaneously!
by ehofesmann
Filter a field of your FiftyOne dataset by one or more values.
by wayofsamu
Visualize x,y-Points as a line chart.
by jacobmarks
Automate data ingestion with Twilio!
Note
Community plugins are external projects maintained by their respective authors. They are not part of FiftyOne core and may change independently. Please review each plugin’s documentation and license before use.