MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.
-
Updated
Nov 5, 2025 - Python
MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.
A Rust library integrated with ONNXRuntime, providing a collection of Computer Vison and Vision-Language models such as YOLO, FastVLM, and more.
Unofficial repository for building Florence-2 in Microsoft Azure
Florence-2 for object detection in Python
This project applies a modular generative AI pipeline to perform virtual staging on empty room images. It synthesizes realistic, high-quality interior furnishings while rigorously preserving the original room’s geometry, structure, and spatial consistency.
This repository provides a powerful AI-driven solution for removing objects from videos using text prompts. By integrating SAM2, Florence2, and ProPainter, the model enables precise and seamless object removal. Simply describe the objects to remove (e.g., "man, car, cap, basket"), and the AI will handle the rest with high accuracy.
Sample: Object Detection over a Video Stream using Microsoft's Florence-2 Model
Add a description, image, and links to the florence2 topic page so that developers can more easily learn about it.
To associate your repository with the florence2 topic, visit your repo's landing page and select "manage topics."