yolo-tfjs-vision

# YOLO-TFJS-VISION [![npm version](https://img.shields.io/npm/v/yolo-tfjs-vision.svg)](https://www.npmjs.com/package/yolo-tfjs-vision) [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](LICENSE) YOLO-TFJS-VISION is a **TypeScript-based YOLO object detection library** powered by [TensorFlow.js](https://www.tensorflow.org/js). It enables real-time object detection on images, videos, and live webcam streams directly in the browser. ## Features - **Easy Integration:** Simple API to quickly add object detection to your projects. - **Real-Time Detection:** Process images, videos, or live webcam feeds in real time. - **Customizable:** Configure labels, detection thresholds, and even a custom color palette. - **Lightweight & Modular:** Written in TypeScript for robust type-checking and maintainability. - **CDN-Ready:** Publish on npm and serve via CDNs like [jsDelivr](https://www.jsdelivr.com/) or [unpkg](https://unpkg.com/). - **Tested with YOLO Models:** Compatible with YOLOv8 and YOLO11. ## Live Examples Check out the following demos to see YOLO-TFJS-VISION in action: ![Image Detection](doc/room.jpg) ![Image Detection](doc/girl.jpg) - 🖼️ **Image Detection:** [Live Demo](https://yolots-examples.vercel.app/) - 🎞️ **Video Detection:** [Live Demo](https://yolots-examples.vercel.app/video.html) - 📷 **Webcam Detection:** [Live Demo](https://yolots-examples.vercel.app/webcam.html) ## Installation Install via npm: ```bash npm install yolo-tfjs-vision ``` Or load directly from a CDN (UMD build): ```html <script src="https://cdn.jsdelivr.net/npm/yolo-tfjs-vision@latest/dist/yolo.umd.js"></script> ``` **Note**: YOLO-TFJS-VISION has peer dependencies on TensorFlow.js and its WebGL backend. When using the UMD build, load these libraries before your YOLO-TFJS-VISION bundle: ```html <script src="https://cdn.jsdelivr.net/npm/@tensorflow/tfjs@4.22.0/dist/tf.min.js"></script> <script src="https://cdn.jsdelivr.net/npm/@tensorflow/tfjs-backend-webgl@4.22.0/dist/tf-backend-webgl.min.js"></script> <script src="https://cdn.jsdelivr.net/npm/yolo-tfjs-vision@latest/dist/yolo.umd.js"></script> ``` ## Initial Configuration The `setup` method in YOLO-TFJS-VISION allows for customization using the following configuration options: ```typescript export interface YOLOConfig { modelUrl: string; labels?: string[]; // Optional, defaulting to COCO categories colors?: string[]; // Optional, custom colors for label display displayLabels?: Set<string> | null; // Optional, filter specific labels to be displayed scoreThreshold?: number; boxLineWidth?: number; boxLabels?: boolean; } ``` ## API Overview YOLO-TFJS-VISION exposes a single class, YOLO, with the following primary methods: **setup(options)** Configure the model with custom settings (e.g., model URL, labels, colors, display filters, and score thresholds). ```javascript yolo.setup({ modelUrl: "model/model.json", labels: ["person", "car", "dog"], colors: ["#FF0000", "#00FF00"], displayLabels: new Set(["person", "dog"]), scoreThreshold: 0.3, boxLineWidth: 10, boxLabels: true, }); ``` **loadModel()** Loads the YOLO model from the specified URL. Returns a promise that resolves to the loaded model. ```javascript yolo.loadModel().then((model) => { console.log("Model loaded!", model) }); ``` **detect(source, model, canvasRef, callback)** Processes an image, video, or canvas element for object detection using the YOLO model. It performs the following steps: - **Preprocessing:** Resizes and pads the source element (using letterboxing) to match the model's input dimensions. - **Inference:** Runs the YOLO model on the preprocessed input to generate detection predictions. - **Postprocessing:** Applies non-max suppression, adjusts the detection coordinates back to the original image dimensions, and maps numeric class indices to human-readable labels. - **Rendering:** Draws bounding boxes (with optional labels and scores) on the provided canvas element. - **Callback:** Invokes the supplied callback with a detection object that includes: - **boxes:** A flattened array of bounding box coordinates in the format `[y1, x1, y2, x2, ...]`. - **scores:** An array of confidence scores for each detection. - **classes:** An array of numeric class indices. - **labels:** An array of human-readable label strings corresponding to the detected classes. ```javascript yolo.detect(imageElement, model, canvas, (detections) => { console.log(detections); // Example output: // { // boxes: [50, 30, 200, 180, 210, 45, 350, 300], // scores: [0.95, 0.87], // classes: [0, 3], // labels: ["person", "dog"] // } }); ``` **detectVideo(videoSource, model, canvasRef)** Continuously processes video frames for real-time detection. ```javascript yolo.detectVideo(videoElement, model, canvas); ``` --- ## Exporting a YOLO Model for TensorFlow.js If you have a trained YOLO model and want to use it with YOLO-TFJS-VISION, you need to export it in TensorFlow.js format. Here's how you can do it: ```python from ultralytics import YOLO # Load the YOLO model model = YOLO("yolo11n.pt") # Export the model to TensorFlow.js format model.export(format="tfjs") ``` ## Contributing Contributions are welcome! If you’d like to improve YOLO-TFJS-VISION, please fork the repository and submit a pull request. - Repository: https://github.com/josueggh/yolo-tfjs-vision - Issues: https://github.com/josueggh/yolo-tfjs/issues