OpenEyes — Robot Vision System v2.5.0

Features

◎

Object Detection

80+ object classes in real-time

◐

Depth Estimation

Measure distance to everything

◉

Face Detection

Identify and track faces

✋

Gesture Recognition

Stop, wave, point commands

⬡

Pose Estimation

Detect body positions

⟡

Object Tracking

Follow objects across frames

⬢

Visual SLAM

Build maps, navigate autonomously

⟁

VLA Models

Vision-Language-Action

🔦

LIDAR Processing

Point cloud obstacle detection

🔀

Sensor Fusion

Camera + Depth + LIDAR

📸

Multi-Camera

Multiple camera support

⚡

Action Chunking

10-30 Hz real-time control

🧠

Diffusion Policy

Robot manipulation AI

🛡️

Safety Controller

E-STOP, velocity limits

❤️

Health Monitor

24/7 with auto-recovery

⬆️

OTA Updates

Model updates + rollback

🧠

World Models

Predictive planning at 200 Hz

🔮

V-JEPA 2

Spatiotemporal perception

📐

Depth Anything V3

35.7% better than MiDaS

🏭

Industry Templates

Warehouse, QA, Agriculture, Retail

🚀

Fleet Management

Multi-device deployment

🔧

Hardware Abstraction

TensorRT, OpenVINO, TVM, Hailo, QNN

Performance

Configuration	FPS	Use Case
Detection only (TensorRT)	35-40	YOLO11n FP16
Full pipeline (default)	4-6	All models enabled
Full pipeline + turbo	8-12	Aggressive frame skipping
Minimal	15-20	Detection + depth + tracking
World model (LeWM 15M)	100-200 Hz	Planning only, <10ms

Hardware

Jetson Orin Nano

40 TOPS, 5-15W

Raspberry Pi 5

+ AI HAT 26 TOPS

Intel Core Ultra

48 TOPS NPU

Hailo-8

26 TOPS, 3.5W

Qualcomm RB5

15-30 TOPS

Camera

CSI, USB, RealSense

Quick Start

bash

git clone https://github.com/mandarwagh9/openeyes.git
cd openeyes
pip install -r requirements.txt
python src/main.py --debug

World Models & Predictive Intelligence (v2.5.0)

bash

# World model with predictive tracking
python src/main.py --world-model lewm --follow

# With safety evaluation
python src/main.py --world-model lewm --safety-predict

# Industry template (warehouse)
python src/main.py --template warehouse --debug

# Turbo mode for maximum FPS
python src/main.py --turbo --world-model lewm

# Process video files
python src/main.py --video input.mp4 --output output.mp4

New in v0.7.0 - v1.0.0

bash

# Multi-Modal Sensing (v0.7.0)
python src/main.py --lidar --lidar-topic /scan
python src/main.py --realsense
python src/main.py --multi-camera

# VLA & Performance (v0.8.0)
python src/main.py --int8 --dla
python src/main.py --diffusion-policy
python src/main.py --action-chunking --control-freq 20

# Safety & Reliability (v1.0.0)
python src/main.py --safety --max-velocity 1.0 --min-distance 0.3
python src/main.py --health-monitor
python src/main.py --ota-update

Models

Model	Type	Size
YOLO11n/12n/26n	Detection	2.6-5.4MB
MiDaS + Depth Anything V3	Depth	350MB
MediaPipe	Face/Gesture/Pose	~20MB
LeWM	World Model	15M params
V-JEPA 2	Perception	80-600M params
SmolVLA	VLA	450M params
OpenVLA	VLA	7B params

We Give Robots Vision

Features

Object Detection

Depth Estimation

Face Detection

Gesture Recognition

Pose Estimation

Object Tracking

Visual SLAM

VLA Models

LIDAR Processing

Sensor Fusion

Multi-Camera

Action Chunking

Diffusion Policy

Safety Controller

Health Monitor

OTA Updates

World Models

V-JEPA 2

Depth Anything V3

Industry Templates

Fleet Management

Hardware Abstraction

Performance

Hardware

Jetson Orin Nano

Raspberry Pi 5

Intel Core Ultra

Hailo-8

Qualcomm RB5

Camera

Quick Start

World Models & Predictive Intelligence (v2.5.0)

New in v0.7.0 - v1.0.0

Models