Platform

The complete video intelligence stack.
From raw feed to structured insight.

VII is a full pipeline: ingest any video source, analyze with vision language models, detect events across time, and deliver structured intelligence via API. Deploy anywhere — cloud, on-prem, or air-gapped.

10+VLM Providers
<2sChunk Analysis
7Industry Domains
99.9%Schema Validation Rate
Core Capabilities

Every layer, purpose-built.

Each component of the pipeline is independently configurable, testable, and replaceable. No black boxes.

[INGEST]

Multi-Source Video Ingest

Connect any video source — IP cameras via RTSP/ONVIF, drone feeds, body-worn cameras, uploaded files. Automatic format detection, adaptive frame sampling from 1-10 FPS, and intelligent chunking with configurable overlap.

  • MP4 · MKV · RTSP · HLS · ONVIF
  • Adaptive 1–10 FPS sampling
  • 30s chunks with 5s overlap
[VLM]

VLM Orchestration Engine

Route video chunks to the best-suited vision language model. Automatic failover between providers. Every response validated against strict Zod schemas — no hallucination passes through.

  • Gemini · GPT-4V · Claude
  • Automatic failover chains
  • Zod schema validation on every output
[EVENT]

Temporal Event Detection

Cross-chunk event correlation detects events that span multiple segments. Deduplication ensures each event is reported once, not per-chunk. Temporal anchoring for precise timeline placement.

  • Cross-chunk correlation
  • Deduplication engine
  • Temporal anchoring (timestamp → event)
[SEARCH]

Semantic Search

Ask questions in natural language across hours of footage. "Show me every time someone enters the loading dock after 9pm." The search index is built automatically during analysis.

  • Natural language queries
  • Auto-built search index
  • Time-range filtering
[ALERT]

Configurable Alert Engine

Define alert rules per domain. "Notify if a person is detected in Zone A after hours." Alerts fire over webhooks, WebSocket, or push notifications with full event context.

  • Rule-based alert definitions
  • Webhook + WebSocket delivery
  • Full event context payload
[OUTPUT]

Structured JSON Output

Every analysis produces strongly-typed JSON. Events, timelines, summaries, objects, persons — all validated against TypeScript types and Zod schemas. No free-text guessing.

  • TypeScript-first types
  • Zod runtime validation
  • Narrative summaries + structured data
Technology
What makes it different

Not just models. A pipeline.

Anyone can call an AI model. The hard part is making it reliable, accurate, and production-ready at scale. These are the engineering decisions that make VII different.

[FUSION]

Multi-Model Cross-Validation

Vision language models understand context and meaning. Object detection models catch precise locations. VII runs both on every chunk — YOLO enriches VLM output with bounding boxes and catches objects the VLM missed. Fewer false positives. Fewer missed events.

  • VLM + YOLO cross-validation
  • Overlapping chunks prevent boundary loss
  • Cross-chunk event deduplication
[VIDEORAG]

VideoRAG: Ask Your Archive Anything

"Show me every red truck near Gate 3 this week." Not keyword search — semantic understanding over your entire video archive. Every answer cites specific timestamps and source footage.

  • Natural language queries
  • Timestamp-accurate citations
  • Scales to thousands of hours
[PRIVACY]

Privacy-First Architecture

Cross-camera person tracking uses text descriptions — not facial recognition, not biometric data. Raw video can stay inside your environment, and the platform is designed to support tightly governed deployments.

  • No biometric storage
  • Edge processing by default
  • Built for regulated environments
[RELIABLE]

Production-Grade Reliability

Every AI output is validated against strict schemas before it reaches your systems. Automatic failover between model providers. If one model is down, the next takes over without interruption.

  • Schema validation on every output
  • Multi-provider failover chains
  • 99.9% structured output rate
Deployment
Deployment

Your infrastructure, your rules.

[MANAGED]

Cloud

Fully managed by VII. Zero infrastructure to maintain. Automatic scaling, updates, and monitoring. Data encrypted at rest and in transit.

[AIR-GAP]

On-Premise

Deploy inside your network perimeter. Air-gapped mode available — video never leaves your environment. Full control over data residency and retention.

[HYBRID]

Hybrid

Process video on-site, sync intelligence to the cloud. Keep raw video local while enabling cross-site dashboards, unified alerting, and central administration.

Integrations
Integrations

Plugs into what you already use.

RTSP / ONVIFDirect IP camera feeds
AWS S3 / GCSCloud storage buckets
SlackAlert notifications
PagerDutyIncident escalation
Splunk / SIEMSecurity event forwarding
WebhooksCustom integrations

See the platform in action.

Request a walkthrough with our team. We'll run your own footage through the pipeline live.

Request Early Access