Features — GeoFMV Desktop Add-Ons

Requirement 1

Video Ingestion & Playback

Robust local and live video playback using embedded open-source media engines — no OS-level codec packs required.

✔ Local file playback

Open .ts, .mp4, .mxf files with embedded KLV metadata. Playback starts within 3 seconds.

✔ Live UDP/RTP streams

Connect to multicast URIs like udp://@:5000. Stream connects and plays within 5 seconds.

✔ Full playback controls

Play, Pause, Stop, Seek, Fast-Forward (2×/4×/8×), Rewind (2×/4×/8×). Frame-accurate seeking within 1 video frame.

✔ Dockable video panel

Video renders in a dockable panel within the host GIS — no separate window. QGIS: QgsDockWidget. ArcGIS: WPF DockPane.

Platform Implementation

🌿QGIS 4.x

from PyQt6.QtMultimedia import QMediaPlayer, QAudioOutput
from PyQt6.QtMultimediaWidgets import QVideoWidget
from PyQt6.QtMultimedia import QVideoSink

# Qt6 backend: WMF (Win), GStreamer (Linux), AVFoundation (macOS)
player = QMediaPlayer()
video_widget = QVideoWidget()
sink = QVideoSink()         # for frame extraction
audio = QAudioOutput()

player.setVideoOutput(video_widget)
player.setAudioOutput(audio)
player.setVideoSink(sink)   # intercepts decoded frames

🗺ArcGIS Pro (C#)

// LibVLCSharp — plays every format VLC supports
using LibVLCSharp.Shared;
using LibVLCSharp.WPF;

var libVlc = new LibVLC();
var mediaPlayer = new MediaPlayer(libVlc);
// Attach to WPF VideoView in DockPane XAML
videoView.MediaPlayer = mediaPlayer;

// Open UDP stream
var media = new Media(libVlc, "udp://@:5000",
    FromType.FromLocation);
mediaPlayer.Play(media);

Requirements 2–4

KLV Metadata Extraction & Synchronization

All 146 MISB ST 0601 tags extracted synchronously with video frames. The telemetry panel updates within 100 ms; seek operations resolve within 200 ms.

Supported MISB Standards

Standard	Description	Key Tags	QGIS	ArcGIS
MISB ST 0601	UAS Datalink Local Set — primary standard, 146 tags	Sensor Lat/Lon/Alt, Heading, FOV, Corner Coords 82–89	✔	✔
MISB ST 0102	Security Metadata — classification & releasability	Tag 1: Security Class, Tag 12: Country	✔	✔
MISB EG 0104	Predator UAV Basic Universal Metadata Set (legacy)	Mapped to ST 0601 equivalents	✔	✔
MISB ST 0903	VMTI — AI detection output encoding	VTarget Pack: location, bbox, class, confidence	✔	✔

Universal Key (MISB ST 0601)

// 16-byte sliding-window search
06 0E 2B 34 02 0B 01 01
0E 01 03 01 01 00 00 00

// BER length decoding
// TLV iteration → named MisbTelemetry fields

Telemetry Fields Extracted

Tag 13/14

Sensor Lat / Lon

±0.17 µdeg precision

Tag 15

Sensor Altitude

−900 to 19,000 m, ±0.3 m

Tags 5/6/7

Heading / Pitch / Roll

0.005° precision

Tags 16/17

H/V FOV

0–180°, ±0.003°

Tag 21

Slant Range

0–5,000 km, ±0.076 m

Tags 82–89

Corner Coordinates

4 corners × Lat/Lon, ±3.5 µdeg

Tags 23/24

Frame Centre

Lat / Lon of scene centre

Tag 2

Precision Timestamp

Unix microseconds

DJI Telemetry Multiplexing

Combine a raw DJI video file with its CSV flight log to produce a STANAG 4609-compliant MPEG-TS with embedded MISB ST 0601 KLV. Linear interpolation fills timestamp gaps >1 second.

Requirements 5–6

Geospatial Visualization & Map–Video Sync

Live sensor footprint, platform tracking, and bidirectional synchronization between the video timeline and the map canvas.

🔷

FOV Footprint Polygon

Dynamic filled polygon on the map canvas updated at ≥5 Hz. Computed from KLV Corner Coordinates (Tags 82–89) when present, or via ray-casting from sensor pose + DEM when absent.

✈️

Platform & Trajectory

Real-time platform position point with heading indicator. Cumulative flight trajectory polyline accumulates for the full session. Frame-centre crosshair updates synchronously.

🔁

Bidirectional Sync

Map→Video: Click a location in a historical footprint to seek to that timestamp. Video→Map: Scrubbing the seek bar updates footprint, platform, and crosshair within 200 ms.

Georeferencing Algorithms

Primary Homography (Tags 82–89)

When MISB Corner Coordinates are present, a perspective transform matrix is computed from the four corner pixels to four WGS 84 geographic positions. Achieves <15 m CEP at 100–5,000 m AGL.

# QGIS Python
M = cv2.getPerspectiveTransform(src_px, dst_geo)
geo_pt = cv2.perspectiveTransform(px_pt, M)

Fallback Ray-Casting from Sensor Pose

When corner coordinates are absent: unit look vector from Sensor Lat/Lon/Alt, rotated by Heading/Pitch/Roll, intersects the DEM (or WGS 84 ellipsoid). Output converted ECEF → WGS84.

3D Scene Support

Both platforms support rendering footprint, platform position, and trajectory on a 3D scene layer alongside the 2D map canvas.

DEM Terrain Correction

When a DEM raster layer is active in the project, the georeferencing engine applies terrain-corrected ray-casting to account for elevation relief in footprint vertex computation.

Requirements 7–10

AI Vision Pipeline

Automated frame extraction, dual-mode inference, and georeferenced detection output — all running on a background thread so the GIS UI stays responsive.

→ See the full AI Pipeline documentation

📸

Frame Extraction

Time-based (0.1–30 fps), telemetry-triggered (FOV change threshold), or manual ROI. Each frame is associated with its synchronous KLV packet.

🧠

Local ONNX Inference

YOLOv11 .onnx model. Pre-process → letterbox 640×640 → normalize [0,1] → run → NMS → filter by confidence threshold. Supports user-supplied custom models.

🌐

Remote OpenRouter

JPEG encode → Base64 → OpenAI Chat Completions JSON → POST. 3-attempt exponential back-off (1s/2s/4s). Supports QWEN3-VL and Llama-3.2-Vision.

Requirements 12–13

Analyst Review Workflow

AI detections are proposals, not facts. GeoFMV gives analysts full control over what enters the final intelligence product.

Review Panel

🖼

Detection preview

Cropped bounding box image, class label, confidence score, geographic coordinates

✅

Accept

Marks detection ACCEPTED — included in all exports

❌

Reject

Marks detection REJECTED — excluded from exports but record is kept

🏷

Reclassify

Update the class label → detection marked RECLASSIFIED with analyst attribution

Map Symbology by Status

Detection points on the map use visually distinct symbols based on review status — making the spatial distribution of accepted, pending, and rejected detections immediately readable.

ACCEPTED — styled by class label (distinct colour per class)

PENDING — awaiting analyst decision

REJECTED — suppressed from exports

RECLASSIFIED — analyst-assigned label

Detection Layer Schema

detection_id  UUID          unique
class_label   String(64)    YOLO class / OpenRouter
confidence    Float         0.0–1.0
timestamp_us  Int64         Unix microseconds
sensor_alt_m  Float         altitude at detection
bbox_x/y/w/h  Int           pixel bounding box
geo_point     Point WGS84   georeferenced location
review_status String(16)    PENDING/ACCEPTED/...
analyst_label String(64)    nullable

Requirements 11, 14, 17

Export & Interoperability

Export filtered detection layers in standard geospatial formats, embed AI results as MISB ST 0903 VMTI metadata, and generate georeferenced frame mosaics as GeoTIFFs.

Vector Export Formats

Format	Extension	Use Case
GeoPackage	`.gpkg`	OGC standard, all attributes, portable
Shapefile	`.shp`	Legacy enterprise GIS compatibility
GeoJSON	`.geojson`	Web interchange, JavaScript tooling
KML	`.kml`	Google Earth, legacy GIS systems
OGC Moving Features	`.mf-json`	Temporal trajectory export (OGC 20-036)

Export Filters

✔ Review status (accepted / rejected / all)

✔ Class label (e.g., export "vehicle" only)

✔ Confidence threshold slider

✔ Time range (start / end timestamp)

VMTI Output (MISB ST 0903)

Each accepted AI detection is encoded as a MISB ST 0903 VTarget Pack and embedded into the output MPEG-TS alongside the original video and MISB ST 0601 KLV stream.

VTargetPack {
  target_id:    uint16   // unique
  target_lat:   float64  // Tag 17
  target_lon:   float64  // Tag 18
  bbox_tl_x:    uint16   // Tag 10
  bbox_tl_y:    uint16   // Tag 11
  bbox_width:   uint16   // Tag 12
  bbox_height:  uint16   // Tag 13
  confidence:   uint8    // Tag 23 (0–255)
  classification: str    // Tag 102
}

GeoTIFF Mosaic (Req. 17)

Extract frames at a configurable interval, georeference each via the KLV corner coordinates, and assemble into a single GeoTIFF mosaic (nearest-neighbour blending, user-specified CRS). Each exported frame embeds MISB timestamp, sensor altitude, and platform heading as GeoTIFF metadata tags.

Requirements 15–18

Advanced Analyst Tools

Video filters, object tracking, mosaic generation, and measurement tools — all georeferenced and integrated with the host GIS.

Video Filters (Req. 15)

⬛

Greyscale

Converts display to greyscale

📐

Canny Edge Detection

OpenCV Canny algorithm highlights structural edges

☀️

CLAHE Auto-Contrast

Contrast Limited Adaptive Histogram Equalisation

↔️

Horizontal Mirror

Flips video horizontally

🌿

NDVI

Normalised Difference Vegetation Index (requires NIR channel)

Active filters are applied to both the display output and frames sent to the AI inference engine.

Object Tracking (Req. 16)

Select a bounding box in the video frame to initialise a tracker. The tracked position is georeferenced on each frame and appended to a track polyline on the map canvas.

Measurement & Drawing (Req. 18)

✔ Distance measurement tool — ground distance in metres via homography

✔ Point, line, polygon drawing annotations on the video frame

✔ Annotations are georeferenced and added to the Detection Layer

✔ Annotations are hidden when seeking to a different timestamp

Complete FMV exploitation, built into your GIS