Model Gallery

Discover and install AI models from our curated collection

10 models available
1 repositories
Documentation

Find Your Perfect Model

Filter by Model Type

Browse by Tags

depth-anything-3-base
Depth Anything 3 (base) monocular metric depth + camera pose, served via the native depth-anything.cpp backend (C++/ggml + purego, no Python at inference). Given an image it returns a dense depth map plus the recovered camera extrinsics (3x4) and intrinsics (3x3). Use GenerateImage (src -> normalized depth PNG at dst) or Predict (JSON depth stats + pose). q4_k is the recommended CPU default.

Repository: localaiLicense: apache-2.0

depth-anything-3-base-q8_0
Depth Anything 3 (base), q8_0 — near-lossless 8-bit quant (~149 MB). Same depth + camera pose output as the q4_k default at higher fidelity.

Repository: localaiLicense: apache-2.0

depth-anything-3-base-f16
Depth Anything 3 (base), f16 — half precision (~233 MB), no measurable accuracy loss vs f32. Depth + camera pose.

Repository: localaiLicense: apache-2.0

depth-anything-3-base-f32
Depth Anything 3 (base), f32 — maximum fidelity (~412 MB). Reference-parity depth + camera pose.

Repository: localaiLicense: apache-2.0

depth-anything-3-giant
Depth Anything 3 (giant / vitg), f32 — the large backbone (~4.9 GB) for maximum quality depth + camera pose. GPU recommended.

Repository: localaiLicense: apache-2.0

depth-anything-3-small
Depth Anything 3 (small / vits), f32 — the smallest backbone (~131 MB) for fast CPU depth + camera pose. Same output as base at lower latency.

Repository: localaiLicense: apache-2.0

depth-anything-3-large
Depth Anything 3 (large / vitl), f32 (~1.6 GB) — higher quality depth + camera pose than base. GPU recommended for interactive use.

Repository: localaiLicense: apache-2.0

depth-anything-3-mono-large
Depth Anything 3 (monocular large / vitl), f32 (~1.3 GB) — single-image monocular depth + a sky mask (no camera pose). DPT single-head variant; use GenerateImage (src -> normalized depth PNG) or Predict (JSON depth stats).

Repository: localaiLicense: apache-2.0

depth-anything-3-metric-large
Depth Anything 3 (metric large / vitl), f32 (~1.3 GB) — single-image metric-scale depth (meters) + a sky mask. DPT single-head metric variant; use GenerateImage (src -> normalized depth PNG) or Predict (JSON metric depth stats, is_metric=true).

Repository: localaiLicense: apache-2.0

depth-anything-3-nested
Depth Anything 3 (nested giant+large), f32 — the recommended metric model. A two-branch pipeline: the anyview GIANT (vitg) branch and a metric ViT-L branch are run and aligned to recover true metric-scale depth (meters) + scaled camera pose from a single image. Downloads both branches (~6 GB total); GPU strongly recommended. Predict returns metric depth stats + pose (is_metric=true).

Repository: localaiLicense: apache-2.0