How CaveFinder ranks the terrain
CaveFinder is a terrain-analysis instrument. It reads public LiDAR elevation data, scans for the morphological signatures of cave entrances, scores every candidate through a calibrated multi-method pipeline, and ranks the results by how cave-like the surrounding terrain looks. This page documents the components, the calibration set, and what the published metrics mean.
Calibrated against over 10,000 documented caves
The current scoring weights were tuned in the April 2026 calibration run. Each candidate the system produces is matched against a held-out set of documented cave entrances to measure how often a known cave lands in the top N candidates for the area being scanned.
What "match" means here. A candidate is counted as a match for a documented cave if it appears in the top N of the candidates returned for a scan area and its centroid is within 50 metres of the documented entrance. The 50 m matching radius is a calibration parameter chosen to be tighter than ridgewalk-scale uncertainty but loose enough to absorb GPS error in the validation records.
What these numbers do not promise. Hit rate measures how often a known cave is in the top of the list, not the false-positive rate among other candidates. CaveFinder is a ranking instrument, not a classifier — it sorts candidates so the strongest leads cluster at the top. Final confirmation still happens with boots on the ground.
Over 20 scoring components, combined deliberately
The pipeline runs 12 detection methods against the elevation surface, applies 14 scoring formulas as multiplicative modifiers, and combines the per-method scores through a calibrated fusion model. Two auxiliary context detectors contribute bonuses without producing candidates of their own.
The plain-English inventory below describes what each component looks for. Specific weights, thresholds, and code-level identifiers are not published.
What we look for
Each detection method scans the elevation surface for a different morphological signature. The methods are grouped into four families — fill-and-diff, surround-context, pattern-recognition, and frequency-domain — plus a handful of newer detectors that don't fit those families cleanly. Multiple methods agreeing on the same location is a stronger signal than any single method on its own.
How we weight what we find
A raw detection is just a yes/no signal. The scoring formulas turn each detection into a number between 0 and 100 by combining geometric properties (depth, area, shape), local context (slope, surrounding terrain, hydrology), regional context (karst region, nearby caves), and agreement between methods. Each formula is calibrated independently and then composed into the final ranking score.
Combining method scores into one ranking
After every detection method has scored every candidate, a fusion model combines the per-method scores into a single 0–100 confidence value used for ranking. The live fusion is a calibrated weighted-sum ensemble that incorporates the multi-method consensus bonus and the method-family consensus bonus. Three alternative fusion schemes were evaluated during calibration and held in reserve as A/B alternates — the live ensemble was selected because it produced the best top-N hit rate against the validation set.
Two detectors that nudge, not produce
Two auxiliary detectors run alongside the main pipeline. They don't produce candidates of their own — they contribute bonuses to candidates produced by the primary detection methods.
- Collapse-chain detector — identifies linear sequences of depressions that align along the same axis, suggesting a roof-collapse trace over a cave passage. Candidates that sit on a collapse chain receive a moderate bonus.
- Lineament detector — identifies long linear surface features (faults, fractures, joints) that often align with subsurface drainage. Candidates that sit on a lineament receive a small bonus, since karst conduits frequently develop along structural weaknesses.
What goes in
Every elevation number CaveFinder uses traces back to a public source. The analysis math is proprietary; the inputs are not.
- USGS 3D Elevation Program (3DEP) — primary source. 1-meter LiDAR coverage across most of the continental US, fetched on demand for each scan area.
- Copernicus and SRTM global DEMs — fallback for areas outside US LiDAR coverage. 30-meter satellite-derived elevation with reduced resolution.
- OpenStreetMap and Wikidata — publicly documented cave records. Used only as an overlay (the "Known Caves" map layer) and for the proximity-boost scoring formula. Never used as training data, never written to user-facing outputs as training references.
- Bedrock-geology rasters — public state and federal geological maps used to define the karst-region gating mask.
What CaveFinder is not
- Not a substitute for ground confirmation. The system ranks candidates by the strength of their cave-like signature. A high-ranked candidate is a strong lead, not a confirmed cave. Final verification always requires boots on the ground.
- Not a guarantee that every depression is a cave. Ranking is a probability surface. Even a top-10 candidate can be a sediment-filled doline with no opening, or an old quarry pit, or a tree-fall pit. The hit-rate numbers above quantify exactly how often the ranking gets it right.
- Not a tool for publishing cave coordinates. CaveFinder displays candidates and known-cave overlays inside the user's own session. It does not export, share, or aggregate cave locations across users. Documented-cave overlays come from public records (OSM/Wikidata) only — never from private cave surveys.
- Not a research dataset. The validation set used for calibration is internal and not redistributed. Researchers who want to use CaveFinder for academic work should contact zach@cavefinder.app for an academic licence and methodology supplement.
Questions about methodology
Karst researchers, NSS members, and land managers with technical questions about the pipeline, validation set construction, or calibration procedure can reach the author at zach@cavefinder.app. Reasonable methodology questions get a real reply, not a form letter.