countrycheck: which country is this point in?

An offline lookup library built on the Trifold grid. The level-10 grid (~7 km cells) classified against GADM-derived country polygons — extended with coastal waters and including X-coded territories like Kosovo and the Caspian Sea — collapses into a 323 KB dataset that names the country anywhere on Earth in microseconds, with a confidence value for every answer. Python and JavaScript give identical results. This page runs the real JS library in your browser; the dataset is embedded right in this HTML file.

Try it on the map

Interactive demo

Load sample points or your own file (CSV lon,lat or GeoJSON points), and every point is resolved to a country in your browser by the bundled library, with no server and no network call per lookup. Each dot is coloured by the country it lands in; open ocean stays grey. The lookups-per-second figure is measured tightly around the classification loop on your machine (map rendering and file parsing excluded), so it is the real library throughput. Use the 100k-random button for a stable number.

Controls
CSV: lon,lat[,name] per line (or a header naming lat/lon columns in either order). GeoJSON: any FeatureCollection of Points. Files stay on your machine.
Off: border cells use the bundled best-call with its area share as confidence. On: exact country/country and coastline borders. Watch how the counts, confidence and lookup rate change.
Click anywhere on the map to see the level-10 triangle and its country answer.
one hue per country (256 total)
no country: open ocean (confidence 1.0)
border cell (mixed)
answer changed by refinement
Loading dataset…

Click any classified point for its full answer: country code (GADM gid_0), ISO 2, name, kind, confidence, area share and cell address (computed on the fly for open-ocean points, whose cells are not stored). Switching on exact border refinement makes the source polygons authoritative in every cell a border crosses. Caveats inherited from the source data: coastal waters are an approximate distance-based assignment, not legal EEZ; disputed territories follow GADM (Crimea, Western Sahara…); lakes belong to their surrounding country, except the Caspian Sea, which is its own XCA entry.

User guide

JavaScript (browser or Node)

import { CountryCheck } from "./countrycheck.mjs";

// Node: bundled file · browser: fetch the 323 KB dataset
const cc = await CountryCheck.fromFile();                  // Node
const cc = await CountryCheck.fromUrl("countries_L10.tfcs"); // browser

cc.country(24.7536, 59.437);   // 'EST'  (lon, lat)
cc.check(-0.1276, 51.5072);
// { country: 'GBR', iso2: 'GB', name: 'United Kingdom',
//   kind: 'country', confidence: 1, share: 1,
//   cell: 'TFA95BM', refined: false }

Python (stdlib only)

from countrycheck import CountryCheck

cc = CountryCheck()                    # bundled data
cc.country(24.7536, 59.4370)           # 'EST'
cc.check(-0.1276, 51.5072)
# CountryResult(country='GBR', iso2='GB',
#   name='United Kingdom', kind='country',
#   confidence=1.0, share=1.0, cell='TFA95BM',
#   refined=False)

What the answer means

kindmeaningcountryconfidence
countrycell wholly inside one countrythat country1.0
nonecell absent from the dataset (international waters) null1.0
bordermixed cell; bundled best call decides (may be none) best callarea share
border + refineddecided by the exact source polygon exact0.99

Measured accuracy: 99.82% agreement with exact polygon containment on 30,000 uniform random points. The country and none answers were 100% correct; all residual error lives in border answers, which self-report lower confidence. With the border refinement loaded, agreement reaches 100.0% on the same sample.

Command line

$ python countrycheck/python/countrycheck.py 24.7536 59.4370
EST  iso2=EE  name='Estonia'  kind=country  confidence=1.000  share=1.0  cell=TFAVKGR  refined=False

The CLI loads the border refinement automatically when borders_L10.tfcr is present. Install with pip install countrycheck or npm install countrycheck, or run straight from a repo checkout.

Technical info

Canonical index

Any Trifold cell at level ≤ 10 maps to a contiguous range in the level-10 index space (face·410 + path): a level-l cell covers exactly 410−l consecutive indices. The whole country classification becomes run-length intervals.

TFCS format · 323 KB

222,403 runs as varint(gap), varint(len·2|border) — interior runs carry a country id, border runs a best call plus a 4-bit area share — over a country table of 256 code/iso2/name strings, all zlib-compressed. Level-agnostic.

Lookup path

Pure-float point location (no dependencies, bit-identical to the SDK) descends 10 subdivision levels, then one binary search over the run starts. ~0.6 µs in Node, ~13 µs in pure Python, ~3 µs batched with numpy.

Border refinement · TFCR

Source country polygons clipped to every border cell, quantized to a cell-local 16-bit grid (~0.1 m), one zone per country present with zigzag-varint rings and the even-odd rule. A point-in-polygon test then decides the exact country (or none) in those cells.

The data is built against GADM-derived country polygons extended with coastal waters: 256 countries and territories, 7.17M level-10 cells belonging to some country, of which 195,062 are border cells. Full documentation, the one-pass build.py and the cross-language test suite live in countrycheck/ on GitHub. Roadmap: an L12 (~1.8 km) variant and timezone detection from the same source data.

Accuracy: tested on 57,501 real airports

Ground truth is the OurAirports dump — 57,501 points, each tagged with an ISO country code from an unrelated source. countrycheck places 99.49% of them in the correct country from the bundled 323 KB data, and 99.66% with the border refinement loaded. Interior-country answers are 99.94% correct; the refinement works only on the 768 airports that fall in a border cell, and there it lifts agreement from 78.65% to 91.54%.

99.49% → 99.66%

overall agreement with airport country codes, bundled vs. border-refined. The residual is mostly disputed/border territory GADM maps differently, dependencies coded to a parent state, and offshore or placeholder coordinates.

100.000% refined

against exact SQL point-in-polygon containment over the same source polygons (100,000 random points): the refinement resolved every one of the 160 base-mode border disagreements. Base mode: 99.84%.

Reproduce with scripts/accuracy_countrycheck_airports.py (airports) and scripts/benchmark_countrycheck.py (vs. SQL containment).

Benchmark: 12–65× faster than SQL spatial engines

One workload, four engines: assign a country (gid_0) to 100,000 sphere-uniform random points against the same GADM country polygons. Median of seven warm runs, Apple M5 Pro, June 2026. The refined Trifold answers reproduce exact polygon containment (see above) while running an order of magnitude faster — true to the name, never less than a three-fold margin. Called one point at a time, the gap widens further.

Batch · 100,000 points per call

Trifold base
405,255 pts/s
Trifold + refine
383,804
PostGIS 3.6
32,329
DuckDB Spatial
5,909

Singular · one point per call

Trifold base
79,849 q/s
Trifold + refine
71,341
DuckDB Spatial
2,415
PostGIS 3.6
1,153

DuckDB 1.5.3 and PostGIS 3.6.3 compute exact containment and returned byte-identical answers; BigQuery is documented but was not run in this pass. PostGIS singular includes localhost TCP + Docker transport; DuckDB runs embedded in-process. The live figure in the demo panel is the same throughput measured on your own device. Full methodology, dataset manifest, the airport test and the BigQuery procedure: countrycheck_benchmark.md.