01 · The capability gap
Why a general model alone falls short on this domain.
A useful rule of thumb for where AI gets dramatically better, fast.
capability spike ≈ verifiability × training attention × data coverage × economic value
Migrant worker exploitation scores low on every factor. Outcomes are hard to verify. Training corpora rarely emphasize labour law or recruitment-fee scams. Public data is thin and scattered across regulators, NGOs, and court filings. The economic incentive to optimize a frontier model for this domain is small relative to coding or search.
Stock models, no matter how large, give plausible-sounding answers that miss the cited statute, under-estimate corridor risk, or paraphrase a recruiter euphemism instead of flagging it.
Until inherent capability arrives, the gap has to be closed by structure: deterministic grep rules that fire before the model speaks, retrieval against vetted corridor packs, tool calls that ground claims in verified sources, and a harness ecosystem that can refuse, narrow, anonymize, evaluate, or train around an answer when needed. Gemma 4 makes that structure practical: it ships with native tool-calling and supports local, edge, and on-device-oriented deployment paths.