Open provenance infrastructure for the machine-readable web. Every entity found, measured, structured, and made traversable — according to a fixed set of governing principles.
The Global Data Registry is the open provenance substrate of the AI era. Every domain the crawler discovers enters the intake pipeline and emerges as a structured, machine-readable entity record — UUID, timestamp, source URL, content hash. Provenance on everything. Inference fills nothing.
The registry operates continuously. Layer 1 profiles are generated automatically and made publicly accessible. Every record is citable, traversable, and permanent. The substrate exists independent of any single intelligence querying it.
The pipeline runs on a fixed set of constitutional principles derived from research and operational practice. Those principles govern every record the system produces.
The registry operates according to a constitutional architecture — a set of governing laws derived from research into provenance, semantic structure, and machine-readable graph design. The constitution is not policy. It is the structural logic the pipeline enforces at every layer.