The White House press secretary said government data that DOGE has collected isn’t being used to train Musk’s AI models, despite Elon Musk’s control over DOGE. However, evidence has emerged that DOGE personnel simultaneously hold positions with at least one of Musk’s companies.
At the Federal Aviation Administration, SpaceX employees have government email addresses. This dual employment creates a conduit for federal data to potentially be siphoned to Musk-owned enterprises, including xAI. The company’s latest Grok AI chatbot model conspicuously refuses to give a clear denial about using such data.
As a political scientist and technologist who is intimately acquainted with public sources of government data, I believe this potential transmission of government data to private companies presents far greater privacy and power implications than most reporting identifies. A private entity with the capacity to develop artificial intelligence technologies could use government data to leapfrog its competitors and wield massive influence over society.
For AI developers, government databases represent something akin to finding the Holy Grail. While companies such as OpenAI, Google and xAI currently rely on information scraped from the public internet, nonpublic government repositories offer something much more valuable: verified records of actual human behavior across entire populations.
This isn’t merely more data – it’s fundamentally different data. Social media posts and web browsing histories show curated or intended behaviors, but government databases capture real decisions and their consequences. For example, Medicare records reveal health care choices and outcomes. IRS and Treasury data reveal financial decisions and long-term impacts. And federal employment and education statistics reveal education paths and career trajectories.
What makes this data particularly valuable for AI training is its longitudinal nature and reliability. Unlike the disordered information available online, government records follow standardized protocols, undergo regular audits and must meet legal requirements for accuracy.
Most critically, government databases track entire populations over time, not just digitally active users.
Treasury data represents perhaps the most valuable prize. Government financial databases contain granular details about how money flows through the economy. This includes real-time transaction data across federal payment systems, complete records of tax payments and refunds, detailed patterns of benefit distributions, and government contractor payments with performance metrics.
Lambert here: Would Willie Sutton please pick up the white courtesy phone?
An AI company with access to this data could develop extraordinary capabilities for economic forecasting and market prediction. It could model the cascading effects of regulatory changes, predict economic vulnerabilities before they become crises, and optimize investment strategies with precision impossible through traditional methods.
Lambert here: It’s a good thing our overlords are benevolent.
The threat of a private company accessing government data transcends individual privacy concerns. Even with personal identifiers removed, an AI system that analyzes patterns across millions of government records could enable surprising capabilities for making predictions and influencing behavior at the population level. The threat is AI systems that leverage government data to influence society, including electoral outcomes.
Since information is power, concentrating unprecedented data in the hands of a private entity with an explicit political agenda represents a profound challenge to the republic. I believe that the question is whether the American people can stand up to the potentially democracy-shattering corruption such a concentration would enable. If not, Americans should prepare to become digital subjects rather than human citizens.

Add new comment