Spooked by Mythos, Trump suddenly realized AI safety testing might be good

Without defining standards, “the process can be politicized,” Kreps said. That risks creating a system where “whoever holds power gets to shape how the vetting works.”

So far, neither the Biden nor the Trump administrations has figured out how to avoid that, Kreps said.

Fears of government controlling AI outputs

Microsoft’s blog said that “CAISI, Microsoft and NIST will collaborate on improving methodologies for adversarial assessments,” which suggests that the plan is to develop these standards on the fly. According to Microsoft, “testing AI systems in ways that probe unexpected behaviors, misuse pathways, and failure modes” is “much like stress-testing whether airbags, seatbelts, and braking systems work effectively and reliably in safety-critical driving scenarios.”

But Gregory Falco, a Cornell University assistant professor of mechanical and aerospace engineering and expert in tracking governance of AI, insists that there’s a better way.

“Government oversight of AI cannot simply mean political review of model outputs, nor should it become a mechanism for deciding whether a model says favorable or unfavorable things about a president or administration,” Falco said.

Rather than relying on a politicized government leveraging evaluations to control the AI systems that the public uses, the US could build “some form of independent audit,” Falco said.

Imagine, Falco suggests, if AI firms understood that their models could be audited at any point, how much more accountability and discipline might such a system create? Operating similarly to the Internal Revenue Service (IRS), a rigorous AI audit system could create “real consequences for reckless deployments,” Falco said. For AI firms facing such consequences, the pressure would be on to ramp up internal AI safety testing, Falco suggested.

That seems like the “only viable path,” Falco said, since “the federal government does not currently have the in-house technical expertise, infrastructure, or day-to-day insight needed to directly evaluate these systems on its own.”

What's Hot

Google’s Gemma 4 AI models get 3x speed boost by predicting future tokens

AI evaluation startup Braintrust confirms breach, tells every customer to rotate sensitive keys

Making Sense of AT&T’s Hiked Prices for Legacy Phone Plans

AI evaluation startup Braintrust confirms breach, tells every customer to rotate sensitive keys

Snap says its $400M deal with Perplexity ‘amicably ended’

TSMC taps wind power as AI chip demand soars, Taiwan feels energy crunch

Google’s Gemma 4 AI models get 3x speed boost by predicting future tokens

AI evaluation startup Braintrust confirms breach, tells every customer to rotate sensitive keys

Making Sense of AT&T’s Hiked Prices for Legacy Phone Plans

Quantization from the ground up

David Sacks is done as AI czar — here’s what he’s doing instead

Judge sides with Anthropic to temporarily block the Pentagon’s ban

Most Popular

Google’s Gemma 4 AI models get 3x speed boost by predicting future tokens

AI evaluation startup Braintrust confirms breach, tells every customer to rotate sensitive keys

Making Sense of AT&T’s Hiked Prices for Legacy Phone Plans

Our Picks

Quantization from the ground up

David Sacks is done as AI czar — here’s what he’s doing instead

Judge sides with Anthropic to temporarily block the Pentagon’s ban

Subscribe to Updates

What's Hot

Spooked by Mythos, Trump suddenly realized AI safety testing might be good

Fears of government controlling AI outputs

Related Posts

Subscribe to Updates