GSA and NIST Partner to Boost AI Evaluation Science in Federal Procurement
WASHINGTON – Today, the U.S. General Services Administration and the National Institute of Standards and Technology’s Center for AI Standards and Innovation announced a joint effort to strengthen how the federal government evaluates artificial intelligence models and services.
The partnership will support USAi, GSA’s secure AI platform for the federal workforce, by developing clearer, more consistent ways to test and measure AI systems before agencies use them in real-world settings. This work supports the White House’s America’s AI Action Plan, which calls for stronger AI evaluation practices and expanded AI adoption across government.
The goal is simple: help agencies adopt AI faster, more securely, and with greater confidence.
“This partnership gives agencies the tools they need to adopt AI with confidence and advances President Trump’s vision for U.S. leadership in the global AI race,” said GSA Administrator Edward C. Forst. “By combining GSA’s government-wide reach with NIST’s AI evaluation expertise, we’re strengthening how the federal government deploys AI.”
Through the collaboration, CAISI will provide tooling and methodological guidance to help GSA evaluate advanced AI models, select and interpret benchmarks, and conduct hands-on testing within real federal workflows. GSA and NIST will also create practical resources, including clear evaluation guidelines and checklists, that other agencies can use to assess AI tools for their own missions.
“We’re at a pivotal time in the AI revolution and this partnership between CAISI and GSA will enable federal agencies to adopt AI in ways that help the American people,” said Deputy Under Secretary of Commerce for Standards and Technology and Acting NIST Director Craig Burkhardt. “We are excited to help advance AI in support of the Trump Administration’s vision for innovation.”
By working together, GSA and CAISI will help agencies avoid duplicating efforts, lower costs, and move more quickly from experimentation to full deployment. The collaboration reflects both agencies’ shared commitment to transparency, strong standards, and building trust in emerging technologies.
U.S. General Services Administration