Gemini 3.5 Flash is out, and it's a major jump over Gemini 3 Flash in model capability for knowledge work. We've been evaluating it on our Box AI Complex Work Eval in early release, and the model delivers a 12 percentage point jump on complex document tasks.
For testing this model, we give the Box AI Agent (using Gemini 3.5) complex problems to solve that represent common but difficult knowledge worker tasks in banking, consulting, public sector, healthcare, and other industries. These tasks can be things like drafting reports, doing due diligence, and more, given a set of relevant documents.
In our tests, Gemini 3.5 Flash delivered jumps across every industry, including:
* Financial services: 81% vs 73% (+8pp)
* Public sector: 76% vs 59%, (+17pp)
* Healthcare: 73% vs 51%, (+22pp)
* Life Sciences: 67% vs 47%, (+20pp)
Incredible to see the continued performance gains.
Gemini 3.5 Flash will be available soon in Box AI Studio and through the Box API. The Box MCP Server will soon be available in the Gemini app with more details to come.