GPT-5.6 Leaks : Coming in June
- OpenAI researchers hinted that the model behind a recent major math breakthrough is already being used internally as a daily driver for debugging and technical work
- Internal testing tags iris-alpha, ember-alpha, and beacon-alpha were spotted during development, potentially pointing toward multiple GPT-5.6 variants being tested
- GPT-5.6 seems heavily focused on stronger multi-step reasoning, better agentic workflows, and improved frontend generation capabilities
- Canary testing references are already appearing in developer environments, the same quiet rollout pattern seen before GPT-5.5 launched
- Current leaks point toward two models arriving: GPT-5.6 and GPT-5.6 Pro
- GPT-5.6, Sonnet 4.8, and Gemini 3.5 Pro are all expected in June, next month is looking like an AI festival
显示更多
Google I/O leaks: Gemini desktop app is becoming a real AI agent
- The new Gemini desktop app is split into two workspaces: a normal Chat mode and a dedicated Spark mode for local agentic tasks
- Gemini Spark can connect to local folders, analyze code files, run scripts, organize files, and even sync workflows directly with Google Drive
- A new “Stream to Cursor” feature acts like Google’s Magic Pointer idea, where Gemini understands the context of whatever app or window your cursor is hovering over
- The floating overlay can instantly share screen, window, or camera context while also allowing rapid switching between models like Gemini 3 Flash and Gemini 3.1 Pro
- Google is also preparing local “Skills” support, letting users attach custom scripts or capability folders directly into the agent workflow
- Internally, Gemini Omni is referred to as “Veo4 Omni”, hinting at deep Veo 4 integration inside the desktop experience
- Gemini Live is also present as a persistent voice overlay, though it still appears to be work in progress internally
显示更多
Gemini 3.2 Flash leaks: fast and cheap seems to be the focus
- Gemini 3.2 Flash looks focused on making AI much faster and cheaper without sacrificing too much quality
- According to my sources, Google may rename it to Gemini 3.5 Flash
- It may perform close to Gemini 3.1 Pro level while keeping very low latency with sub-200ms responses rumored for many queries
- Pricing leaks point to around $0.25 input / $2 output per 1M tokens, though honestly that still feels too cheap to fully trust right now
- Google is using stronger distillation and sparsity techniques to compress larger model capabilities into a lightweight version
- Knowledge cutoff is said to be updated to January 2026
- Google also seems focused on grounding + search reliability to reduce hallucinations in real-world workflows
- Expected around Google I/O, possibly 1-2 days before the keynote
显示更多