Briefing Archive
Every briefing we have published, organised by topic, company, and business relevance. Structured analysis of major AI developments for founders and operators.
Browse by Company
Browse by Business Relevance
All Briefings
AI Agents Can Now Create Accounts, Buy Services, and Deploy Code
Cloudflare and Stripe launched an open protocol on 30 April 2026 that allows AI agents to autonomously create cloud accounts, register domains, start paid subscriptions, and deploy applications to production without any human completing those steps. Initial integrations include Vercel, Supabase, Clerk, PostHog, Sentry, PlanetScale, and Inngest, with a default $100 per month spending cap per provider.
OpenAI urges all macOS users to update ChatGPT, Codex and Atlas after Axios library compromise
OpenAI issued an urgent security alert on 29 April 2026 after a compromised third-party JavaScript library, Axios, was used to push a remote access trojan into its desktop apps. All macOS users must update before 8 May 2026 or risk credential theft.
Google Cloud Next 2026: Agents Are Now the Enterprise Architecture
Google Cloud Next 2026 delivered the biggest enterprise AI announcement of the year: a unified Gemini Enterprise Agent Platform that lets organisations build, govern, and optimise AI agents in a single environment. Paired with 8th-generation TPU chips, an open Agent-to-Agent (A2A) protocol now in production at 150 organisations, and a $750 million partner fund, Google has signalled that agents are no longer a feature of its cloud platform. They are the architecture.
OpenAI Launches GPT-5.5 with Stronger Agentic and Computer-Use Capabilities
OpenAI released GPT-5.5 on April 23, 2026, with significant advances in agentic coding, computer use, and long-horizon task execution. Available to Plus, Pro, Business, and Enterprise users, it carries a 1 million-token context window and is priced at $5 per million input tokens in the API. OpenAI describes it as its smartest and most intuitive model to date.
Google Launches Workspace Studio: No-Code AI Agent Builder for Business Users
Google announced Workspace Studio on April 22, 2026, a no-code platform allowing business users to build and deploy AI agents across Gmail, Docs, Sheets, Drive, Meet, and Chat using plain-language descriptions. The launch signals that enterprise AI agent creation is moving from engineering teams to operations and business users.
Anthropic Pledges $100B to AWS as Amazon Doubles Down on Claude
Amazon has invested an additional $5 billion into Anthropic, with up to $25 billion available in the current funding round, while Anthropic has pledged to spend more than $100 billion on AWS infrastructure over the next decade. The deal will see the full Claude Platform embedded directly within AWS with integrated billing and security controls, making Claude native infrastructure for the businesses already running on Amazon's cloud. For operators, this signals that enterprise AI is consolidating inside major cloud providers rather than remaining a standalone procurement category.
Mozilla Thunderbolt Gives Businesses a Self-Hosted AI Alternative
Mozilla's for-profit subsidiary MZLA Technologies launched Thunderbolt on 16 April 2026, an open-source, self-hostable enterprise AI client designed to replace Microsoft Copilot, ChatGPT Enterprise, and Claude Enterprise for organisations that want full control over their data. Thunderbolt supports any AI model, integrates with MCP servers and the Agent Client Protocol, and includes optional end-to-end encryption with device-level access controls. It is available on GitHub now, with a managed hosted version for smaller teams currently accepting signups.
PwC: 74% of AI's Economic Value Goes to Just 20% of Firms
PwC's 2026 AI Performance Study, drawing on surveys of 1,217 senior executives across 25 sectors worldwide, finds that 74% of AI's financial gains are captured by just 20% of companies. The leading firms generate 7.2 times more AI-driven revenue and efficiency gains than the average competitor. The differentiating factor is not technology access but strategic intent: leaders use AI to reinvent how they generate revenue, not merely to reduce costs.
Anthropic Releases Claude Opus 4.7 with Stronger Agent and Vision Capabilities
Anthropic released Claude Opus 4.7 on April 16, 2026, its most capable commercial model to date. The release delivers significant gains in software engineering, vision, and long-running agent workflows at unchanged pricing of $5 per million input tokens and $25 per million output tokens. It is positioned just below the restricted Mythos Preview model.
Stanford AI Index 2026: Agent Task Success Rate Jumps from 20% to 77% in One Year
The 2026 Stanford AI Index Report reveals that AI agent task completion rates on real-world benchmarks improved from 20% in 2025 to 77.3% in 2026. Generative AI reached 53% population adoption within three years, faster than the personal computer or the internet. As of March 2026, Anthropic's top model leads the frontier by just 2.7%.
Google AI Mode Cutting Organic Traffic as Users Get Answers Without Clicking
Google's AI Mode is changing what happens after someone searches, with many users getting what they need without ever clicking through to a website. Most brands have not adjusted their SEO strategy to account for this shift. Early data suggests significant drops in organic click-through rates for informational queries.
Google Integrates NotebookLM Into Gemini, Creating a Unified AI Research Layer
Google has fully integrated NotebookLM into the Gemini app, allowing users to create research notebooks directly inside the chatbot. Users can upload PDFs, documents, website URLs, YouTube videos, and text, with notebooks syncing across both apps. This merges Google's conversational AI and structured research tools into a single knowledge layer for enterprise teams.
Agentic AI Prompt Injection Confirmed as Primary Enterprise Security Threat
Security researchers have confirmed that prompt injection via malicious instructions embedded in GitHub issues, documentation, and email is the leading attack vector against AI agents. In some enterprise environments, machine-to-machine interactions now outnumber human logins 100-to-1, creating a largely ungoverned attack surface.
DeepSeek V4 Achieves Near-Frontier Performance at $5.2M Training Cost
DeepSeek released V4, a one-trillion-parameter Mixture-of-Experts open-weights model achieving near-frontier performance for an estimated $5.2 million training cost. At $0.28 per million input tokens versus $2+ for Western flagships, it is reshaping cost assumptions for enterprise AI procurement.
Anthropic Withholds Mythos From Public Over Cyberattack Risk
Anthropic has officially launched Project Glasswing, a tightly controlled release programme for its most powerful model, Claude Mythos Preview. The model, capable of finding tens of thousands of zero-day vulnerabilities and exploiting them autonomously, is being restricted to approximately 40 vetted organisations for defensive security work only. Anthropic describes it as the first AI model capable of bringing down a Fortune 100 company or penetrating critical national defence systems.
OpenAI GPT-5.4 Fully Deployed Across All Surfaces With Native Computer-Use
GPT-5.4 is now fully deployed across ChatGPT, Codex, and the OpenAI API, completing a rollout that began in March. The model introduces native computer-use capabilities, enabling agents to interact directly with desktop applications and browsers without custom integrations.
Shopify Launches AI Toolkit, Letting Coding Agents Run Your Store
Shopify released a free, open-source AI Toolkit that connects coding agents like Claude Code, OpenAI Codex, Cursor, and Gemini CLI directly to the Shopify platform. Merchants can now manage products, inventory, and store operations in plain English without logging into the dashboard. The toolkit provides live API schema validation and real-time store execution through MCP servers.
Meta Launches Muse Spark, Its First Proprietary Model From Superintelligence Labs
Meta released Muse Spark, the first model from its new Superintelligence Labs, marking a sharp pivot from open-source Llama to proprietary AI. The multimodal reasoning model uses 'thought compression' to achieve frontier performance at a fraction of the compute cost, processing text and images natively. Meta AI app downloads jumped 87% on launch day.
70% of Organisations Have AI-Generated Code Vulnerabilities in Production
A new industry report reveals that 70.4% of organisations have confirmed or suspected security vulnerabilities in production systems introduced by AI-generated code. Despite this, 92% express confidence in their detection capabilities, revealing a dangerous confidence gap. Service principals and autonomous agents now outnumber human users 100-to-1 in enterprise environments, creating a largely ungoverned attack surface.
OpenAI, Anthropic, and Google Unite to Fight Chinese Model Distillation
OpenAI, Anthropic, and Google announced a joint intelligence-sharing operation through the Frontier Model Forum to detect and counter adversarial distillation attacks from Chinese AI labs. Anthropic reported that DeepSeek, Moonshot AI, and MiniMax collectively generated over 16 million exchanges with Claude via roughly 24,000 fraudulent accounts. This is the first time the Forum has been activated as an active threat-intelligence operation.
Anthropic Leaks Claude Code Source via npm Packaging Error
On 31 March 2026, Anthropic accidentally exposed the full source code of Claude Code through a 59.8 MB source map file bundled in npm package version 2.1.88. The leak revealed 513,000 lines of unobfuscated TypeScript across 1,906 files, including 44 unreleased feature flags and the complete agent orchestration logic. Within hours, the code was mirrored to GitHub and forked tens of thousands of times.
Microsoft Ships Three Enterprise AI Models Through Foundry
Microsoft launched MAI-Transcribe-1, MAI-Voice-1, and MAI-Image-2 on 3 April 2026 through Microsoft Foundry. The three models cover speech-to-text, voice generation, and image creation at commercially competitive pricing, and are available immediately to enterprise developers. All three already power Microsoft's own products including Copilot, Bing, and Azure Speech.
OpenAI Closes $122B Round as Enterprise Tops 40% of Revenue
OpenAI closed a record $122 billion funding round on 31 March 2026 at an $852 billion valuation, with Amazon committing $50 billion and Nvidia and SoftBank each contributing $30 billion. Enterprise customers now account for more than 40% of OpenAI's $2 billion monthly revenue, and the company's APIs process over 15 billion tokens per minute. The round signals that OpenAI is cementing its position as the foundational AI infrastructure layer for business, not merely a consumer chatbot.
AI Agent-Level Exploits Emerge as Top Enterprise Security Threat
Security researchers are flagging agent-level exploits as one of the fastest-growing attack vectors of 2026, as enterprises roll out agentic AI systems with write access to databases, APIs, and financial systems. Legacy security platforms cannot address AI-to-AI interaction monitoring, creating a new class of tooling requirement.
Google Launches Gemini 3.1 Flash-Lite at $0.25 Per Million Tokens
Google has released Gemini 3.1 Flash-Lite, its most cost-efficient AI model to date, priced at $0.25 per million input tokens, one-eighth the cost of Gemini 3.1 Pro. The model delivers 2.5 times faster responses and 45% higher output speeds than its predecessor, while supporting a one-million-token context window and multimodal inputs including text, images, audio, video, and PDFs. For operators running high-volume AI workflows, the pricing shift opens use cases that were previously too expensive to sustain.
Microsoft Releases Open-Source Agent Governance Toolkit Addressing All 10 OWASP Agentic AI Risks
Microsoft released the Agent Governance Toolkit on April 2, 2026, a free seven-package open-source system providing runtime security governance for autonomous AI agents. It covers all 10 OWASP agentic AI risks with deterministic, sub-millisecond policy enforcement and integrates directly with LangChain, CrewAI, Google ADK, and Microsoft Agent Framework without requiring code rewrites.
OpenAI's GPT-5.4 Surpasses Humans at Autonomous Desktop Tasks
OpenAI launched GPT-5.4 on 5 March 2026, the company's first general-purpose model with native computer-use capabilities. The model scored 75% on the OSWorld-V benchmark, outperforming the human baseline of 72.4%, and 83% on the GDPVal benchmark for economically valuable knowledge work. It marks the clearest shift yet from AI as a conversational tool to AI as an autonomous digital coworker capable of executing multi-step tasks across software environments.
Anthropic Mythos Leaked: A Step-Change Model Above Opus
A misconfigured content management system exposed internal Anthropic documents on 27 March 2026, revealing a new model called Claude Mythos, described as a step change above the existing Opus tier. The leaked draft blog warns that Mythos poses unprecedented cybersecurity risks and is far ahead of any other AI model in cyber capabilities. Anthropic has confirmed the model exists and is restricting early access to cyber defence organisations while it improves efficiency before a general release.
GPT-5.4 Turns ChatGPT into an Autonomous Digital Coworker
OpenAI released GPT-5.4 and GPT-5.4 Pro across ChatGPT, the API, and Codex on 17 March 2026. The model features a 1-million-token context window and can autonomously execute multi-step workflows across documents, spreadsheets, and software environments. A new Skills feature lets teams build and share reusable automations, marking a practical shift from AI as a chat assistant to AI as an autonomous digital coworker.
Tech Sector Cuts 59,000 Jobs in 2026, AI Agents Cited
The global tech sector has eliminated nearly 60,000 jobs since January 2026, with Amazon leading at 16,000 cuts and a reported second wave of 14,000 more in preparation. Amazon CEO Andy Jassy explicitly cited AI agents as a driver of reduced workforce needs, stating that billions of agents are coming fast. AI was formally cited in over 12,000 US job cuts in the first two months of the year alone.
MCP Hits 97 Million Installs and Becomes the AI Standard
The Model Context Protocol reached 97 million installs in March 2026, with every major AI provider now shipping MCP-compatible tooling. MCP has become the foundational standard for connecting AI agents to external tools, databases, and APIs. Operators building AI workflows on proprietary integration approaches are creating technical debt that will be expensive to unwind.
GitHub Copilot Will Train on Your Code from April 24
GitHub has announced that from April 24, 2026, interaction data from Copilot Free, Pro, and Pro+ users will be used to train AI models by default. The data collected includes code snippets, accepted outputs, repository structure, and chat interactions. Users must actively opt out via Privacy settings before the deadline.
Microsoft Copilot Cowork Launches as Enterprise AI Agent for Files and Workflows
Microsoft launched Copilot Cowork, an enterprise AI agent designed to read, analyse, and manipulate files across an organisation. Built on Anthropic technology, it automatically selects the best AI model for each task and is targeted at business teams managing complex document and workflow operations.
NVIDIA Agent Toolkit Puts AI Agents Inside Your Business Software
NVIDIA launched the Agent Toolkit at GTC 2026, an open source platform for deploying autonomous AI agents across enterprise software. More than 20 platform partners including Salesforce, SAP, ServiceNow, Adobe, and Cisco committed to building on the shared foundation. For operators already running these platforms, agentic AI capabilities are about to become native to tools they already pay for.
Gemini 3.1 Flash-Lite Makes Powerful AI 8x Cheaper to Run
Google launched Gemini 3.1 Flash-Lite on 3 March 2026, pricing it at $0.25 per million input tokens, one-eighth the cost of Gemini 3.1 Pro. The model is 2.5 times faster than its predecessor and outperforms rival efficiency models from OpenAI and Anthropic across most benchmarks. For operators building or buying AI-powered tools, the cost of running capable AI at scale has dropped significantly.
HiddenLayer: 1 in 8 Companies Reporting AI Breaches Linked to Agentic Systems
HiddenLayer has released its 2026 AI Threat Landscape Report, finding that 1 in 8 companies have experienced AI breaches tied to agentic systems. 73% of organisations report internal conflict over who owns AI security, and 31% do not know if they have been breached.
U.S. AI Accountability Act Requires Mandatory Bias Audits
The U.S. AI Accountability Act has passed, requiring companies that use AI in hiring, lending, healthcare, and criminal justice to conduct and publish regular bias audits. This ends the era of voluntary self-regulation and introduces binding compliance obligations for any organisation using AI in high-stakes decision-making.
Anthropic Launches Enterprise Marketplace for Claude with Zero Commission
Anthropic opened an enterprise marketplace allowing businesses to purchase third-party Claude-powered applications against existing spend commitments, with launch partners including Snowflake, Harvey, and Replit. Anthropic is taking no commission at launch, making it a low-friction entry point for enterprise procurement. Claude Opus 4.6 and Sonnet 4.6 also launched with 1 million token context windows in beta.
Meta's Llama 4 Brings Frontier AI to Self-Hosted Deployments
Meta's Llama 4 family delivers frontier-class AI capability at roughly one-ninth the per-token cost of GPT-4o, with full self-hosting support for organisations that cannot send data to third-party cloud providers. Scout and Maverick are available across AWS, Azure, and Snowflake, with dedicated deployment guides for regulated industries including finance, healthcare, and defence.
Snowflake Launches Agentic AI That Executes Work on Your Data
Snowflake announced Project SnowWork on 18 March 2026, a new agentic AI platform that autonomously completes multi-step business workflows from plain-language prompts. Built on a company's own governed data, it handles tasks like pulling figures, building analysis, generating deliverables, and drafting follow-up communications without human hand-holding. The platform enters research preview with a limited set of customers and no disclosed pricing.
McKinsey Now Runs 25,000 AI Agents Alongside Its Staff
McKinsey CEO Bob Sternfels has confirmed the firm operates 25,000 AI agents working alongside its 40,000 human employees, growing from just 3,000 agents 18 months ago. The deployment has saved 1.5 million hours of work in a single year and prompted McKinsey to introduce an AI collaboration test as a formal stage in its graduate hiring process. The announcement signals that agentic AI has moved from competitive advantage to operational standard at the world's largest management consultancy.
US AI Accountability Act Passes, Mandating Bias Audits for Consequential AI
The US AI Accountability Act passed in March 2026, requiring companies deploying AI in hiring, lending, healthcare, and criminal justice to conduct and publish regular bias audits. It ends years of voluntary self-regulation and creates binding obligations for any organisation using AI in decisions that affect individuals.
GPT-5.4 Beats the Human Baseline on Real Desktop Work
OpenAI's GPT-5.4 has become the first general-purpose AI model to score above the human baseline on OSWorld-V, a benchmark that simulates real desktop productivity tasks. Released on 5 March 2026, the model introduces native computer-use capabilities, a 1-million-token context window, and autonomous multi-step workflow execution across software environments. It is available through ChatGPT, the API, and Codex, with enterprise-grade security controls for business accounts.
Cisco and NVIDIA Bring Secure AI to the Enterprise Edge
Cisco announced a major expansion of its Secure AI Factory with NVIDIA at GTC 2026 on 17 March, extending AI deployment capabilities from central data centres to edge locations including warehouses, hospitals, and vehicles. The platform compresses enterprise AI deployment timelines from months to weeks, with zero-trust security and agent-level guardrails built in from the start. AT&T is the first service provider to bring these capabilities to market.
Perplexity's 'Computer' Agent Targets Enterprise Workflows
Perplexity has launched its multi-model AI agent, Computer, for enterprise customers, positioning itself as a direct competitor to Microsoft Copilot and Salesforce. The platform orchestrates 20 frontier AI models inside an isolated cloud environment to execute complex, multi-step workflows autonomously. The enterprise launch adds SOC 2 compliance, SAML single sign-on, native Slack integration, and connectors for Snowflake, Salesforce, and HubSpot.
NVIDIA GTC 2026: NemoClaw Brings Enterprise AI Agents to Every Business
NVIDIA launched NemoClaw at GTC 2026 today, an open-source platform that lets businesses deploy AI agents without proprietary lock-in. Paired with the Vera Rubin chip platform, which delivers up to 10 times cheaper AI inference than its predecessor, NVIDIA has made a clear push to become the foundational layer for the agentic AI era. For operators, this means the infrastructure for autonomous AI workflows is becoming faster, cheaper, and more accessible.
Anthropic Launches a Marketplace to Simplify Enterprise AI Buying
Anthropic launched the Claude Marketplace on 6 March 2026, allowing enterprise customers to apply existing Claude API spending commitments toward third-party applications built on Claude. Launch partners include Snowflake, GitLab, Harvey, Replit, and Lovable Labs. Anthropic is taking no commission at launch, positioning itself as an enterprise procurement layer rather than just a model provider.
Perplexity's Computer Agent Enters Enterprise at $200 Per Month
Perplexity has launched Computer for Enterprise, making its multi-model AI agent available to business customers at $200 per month. The platform connects natively to Snowflake, Salesforce, HubSpot, and Slack, and an internal study claims it saved the equivalent of 3.2 years of work in just four weeks. The launch places a $20 billion AI startup in direct competition with Microsoft and Salesforce for enterprise software budgets.
Microsoft Launches Copilot Cowork: AI Agent That Operates Files on Employee Computers
Microsoft entered the AI coworker category with Copilot Cowork, an enterprise agent that reads, analyses, and manipulates files directly on employee computers. Built using both Anthropic and OpenAI models, it selects the best model per task. For businesses already in the Microsoft 365 ecosystem, this offers a direct path to file-level automation without additional third-party tools.
GPT-5.4 Can Now Control Your Computer Autonomously
OpenAI released GPT-5.4 on 5 March 2026, the first general-use AI model with native computer-use capabilities. The model surpasses the human benchmark for real-world computer tasks and embeds directly into Excel and Google Sheets, bringing autonomous workflow execution to everyday business tools.
GPT-5.4 Launches with Native Computer Use and 1M Token Context
OpenAI launched GPT-5.4 on 5 March 2026, its most capable general-purpose frontier model to date. The release combines native computer-use capabilities with a 1-million-token context window and 33% fewer factual errors than its predecessor, and is available immediately to API developers and ChatGPT paid subscribers.
Microsoft Copilot Cowork Turns Requests into Automated Workflows
Microsoft introduced Copilot Cowork on 9 March 2026, an AI execution layer inside Microsoft 365 that converts plain-language requests into multi-step automated task plans. Grounded in a team's real Outlook, Teams, Excel, and Files data, it runs tasks in the background and waits for approval at checkpoints before applying changes. The feature launches in limited Research Preview now, with broader access and a new $99 per user per month Microsoft 365 E7 plan from May 2026.
Enterprise Connect 2026 Opens with Agentic AI as the Headline Theme
Enterprise Connect 2026 has opened in Las Vegas with agentic AI dominating the agenda. Amazon, Zoom, RingCentral, Dialpad, and Genesys are all launching autonomous agent platforms, marking the shift from pilot projects to production deployments. The focus has moved from what AI agents can do to how organisations govern, measure, and scale them.
Anthropic Launches Claude Agent SDK for Production Deployments
Anthropic has released its official Claude Agent SDK, providing a standardised framework for building, testing, and deploying autonomous AI agents in enterprise environments. The SDK includes built-in tool orchestration, memory management, and safety guardrails designed for production workloads.
Google Gemini in Workspace Now Generates Documents From Email, Chat, and Files
Google updated Gemini in Workspace to generate complete documents, spreadsheets, and presentations by pulling from a company's emails, chats, and Drive files. This transforms Google Drive into an active AI knowledge base capable of producing finished deliverables from existing organisational context.
Each briefing includes structured summaries, sources, and operational analysis to help founders and operators understand the implications of major AI developments.
Looking for shorter updates? Browse AI Signals for fast, structured coverage of the latest AI developments.
Evaluating AI consulting options? Compare AI consulting approaches to see how different models fit different organisations.
Want to discuss what you have read?
Every briefing connects back to real systems we build. If something resonates, let us show you what it looks like in practice.
Book a Strategy Call