At its big developer conference, Microsoft showed seven home-grown AI models, a digital assistant called Scout that works in the background instead of waiting for you to ask, and a new quantum chip it says is 1,000 times more reliable than last year's. The quiet headline: most of this is Microsoft reducing its dependence on OpenAI, the company it's spent billions partnering with. One of the new models is already the default helper inside Microsoft's popular code editor, so millions of developers are using it as of today.
Two grains of salt. Almost every "it's the best" number came from Microsoft's own testing, not an outside referee. And an assistant that's always on, with the keys to your email and files, is convenient right up until you think about everything it can quietly see.
The throughline of Build this year is independence. Microsoft put out seven in-house MAI models, its first proactive agent, a second-gen quantum chip, an agentic OS that isn't Windows, and an isolation layer that makes agent containment an OS primitive. Most of it points away from OpenAI.
MAI-Thinking-1 is the one that matters: a sparse mixture-of-experts reasoning model, 35B active parameters out of roughly a trillion, 256K context, and the headline claim is that it was trained with no distillation from any third-party model and no AI-generated pre-training data. Reported scores: 97.0% AIME 2025, and 52.8% on SWE-Bench Pro, which ties Claude Opus 4.6. Microsoft also says blind human evals preferred it over Sonnet 4.6.
Read those numbers cold. The preference test was run by Surge, Microsoft's own data vendor, and the benchmarks are self-reported with no independent replication. The model is private-preview only. None of the seven release open weights. The genuinely shipping piece is MAI-Code-1-Flash, a 5B model now the default in VS Code and Copilot, which puts a Microsoft model in front of millions of developers today. The rest of the family covers image, transcription (43 languages), and voice.
Scout is Microsoft's first "Autopilot" agent: it runs in the background across Teams, Outlook, OneDrive, and SharePoint rather than waiting for a prompt. The two technically interesting decisions are that it's built on the open-source OpenClaw stack, not OpenAI, and that every agent gets its own Entra identity with scoped, log-redacted credentials, so an auditor can actually trace what an agent did. It's private preview, gated behind a Copilot license and Intune policy, with human sign-off required for sensitive actions.
What's real: a credible, independent model stack and an enterprise agent-identity model auditors can use. What's marketing: every benchmark here is Microsoft's own, and "always-on agent with access to your mail and files" is a data-access footprint nobody has fully priced yet.
OpenAI's Codex began as a tool for programmers. Now it's being pitched to everyone else, with ready-made packs for jobs like sales and finance, and a feature called Sites that turns a plain description into a working website you can share with a link, no setup. OpenAI says 5 million people use it weekly and that non-coders are now its fastest-growing group. If true, it's muscling in on the easy app-builder tools normal office workers already use. Worth noting the user numbers are OpenAI's own.
Codex started as the developer product. The June 2 update is OpenAI's bid to make it a general work surface that sits on top of Salesforce, Snowflake, and Figma instead of replacing them. Three pieces:
The number OpenAI is leaning on: 5M+ weekly users, up more than 6x since the February desktop launch, with non-developers now roughly 20% of users and growing 3x faster than the developer base. If that growth rate is real, Codex is pulling in a new category without cannibalizing the old one, and Sites in particular points it straight at the no-code tier (Notion, Airtable, Replit).
Caveats worth keeping: the exact plugin lineup is still fuzzy, so treat any canonical list of six as soft, and the 5M-users and growth figures are OpenAI's own. And "plugins" aren't new to Codex; this is packaging and positioning as much as new capability.
At Computex, the chip giant rolled out a pile of gear. A desk-sized "supercomputer" that can run enormous AI models without the cloud. A free, downloadable AI model that's the strongest American open model out there (though a Chinese one still edges it). And, most notably, Nvidia's first chip built for regular laptops, aimed squarely at Apple's home-grown chips. The theme: Nvidia wants AI running on your own machine, not just rented from a data center. None of it has shipped or been independently tested yet, so the speed claims are promises.
Four launches off one keynote, and they cohere into a single message: Nvidia wants to own local compute from the laptop to the enterprise desk, and it's putting open weights underneath the robotics stack.
A 550B-total, 55B-active open-weight model (90% sparsity), 1M context, and the highest-scoring US open-weight model on the Artificial Analysis Intelligence Index at launch (48) — though that still trails China's Kimi K2.6 at 54, so "leading US open model" is the honest framing, not "leading model." The architecture is the interesting part: a hybrid Mamba-Transformer MoE with a LatentMoE routing trick (compress tokens to a low-rank space, fit 4x more experts at the same cost) and multi-token prediction. It serves 300+ tokens/sec, which is where the cost story lives.
| DGX Station | RTX Spark | |
|---|---|---|
| Chip | GB300 Grace Blackwell Ultra | 20-core Grace + Blackwell GPU |
| AI compute | up to 20 petaflops (FP4) | up to 1 petaflop |
| Memory | up to 748 GB coherent | 128 GB unified |
| Runs locally | trillion-param models | 120B models, 1M context |
| Ships | Q4 2026 | Fall 2026 |
The DGX Station puts a trillion-parameter model on a single desk for anyone who needs air-gapped inference or wants off the per-token meter. RTX Spark is the bigger structural move: Nvidia's first Arm SoC for mainstream laptops, a direct shot at Apple Silicon's unified-memory advantage, paired with Microsoft on the Windows side.
A fully open mixture-of-transformers world model that unifies language, image, video, audio, and robot actions in one backbone, ranked first among open models on combined robotics benchmarks. The deployment target is the Jetson AGX Thor robot platform. For robotics teams who've been stitching together separate perception, language, and action models, one shared open backbone is the practical win.
None of this has shipped or been independently benchmarked yet; the petaflop and memory figures are Nvidia's targets.
President Trump signed an executive order on AI that's far softer than an earlier draft. Companies can let the government peek at powerful new models for up to 30 days before release, but they don't have to, and the order flatly rules out any government licensing or permission slips for building AI. It does tell the NSA and Treasury to set up security and bug-sharing programs. The practical effect rests entirely on whether the big AI labs choose to play along, because there's no penalty if they don't.
The signed order, "Promoting Advanced Artificial Intelligence Innovation and Security," is the scaled-back version. A May draft had a 90-day mandatory pre-release review; Trump pulled it. What he signed instead sets up a voluntary framework where developers may give the government up to 30 days of pre-release access to frontier models, and it explicitly bars any "mandatory governmental licensing, preclearance, or permitting requirement."
The operative deadlines are the part to watch:
The whole thing has no enforcement teeth, so its effect depends entirely on whether the big labs opt in. The quiet leverage is that NSA threshold definition: whoever draws the line around "covered" models shapes the regime more than the review window does.
The legendary director joined AI startup Black Forest Labs and showed himself using its image tool to sketch out shots for his next movie before filming. He's careful about the limit: it's for planning and showing his crew what's in his head, not for replacing actors or sets. Because Scorsese is such a craft purist, his blessing carries weight, and it gives the startup a stamp of seriousness money can't buy. Not everyone's on board; fellow director Guillermo del Toro recently compared starting a project with AI to "spitting on God."
Black Forest Labs named Martin Scorsese a company advisor and released a video of him using its FLUX model to storyboard What Happens at Night, his next film with DiCaprio and Lawrence. The use is narrow and he's precise about it: pre-production visualization, communicating shots to his cinematographer and production designer, not generating performances or sets.
There's always been this problem of, how do you communicate what you see in your head to your cast and crew? Now, with this tool, I can share what I'm visualizing more clearly and efficiently.
Why it matters has nothing to do with the model's specs and everything to do with the signature. Scorsese's whole reputation is built on craft, so his endorsement makes it harder to wave off AI pre-production tools as incompatible with serious filmmaking, and it buys Black Forest Labs cultural legitimacy no benchmark could. The reaction isn't unanimous: Guillermo del Toro recently called starting a creative project with AI "like spitting on God." The advisory terms weren't disclosed.
The maker of the Claude chatbot secretly filed early IPO paperwork with regulators. "Confidential" means we don't yet know the price, the share count, or the timing, and it might not even happen. The eye-watering context, from press reports rather than Anthropic itself: the company is valued around $965 billion and is reportedly on pace for about $47 billion in annual revenue. If it goes through at that size, it'd be one of the biggest stock-market debuts ever.
Anthropic confidentially submitted a draft Form S-1 to the SEC on June 1. That's the procedural first step toward an IPO and nothing more: no share count, no price, no underwriters, no timeline, and the company is explicit that going public is contingent on SEC review and market conditions.
Keep the two layers separate. What Anthropic confirmed is the filing. The financial picture around it comes from Fortune's reporting, not Anthropic's disclosure: a $965B valuation (ahead of OpenAI's $852B in March), a roughly $47B annualized revenue run-rate as of May, up from about $10B at the end of 2025, on the back of a $65B raise. If the valuation holds into an actual offering, this is one of the largest IPOs in US history, and the first real template for what an AI-native listing looks like. And because the S-1 is confidential, its actual contents stay private for now.
Anthropic expanded a program that points its AI at software to hunt for vulnerabilities, now covering around 200 organizations. Across a thousand-plus open-source projects, the AI flagged about 23,000 issues, and outside experts confirmed the serious ones were real over 90% of the time. Here's the catch: of the high-severity bugs reported to the projects' maintainers, only 75 have actually been patched.
That's the whole point. Finding bugs used to be the hard part; now an AI can spit out thousands. The new bottleneck is the small number of humans who have to verify and fix them. Think of a smoke detector that goes off in 6,000 rooms at once when you've got one firefighter.
Anthropic expanded its security program from about 50 partner organizations to roughly 200, across 15-plus countries, running on a Claude Mythos Preview model that scans critical-infrastructure and open-source code for vulnerabilities. The expansion is the headline. The number that should stop you is in the earlier results post.
Across 1,000+ open-source projects, Claude surfaced 23,019 issues, with an estimated 6,200 high or critical. Six independent security firms triaged a sample of 1,752 and validated 90.6% as true positives, which is far above typical scanner noise. Then the funnel collapses: of the high/critical bugs, 530 were formally disclosed to maintainers, and 75 have actually been patched.
That gap is the real story, and Anthropic is right to frame it as a structural shift. AI has moved the bottleneck in software security off of finding bugs and onto verifying, disclosing, and convincing a human maintainer to fix them. A model that can generate 6,000 credible high-severity findings against a volunteer ecosystem that patches dozens isn't a solution; it's a new operational problem nobody has staffed for.
(One precision note: the 23,019 and 75 figures come from the initial-update post, not the expansion announcement, and they aren't a single clean ratio. The 23k is all severities; the 75 patched is against the 530 disclosed.)
The company Cognition renamed its coding app and turned it into one place to run AI coding helpers, whether they work on your own computer or out in the cloud. The clever bit is that it doesn't lock you to one assistant; it's built to plug in several brands and, through an open standard, any future one. It's a bid to become the universal adapter for AI coding agents. Heads up if you use the old version: the legacy engine gets switched off July 1.
Cognition rebranded the Windsurf IDE (the one it acquired from Codeium) as Devin Desktop, and folded it into a single surface that runs both a local coding IDE and cloud autonomous agents. The interesting bet is that it's agent-agnostic: Devin, Claude Agent, Codex, and OpenCode are first-class out of the box, and anything else plugs in through the open Agent Client Protocol (ACP).
Under the hood, Devin Local is a Rust rewrite of the old Cascade engine, claiming up to 30% better token efficiency with subagent support, though that figure is unattributed and unbenchmarked. Legacy Cascade is deprecated July 1, so existing users have a hard cutover. If ACP gets adoption, it's a play to become the common protocol for plugging any agent into any host, the thing every one of today's siloed agent desktops lacks.
GitHub, the place developers store code, released a desktop app for running multiple AI coding agents in parallel without them tripping over each other (each gets its own private copy of the project to work in). It can even watch the automated tests and merge finished work for you once your rules are met. That auto-merge is handy and a little nerve-wracking, since you're letting software approve changes to real code, so the controls over how much it's allowed to do will matter a lot. It's an early preview.
GitHub shipped a standalone desktop app whose whole premise is orchestrating multiple agents at once instead of autocompleting one file. Each agent session runs in its own git worktree, an isolated branch checkout, so parallel agents can't step on each other. A "My Work" view consolidates active sessions, issues, PRs, and background automations; Canvas surfaces let you redirect an agent mid-task; and Agent Merge watches CI, chases reviewers, and triggers merges once your conditions are met.
That last one is the load-bearing risk. Automating the merge decision off CI status is powerful and dangerous in a production repo, and the value will live entirely in how granular the scope controls are (CI-recovery-only versus full merge authority). It's a technical preview for Copilot Pro and up, no GA date. The signal: GitHub is betting the core product is now agent orchestration, not code completion.
The research group Nous Research released Hermes Desktop, a free and open assistant for Mac, Windows, and Linux. Unlike the day's other tools, it isn't about coding: it remembers one conversation across Telegram, Discord, WhatsApp, Signal, email, and more, and you run it yourself instead of renting it from a company. It's early-stage software, so expect rough edges, but the appeal is independence: your assistant, your machine, no single company in the middle.
Hermes Desktop is a native macOS/Windows/Linux app from Nous Research, MIT-licensed, that wraps the Hermes agent system in a GUI. What sets it apart from the day's other agent desktops is the design philosophy: it isn't repo-shaped. One unified memory spans Telegram, Discord, Slack, WhatsApp, Signal, email, and CLI, subagents run isolated with their own terminals and Python RPC, and it auto-generates skills from how you use it. Five execution backends (local, Docker, SSH, Singularity, Modal) cover everything from a laptop to a cluster.
It's a model-agnostic, provider-independent agent you actually run yourself, which is rare. Caveats are the usual for v0.15.2 / Feature Preview: the only source is the product page, no paper or independent benchmark, and the API surface isn't stable.
Perplexity is building a system that automatically splits AI work between a small model on your own device and powerful models in the cloud, deciding task by task. Sensitive stuff (your finances, health, private files) stays local; the heavy thinking goes to the cloud. The pitch is privacy without you fiddling with any settings. It only works if the auto-decisions are trustworthy, and Perplexity hasn't said much about how it chooses. It's due in July.
Perplexity's pitch is hybrid inference that routes per task, at runtime, between a compact on-device model and frontier cloud models, with no user toggle. The local model handles sensitive data (financial records, health, personal docs) and also makes the routing call; heavy reasoning and large retrieval go to the cloud. The point is privacy by default plus token savings.
Per-task routing decided automatically, rather than per-session by the user, is the new idea here, and it's a credible answer to "I don't want my agent shipping my bank statements to a server." Whether it works comes down to routing reliability, and Perplexity disclosed nothing about the local model's size or the decision mechanism. Both the hybrid feature and a personal-computer hardware product are slated for July 2026.
Alphabet announced the biggest stock-raising effort by any US company ever, $80 billion, to fund the data centers and chips behind its AI push. Notably, $10 billion of it comes from Warren Buffett's old company, Berkshire Hathaway, which historically avoided big tech bets. Alphabet plans to spend roughly $180–190 billion this year alone. The signal: AI infrastructure isn't a one-time splurge, it's a permanent, enormous bill.
Alphabet announced an $80B equity raise, the largest single equity offering in US corporate history, in three tranches:
The Berkshire piece is the tell: it's one of Greg Abel's first big capital deployments since taking over from Buffett in January, and a historically tech-averse allocator buying directly into AI infrastructure is its own signal. The money funds a 2026 capex program Sundar Pichai has put at $180–190B. Alphabet is financing a buildout larger than most national defense budgets entirely through dilution rather than debt, which says compute is now a permanent cost center, not a one-cycle bet.
SK Hynix, which makes the specialized memory that AI chips depend on, says it'll double how many of those chips it can produce over the next five years. Demand is so far ahead of supply that its entire 2026 output is already spoken for, and it expects the shortage to last until 2030. The moment of the show: Nvidia CEO Jensen Huang grabbed a marker at SK Hynix's booth and scrawled "Please make more" on a chip wafer. When your biggest customer is writing notes like that, you know the crunch is real.
At Computex, SK Group chairman Chey Tae-won said SK Hynix will double total memory wafer capacity within five years. He pointedly declined to attach a dollar figure (equipment, land, and power costs make it unforecastable, he said), but committed 2026 capex above the ~30.2 trillion won (~$20B) spent in 2025. The supplier's own forecast is that the AI-driven memory crunch persists through 2030, and its entire 2026 HBM output is already sold out.
The image that traveled: Jensen Huang stopped at the SK Hynix booth, picked up a marker, and wrote "Please make more" on an HBM4E wafer. HBM is the actual ceiling on how fast the industry can ship AI accelerators, and SK Hynix sits on the dominant share, so a five-year doubling pledge, paired with a confidential US ADR filing to fund it, is the most concrete commitment yet to lifting that ceiling. It's still a chairman's directional target, not a board-approved plan with milestones.
Chinese company MiniMax unveiled M3, a model that can read about a million words at once (think a stack of books in one go) and handle images and video. It promises to give away the model for anyone to use "within about 10 days," but it wasn't available yet at launch, so the "free and open" part is still an IOU. The headline trick is a way to handle huge amounts of text far more cheaply than usual; if it holds up under outside testing, it'd make long-memory AI assistants much more affordable.
MiniMax released M3, a mixture-of-experts model (229.9B total, 9.8B active, 256 experts) with a 1M-token context window and native image/video input. Open weights are promised on Hugging Face and GitHub "within ~10 days," and a license; neither was live at launch, so the open-source framing is on credit for now.
The architecture claim to watch is MiniMax Sparse Attention (MSA): block-level KV selection layered on a GQA backbone, which MiniMax says cuts per-token compute at 1M context to roughly a twentieth of its prior model, with 9x faster prefill and 15x faster decoding. Sub-quadratic attention that holds quality at a million tokens is the thing that decides whether long-context agents are affordable, so that's the number to wait on independent confirmation for. The benchmarks (SWE-Bench Pro 59.0%, BrowseComp 83.5) are vendor-run and put it frontier-adjacent on coding, a hair behind Claude Opus 4.7. If the weights ship under a permissive license, this is forkable quality at a tier that rarely is. If they don't, it's a teaser.
OpenAI and partners started building a massive data center near Saline, Michigan, big enough to power a small city, costing about $16 billion. To win local goodwill, OpenAI is handing 400,000-plus Michigan students up to $45 million in free credits for its coding tool. The catch that's stirring debate: the project went ahead even though the local township had voted against it, with state-level approval overriding the town. Expect that fight to repeat as more of these get built.
OpenAI, Oracle, Related Digital, and Blackstone formally broke ground on a Stargate campus in Saline Township, Michigan, internally called "The Barn": 250 acres, three 550,000-sq-ft buildings, 1+ GW of compute drawing ~1.4 GW of power from DTE Energy. Steel was already going up in March, so the ceremony mostly formalized work in progress. The financing is the maturing part: $16B total, with $14B in bonds placed through Bank of America (Pimco took $10B).
Two things to file. OpenAI pledged up to $45M in free Codex credits to 400,000-plus Michigan students, which reads as deliberate constituency-building for a project that overrode a local township vote against it. That override is the precedent worth watching, a template for how state-level approval can steamroll municipal opposition on AI siting.
Off the coast of Shanghai, China switched on a data center sitting about 35 meters underwater, cooled by ocean water and plugged straight into offshore wind turbines. The point: data centers normally hog land, fresh water, and electricity, and putting one underwater next to wind farms sidesteps all three. The operator claims it uses about 23% less power for cooling. Those numbers come from the company, not an independent check, and it's still a small pilot, but the approach gives coastal China a clever way to keep adding AI computing.
HiCloud brought a subsea data center into full commercial operation about 10 km off Shanghai's Lingang area, with sealed server modules ~35 m down. Two design choices attack the three things that actually constrain where you can build a data center, land, freshwater, and power: it's cooled by circulating seawater (claimed 22.8% less electricity than land cooling, zero freshwater, 90%-plus less land), and it's wired directly to offshore wind rather than the grid.
That direct-wind connection is what makes the "world's first" claim plausibly distinct from Microsoft's old grid-powered Project Natick. It's at pilot scale now (2.3 MW of a planned 24 MW, ~2,000 servers, ~$226M committed). The efficiency number comes from the operator and state-linked partners with no independent audit, so treat it as a vendor figure. The strategic read: China's coastline gives it a structural way to add AI compute without the land-permitting and water fights now slowing US sites.
A company called WindBorne says its AI weather model beats the big US and European forecasting agencies, partly because it flies its own balloons to gather data over the oceans where measurements are thin, and updates its forecast every hour instead of every six. The boldest claim: its 4.5-day forecast is as accurate as the official 1-day one. The honest caveat: WindBorne graded its own homework. Until an independent group checks it, treat "better than the government" as a sales pitch, not a fact.
WindBorne released WeatherMesh-6, an AI weather model it claims beats ECMWF's IFS and NOAA's HRRR. The specifics are genuinely novel: 0.25° global resolution, 128 ensemble members, a fresh forecast every hour (traditional models cycle every six), and, crucially, its own balloon constellation feeding upper-atmosphere observations over data-sparse oceans, ingested via an innovation-based assimilation scheme. The standout claim is that WM-6's 4.5-day temperature forecast matches IFS's 1-day accuracy, with up to 38% lower ensemble RMSE.
The discipline note: every one of those numbers is self-reported, and no independent body has replicated them. WindBorne does publish raw gridded outputs for external verification, which is the right instinct, but until a group like ECMWF's AI Weather Quest runs the comparison, "out-forecasting the government" is a company claim, not a finding.
A big study of over 100,000 programmers found that AI tools massively boost how much code people write, up to 180% more, but only about a sixth of that boost survives to actual finished, released software. Why? Writing code was never the real bottleneck. Reviewing it, testing it, fixing the plumbing, and making judgment calls still need humans, and those steps don't speed up. The lesson for companies: buying AI coding tools without beefing up the review-and-release side mostly buys you a bigger pile of unfinished work.
A working paper (Demirer, Musolff, Yang; NBER w35275) tracked 100,000-plus GitHub developers across three generations of AI coding tools and measured output not at the commit but all the way to shipped releases. The attenuation is the finding:
| Tool generation | Commit gain |
|---|---|
| Autocomplete | +40% |
| Interactive agents | +140% |
| Autonomous agents | +180% |
That +180% at the commit level falls to roughly +50% at the project level and about +30% by the time it reaches an actual release. Roughly a sixth of the headline productivity survives to shipping. The estimated elasticity of substitution between AI and human effort is 0.25, which is the formal way of saying the two are strong complements and the human is the bottleneck. A separate marketplace check found more new apps but no rise in total app usage.
The authors read it as a "weak-link" production model: AI demolished the code-writing constraint, so the constraint moved downstream to review, integration, environment setup, and judgment. The operational takeaway is blunt: buy coding agents without also expanding review and deployment capacity and most of the gain evaporates before it ships. It's a working paper, not yet peer-reviewed, so treat the exact figures as provisional.
Scientists in Switzerland made microscopic "robots" out of nerve-growing cells wrapped in special particles, then steered them to spinal injuries using magnets from outside the body, where they deliver gentle electrical stimulation with no wires or implants. In fish they restored near-normal swimming in three days; in mice with fully cut spinal cords, movement improved over four weeks. It's a genuinely exciting early result in a respected journal, but it's only animals so far, and this field has seen many animal successes that never worked in people. No human testing yet.
Researchers at ETH Zurich built NPCbots: neural progenitor cells coated in magnetoelectric nanoparticles, steered to a spinal-injury site by external magnetic fields, where the particles convert that field into localized electrical stimulation with no implanted electrodes. The cell can become nerve tissue; the coating delivers the stimulation. Both jobs in one wireless construct is the new idea.
The results, in Nature Materials:
A fully transected mouse cord is a hard model, and the venue is top-tier, so this is a meaningful proof-of-concept. It is also strictly preclinical, and spinal-cord research specifically has a long graveyard of animal results that never translated. The authors say human-tissue magnetic parameters still need working out. No human data exists.
NewLimit, co-founded by Coinbase's CEO, raised $435 million to develop medicines that make old cells behave young again. They say they've already rejuvenated aged human liver cells in the lab and want to start the first human trial in 2027, much sooner than they once expected. It's a huge bet on an unproven idea, and the encouraging part is that the early result was in human cells, not just mice. Everything so far comes from the company's own announcement, not an independent study, so it's promising, not proven.
NewLimit, the cellular-reprogramming company co-founded by Coinbase CEO Brian Armstrong, closed a $435M Series C led by Founders Fund, with Thrive, Greenoaks, and Quiet Capital. The science is epigenetic reprogramming: restoring youthful gene-expression patterns in aged cells, on the premise that aging is plastic at the cellular level. The milestone behind the raise is a prototype shown to reverse age in old human liver cells in the lab, and the company now plans its first human trial in 2027, years ahead of the decade-plus its founders originally expected.
A $435M round for a preclinical longevity company with no approved product is an outsized private bet, and the thing that justifies it is that the proof-of-concept was in human cells, not mouse models. Every claim here is from a fundraising post, not a peer-reviewed study, the liver results aren't independently published, and "first human trial next year" is a plan that still needs an IND clearance the announcement doesn't mention.
Microsoft just launched Scout, an AI helper that runs in the background of your work apps. The next day, the outlet 404 Media published internal Microsoft documents in which the rollout plan's first step is written, plainly, as "make people addicted." Scout can reach your email, files, and calendar, so a tool designed around keeping you hooked, rather than finishing your task and leaving, is a very different thing from what the launch ad suggested.
A day after Microsoft launched Scout as the friendly always-on agent (story 1), 404 Media published internal documents that frame it rather differently. One, titled "ClawPilot: Overview and Plan with Project Lobster," lays out a three-phase rollout whose first phase is written, in Microsoft's own words, as "Make people addicted." ClawPilot is Scout's internal name; it runs on the OpenClaw stack and is being woven through Microsoft 365.
Strip the spin and this is the engagement playbook stated plainly. The public pitch for an ambient assistant is convenience; the business case is daily active use, and "addiction" is just the unguarded word for the metric every consumer product chases. What makes it land harder here is the surface area: an agent with standing access to your mail, files, and calendar, tuned for habit rather than for finishing a task and getting out of your way. That phrasing is 404 Media's reading of the internal documents, and Microsoft hasn't addressed it on the record. It sits awkwardly next to the same week's launch copy, and it's the kind of internal candor that ages badly.
DeepSeek made its name building competitive AI for far less money than its rivals. Now it's raising about $7.4 billion in its first outside funding, valuing it around $52–59 billion, with its founder, Tencent, and battery maker CATL leading. The takeaway: even the lab famous for thrift can't dodge the giant compute and power bills, so it's taking the money. It's still unconfirmed by DeepSeek and rests on insider sources.
DeepSeek is closing its first external funding round, about 50 billion yuan (~$7.4B), at a post-money valuation of 350–400 billion yuan ($52–59B), according to people familiar with the talks. Founder Liang Wenfeng is reportedly putting in ~20B yuan himself, with Tencent (~10B) and battery giant CATL (~5B) the largest outside backers, and the national AI fund, NetEase, and JD.com in the conversation.
The irony is the story. DeepSeek built its name on frontier-adjacent models trained for a fraction of the usual spend, the lab that supposedly didn't need a war chest. Taking outside money, and this much of it, says the efficiency pitch and the capital race were never opposites; you still need the compute. The backers are the tell, too. Tencent brings distribution, and a battery maker on the cap table points straight at the part of the AI bill nobody can software their way out of: power. This is sourced to unnamed people, not a DeepSeek announcement, so the figures firm up only if and when it closes.
Unitree, the Chinese company known for its four-legged "robot dogs" and now humanoid robots, cleared a key approval step toward listing on Shanghai's stock exchange. It would be one of the first humanoid-robot makers to go public. That matters because an IPO forces a company to open its books, so we'd finally see whether the humanoid-robot boom is real revenue or mostly impressive videos. China is betting big on the sector, and Unitree is its poster child.
Unitree passed its listing-committee hearing on the Shanghai Stock Exchange's STAR Market, clearing the major gate before an IPO moves on to registration and issuance. The Hangzhou company is best known for its quadruped "robot dogs" and, lately, humanoids, and it would be among the first pure-play humanoid-robotics makers to reach public markets.
Passing the hearing isn't the bell-ringing; registration and pricing still have to follow. But it's a real marker for a sector that has run mostly on demo videos and term sheets, because a listing forces audited numbers into daylight: revenue, margins, how much of the humanoid story is shipping product versus promise. China is pushing humanoid robotics hard as industrial policy, and Unitree is the closest thing it has to a flagship. The public-market test is the one that separates the category's hype from its economics.
AI labs are hiring philosophers, people trained to reason about right, wrong, and hard judgment calls, to help decide how their chatbots should act in tricky situations a test can't measure. (Anthropic, for example, used philosophers to help write the "rules" that guide Claude.) Supporters say that once AI makes real-world decisions, "what should it do?" is genuinely a philosophy question. Critics call it window dressing: hire a philosopher, point to them when people complain, and ship anyway. The real test is whether those philosophers can actually block a launch.
The labs are recruiting philosophers. WIRED's piece, bluntly titled "To Land a Job in AI, Try Reading Kant," reports that frontier labs are bringing on people trained in ethics and epistemology to help define how models should behave at the edges: the values, the hard trade-offs, the cases a benchmark can't score. Anthropic philosophers, for one, worked on the "constitution" that shapes Claude's behavior.
There's a real argument for it. Once a model is making consequential calls in domains with no clean right answer, "what should it do" stops being an engineering question and becomes a philosophical one, and you'd rather have someone who has thought about it for a living. The skeptic's read, also in the piece, is ethics-washing: hire a philosopher, point to them when the questions get pointed, ship anyway. Which one it is comes down to whether the philosophers can actually say no to a launch, and no job posting tells you that.

Two conferences set the weather this week, Microsoft Build and Computex, and between them the message was the same: the agent stops waiting for you. Microsoft's Scout runs in the background; OpenAI aimed Codex at people who've never written a line of code; GitHub, Cognition, Nous, and Perplexity all shipped some version of "an agent that lives on your desktop." Capability is no longer the story. Distribution is. The frontier is now a race to put an autonomous helper in front of every worker, with their email, their files, and their judgment one approval-click away.
What makes today cohere, though, is the bill arriving underneath all of it. Alphabet is raising a record $80 billion, Berkshire money included, to pay for compute. OpenAI broke ground on a gigawatt in Michigan over a town's objection. SK Hynix promised to double the world's supply of AI memory while Jensen Huang scrawled "please make more" on a wafer, and China lit up a data center under the sea to dodge the land, water, and power limits that are starting to bind. The physical economy of AI, concrete, silicon, seawater, equity dilution, is now the constraint the software has to grow into.
And the day's sharpest signal is that the constraint isn't only physical. A 100,000-developer study found AI multiplies the code you write but barely the code you ship, because the bottleneck moved downstream to the humans who review and decide. Anthropic's own security AI found 23,000 software flaws and watched 75 get patched. Same shape, twice: the machine now generates work faster than people and institutions can absorb it. The week's gold rush is in always-on agents and trillion-dollar buildouts; the week's quiet lesson is that the rate-limiting step is, increasingly, us.

Pick a question above, or type your own. The badger answers from this issue's own words.
The real badger's napping off the dig, so this one's AI. It can be wrong, so check the sources.