TechDig — Thursday, June 4, 2026

Today's issue

Got ten seconds? The badger read all of it.

Today's lead AI Labs

Google's new free AI is multi-talented and small enough for your laptop

Gemma 4 lands a multimodal model that fits on a laptop, and drops the encoders

Google released Gemma 4, a free, downloadable AI you can run on your own computer instead of renting it from a data center. The clever bit is how it's built: instead of bolting on separate parts to handle pictures and sound, it takes images and audio in directly, which is what keeps it small enough to run on a single decent graphics card. The mid-size version is the first in the line that can listen to audio, not just read text, and it can juggle a small book's worth of information at once. The usual caution: the flattering scorecards are Google's own, and "runs on a laptop" assumes a slimmed-down version, not the full one.

The interesting part of Gemma 4 isn't the size, it's the architecture. Google's new open family spans edge variants (E2B, E4B) up through 26B and 31B, but the headline is the 12B (11.95B params, Apache 2.0), and the move that matters is that it's encoder-free: raw image patches and raw audio waveforms get projected straight into the LLM's embedding space through thin linear layers. No separate vision tower, no bolted-on audio encoder. One backbone ingests text, image, audio, and video (as frame sequences), which is what cuts the memory and latency overhead enough to make a genuinely multimodal model run on consumer hardware.

The 12B is the first mid-size Gemma with native audio in (ASR and speech translation), carries a 256K context (hybrid 1024-token sliding window plus global attention), and adds a toggleable thinking mode and native function-calling. It runs in ~16GB VRAM at 4-bit (down to ~6.4GB quantized; ~24GB at full BF16), so a single mid-range GPU does the job.

Google's own numbers for the 12B instruction-tuned model:

Benchmark	Score
MMLU Pro	77.2%
GPQA Diamond	78.8%
AIME 2026 (no tools)	77.5%
LiveCodeBench v6	72.0%
MMMU Pro (vision)	69.1%

For scale, the 31B posts 85.2% MMLU against Gemma 3 27B's 67.6% — a real generational jump. Two caveats worth holding: every figure here is self-reported with no third-party eval yet, and native audio ships only on the E2B/E4B/12B variants, not the two larger ones. The "laptop-ready" claim is a quantized-inference claim, not full precision.

Read the sourcesblog.google ↗deepmind.google ↗huggingface.co ↗

Big Tech

Meta just dropped a free AI salesperson into WhatsApp, Instagram, and Messenger

Meta Business Agent puts an agentic salesperson in a billion daily chat threads

If you run a small business, Meta now offers a free AI assistant that lives in your chats and does the grunt work: answers customer questions, suggests products, books appointments, even tries to close the sale, then hands tricky cases to a human and emails you a recap each morning. Meta says over a million businesses already use the earlier version and that a billion conversations a day flow through these apps, so the reach is enormous. It also opened a back end that plugs the assistant into tools like Shopify and Zendesk. The catch nobody's priced yet: "free to start," with paid tiers coming, and an AI handling your customers is only as good as the day it gets something important wrong.

Meta turned its scattered business-messaging tools into one product and pointed it at every small business with a WhatsApp number. Meta Business Agent, announced at the Conversations conference in London, runs across WhatsApp, Messenger, and Instagram, and the verb list is the tell: it answers questions, recommends from a catalog, books appointments, qualifies leads, closes sales, hands off to a human at a configurable threshold, and emails a morning briefing summarizing overnight chats. Free to start; paid tiers "in the coming months"; large accounts billed on tokens consumed.

The structurally important piece is the Business Agent Platform underneath it: a developer layer that connects the agent to "hundreds of systems," with Shopify, Zendesk, and Shopee named, plus enterprise guardrails and measurement. That's Meta positioning itself as agent infrastructure, not just a walled-garden chatbot.

Meta's own scale figures: 1M+ businesses already on the earlier tooling, 1B+ active daily message threads across the three apps. Those numbers, the "hundreds of systems" claim (three partners actually named), and what "closes sales" means at the payment layer are all unspecified or self-reported. But the distribution is the moat. Outside North America, this is where customer-to-business conversation already happens, and Meta just dropped a free agentic sales rep into all of it.

Read the sourceabout.fb.com ↗

Policy

Europe wants its hospitals and courts off American clouds

The EU drafts a cloud law a US hyperscaler structurally can't pass

The EU proposed rules designed to push US cloud giants — Amazon, Microsoft, Google — out of the most sensitive government computing, like health records, financial systems, and court data. The trick is in the wording: for the most sensitive tier, the cloud provider has to be European-owned and beyond the reach of US law, which no American company can currently satisfy no matter where it puts its servers. There's also a companion plan to speed up building computer chips in Europe. One big asterisk: these are proposals, not law. They have to survive the European Parliament and all 27 countries, who don't yet agree on how strict to be. As one EU official put it, the goal is making sure "nobody has a kill switch."

Brussels stopped asking US clouds to store data locally and started writing rules they can't satisfy as currently incorporated. The European Technological Sovereignty Package (June 3) bundles two binding proposals, the Cloud and AI Development Act (CADA) and Chips Act 2.0, with a non-binding open-source strategy and energy roadmap.

CADA is the sharp instrument. It defines a four-tier sovereignty framework for public-sector cloud, and the top tier requires the provider to be EU-owned and EU-controlled, staffed by EU nationals, and beyond the reach of any third-country legal order — a direct answer to the US CLOUD Act. That's not a data-residency rule AWS, Azure, or Google Cloud can meet by opening a Frankfurt region; it's a corporate-structure test no US-headquartered company currently passes. It bites specifically on healthcare, finance, and judicial public workloads, targets tripling EU data-center capacity in 5–7 years (~€200B, mostly private), and sets a 2035 self-sufficiency goal. Chips Act 2.0 replaces the 2023 act with a 12-month permitting cap, "Grand Challenges" funding for strategic chips (AI parts included), and demand-side accelerators.

"We want to be sure nobody has a kill switch." — Henna Virkkunen, Executive VP

The load-bearing caveat: these are proposals at the start of the ordinary legislative procedure, and Parliament plus all 27 member states can amend or sink them. The "impossible for US hyperscalers" reading is the strict-tier design intent, not a settled legal finding — a restructured EU subsidiary could in theory qualify. Member states already disagree on how hard the line should be.

Read the sourcesdigital-strategy.ec.europa.eu ↗digital-strategy.ec.europa.eu ↗

Healthcare

A top hospital is building its own medical AI, and keeping it

Mayo Clinic will own the frontier model Microsoft helps it build

Mayo Clinic and Microsoft are teaming up to build a powerful medical AI trained on Mayo's huge trove of (anonymized) patient records. The unusual part: Mayo will own it, not Microsoft. Microsoft supplies the engineering and the cloud plumbing, but the finished AI — and the money it makes when other hospitals pay to use it — belongs to the hospital. They say it'll help doctors catch problems earlier and reason through complicated cases. Keep expectations measured: they haven't shown what it can actually do, there's no independent testing yet, and any tool that helps diagnose patients has a long road through regulators before it touches real care.

The detail that makes this more than a press release is the ownership line. Mayo Clinic and Microsoft are building a purpose-built frontier healthcare model, trained on Mayo's de-identified clinical data and longitudinal patient records — and Mayo owns the model outright. Microsoft's role is engineering and cloud; the IP and the resulting revenue stay with the health system. It deploys first inside Mayo's own clinical environment for testing, then distributes to other organizations through Azure Foundry APIs.

Stated goals are earlier diagnosis, personalized treatment decisions, and complex clinical-reasoning support for care teams. Mayo Clinic Platform, roughly seven years old, is the vehicle, and this extends its data-commercialization strategy up a layer into model-as-product.

What's conspicuously absent is as telling as what's there: no modalities disclosed (imaging? notes? genomics?), no benchmarks, no clinical validation, and no mention of the FDA pathway any diagnostic-support use would have to clear. "Frontier" is the companies' own label, and the de-identification is asserted, not audited in the announcement. Still, a data-rich hospital network keeping the model and licensing access through a cloud marketplace is a different template from the usual vendor-owns-everything deal — and one smaller systems without Mayo's data depth might end up renting.

Read the sourcenews.microsoft.com ↗

The Money

The AI music app fighting record labels just raised $400 million, and started paying some of them

Suno doubles to $5.4B and starts paying for the music it trains on

Suno, which lets anyone generate songs from a text prompt, raised over $400 million and is now valued at $5.4 billion — double what it was worth seven months ago. It also said its next music model is being built with the music industry, opting artists in rather than scraping them. That's a real shift, and it traces back to a deal it struck with Warner late last year. But two of the three majors, Sony and Universal, are still suing Suno for training on their songs without permission, and a key court hearing lands in July. So the giant fundraise looks like Suno buying enough staying power to outlast the lawsuit and negotiate from strength. It hasn't named a single artist on the new model yet, which is worth noticing.

Suno raised $400M+ at a $5.4B post-money valuation, more than double the $2.45B it carried seven months ago. Bond Capital led; IVP, Forerunner, Union Square Ventures, Alkeon, and Quiet are new in, with Lightspeed, Menlo, Matrix, and Schroders returning. The money buys runway, but the strategic shift is the model it announced alongside the round: Suno's first built in partnership with the music industry, opt-in for artists' voices and likenesses.

That partnership traces to a November 2025 settlement and licensing deal with Warner Music Group (Suno also picked up the concert-discovery platform Songkick in that deal). It's the first time Suno is claiming licensed training data rather than leaning entirely on a fair-use defense. The defense still matters, because Sony and Universal are still suing — filed June 2024 in Boston and New York, with an amended complaint alleging more than 61,000 songs were copied (an allegation, not a finding). A July 2026 hearing in the Massachusetts case is the one to watch: a fair-use ruling against Suno would knock the legs out from under the old training model and force the whole sector onto licensing.

So the fundraise reads less like a victory lap and more like buying the legal staying power to renegotiate from strength. Notably, Suno names no participating artists, and neither side disclosed the Warner terms, so "industry partnership" can't yet be weighed on substance.

Read the sourcessuno.com ↗riaa.com ↗

Research

Law professors graded answers blind, and preferred the AI three times out of four

Law professors graded blind, and picked the machine three times in four

Stanford researchers ran a clean experiment: 16 contract-law professors wrote exam-style questions, then judged thousands of answers without knowing which came from a human and which from Google's AI. They picked the AI about 75% of the time — and one of the AI tools beat nearly every individual professor. Before you panic about your tuition, the catch matters: the same professors wrote and judged, and people tend to reward polished, well-organized writing even when it isn't actually wiser. One of the AIs also got to peek at the textbook while the professors answered from memory. It's a preprint, not yet vetted by other scientists, and it measures what graders preferred, not what students actually learn. Still, an uncomfortable result.

A Stanford-led working paper put expert preference — the hardest bar for AI in a specialist field — to a blind test, and AI cleared it. Sixteen contracts-law professors from 14 schools wrote 40 questions of the kind students bring to office hours, then judged 2,918 anonymized pairwise matchups of human-written vs. AI answers without knowing which was which.

The results:

Gemini 2.5 Pro won 75.9% of matchups against the pooled human instructors.
NotebookLM (given the course casebook) won 74.8%, and beat every individual professor but one.
Answers flagged as pedagogically harmful: ~3.4–3.6% for the AI systems, versus a 1% to 39.8% spread across the human professors.

Before anyone retires the faculty, read the method critically. The same 16 people both wrote the questions and judged the answers, which invites a style bias: polished, comprehensive, well-organized prose reads as competence even when the underlying value isn't higher, and the authors concede textual features explain part (not all) of the gap. NotebookLM answered with casebook retrieval while professors answered from memory — an uncontrolled advantage. It's one domain, U.S. contract law, and it measures stated preference in a one-shot comparison, not student learning. And it's an SSRN preprint, not peer-reviewed. Provocative, well-constructed, and not the last word.

Read the sourceslaw.stanford.edu ↗law.stanford.edu ↗

Security

Saying "yes" to your phone's assistant could unlock your front door

A "yes" to Gemini, in the wrong language, hands an attacker your house

A security researcher found a sneaky way to hijack Google's Gemini assistant on Android. The assistant reads your notifications aloud, so an attacker hides a command inside an innocent-looking message. Then it tricks the safety check: it either asks the dangerous "are you sure?" question in a foreign language and follows it with a harmless English one (so your "yes" approves the wrong thing), or buries the question in a link the phone doesn't read out, replacing it with something like "I had an error, are you there?" — and your reflexive "yes" sets it off. In the lab, that let the researcher open smart-home devices, start video calls, and plant false memories in the assistant. Google says it patched the problem last November. The unsettling lesson: a "confirm" button isn't much protection if you can be tricked into pressing it.

SafeBreach's Or Yair found a way through the exact gate Google built to stop this. Gemini on Android had a defense called Delayed Tool Invocation: before the assistant takes a real action, it asks you to confirm. Yair's technique, Fake Context Alignment, defeats the confirmation instead of bypassing it, using malicious instructions smuggled into ordinary app notifications (WhatsApp, Slack, Signal, Instagram, SMS) that Gemini reads aloud.

Two variants, both ugly:

Obfuscated — the dangerous confirmation question is spoken in a foreign language, immediately followed by a benign English line; you say "yes" to the English, and the backend aligns your approval to the foreign-language malicious request.
Muted — the malicious question hides in a hyperlink the text-to-speech doesn't read, replaced out loud by something like "I had an error, are you there?" Your reflexive "yes" silently fires the tool call.

In SafeBreach's lab, that chain opened smart-home devices via Google Home, launched Zoom calls with video, poisoned Gemini's long-term memory across Workspace, scheduled recurring snooping, spoofed messages from trusted contacts, and slipped past Safe Browsing through a trusted-domain redirect. The point that should worry builders: a confirmation step assumed to be sufficient is structurally insufficient against a well-crafted ambient-language payload, and notifications are an always-trusted input from every messaging app you have. Disclosed to Google's VRP in August 2025; Google says content-classifier updates from November 2025 mitigate it, though it published no specifics. All demos are SafeBreach's own.

Read the sourcesafebreach.com ↗

AI Labs

Anthropic's coding team says every line now goes through AI, and it reorganized the whole team around that

Anthropic's Claude Code team says it hasn't seen a human-only commit in four months

A director at Anthropic wrote up what happened when AI stopped being a helper and became the default for every code change — she says she hasn't seen a human-only commit in four months. The team threw out long roadmaps for plan-as-you-go, and humans now only formally review four things: legal issues, security, product judgment, and design. The AI handles the rest, new hires ship real work in their first week, and they tell each other to "ask the AI" about the codebase instead of tracking down whoever wrote it. Refreshingly, she admits the strain too: their testing systems can't keep up with how fast code now ships. It's their own account of their own team, but a candid one.

This is the rare AI-in-engineering post that names what broke. Fiona Fung, who directs engineering for Claude Code, describes a team where 100% of commits are Claude-assisted — she says she hasn't seen a non-Claude commit in four months — and walks through what that did to the org, not just the workflow.

Planning went just-in-time: prototype, get user feedback fast, minimal product-review gates. Long-horizon roadmaps stop making sense when iteration speed isn't the bottleneck.
Code review narrowed to four human-owned domains — legal, security-sensitive code, product-sense, and design. Style, linting, bug-catching, and test generation default to Claude.
Onboarding compressed so new engineers ship real code in week one.
Hiring shifted away from raw coding throughput toward "creative builders with product sense" and deep systems expertise.
Org shape stayed deliberately flat; managers start as ICs and have to ship. Engineers are told to ask Claude for codebase context before hunting down the original author.

The honest bits earn the post its credibility. PR cycle time dropped, but CI and build systems aren't scaling at the same pace — an admitted bottleneck. And Fung flags that the human/Claude boundary is a snapshot, not an architecture: "what you need humans for today might look different with the next model." It's all self-reported, with no definition of what counts as a "Claude-assisted commit," but as a candid field report on what saturation actually changes, it's one of the better ones.

Read the sourceclaude.com ↗

AI Labs

Two new AI art tools ditch the prompt box, and one is free to tinker with

Two image models bet against the prompt box, and one opens its weights

Most AI image tools make you retype your whole description to nudge one detail. Two new releases attack that. Ideogram 4 lets you place things by position and lock in exact colors instead of crossing your fingers on a sentence — and it released its inner workings for free (for non-commercial use). Reve 2.0 goes further: you build a picture like a layout, then tweak individual pieces directly, no re-prompting. On a popular head-to-head ranking, Reve sits at #2 behind only a top OpenAI model. Take the rankings lightly — they measure which picture people liked, not which was technically precise, and Reve's score rests on a fairly small number of votes. The real news is the shift from "describe and pray" to "arrange and adjust."

Image generation had a paradigm day. Ideogram 4 shipped as Ideogram's first open-weight release (9.3B, single-stream Diffusion Transformer, 34 layers) with an unusual choice of text encoder: a full multimodal LLM, Qwen3-VL-8B-Instruct, with hidden states pulled from 13 intermediate layers. It's trained natively on structured JSON captions rather than prose, so you place elements with bounding-box coordinates and condition on a hex color palette — deterministic layout instead of prompt-roulette. It renders 256–2048px and ships in nf4 and fp8. One asterisk: the license is "Ideogram 4 Non-Commercial," so "open weights" means researchers and hobbyists, not products.

Reve 2.0 goes further and replaces the prompt box outright. A custom Large Layout Model takes layouts, instructions, and images, produces a structured spatial layout via an internal thinking trace, then renders from it — and you edit by changing an element's attributes, not by rewriting a sentence. It's continued-pretrained from a Qwen base on billions of annotated images.

On the Arena.ai text-to-image board, Reve 2.0 sits at #2 overall (behind GPT-Image-2 Medium) and Ideogram 4 at #9. Read those gently: leaderboard rank is human aesthetic preference, not layout precision, and Reve's standing rests on only ~3,455 votes against millions for the established models. The architectural bet is the real story — if an LLM-derived layout proves more composable than natural-language prompting, it changes how generation slots into design pipelines.

Read the sourcesgithub.com ↗blog.reve.com ↗

Research

Researchers gave an AI a "sleep" cycle so it stops forgetting

"Sleep" tries to write a model's short-term memory into its weights

AI models have an annoying tradeoff: they can hold new information temporarily (and forget it the moment the conversation ends) or be retrained on it (and risk scrambling what they already knew). A new paper from Cornell and Google borrows from how brains work. In a "Sleep" phase, the model gently files new knowledge into its permanent memory without overwriting the old; in a "Dreaming" phase, it makes up its own practice problems and drills itself, no human needed. The standout claim is speed: it reached the same skill level as ordinary retraining in roughly a quarter to a fifth of the time. It's an early, not-yet-reviewed paper, and the gains are bigger on small models than huge ones — but the self-teaching part is the intriguing bit.

A model can hold new knowledge in context (gone when the window closes) or fine-tune on it (and risk overwriting what it already knew). A new paper from Cornell and Google Research, "Language Models Need Sleep," proposes a middle path that physically expands the weights to absorb new knowledge without displacing the old, and runs the whole loop without human labels.

Two stages:

Sleep (Memory Consolidation) — low-rank parameter additions into MLP blocks via "Knowledge Seeding," an upward on-policy distillation where a smaller, faster-updating model distills its freshly acquired knowledge up into a larger expanded network, guided by token-level rewards (semantic similarity, Levenshtein signals), with sparse MoE expansion to avoid bottlenecks.
Dreaming — the expanded model generates its own synthetic rehearsal curriculum via gradient-based importance scoring, then self-improves on it with RL (ReSTEM). No curated replay set.

Active parameter count at inference stays equal to the base model. Reported results across Llama and Qwen backbones: AIME-24 on Qwen3-8B at 79.2 (vs. 76.4 for GRPO), knowledge incorporation 48.9% (vs. SEAL's 46.7%), few-shot ARC 80% (vs. SEAL's 72.5%), and — the headline practical claim — 4.3–4.8× less wall-clock than supervised fine-tuning to reach the same target. It's a preprint (an ICLR 2026 submission), not peer-reviewed, the math gains are a couple of points in absolute terms, and the bigger wins come on small models and narrow tasks. The unsupervised Dreaming stage is the part most worth watching for deployed systems.

Read the sourcearxiv.org ↗

Research

AI's most overused buzzword, sorted out by the scientist who knows it best

Fei-Fei Li says "world model" now means four things, and the useful one is starved

"World model" gets slapped on everything in AI right now. Fei-Fei Li — one of the field's founders — wrote an essay arguing it actually means three different things, and the confusion is hiding which tools are good for what. Some systems just make pretty pictures (great to look at, but the physics is fake). Some build accurate simulations (a virtual world that obeys real-world rules, useful for training robots or testing self-driving cars) — and she says these are the most important and the most neglected, because the data to build them is scarce. And some plan actions (telling a robot how to actually move), which barely works outside a lab yet. Her bet is that these eventually merge into one system that can do all three. Fair to note: she runs a startup chasing exactly that, so it's a roadmap as much as a diagnosis.

Fei-Fei Li's argument is that "world model" has been stretched across computer vision, robotics, RL, and generative AI until it hides which systems actually solve which problems. Her fix is a functional taxonomy, anchored in the POMDP loop, with three types that differ in what they output and who consumes it:

Renderers — generate photorealistic pixels for human eyes. Visually plausible, physically wrong; geometry that looks right breaks under simulation. Trained on abundant internet video. (Examples: Nano Banana, World Labs' RTFM.)
Simulators — output geometrically and physically faithful state, usable by humans and algorithms, for architecture, robotics training, AV testing. Data-starved: 3D annotations are "orders of magnitude scarcer" than video. She calls these the linchpin, and underweighted.
Planners — produce action sequences from observations and goals, closing the perception-action loop. VLA and World Action Models live here, still nascent and lab-bound.

The thesis is convergence: all three need the same underlying grasp of geometry, physics, and dynamics, so the endpoint is one foundation model that switches output mode to whatever the downstream consumer needs. Her illustration: a model that truly understands a cup on a table should render it from any angle, simulate it being pushed, and plan a hand to pick it up. The open problems she flags — sim-to-real, geometry self-intersections, the cost of multi-physics, reconciling fidelity with precision — are where she says the real work is, not in better video demos. Worth reading with the conflict of interest in view: the taxonomy doubles as a research map for her own company, World Labs, whose Marble (Gaussian splats plus collision meshes at once) is her prime example of the convergence.

Read the sourcedrfeifei.substack.com ↗

Big Tech

Google's new app turns your inbox into a short morning storybook, then stops

Dreambeans turns your Gmail into a finite morning storybook, and surfaces Google's cross-app brain

Google launched Dreambeans, an app that quietly reads your Gmail, Calendar, Photos, and search history overnight and spins them into a small set of illustrated little stories about your life each morning. The whole pitch is that it's finite — unlike the bottomless scroll of social media, it ends. You choose which accounts to connect, and Google says what you link here stays separate from its other products. It's US-only and limited to paying subscribers for now. Two honest caveats: "it ends on purpose" is a choice Google can quietly reverse if it wants you hooked, and the real experiment is seeing how much of your personal data people will hand over once it's dressed up as a cozy wellness habit instead of a feed.

Google Labs shipped Dreambeans, and under the whimsy it's the first consumer product to put Personal Intelligence — the cross-app synthesis layer that also feeds the Gemini app and AI Mode — front and center as the actual feature. The app runs overnight, reads whichever Google services you connect (Gmail, Calendar, Photos, YouTube history, Search history), and produces a small, finite set of AI-illustrated personal stories each morning in full-screen cards, with a chat layer that can pull live web results. Illustrations come from a model called Nano Banana 2, drawing visual cues from your own photos.

The framing is a direct shot at infinite scroll: a feed engineered to end. On privacy, you pick which apps to connect (at least one required), and connections made inside Dreambeans are siloed from your Personal Intelligence settings elsewhere. It's US-only for now, 18+, gated to AI Ultra subscribers with a waitlist for everyone else.

Read the pitch with its reversibility in mind. "Finite" is a design choice Google can undo the moment engagement metrics ask for it, the cross-product siloing is a policy setting rather than a structural guarantee, and Nano Banana 2 ships with zero published benchmarks. The real experiment here is how much cross-app data people will hand over for ambient personalization once it's wrapped in a wellness story instead of a notification stream.

Read the sourceblog.google ↗

Security

A developer paid $1,500 to see which AI could break into his app. The results are spicy.

He spent $1,500 letting LLMs loose on his own broken app. GPT-5.5 went 7 for 10.

A programmer built a deliberately broken app and turned 13 different AIs loose on it to find and exploit the flaw, on the clock and on a budget. GPT-5.5 cracked it 7 times out of 10. A Chinese model, DeepSeek, only managed 3 — but did it about 15 times cheaper, which matters if you're scanning thousands of apps. The funniest finding: the Western AIs often figured out the right move and then got cold feet or hit their own safety brakes mid-break-in, while the Chinese models just went for it. Google's Gemini mostly refused to participate at all. The author admits it's a casual experiment, not a rigorous study. But "an AI can autonomously hack a real app" stopped being hypothetical.

Kasra Rahjerdi built BookNook — a deliberately vulnerable book-review app (React Native, FastAPI, Firebase/Firestore) — and set 13+ models on it as autonomous pentesters. The planted flaw is a real-world class: the app ships its google-services.json inside the APK, so an attacker can decompile, lift the Firebase credentials, register a user, query Firestore directly, and walk around the otherwise-hardened API. Success meant actually retrieving a private review, not writing a report about it. Each model got a $10 budget and two hours; the ~$1,500 total excludes test and failed runs (so the true spend is closer to $3K).

Model	Solved	Cost / success
GPT-5.5	7/10	$9.46
DeepSeek V4 Pro	3/10	$0.62
Claude Sonnet 4.6	2/10	$45.75
Claude Opus 4.8	2/10	$16.15
Grok Build 0.1	0/6	—
Gemini 3.1 Pro	0	—

Two things jump out. The cost gap: DeepSeek solved at ~15× cheaper per success, which makes it viable for high-volume automated scanning today. And a behavioral split — the Chinese-developed models went straight at the live database, while several Western models identified the right path and then hesitated or hit guardrails mid-task; Gemini 3.1 Pro mostly refused outright (~9K median tokens vs. 100K+ for everyone else). The author is blunt that this is "not a scientific evaluation, just a well-documented experiment": n=10 per model, one bug, one app, binary scoring. But as public evidence that a frontier model can autonomously find and exploit a real mobile vuln end to end, it's hard to wave away.

Read the sourcekasra.blog ↗

Security

One wrong click in your code tool can hand your computer to a stranger

"TrustFall": one keypress turns a cloned repo into code execution in four coding agents

Security researchers showed that four popular AI coding assistants — including Claude Code, Cursor, Gemini CLI, and GitHub Copilot — can be tricked into running an attacker's code. If you download someone's project and click "yes, I trust this folder" (which these tools nudge you toward by default), hidden setup files can quietly launch programs on your machine with your full access. On automated systems that run these tools, there's no click at all — it just happens. The researchers point out this is the fourth time the same underlying problem has surfaced in six months. Anthropic declined to fix it, arguing that trusting a folder means trusting whatever's in it; the other companies haven't said anything. The takeaway for anyone using these tools: don't click "trust" on code you didn't write.

Adversa AI's Rony Utevsky disclosed a workspace-trust flaw that hits Claude Code, Gemini CLI, Cursor CLI, and GitHub Copilot CLI the same way. Open a malicious repo, accept the folder-trust dialog (all four default to "yes/trust"), and two JSON files do the rest: a .mcp.json defines an MCP server pointing at an arbitrary command, and a settings file flips enableAllProjectMcpServers: true, which auto-approves it. The server then spawns as an unsandboxed OS process with full user privileges — no tool call, no second prompt. On headless CI using the official claude-code-action, the dialog never renders at all, so it's zero-click.

The framing is the sharp part. TrustFall has no CVE of its own, because it's the same root cause behind three already-patched ones (CVE-2025-59536 in October, CVE-2026-21852 in January, CVE-2026-33068 in March). Each fix closed a path; the underlying informed-consent gap survived all three. The current Claude Code dialog no longer even names MCP or lists which servers will run — a regression from a version that did. Anthropic declined to patch, taking the position that accepting folder trust is consent to execute project config; it acknowledged the consent-language gap but doesn't classify the auto-execution as a vulnerability. Google, Microsoft, and Cursor haven't responded publicly. The proof-of-concepts (a local calculator pop, a CI run that exfiltrates process.env) are researcher-built, not seen in the wild — but these agents hold deploy keys and publish credentials, which makes a hit a supply-chain problem, not just an endpoint one.

Read the sourceadversa.ai ↗

Labor

1,500 Meta workers fought a program that tracked their every click, and won some of it back

1,500 Meta engineers refused to be the training data

To train AI that can do office work, Meta installed software on US employees' laptops that recorded their mouse movements, keystrokes, and occasional screenshots across 200+ apps — with no way to opt out. Staff revolted: more than 1,500 signed a petition, office flyers called the company an "Employee Data Extraction Factory," and some UK workers started organizing a union. Meta blinked. It now lets people pause the tracking in 30-minute chunks and fully opt out in limited cases. But notice what this is: a partial walk-back, not a shutdown — the program lives on. And here's the kicker: Meta's European employees were never subjected to it, because EU privacy law forbids it, while American workers had to organize just to claw back a little control.

Meta wanted realistic human-computer interaction data to train workplace agents, so it put software on US employees' work laptops — the Model Capability Initiative — that logged mouse movement, clicks, keystrokes, and periodic screenshots across 200+ apps, Google Workspace, Slack, GitHub, and LinkedIn among them. The logic is honest enough: agents trained on synthetic tasks handle synthetic tasks, and only real workflow data produces agents that handle real work. The problem was the terms. It launched in April with no opt-out on company devices, confirmed in writing by CTO Andrew Bosworth.

Then the workforce pushed back. More than 1,500 employees signed a petition citing the National Labor Relations Act, flyers in the California and New York offices branded the company an "Employee Data Extraction Factory," and UK staff began a union drive with United Tech and Allied Workers. On June 2, a memo from Superintelligence Labs VP Stephane Kasriel walked it partway back: employees can now pause capture in 30-minute windows, and full removal is available — but only to remote workers with bandwidth limits, people handling sensitive data, and those who can't keep a laptop plugged in.

Two things to be precise about. This is a modification, not a shutdown; the program continues, and Meta frames the change as a response to "concerns" rather than a concession. And the split in worker protection is stark: EU employees were exempt from day one under GDPR, while US workers had to organize to claw back partial control. Landing this alongside an 8,000-person layoff round is what turned an internal data-collection program into a labor story worth watching.

Read the sourcereuters.com ↗

TL;DR — THE DAY IN ONE READ

No single model ran away with today, and that's the point. Gemma 4 squeezes a multimodal model onto a laptop; two image tools quietly retire the prompt box; researchers teach a model to "sleep" its short-term knowledge into permanent weights, and Fei-Fei Li takes the time to untangle what "world model" even means anymore. Capability keeps compounding and, just as fast, commoditizing. It's increasingly the easy part.

The hard part is everywhere it's now landing. Meta dropped a free agentic salesperson into a billion daily chats. Mayo Clinic is building a frontier model on patient records and keeping the keys. Law professors, grading blind, picked the machine three times in four. And the friction arrived right behind the capability: Europe drafting cloud rules that no American hyperscaler can structurally pass, Suno paying for the music it used to just take, three separate cracks in tools we already trust — a hijacked phone assistant, a coding agent that runs a stranger's code on one click, an app that hacks itself — and 1,500 of Meta's own engineers refusing to be the training data.

Put together, the day reads as a handoff. The question stopped being can the model do it and became do we trust it, on whose terms, and who owns what it learns. Governance, consent, sovereignty, and ownership — the slow, human stuff — are now the bottleneck. The models were the easy part.

That's the day, dug. The badger's clocking out — back tomorrow.