Home
Posts
AI Reading for Wednesday August 27

AI Reading for Wednesday August 27

95% of statistics are made up

Aug 27, 2025

Here is my crack at explaining that viral MIT paper saying something about 95%/5% . I think this is the key chart:

I think what this is saying is, suppose your are a biglaw firm, just as an example. 60% of firms have looked at implementing a comprehensive tool like Harvey, 20% piloted it, 5% have measurable improvement in productivity or profitability KPIs. So actually 25% of these pilots are successful.

The dark blue is firms that rolled out e.g. ChatGPT Enterprise. So it turns out that when you do these big top-down projects, a lot of people say, why should I use Harvey, I have my own ChatGPT prompts that I can customize and improve, and workflows that work better.

Or in financial services, substitute BlueFlame or Auquan or Hebbia or Rogo although I mostly hear good things (but occasionally the narrative above)

The key thing is you have to customize the Harvey for those workflows, build in observability and see what works, and improve it for everyone over time, with agentic workflows feeding it the right context for what they actually do with it. Also good training on using the agentic multi-turn workflows so users get maximum benefit exceeding single-turn chat prompts or repeated cut and pastes.

If people get benefit from chatbots, they should get more benefit from longer scripted versions of what they do daily. But a lot of time there is lack of alignment, so they don’t.

The overall paper has a lot of nuance that resonates, AI is stochastic and evolving fast, you have to learn how to implement it in your context.

AI is hard to measure, you give coders Claude and it helps them and they use it, but exactly how much more productive are they? The chatbots help, shadow AI that people build in the trenches is aligned with their business needs, AI helps grunt work but less for highest-value work.

They put a clickbait headline on it and it went viral, which was the intent, but what went viral was, 95% of the time AI adds no value which is not what they said.

The eye sees what it wants to see, or brings the power to see … people don't do nuance, they want certainty. AI is either useless, or magic beans, there can be no middle ground. Like, it’s hard, it takes time, you have to learn how to do it right and adapt as it changes, and train people, and success is hard to measure.

Interoperability and adaptability are key to AI in the enterprise, “the AI revolution will not be won by technological advances alone.” - Fast Company

New technologies paradoxically decrease the effort required to perform tasks while increasing the effort required to remain competitive — we have to run to stand still - Indie Hackers

Nvidia earnings incoming after the close - Yahoo Finance

China aims to triple AI chip output, reducing Nvidia's dependency, FT says - Reuters

Wonks say US should maintain China chip restrictions: "A short-term profit grab risks eroding America’s biggest advantage in the AI race." - The Washington Post

Nvidia's new $3,500 robot 'brain' could herald the final form of AI - Fortune

The A.I. Spending Frenzy Is Propping Up the Real Economy - The New York Times

In Jackson Hole, AI and its economic impacts are on the agenda - Forbes

Microsoft talks set to push OpenAI’s restructure into next year, jeopardizing $10b of funding. - Financial Times

A family sued OpenAI for allegedly encouraging teen to commit suicide - The New York Times

A California teen sought advice from OpenAI's GPT-4o on how to end his life. The chatbot gave him explicit instructions and encouragement. His parents are suing the company and its CEO, Sam Altman, alleging “it was the predictable result of deliberate design choices." - Tech Policy Press

OpenAI Says It Will Update ChatGPT After Parents Sue Over Teen’s Suicide - Bloomberg

So, we want the ‘just kill yourself’ AI teaching the little children or nah? - The Guardian

Evaluating AI models on questions about suicide. Very high risk questions get refusals from all LLMs, other responses vary, sometimes benign questions are refused, more work on alignment with clinical practitioners recommended. - www.psychiatryonline.org

Some AI researchers left Meta shortly after joining to go back to OpenAI - WIRED

Q&A with General Catalyst's Hemant Taneja on the VC firm's “AI roll-up” strategy to buy service businesses and inject them with AI - Financial Times

Anthropic launches a Claude AI agent that lives in Chrome (has some of the same security issues as Comet) - TechCrunch

Anthropic announces a pilot test of a new Claude browser extension - Anthropic

Anthropic: ‘Vibe-hacking’ is now a top AI threat - The Verge

Anthropic settles copyright lawsuit - WIRED

A popular dev tool was compromised to look for Claude Code on users' systems and issue prompts to find and steal cryptocurrency credentials. - Semgrep

The Era of AI-Generated Ransomware Has Arrived - WIRED

Some notes on the insecurity baked into Perplexity's Comet "AI Browser" - the Brave security team reported serious prompt injection vulnerabilities in it, but Brave themselves are developing a similar feature that looks doomed to have similar problems […] - Mastodon

Google AI is hallucinating restaurant specials, making customers mad - Android Headlines

Which AI search should you trust for facts? Our librarians' best rankings. Google AI mode came out on top. - The Washington Post

Google's AI Weather Model Nailed Its First Major Storm Forecast - Gizmodo

Google Gemini's Flash 2.5 gets a 'bananas' image model upgrade - TechCrunch

Google is building Duolingo feature into Translate - Engadget

AI for academics: SciSpace for locating, summarizing and organizing papers; Jenni AI, Yomu and GenSpark for drafting; Thesify for clarity/logic feedback; and platforms like NotebookLM, Google Gemini and Deep Research for advanced research tasks - Geeky Gadgets

Doctors love ambient AI transcription because it gives them hours of their lives back - Fast Company

Designing more sustainable products with AI - MIT Technology Review

Netflix provides a triage matrix to use AI in production - (low‑risk: moodboards; high‑risk: character designs/digital doubles) and warns producers to "treat it like explosives – useful in the right hands, dangerous if mishandled." - CineD

Apple still debating potential Mistral and Perplexity M&A bids internally amid looming Google Search shakeup - 9to5Mac

AI skeptics are running out of rope to argue Apple isn’t behind - 9to5Mac

AlphaAgents: Large Language Model based Multi-Agents for Equity Portfolio Constructions - arXiv

The digital afterlife industry may near $80 billion in a decade, fueled by AI "deadbots." Tech firms see profit. But experts warn of troubling consequences. - NPR

Elon Musk can't stop posting anime gooner AI slop because of course he can't - Rolling Stone

Glass-hole controversies are about to make a comeback - Futurism

Spent 15 minutes writing a prompt, AI thought for 30 minutes, will take me a day or two to sort through all this - ChatGPT

Follow the latest AI headlines via SkynetAndChill.com on Bluesky

Keep Reading