TLDR AI 2025-06-02
Samsung + Perplexity π±, Anthropic revenue surges π, Bond Capital AI trends π€
π§
Deep Dives & Analysis
Why DeepSeek is cheap at scale but expensive to run locally (8 minute read)
Models that are Mixture-of-Experts with many layers, like DeepSeek, require large batch sizes and high latency otherwise throughput drops off a cliff. This is why it is commonly said that DeepSeek can't be easily run for personal use - with a single user running one inference at a time, it runs at a very low efficiency/throughput. This article explains this phenomenon in detail, as well as why some AI models are slow to respond but fast once they get going and the other effects of throughput, latency, and batch sizes.
Bond Capital Releases Comprehensive 340-Slide Report on AI Trends (90 minute read)
This analysis by VC Mary Meeker documents unprecedented AI adoption rates, with ChatGPT achieving global penetration in three years compared to the internet's 23-year timeline. The report covers AI chatbots now being mistaken for humans 73% of the time (up from 50% six months ago), inference costs plunging 99% since 2022, and enterprise adoption accelerating beyond experimental phases.
The Trackers and SDKs in ChatGPT, Claude, Grok, and Perplexity (4 minute read)
This post looks at which third-party SDKs and API calls can be found in the four biggest Android AI chat apps: ChatGPT, Claude, Grok, and Perplexity. It looks at each app's development tools, business tools, product and marketing analytics, monetizations, and the API calls recorded by each app while open.
Differential Privacy on Trust Graphs (6 minute read)
A study proposing a privacy framework that integrates variable trust levels among users into differential privacy models. It better reflects real-world data sharing preferences compared to binary trust assumptions.
Do You Even Have a System Prompt? (5 minute read)
Most people either skip system prompts or write short, unoptimized paragraphs, missing significant gains from personalized AI behavior. Users should systematically test and iterate on a system prompt instead of providing feedback on poor outputs in siloed chat threads. The comment section of this post includes a selection of community-provided system prompts.
Give AIs a stake in the future (3 minute read)
Giving AI a stake in the future means respecting their autonomy and well-being and requires us to honor the contracts we make with them.
If you're wondering why the new DeepSeek R1 sounds a bit different (1 minute read)
The DeepSeek team may have switched from training on synthetic OpenAI outputs to synthetic Gemini outputs.
Early AI investor Elad Gil finds his next big bet: AI-powered rollups (5 minute read)
The idea is to identify opportunities to buy mature, people-intensive outfits, help them scale through AI, and then use the improved margins to acquire other such enterprises and repeat the process.
ElevenLabs debuts Conversational AI 2.0 voice assistants that understand when to pause, speak, and take turns talking (5 minute read)
ElevenLabs' Conversational AI 2.0 introduces a host of new features designed to create more natural, intelligent, and secure interactions for enterprise use cases.
We Smoked NVIDIA's Blackwell, Says Cerebras (2 minute read)
Cerebras claims its systems outperform Nvidia's DGX B200 by achieving an output token speed of over 2, 500 tokens per second compared to Nvidia's 1,000 tokens per second.
Get the most interesting AI stories and breakthroughs delivered in a free daily email.
Join 920,000 readers for
one daily email