TLDR AI 2025-06-06
Gemini 2.5 Pro upgrade 🤖, Eleven v3 🗣️, Cursor’s $9.9B valuation 💰
[RESEARCH] Zero-shot auto-labeling that's 100,000x cheaper and 5,000x faster with near-human performance (Sponsor)
Manual labeling has long been the most burdensome task in computer vision development.
Voxel51's latest ML research dives into the parameters, confidence thresholds, and configurations that unlock auto-labeling performance. These benchmarks show that zero-shot labeling with foundational models like YOLO-World and Grounding DINO achieves 95% of human performance, at 100,000× lower cost and 5,000× faster speed.
The $ difference: a human annotation service labeling 1.2 million objects would take ~2,470 hours and cost $45,725, versus 27 minutes and $0.42 with Voxel51's Verified Auto Labeling — with similar or better model mAP scores.
>> Read the research
>> Join the webinar for expert analysis
OpenAI Threat Intelligence Report: June 2025 (35 minute read)
LLMs aren't giving bad actors fundamentally new capabilities. OpenAI has published 10 examples of models accelerating existing hacking, fraud, and misinformation operations. This includes North Korean actors using GPT to scale fraudulent IT worker schemes, Russian groups developing sophisticated malware, and Cambodian scammers generating multilingual “task scams” promising victims $500/day for liking TikTok posts.
Latest Advancements in Search and Recommendation Systems (4 hour video)
This 4-hour session, presented during the AI Engineer World's Fair 2025, covers recent innovations in search and recommendation systems.
👨💻
Engineering & Research
LLM-Driven Data Annotation (14 minute read)
To address label uncertainty in LLM-based annotation, this paper introduces a method that captures multiple possible labels and uses a teacher-student framework called CanDist to distill them into a single output.
Apple Research Finds Critical Limitations in Reasoning Models (20 minute read)
When OpenAI's o3, Claude, and DeepSeek-R1 were tested in puzzle environments, their performance collapsed beyond certain complexity thresholds, despite generating detailed "thinking" processes. The models exhibit a counterintuitive scaling limit where their reasoning effort actually decreases as problems become more complex, and they fail to improve even when given explicit solution algorithms.
Claude Composer CLI (GitHub Repo)
Claude Composer CLI is a tool that enhances Claude Code with automation, UX, and configuration. It reduces interruptions while giving users flexible control and tools to configure Claude. The tool provides users with system notifications to keep them informed. Users can control which permission dialogs are automatically accepted.
Introducing Modify Video (3 minute read)
Modify Video allows professionals to reimagine environments, lighting, and textures in videos without altering motion or performance. It offers tools for restyling, retexturing, and editing specific elements like wardrobe and props. Outperforming competitors, Modify Video maintains motion consistency, offering multiple output variants and using advanced performance signals for high-fidelity creative control.
Portraits: personalized AI coaching built alongside real experts (2 minute read)
Google Labs launched Portraits, an AI coaching tool featuring experts like Kim Scott, to provide AI-driven guidance. The tool uses Gemini's capabilities to simulate expert advice through interactive avatars.
Get the most interesting AI stories and breakthroughs delivered in a free daily email.
Join 920,000 readers for
one daily email