TLDR AI 2024-10-31

OpenAI hallucination benchmark 📚, Anthropic social bias study 📃, DeepMind Audio Generation 🔊

🚀

Headlines & Launches

Pushing the Frontiers of Audio Generation (6 minute read)

OpenAI's new hallucination benchmark (7 minute read)

Evaluating feature steering: A case study in mitigating social biases (17 minute read)

🧠

Research & Innovation

ThunderKittens 2 (17 minute read)

Realistic Motion Retargeting (2 minute read)

Better Generation with Self-Guidance Sampling (18 minute read)

👨‍💻

Engineering & Research

Speeding Up Transformers with Token Merging (GitHub Repo)

3D Reconstruction Without Pose Data (3 minute read)

A Benchmark for Evaluating Data Curation Methods (GitHub Repo)

🎁

Miscellaneous

How we saved hundreds of engineering hours by writing tests with LLMs (7 minute read)

25% of Smartphone Owners Don't Want AI as Apple Intelligence Debuts (6 minute read)

Fine-tuning LLMs to 1.58bit: extreme quantization made easy (24 minute read)

⚡️

Quick Links

Rime AI achieves 100% API uptime in 2024 with Baseten (Sponsor)

Google preps ‘Jarvis' AI agent that works in Chrome (2 minute read)

OpenAI's Whisper transcription tool has hallucination issues, researchers say (1 minute read)

Forerunner K2 humanoid robot can carry 33 lb in each dexterous hand (3 minute read)

Get the most interesting AI stories and breakthroughs delivered in a free daily email.

Join 920,000 readers for one daily email

Privacy Careers Advertise