TLDR AI 2024-10-31
OpenAI hallucination benchmark π, Anthropic social bias study π, DeepMind Audio Generation π
π§
Research & Innovation
ThunderKittens 2 (17 minute read)
Thunder Kittens is a framework for writing extremely performant GPU Kernels. It is built on the idea that GPUs actually want to operate on small 16x16 tiles of data. In turn, the useability is quite high, and 40% faster kernels only take a few hundred lines of code.
Realistic Motion Retargeting (2 minute read)
MeshRet has introduced a novel approach for improving motion retargeting for 3D characters that focuses on preserving body geometry interactions from the start.
Better Generation with Self-Guidance Sampling (18 minute read)
Researchers have enhanced Masked Generative Models (MGMs) with a new self-guidance sampling method, improving their image generation quality while maintaining diversity.
How we saved hundreds of engineering hours by writing tests with LLMs (7 minute read)
Assembled uses LLMs to accelerate and improve software testing, enabling test generation in minutes instead of hours. This approach increases engineering productivity, saving time and shifting focus to feature development. LLMs generate comprehensive and accurate tests that maintain code quality and development velocity.
25% of Smartphone Owners Don't Want AI as Apple Intelligence Debuts (6 minute read)
A CNET survey revealed that only 18% of smartphone users are motivated by AI features to upgrade their devices, with privacy and cost being significant concerns. Major manufacturers like Apple, Google, and Samsung are integrating more AI capabilities in their phones, yet many users prioritize battery life and storage over AI functions. AI subscriptions are set to become common, but nearly half of users are unwilling to pay for these features.
Fine-tuning LLMs to 1.58bit: extreme quantization made easy (24 minute read)
BitNet, developed by Microsoft Research, introduces a transformer architecture that reduces LLM computational and memory requirements by using ternary precision (-1, 0, 1) equating to 1.58 bits per parameter. Models are required to be trained from scratch. BitNet can also fine-tune existing models to this low-precision format, maintaining strong performance on downstream tasks. This approach significantly reduces energy consumption and improves inference speed using specialized kernels for efficient matrix multiplication.
Get the most interesting AI stories and breakthroughs delivered in a free daily email.
Join 920,000 readers for
one daily email