TLDR AI 2024-01-23
Stability 1.6B LLM π, Tesla FSD v12 π, self-rewarding language models π
Stability AI Unveils Smaller, More Efficient 1.6B Language Model (3 minute read)
Stability AI's latest release, Stable LM 2 1.6B, is a compact yet powerful language model that supports seven languages. It is designed to outperform other models under 2 billion parameters, including its own previous 3B model. With its smaller size, it aims to lower barriers for developers, offering various versions including a unique "half-cooked" model for more customization flexibility.
Tesla finally releases FSD v12 (2 minute read)
Tesla has started rolling out its Full Self-Driving Beta v12 update, which shifts vehicle control from explicit C++ code to an AI-powered single neural network. The release marks a significant move towards fulfilling the company's self-driving ambitions, although the software is still labeled as beta. Skepticism will remain until tangible improvements in autonomous driving capabilities are observed as the software is rolled out to cautious select beta testers.
π§
Research & Innovation
Face Mixer Diffusion (18 minute read)
This work shows how you can use diffusion to clone faces in images. There are many ways to do this with deep fakes, but diffusion is interesting due to its ability to inpaint other pieces of the image as needed.
DeepDive: LoRA from Scratch (22 minute read)
LoRAs are Low Rank Adapters that allow you to fine-tune only a small amount of parameters in a language model. They can dramatically improve and alter the performance of these models.
GroupAnything (12 minute read)
Grouping in 3D is a challenging and ambiguous task because you donβt know what granularity youβll need for the grouping operation (e.g., keys on a keyboard vs the entire keyboard itself). This work uses multi-level masks and makes great progress on the problem of semantic 3D grouping.
My AI Timelines Have Sped Up (Again) (13 minute read)
This author revised their AI timeline predictions based on advancements in scaling up models. They are now estimating a 10% chance of achieving Artificial General Intelligence by 2028 and a 50% chance by 2045. These changes are attributed to the effectiveness of large language models and the realization that many intelligent capabilities may emerge at scale.
Interactive Control in Text Generation (7 minute read)
Researchers introduce the "Prompt Highlighter," a method that revolutionizes text generation in multi-modal language models by allowing users to highlight parts of prompts.
Get the most interesting AI stories and breakthroughs delivered in a free daily email.
Join 920,000 readers for
one daily email