TLDR AI 2025-02-19
Humane’s AI Pin is dead ⚰️, Meta LlamaCon 🦙, Mira Murati’s Thinking Machines Lab 🧠
Humane's AI Pin is dead, as HP buys startup's assets for $116M (3 minute read)
HP acquired most of Humane's assets for $116M, leading to the discontinuation of Humane's AI Pins. The AI Pins will lose functionality by February 28. Customers are advised to transfer data. Humane's team will form HP's new AI innovation lab, HP IQ.
Mira announces Thinking Machine Labs (4 minute read)
The former CTO of OpenAI, along with many extremely talented scientists and engineers, have joined to make another AI company. The goals are somewhat vague, but it seems to be a product and foundation model company, with a focus on infrastructure.
Meta is Launching LlamaCon (1 minute read)
Meta is launching LlamaCon, an open-source AI developer conference, on April 29. The event will showcase advancements in the Llama AI model ecosystem, followed by Meta Connect in September for XR and metaverse developments.
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention (35 minute read)
DeepSeek has entered the Attention Alternative arena with this novel algorithmic method to speed up quadratic Attention. They get as much as an 11x speed up with no loss in overall performance.
SWE Lancer (15 minute read)
A great benchmark of over 1,400 freelance software engineering tasks from Upwork, valued at $1 million USD total in real-world payouts. SWE-Lancer encompasses both independent engineering tasks - ranging from $50 bug fixes to $32,000 feature implementations - and managerial tasks, where models choose between technical implementation proposals. Top models earned $400k.
Geometric Folding in ReLU Networks (22 minute read)
Researchers present a quantitative analysis of how ReLU neural networks fold input space, revealing self-similarity patterns. They introduce a novel metric to study these transformations and provide empirical results on benchmarks like CantorNet and MNIST.
👨💻
Engineering & Research
What happens when your ad lands in TLDR? (Sponsor)
Over 5 million tech pros start their day scanning TLDR for the latest news—so when an ad appears, it gets noticed.
That's why companies run TLDR ads to reach founders, engineers, and decision-makers without the noise of traditional ad platforms. Plus TLDR provides free ad copywriting and performance insights. Learn more.
Task Grouping for Better Multi-Task Learning (19 minute read)
A new approach to multi-task learning mitigates negative transfer by dynamically grouping tasks and updating them sequentially during training. The method, based on proximal inter-task affinity, significantly improves performance over existing multi-task optimization techniques.
LLM-Guided Reinforcement Learning (6 minute read)
CAMEL improves reinforcement learning efficiency by integrating LLM-generated suboptimal policies with dynamic action masking.
R1 1776 (Hugging Face Hub)
Perplexity has post-trained R1 to remove Chinese censorship. They do so in a way that doesn't harm underlying reasoning. It is Perplexity's first open weights release.
Get the most interesting AI stories and breakthroughs delivered in a free daily email.
Join 920,000 readers for
one daily email