TLDR AI 2025-04-04
DeepMind AGI Safety 🦺, OpenAI Nonprofit Guidance Commission 👥, Hate Speech Benchmark 📚
DeepMind's Approach to AGI Safety (11 minute read)
Google DeepMind outlines its perspective on building safe and secure artificial general intelligence, emphasizing oversight and technical safeguards as AGI capabilities progress.
OpenAI Nonprofit Guidance Commission (3 minute read)
OpenAI is forming a new expert commission to shape how its philanthropic arm supports communities using AI, aiming to align AI innovation with real-world nonprofit needs.
OpenAI just made its first cybersecurity investment (3 minute read)
OpenAI invested in Adaptive Security, a startup using AI to simulate and train employees against social engineering hacks. Adaptive Security secured $43M in Series A funding, with plans to bolster its platform amid growing AI threats. Co-founded by veteran entrepreneur Brian Long, the startup has quickly amassed over 100 clients since launching in 2023.
Enhanced LoRA-based Fine Tuning (18 minute read)
MetaLoRA introduces dynamic parameter generation using meta-learning principles, enhancing the flexibility and task-awareness of LoRA-based fine-tuning strategies.
Backdoor Attacks in CLIP (19 minute read)
CLIP models are highly vulnerable to poisoning backdoor attacks, achieving nearly 100% attack success with minimal poisoned data. An efficient detection method is to use local outlier detection to uncover unintentional backdoors in existing datasets.
Articulated Kinematics Distillation from Video Diffusion Models (18 minute read)
This work introduces Articulated Kinematics Distillation (AKD), a framework that leverages skeleton-based animation and generative diffusion models to produce high-fidelity, physically plausible character motions with reduced complexity. It ensures structural consistency and outperforms existing methods in 3D coherence and expressive motion quality by using Score Distillation Sampling to guide joint-level control.
👨💻
Engineering & Research
Flash Attention reimagined with Kvax (Sponsor)
HateBench for Evaluating Hate Speech (6 minute read)
HateBench provides a framework for evaluating hate speech detection models on LLM-generated content along with a manually annotated dataset and code for analyzing adversarial and stealthy hate campaigns.
Large Small Net (GitHub Repo)
A new family of lightweight vision models inspired by the dynamic heteroscale capability of the human visual system, i.e., "See Large, Focus Small". LSNet achieves state-of-the-art performance and efficiency trade-offs across various vision tasks. It introduces a new type of convolution kernel.
Pplx Cuda Kernels (GitHub Repo)
Perplexity has released some of its MoE kernels, which outperform DeepSeek at scale while being slightly more flexible and less opinionated about the MoE architecture.
Hugging Face's AI Agents Course (14 minute read)
Hugging Face released an AI agents course today. This free course will take you on a journey, from beginner to expert, in understanding, using, and building AI agents.
Zonos TTS (8 minute read)
A compelling Apache 2.0 model for speech generation and voice cloning. It has multi language support and expressive real time generation.
Google is shipping Gemini models faster than its AI safety reports (4 minute read)
Google has launched an AI reasoning model, Gemini 2.5 Pro. The model leads in coding and math capabilities, but Google hasn't released safety reports yet. Google plans to publish these reports after collecting feedback from experimental releases, though this approach raises transparency concerns. Despite pledges for transparency, Google seems to prioritize speed in model deployments, diverging from industry norms for responsible AI practices.
Get the most interesting AI stories and breakthroughs delivered in a free daily email.
Join 920,000 readers for
one daily email