TLDR AI 2024-04-04

Stable Audio 2.0 🎵, run LLMs locally 💻, boosting Transformer efficiency 🚀

🚀
Headlines & Launches

Stable Audio 2.0 (4 minute read)

Stability AI has announced the next generation of its music generation model. Trained on properly licensed music, the model can generate up to 3 minutes of high-quality music. It also has audio-to-audio generation.

Scientists create AI models that can talk to each other and pass on skills with limited human input (4 minute read)

Researchers have developed an AI network where one AI can teach another to perform tasks using natural language processing, a capability not previously demonstrated. The system uses a model called S-Bert that allows AI to perform tasks given via instructions and then communicate that knowledge to another AI. This breakthrough has potential applications in robotics and could further understanding of human cognitive functions.

Opera Allows Users To Download And Use LLMs Locally (1 minute read)

Opera has launched a new feature that allows users to download and run large language models locally on their computers, with over 150 models from more than 50 families available.
🧠
Research & Innovation

Boosting Transformer Efficiency (16 minute read)

Researchers have developed DiJiang, a new approach that transforms existing Transformers into leaner, faster models without the heavy burden of retraining.

Autonomous Driving with World-Centric Diffusion Transformers (8 minute read)

This study introduces a new method for creating driving paths for autonomous vehicles that combines diffusion models and transformers in a system called "World-Centric Diffusion Transformer" (WcDT).

RealKIE: Five Novel Datasets for Enterprise Key Information Extraction (24 minute read)

Extracting information from datasets is critical for enterprise AI applications. These five new benchmark datasets can be used to measure general algorithmic performance for RAG applications.
👨‍💻
Engineering & Resources

3D Detection for Large Objects (GitHub Repo)

SeaBird is a new 3D detection method that excels at recognizing large objects where traditional monocular detectors falter.

Can AI Identify Unsolvable Problems? (GitHub Repo)

This project introduces the concept of Unsolvable Problem Detection (UPD) in Vision Language Models, a new test to see if AI can identify when a problem can't be solved.

Action Spotting in Soccer (GitHub Repo)

ASTRA is a Transformer-based model that is capable of identifying key moments during soccer matches and overcoming challenges like action localization and data imbalance.
🎁
Miscellaneous

Worldcoin Foundation open sources core components of the Orb’s software (1 minute read)

Tools for Humanity has developed a secure and powerful computing environment for the Worldcoin Orb that utilizes NVIDIA Jetson for processing and Arm Cortex M4 microcontrollers for real-time functions. The Orb runs Rust applications and uses NVIDIA's TensorRT for neural network inference. It is powered by a custom security-focused GNU/Linux distribution called Orb OS. The system integrates a secure element for cryptography and supports trusted execution environments for backend authentication.

When Will The GenAI Bubble Burst? (5 minute read)

That Generative AI may turn out to be a disappointment. There are concerns about the technology's lack of profitability, security issues, and the inherent problem of hallucinations in language models. Unless a groundbreaking model like GPT-5 is released by the end of 2024, addressing key issues and offering a killer application, the hype surrounding Generative AI may start to dissipate.

Inside the shadowy global battle to tame the world’s most dangerous technology (7 minute read)

This article delves into the complex international efforts to regulate AI, regarded as one of the most potent and risky technologies in modern times.
⚡️
Quick Links

Bluedot 1.1 (Product)

Generate follow-up emails from Google Meet.

Google Might Make SGE A Paid Feature (1 minute read)

Google is reportedly considering making the Search Generative Experience (SGE), which has been available through Search Labs for nearly a year, a paid feature as part of its Google One AI Premium subscription.

AI Infrastructure Explained (6 minute read)

AI infrastructure, underpinned by GPUs, specialized software, and cloud services, is essential for the deployment and scaling of AI technologies.
The most important AI, ML, and data science news in a free daily email.
Join 500,000 readers for