TLDR AI 2024-04-08

Cohere Command R+ ➕, xAI to raise $3B 💰, SWE-agent 💻

🚀
Headlines & Launches

xAI Looking To Raise $3B (1 minute read)

Elon Musk's xAI is in talks with investors to raise $3B in a funding round that would value the artificial intelligence startup at $18B.

AMD to open source MES firmware (6 minute read)

AMD has agreed to open source the firmware for its Radeon GPUs. This means the community can quickly improve AI frameworks and potentially AMD adoption.

Introducing Command R+: A Scalable LLM Built For Business (4 minute read)

Cohere has introduced Command R+, a powerful, scalable LLM designed for enterprise use cases, featuring advanced retrieval augmented generation with citation, multilingual coverage in 10 key languages, and tool use capabilities.
🧠
Research & Innovation

Protecting Images from Unauthorized Segmentation (20 minute read)

"Anything Unsegmentable" is a new approach designed to shield digital images from being segmented by powerful AI models, addressing potential copyright and privacy issues.

Qwen1.5-32B (18 minute read)

The Qwen team has trained and released a 32B parameter model that achieves strong performance. It can be fit in more modest memory systems.

A Benchmark for Detecting LLM Errors (18 minute read)

Researchers have launched ReaLMistake, a benchmark aimed at systematically detecting errors in large language model responses.
👨‍💻
Engineering & Resources

Schedule Free Optimization (GitHub Repo)

Researchers from Meta have been teasing a new optimizer on X. They have now released the code along with various integrations. The optimizer has no LR schedule, which means you don’t need to know the full number of training steps beforehand. It has been shown empirically to work on a wide variety of problems including language models.

SWE-agent (3 minute read)

SWE-agent turns LLMs (e.g. GPT-4) into software engineering agents that can fix bugs and issues in real GitHub repositories. SWE-agent sets the state-of-the-art performance on the full SWE-bench benchmark, resolving 12.29% of issues.

Representation Fine-Tuning (GitHub Repo)

ReFT is a new parameter-efficient method for fine-tuning language models. It is substantially cheaper than even PeFT while achieving strong performance.
🎁
Miscellaneous

Four Takeaways On The Race To Amass Data For AI (5 minute read)

The development of artificial intelligence heavily relies on vast amounts of data, which is being rapidly consumed by tech companies faster than it is being produced, leading to predictions that high-quality digital data may be exhausted by 2026. In response, companies like OpenAI, Google, and Meta are exploring new methods to obtain more data, including using YouTube video transcripts, revising privacy policies, considering the purchase of major publishers, and investigating the use of "synthetic" data generated by AI models, despite the risk of compounding errors.

Speed Tests for Llama 2, Stable Diffusion (6 minute read)

MLPerf has updated its inferencing benchmarks to include large language models like Llama 2 70B and Stable Diffusion XL, reflecting the industry's shift to massive generative AI. In the latest tests, Nvidia's systems, particularly those equipped with the H200 processor, outperformed competitors from Intel and Qualcomm in both speed and efficiency. The benchmarks highlight the importance of high-bandwidth memory in handling large AI models, with Intel's Gaudi 2 and Qualcomm's Cloud AI 100 Ultra also showcasing significant performances in the generative AI space.

Rabbit partners with ElevenLabs to power voice commands on its device (2 minute read)

Rabbit partners with ElevenLabs to incorporate voice command technology into its upcoming r1 devices, enhancing human-device interaction with low latency models for a more natural experience. The first batch of r1 devices, which feature functionalities like chatbot interaction and bi-directional translation, is set to ship by March 31. ElevenLabs has recently raised $80 million, despite facing challenges with misuse of its voice cloning technology.
⚡️
Quick Links

DALL-E now lets you edit images in ChatGPT (2 minute read)

OpenAI's DALL-E has been integrated with ChatGPT - users can now edit DALL-E images in ChatGPT across web, iOS, and Android.

The Top 100 AI For Work (20 minute read)

A curated list of the top 100 AI tools to boost your work productivity.

LLocalSearch (GitHub Repo)

A completely locally running search engine using LLM Agents.
The most important AI, ML, and data science news in a free daily email.
Join 500,000 readers for