TLDR AI 2024-04-18

Mistral $5B valuation šŸ’°, Stable Diffusion API šŸ’», Stable Audio Paper šŸ“œ

šŸš€
Headlines & Launches

Mistral seeking funding at $5B valuation (1 minute read)

There have been reports of the open source pioneer Mistral seeking several hundred million dollars of funding to train more models.

Stability AI Makes The Stable Diffusion API Available (1 minute read)

Stability AI has made its latest text-to-image AI model, Stable Diffusion 3, available to some developers via API and its new content creation platform called Stable Assistant Beta. The model is still in preview and not yet available to the general public.

European car manufacturer will pilot Sanctuary AI's humanoid robot (2 minute read)

Sanctuary AI will deliver its humanoid robot, Phoenix, to a Magna auto manufacturing facility, following Magna's 2021 investment in the company. Phoenix is notable for its walking capability and dexterous hands aimed at enhancing manufacturing efficiency. The specifics of the pilot, including the number of robots and deployment details, remain undisclosed.
šŸ§ 
Research & Innovation

How to improve 24 points in MMLU with Olmo (12 minute read)

The latest completely open 7B language model had an upgrade with a much-improved reasoning score on MMLU. Its development team found that changing the data mixture had a substantial effect on performance. They have provided the exact statistics of new data sources and percentages that led to this improvement.

Stable Audio Paper (22 minute read)

This paper describes Stability AI's diffusion transformer model for audio synthesis.

Mistral 8x22B Report and Instruction Model (4 minute read)

The new base Mistral report was released today. Additionally, their own instruction tuned model was released under a similar permissive license. They report a strong MMLU and HumanEval performance along with incredible multilingual English, French, Italian, German, and Spanish performance, function calling, and 64k native context length.
šŸ‘Øā€šŸ’»
Engineering & Resources

The Challenges of Getting ML Models to Production (37 minute read)

A deep dive interview from industry experts on the challenges and solutions of getting AI models into production and where MLOps differs from traditional engineering. They discuss why so few ML projects make it to production and how to focus as an organization to actually launch.

Effort (5 minute read)

The Effort library allows for real-time adjustment of the number of calculations performed during inference of an LLM model, resulting in significant speed improvements while retaining most of the quality. Despite some implementation overhead, initial results suggest that the Effort library has the potential to significantly improve LLM inference speed while maintaining quality. The author encourages others to test the 0.0.1B version and provide feedback to further improve the library.

Image Captioning with Diffusion Models (GitHub Repo)

This study revisits using diffusion models for image-to-text generation. It introduces the LaDiC architecture, which enhances diffusion models' performance in image captioning tasks.
šŸŽ
Miscellaneous

The Shifting Dynamics And Meta-Moats Of AI (14 minute read)

Building a successful AI company requires navigating complex short, mid, and long-term dynamics and maintaining elite speed and execution, owning more of the stack, gathering unique data, and leveraging synthetic data generation. As the AI industry matures, companies must adapt to shifting talent dynamics and understand the machine they are building and the axes of competition they are predicated on to create lasting moats and stand out amidst the noise.

Driving with Natural Language (7 minute read)

Wayve's Lingo-2 is a VLM train in simulation for autonomous driving tasks. It performs driving actions based on video input. Lingo-2 includes descriptions of reasoning for the driving actions it takes based on the scene.

Is Attention All You Need? (29 minute read)

Researchers are developing alternative architectures to address the limitations of Transformers in long-context learning, generation, and inference speed, showing competitive quality at smaller scales but uncertain scalability. The rapid progress in this subfield suggests that the Pareto frontier will continue to expand, enabling longer context modeling and greater inference throughput, ultimately increasing the number of use cases for AI.
āš”ļø
Quick Links

Verify weights DDP Pytorch (GitHub Gist)

Verifying training when sharding across devices can be challenging - this handy snippet verifies that things have been done and updated properly.

Luminal (GitHub Repo)

Luminal is a deep learning library that uses composable compilers to achieve high performance.

Google Maps will use AI to help you find out-of-the-way EV chargers (2 minute read)

Google Maps will use AI to summarize directions to EV chargers as well as provide reliability and wait times.
The most important AI, ML, and data science news in a free daily email.
Join 500,000 readers for