TLDR AI 2024-04-01

Grok 1.5 🤖, Microsoft & OpenAI $100B supercomputer 💻, US vs. China AI talent 🧑‍💻

Headlines & Launches

Grok 1.5 (3 minute read)

xAI announced its next model, with 128k context length and improved reasoning capabilities. It excels at retrieval and programming.

Microsoft And OpenAI Planning $100B Supercomputer (1 minute read)

Microsoft and OpenAI are reportedly planning a joint data center project that could reach $100 billion in cost, culminating in the launch of a massive AI supercomputer named “Stargate” by 2028.

In One Key A.I. Metric, China Pulls Ahead of the U.S.: Talent (5 minute read)

China is now producing almost half of the world’s top AI researchers, surpassing the US, with 18% coming from US undergrad institutions. Despite pioneering AI breakthroughs, the US relies heavily on Chinese-born researchers, with Chinese talent making up 38% of top US-based AI professionals. The trend of Chinese researchers staying in China rather than moving to the US could impact global AI leadership dynamics.
Research & Innovation

Building evals for business problems (25 minute read)

Data, evals, and compute are essential for strong performing AI. This is especially true in enterprises. Evals may be one key moat that allows organizations to improve their AI products.

Captioning Outdoor Scenes (29 minute read)

Researchers have introduced a new approach to understanding outdoor environments, overcoming obstacles like the varying conditions and lack of data that have previously limited advancements.

Making Lane-Changes Safer in Busy Traffic (19 minute read)

This paper introduces a control framework that combines AI and predictive models to facilitate smooth and safe lane changes in dense traffic, emphasizing cooperation with nearby drivers.
Engineering & Resources

VoiceCraft cloning and TTS (GitHub Repo)

Zero shot voice cloning and speech generation capabilities in a powerful 700m parameter model.

Testing the Coding Abilities of LLMs with a New Code Benchmark (GitHub Repo)

EvoEval is a new benchmark suite that tests the coding abilities of Large Language Models more rigorously than ever before.

Interrupting Cow (GitHub Repo)

In natural conversation, people sometimes interrupt or talk over one another. This can be key to quickly coming to a consensus. This AI assistant predicts tokens while the person is talking and if it predicts enough in a row it will interrupt.

“The king is dead” — Claude 3 surpasses GPT-4 on Chatbot Arena for the first time (4 minute read)

Anthropic's Claude 3 Opus has surpassed OpenAI's GPT-4 for the first time on Chatbot Arena. Chatbot Arena is a leaderboard run by the Large Model Systems Organization, a research organization dedicated to open models. Its site allows visitors to rate outputs from various models, enabling it to calculate the best models in aggregate. While Claude's rise is notable, GPT-4 is now over a year old.

How Autonomous Racing Is Pushing Self-Driving Cars Forward (4 minute read)

Autonomous racing is advancing AI and machine learning in high-stress conditions, with competitions like the Indy Autonomous Challenge accelerating innovation in vehicle safety. Researchers and students use platforms like F1tenth for algorithm development, pushing the limits of autonomous vehicle capabilities on actual racetracks. These high-speed challenges contribute to a better understanding of machine perception, decision-making, and control systems for real-world traffic applications.

Qwen MoE (8 minute read)

Qwen MoE is equivalent in performance to a strong 7B model with 1⁄3 of the activated parameters.
Quick Links

Does AI Need A “Body” To Become Truly Intelligent? (5 minute read)

The embodiment hypothesis argues true intelligence necessitates physical interaction, prompting advancements in AI through simulations and real-world testing, despite challenges like the "sim-to-real gap," leading to the cautious deployment of AI robots in industries.

Microsoft Copilot AI Will Soon Run Locally On PCs (2 minute read)

Microsoft's Copilot AI will soon run locally on PCs, requiring future AI PCs to have built-in neural processing units capable of over 40 TOPS.

Airtable AI (Product)

Harness the power of AI and incorporate it directly into your workflows.
The most important AI, ML, and data science news in a free daily email.
Join 500,000 readers for