TLDR AI 2023-08-14
Andrew Ng on LLM world models π, robotics startup Serve to go public π, MosaicML secure code evaluation π
Supabase Studio 3.0 (8 minute read)
Supabase Studio 3.0 is here, with some huge new features, including a brand new Supabase AI integrated right into the SQL Editor, Schema Diagrams, and new Wrappers.
Andrew Ng on LLM world models (3 minute read)
Recent studies suggest LLMs like Othello-GPT build world models, implying that they understand the world to an extent. The network discerned the underlying 8x8 board structure and game rules by training on game moves without explicit game rules. This challenges traditional notions, like John Searle's Chinese Room Argument, about machine understanding.
Uber, Nvidia-backed delivery robot startup Serve Robotics to go public (3 minute read)
Serve Robotics, the autonomous sidewalk delivery robot startup that spun out of Uberβs acquisition of Postmates, is going public via a reverse merger with a blank-check company.
π§
Research & Innovation
SDXL ControlNet (7 minute read)
One of the best ways to control image generation is to use edges to condition the final output. The best open source model, SDXL, has a controlnet variation trained and ready to use.
Secure system for evaluating code models (12 minute read)
One hard challenge when working with language models that write code is evaluating the quality of the generated code. This is difficult because running arbitrary code is dangerous for a system. The code could do anything, including deleting everything. MosaicML offers a secure environment for evaluating generated code.
Follow Anything: A Robot That Can Track and Follow Anything (12 minute read)
A robotic system called "Follow Anything" can spot, trace, and follow any object in real time, even if it has never seen it before. This smart robot can recognize objects from just a text description, a picture, or even a simple click - all powered by a regular laptop!
π¨βπ»
Engineering & Research
ChainML Releases Council (GitHub Repo)
ChainML recently launched Council β an open-source AI agent platform that makes it easy to rapidly develop and deploy customized generative AI applications using collaborative AI agents. Council integrates with a wide variety of LLMs, such as GPT-4, Llama 2, and Claude 2, to unlock generative AI for business applications, enabling use cases that were previously impossible through AI agents with sophisticated control flows.
World class text-to-3D now open sourced (GitHub Repo)
Mirage is a platform for generating 3D scenes. It is used by game developers to speed up the process. The team has been working internally on improving a variety of 3D generation methods. This open source push is designed to leverage the community to improve its models.
Open Source fork of HF text-generation-interface (GitHub Repo)
Hugging Face recently removed commercial license-ability from one of its previously open packages. The package has now been forked by Preemo and will continue to be developed as an open source package.
Wizard Math (6 minute read)
The WizardLM group from Microsoft has released the worldβs best open source math model. Itβs really good at reasoning through multi-step problems. The model hallucinates a bit, which is really fun for math, and offers explanations. I think this may be one of the best models for education for those interested in learning mathematics.
Do Machine Learning Models Memorize Or Generalize? (17 minute read)
This article examines the training dynamics of a tiny model and reverse engineers the solution it finds. It provides an illustration of the exciting emerging field of mechanistic interpretability.
Zooming in on images before classifying (8 minute read)
If you zoom in on salient pieces of an image before running classification, you can get 98% on ImageNet. Fascinating and fairly easy implementation for real world computer vision systems.
Get the most interesting AI stories and breakthroughs delivered in a free daily email.
Join 920,000 readers for
one daily email