TLDR AI 2023-10-02

Llama 2 Long πŸ¦™, creating videos from sound 🎡, vision transformers πŸ‘“

Gretel trained an LLM on 500BN tokens of public tabular datasets to generate better synthetic data (Sponsor)

Gretel's novel tabular large language model reliably generates synthetic tabular data with zero-shot prompting. You can use it to create a new correlated dataset with the correct datatypes, based on an SQL schema or query; or to expand existing datasets by automatically adding columns and records, or filling in gaps and null values.

The LLM was trained on half a trillion tokens of public data and generates highly realistic tables while ensuring relationships between records remain consistent. It’s even aware of regional differences (company names, emails, etc.) Request early access here

πŸš€
Headlines & Launches

AI and jobs (1 minute read)

AI is great at doing tasks, but not so much doing full jobs. Sam Altman argues that AI will enable humans to focus on different work that AI can’t do.

Shopify AI for product photo creation (3 minute read)

Building an online store requires lots of high-quality product and marketing images. This is an early demo of a background replacement tool built with Stable Diffusion XL by the Shopify AI team that makes it easy to use your existing product images to make something new.

Meta Quietly Unveils Llama 2 Long (3 minute read)

Meta has released Llama 2 Long, a new AI model that outperforms GPT-3.5 Turbo and Claude 2 for long user prompts.
🧠
Research & Innovation

Transformer-VQ: Efficient Linear-Time Attention (17 minute read)

This study presents Transformer-VQ, a new design of transformer that processes attention faster thanks to unique vector-based keys and caching.

Identifying Real vs Fake Faces (2 minute read)

This study uses visual and language-based tools to improve how systems recognize real vs. fake faces.

Vision Transformers need registers (30 minute read)

One of the coolest and simplest vision papers in recent weeks. Vision Transformers use "useless" pixel values as places to store global information. This makes attention maps uninterpretable. However, if you add a simple [reg] token to the vocabulary, the model will use it and won't store information in pixel values.
πŸ‘¨β€πŸ’»
Engineering & Resources

Creating Videos from Sound (4 minute read)

Researchers have developed a method to create videos that closely match a sound they're paired with, both in overall theme and moment-by-moment details.

SapientML (GitHub Repo)

SapientML is an AutoML technology that can learn from a corpus of existing datasets and their human-written pipelines and efficiently generate a high-quality pipeline for a predictive task on a new dataset.

Local LLMs made easy (3 minute read)

With all the new models coming out, it can be somewhat challenging to find a way to pull and run them. It's also hard to remix and remake them easily. This neat little tool, Ollama, makes it easy to test the latest models.
🎁
Miscellaneous

Perhaps AI Is Modern Alchemy (5 minute read)

ChatGPT and other AI technologies, while critiqued as "unscientific," can be viewed as an "aspiring science" akin to how alchemy preceded chemistry.

Dataiku Unveils LLM Mesh and Announces LLM Mesh Launch Partners Snowflake, Pinecone, and AI21 Labs (5 minute read)

Dataiku has unveiled the LLM Mesh, addressing the critical need for an effective, scalable, and secure platform for integrating LLMs in the enterprise. With the LLM Mesh sitting between LLM service providers and end-user applications, companies can choose the most cost-effective models for their needs to ensure the safety of their data and responses.

Inside mind reading AI (24 minute video)

More startups are popping up to help us read and manipulate our mental states and help us relax, learn, and reduce pain. Neuralink, Mendi, and FocusCalm are just some of these startups. These companies will gather data from users if they are given access to their brains. Who owns this data?
⚑️
Quick Links

BulkCorrector (Product)

Never deal with ChatGPT’s character limit again, as BulkCorrector segments your text into manageable pieces for bulk corrections.

Audio AI subreddit (Subreddit)

A subreddit about AI-driven music, speech, audio production, and all other emerging AI audio technologies.

Getty Made An AI Generator Trained On Its Licensed Images (2 minute read)

Getty Images, in collaboration with Nvidia, has launched "Generative AI by Getty Images," a tool that allows users to create images using Getty's extensive photo library.
TLDR is a daily newsletter with links and TLDRs of the most interesting stories in startups πŸš€, tech πŸ“±, and programming πŸ’»!
Join 500,000 readers for one daily email