TLDR AI 2023-09-20

OpenAI’s Red Teaming Network 🚨, Anyscale Endpoints 🔚, 6T multilingual tokens 6️⃣

New analyst report compares the leading semantic layers and metrics stores (Sponsor)

Before going all-in on AI, you need data that’s AI-ready. The semantic layer has a big role to play here — providing a single view of critical business metrics (e.g. ARR, headcount) and common analytics vocabulary across various groups.

The new 2023 GigaOm Sonar Report for Semantic Layers and Metrics Stores compares the leading vendors in this space including AtScale, Microsoft, Google, and dbt. The report names AtScale as the leader in semantic layers, owing to consistently exceptional performance across several categories — user interface, data source and client connectivity, security, access controls, and native API support.

Download the full GigaOm report (free) ↗️

Headlines & Launches

OpenAI Launches A Red Teaming Network (2 minute read)

OpenAI has introduced the OpenAI Red Teaming Network, a group of experts that will assist in assessing and mitigating the risks associated with its AI models.

Bard Can Now Find Answers In Your Gmail, Docs, And Drive (2 minute read)

Google's Bard chatbot can now scan Gmail, Docs, and Drive to retrieve information and perform tasks. Google says these new integrations, called extensions, aim to save users time when searching for specific information within their documents and emails.

Anyscale optimizes open source AI deployments with Endpoints (5 minute read)

Anyscale has unveiled Endpoints, which allows developers to integrate fast, cost-efficient, and scalable LLMs into their applications using popular LLM APIs. The company also announced a new integration with NVIDIA to boost the performance and development of LLMs on Ray, Anyscale’s open-source framework for distributed machine learning.
Research & Innovation

Improving Text-to-Image Generation (13 minute read)

Turning long texts into images can be tough, especially when there are lots of items or details to show. This new method breaks down long descriptions into simpler steps and makes sure each detail fits perfectly in the picture, setting a new high standard for creating images from text.

Making Depth Estimation More Flexible (13 minute read)

Figuring out 3D depth from a single 2D image is tricky because many 3D scenes can look the same in 2D. This method separates camera details from the picture's content, making it easier to guess the depth in various situations.

6T multilingual tokens (22 minute read)

Hot on the heels of Google's new dataset comes CulturaX. A cleaned dataset of trillions of multilingual tokens will be a great boost for multilingual language models.
Engineering & Resources

DoLa: A Simple Way to Make Language Models Stick to the Facts (23 minute read)

Researchers have introduced a new method called DoLa that helps large language models (like the ones that chat with you or write articles) stop making stuff up and stick to the facts. It uses a smart way of comparing different layers inside the model to filter out false information.

Track hundreds of points in real time (GitHub Repo)

PIPS2 is a state-of-the-art point-tracking system that performs dense pixel-level tracking and is robust to basic occlusions and fast motion.

Using Language Models to Improve How Things Look Online (GitHub Repo)

Creating eye-catching designs online is important for getting users interested and helping them understand information. This new method, called LayoutNUWA, makes these designs smarter by using language models to turn coding instructions into awesome layouts.

Supercharge your app development with superhuman speech-to-text (Sponsor)

The world's most powerful speech-to-text (STT) model is here: Nova-2. Compared to alternatives, Deepgram’s Nova-2 offers next-level accuracy and blazing fast speed, to ship voice AI features faster. Learn more or sign up to start building with $200 in free credit.

Google’s AI protein folder IDs structure where none seemingly existed (4 minute read)

Google’s AlphaFold played a crucial role in determining the specific structures of two intrinsically disordered proteins that induce previously unidentified structures when they interact. The discovery sheds light on how such proteins function and brings valuable insights into complex biological questions that were previously challenging to answer.

State-of-the-art background removal in HF transformers (5 minute read)

Matting is the process of creating a black-and-white mask to segment out important parts of an image. It is usually used for foreground and background segmentation. The portrait mode on smartphones uses a method similar to ViTMatte described here.
Quick Links

ChatGPT Cut Off Date Now January 2022 (Hacker News Thread)

ChatGPT’s training cut-off date has been upgraded from September 2021 to January 2022.

TimeOS (Product)

Your personalized AI for meeting notes and scheduling.

ChartGenAI (Product)

Generate any chart you want using a single sentence.
TLDR is a daily newsletter with links and TLDRs of the most interesting stories in startups 🚀, tech 📱, and programming 💻!
Join 500,000 readers for one daily email