TLDR AI 2023-10-03

Humane Ai Pin 🧷, Rewind Pendant 📿, efficient video model training 📹

🚀
Headlines & Launches

Workers AI: serverless GPU-powered inference on Cloudflare’s global network (12 minute read)

Cloudflare unveils "Workers AI," a serverless AI inference platform on its global GPU network. Workers AI features popular open-source models and collaborates with Hugging Face.

Humane reveals first AI device - the Ai Pin (2 minute read)

Humane Inc unveiled the "Humane Ai Pin", a screenless wearable designed for AI integration, at Paris Fashion Week in collaboration with Coperni. The device offers AI-powered optical recognition, laser-projected displays, and prioritizes user privacy without requiring smartphone pairing.

Rewind Pendant (2 minute read)

Rewind Pendant is a wearable that captures what you say and hear in the real world and then transcribes, encrypts, and stores it entirely locally on your phone.
🧠
Research & Innovation

🤳After 15 years, is this the first product to disrupt the smartphone? (Sponsor)

Mode Mobile’s paradigm-changing EarnPhone has helped consumers earn $150M+. Now, at $50M+ in revenue and growing rapidly, all eyes are on Mode’s Pre-IPO Offering.

Want in? Invest at $0.16/Share*. Do it today to claim up to 90% bonus shares!

Disclosure: Please read the offering circular at invest.modemobile.com. This is a paid advertisement for Mode Mobile’s Regulation CF Offering.

StreamingLLM (Github Repo)

Language models are limited by their context length. The context length is usually limited by compute hardware and clever algorithmic updates. This is an algorithmic update that streams the tokens through the attention mechanism to allow theoretically infinite context window size. Usually, these claims fall over at scale, but this one seems robust as it can work on existing pre-trained models without fine-tuning. Will it make the forgotten middle issue worse though?

Improving Optical Flow with Gaussian Attention (GitHub Repo)

Optical flow helps figure out how things move in images. This study introduces new techniques using Gaussian Attention to focus on finer details and match them better, resulting in a model named GAFlow.
👨‍💻
Engineering & Resources

Efficient Video Model Training (16 minute read)

Training top-notch video models usually need huge resources, often beyond what academia can access. Researchers have found a way to train these models using just one machine with eight standard GPUs in a day.

Kosmos-2.5 (25 minute read)

The UniLM group at Microsoft has done some great work in the past few years around natural language. With the suite of Kosmos models, they’ve recently moved into images. This specific instantiation is for reading text-intensive documents from an image and generating the text or markdown for that document. It is similar to the recent Meta work for academic OCR.

Teaching Image-Recognizing Model Through Conversation (21 minute read)

Researchers have developed a new way to improve vision-language models, which are models that understand both text and images, using just natural language prompts.
🎁
Miscellaneous

8 Ways SaaS Companies Can Use AI To Get Ahead (12 minute read)

This blog post features several AI SaaS use cases for various departments, highlighting ways the technology can be used to gain a competitive edge.

Open-Source Evaluation Suite for Large Models (3 minute read)

OpenCompass is a free tool designed to test large models quickly and efficiently.

AI, Hardware, and Virtual Reality (11 minute read)

AI, combined with hardware, is going to enable groundbreaking forms of virtual and augmented reality. Here’s a breakdown of the latest news in each of those domains.
⚡️
Quick Links

How Much Can Artists Make From Generative AI? (5 minute read)

Despite some companies proposing "creators' funds," the specifics of how and how much artists will be paid remain unclear, with many details still under development or undisclosed.

Vespio (Product)

Vespio provides AI-powered sentiment analysis so your sales teams never waste leads.

Bestever (Product)

Create brilliant marketing visuals in minutes using AI.
TLDR is a daily newsletter with links and TLDRs of the most interesting stories in startups 🚀, tech 📱, and programming 💻!
Join 500,000 readers for one daily email