TLDR AI 2024-05-06

OpenAI may launch search šŸ”, Lamini AI raises $25m šŸ’°, generate synthetic instruction data šŸ“š

šŸš€
Headlines & Launches

From Baby Talk to Baby A.I. (6 minute read)

Dr. Brenden Lake of NYU aims to bridge human cognitive studies and AI by recording his daughter's POV to develop a language model, challenging the current AI approach that relies heavily on large datasets. This work explores how AI can learn from the sensory input akin to a toddler's experience. The research could lead to AI capable of conceptual mapping through association, similar to human language acquisition.

Lamini AI $25m Series A (4 minute read)

Lamini, an Enterprise AI platform, makes it possible for software teams within enterprises to develop new LLM capabilities that reduce hallucinations on proprietary data, run their LLMs securely from cloud VPCs to on-premise, and scale their infrastructure with model evaluations that prioritize ROI and business outcomes over hype. It raised a $25M Series A led by Amplify Partners.

Microsoft-backed OpenAI may launch search, taking on Google's 'biggest product' (2 minute read)

OpenAI is reportedly planning a major announcement on May 9, possibly unveiling a new search engine that could challenge Google. This engine may use Microsoft Bing's infrastructure, aligning with OpenAI CEO Sam Altman's vision of revolutionizing information discovery beyond the current Google model. If true, this could significantly disrupt the search engine market and how users find information online.
šŸ§ 
Research & Innovation

Molecular Simulations with Hybrid Neural Networks (26 minute read)

FeNNol is a cutting-edge library that simplifies the creation and deployment of hybrid neural network potentials for molecular simulations.

Context-Dependent Concept Understanding (16 minute read)

Spider is a novel unified model designed to enhance the understanding of context-dependent (CD) concepts such as camouflaged objects and medical lesions, which depend heavily on visual context.

Single and multi image instruction tuning (14 minute read)

A novel dataset and trained visual language model that enables higher quality instruction following over multiple images.
šŸ‘Øā€šŸ’»
Engineering & Resources

Enhancing Medical Imaging (GitHub Repo)

Researchers have developed a new algorithm called RaffeSDG that improves the accuracy of medical imaging models when analyzing data from different sources.

Generate synthetic instruction datasets from unstructured datasets (GitHub Repo)

Bonito is a model and toolkit designed to take unstructured text as input and create certain types of instruction datasets like question answering, instructions, and summarization.

A language model that evaluates language models (GitHub Repo)

Many modern performance benchmarks rely on GPT-4 as a judge of generation quality. Prometheus is a model built on top of Mistral that performs extremely well on this task.
šŸŽ
Miscellaneous

Tutorial: traffic analysis via video (21 minute read)

This deep dive tutorial walks through how to build a system that reports on car traffic density. It uses modern computer vision to count vehicles over time.

The AI Hardware Dilemma (6 minute read)

Recent AI-powered hardware launches, like the Humane Pin and Rabbit R1, have faced criticism, yet there's still significant venture capital and interest in the sector, with prominent figures like Sam Altman eyeing substantial investments. The allure lies in AI's potential to revolutionize consumer hardware by utilizing sensors, silicon, and interfaces creatively. However, the challenge to offer a compelling alternative to versatile smartphones remains, with AI still needing to mature and hardware startups struggling to compete with established tech giants.

Penzai (3 minute read)

Penzai, a JAX library, enables easy manipulation and understanding of trained models through legible, functional Pytree structures. It includes a diverse set of tools for model visualization, debugging, and component analysis. Installation and usage are straightforward, with detailed tutorials available for learning to build and manipulate neural networks using Penzai.
āš”ļø
Quick Links

15k extremely detailed fully labeled images (6 minute read)

A great new dataset from Google that contains detailed and comprehensive labels.

An AI-controlled fighter jet took the Air Force leader for a historic ride (4 minute read)

The U.S. Air Force is aggressively moving towards an AI-enabled fleet of over 1,000 unmanned warplanes, with operational capability expected by 2028.

PeopleGPT (Product)

AI-powered talent sourcing.
The most important AI, ML, and data science news in a free daily email.
Join 500,000 readers for