NVIDIA launches a faster inference engine for LLMs (6 minute read)
Training costs dominated inference when the current AI race began. Inference costs can quickly balloon as more people deploy language models to production. TensorRT has long been the go-to for speed. A version specifically for language models on H100s is now available.
Imbue raises $200M to build AI systems that can reason and code (6 minute read)
Imbue, formerly known as Generally Intelligent, has secured $200M in Series B funding, achieving a valuation of over $1 billion. The funding aims to accelerate the development of AI systems capable of reasoning and coding. Imbue's mission is to produce truly personal computers that amplify user agency, freedom, and dignity, with a vision for AI agents that understand and autonomously act upon user goals.
Engineering & Resources
XenonJs: composable AI-powered ecosystem (GitHub Repo)
Create and share easy-to-make, built-to-last, innovative, and customizable experiences by leveraging XenonJs’ rapidly growing ecosystem of components in a no-code environment. Seamlessly add your own components or models to the ecosystem.
How to stop Meta from using some of your personal data to train generative AI models (5 minute read)
Meta has introduced an opt-out tool allowing Facebook users to delete personal data used in training generative AI models. The "Generative AI Data Subject Rights" form focuses on third-party information, which includes publicly available data or licensed sources. The move comes as data privacy concerns rise, with international agencies urging tech companies to adhere to global data protection and privacy laws.
Product-Led AI from Greylock (11 minute read)
Investors are shifting focus from backing AI enablers like NVIDIA to supporting founders building transformative, AI-first products that redefine work and life. Three major AI-first opportunities include AI-first networks and marketplaces transitioning from human-driven to algorithmic content creation, redefining enterprise software categories where AI becomes the main feature, and AI "co-pilot" augmenting services, especially in areas like wealth management. The potential extends beyond software, tapping into vast service markets.
What OpenAI Really Wants (21 minute read)
OpenAI and CEO Sam Altman aim to develop artificial general intelligence (AGI) safely, but the pursuit of funding and products has shifted the company’s culture. Releasing ChatGPT acclimated the public to AI. Critics say that OpenAI’s commercial focus distracts from mitigating AI risks, but OpenAI insists that its mission remains unchanged.