TLDR AI 2026-06-25
Jalapeño chip 🌶️, Anthropic accuses Alibaba ⚖️, Gemini computer use 🖥️
Learn how leaders from Prudential Insurance, Siemens, GAF, and HF Sinclair build resilient, scalable data foundations for AI in this virtual panel. (Sponsor)
- Ram Kumar, Group Leader, Data & AI, Centre of Excellence, Prudential Insurance
- Michael Taylor, Chief Data Scientist, Siemens
- Dhanya Nair, Director, Data and Analytics, GAF
- Madhu Bangalore, Head of Digital Solution, Data, Analytics, and AI, HF Sinclair
Register now
Explore how to:
Jalapeño: OpenAI's new Chip (7 minute read)
OpenAI and Broadcom unveiled Jalapeño, the first accelerator in a planned family of LLM inference chips optimized for performance per watt and rapid deployment. The companies said the processor was designed in nine months with AI-assisted development and is intended for gigawatt-scale data center deployments.
Gemini Researchers Join Anthropic (1 minute read)
Bloomberg reported that Gemini researchers Jonas Adler and Alexander Pritzel left Google for Anthropic, continuing a wave of high-profile AI talent departures. The trend followed recent exits by Noam Shazeer and DeepMind director John Jumper amid increasing competition between leading AI companies.
Introducing Computer Use on Gemini 3.5 Flash (3 minute read)
Google launched native computer-use capabilities for Gemini 3.5 Flash, allowing the lightweight model to interact directly with digital desktop interfaces. The model processes continuous screenshots to execute click, scroll, and typing actions seamlessly across varied software environments.
GLM-5.2 is the step change for open agents (11 minute read)
GLM-5.2 seemed like an incremental update, but the small change in benchmarks and training opened up a wide range of new use-cases. It feels right at home in coding harnesses as a general agent. Many in the AI community have praised the model after using it personally.
Notes on Amazon v. Perplexity (27 minute read)
Amazon is suing Perplexity for breaking the Amazon Store's Conditions of Use as Perplexity's Comet browser circumvents the requirement to clearly identify itself as an agent and instead identifies itself as Chrome. The idea that Perplexity's client needs to behave in a certain way goes against the basic principles of the open Web, which are about user control. The increased user agency of the open Web is what distinguishes it from downloadable apps. Sites have historically attempted all kinds of technical measures to prevent users from experiencing their content on their terms, but at the end of the day, the site is rendered on the client, so users mostly have the ability to download a client that renders the site in the way they prefer. Agentic browsing is just another browser feature that lets users engage with the Web on their terms.
Accelerating Transformers Fine-Tuning with NVIDIA NeMo AutoModel (4 minute read)
NVIDIA launched NeMo AutoModel on Hugging Face to optimize the fine-tuning pipelines of massive Mixture-of-Experts (MoE) architectures like Qwen3 and DeepSeek V3. The framework introduces Expert Parallelism and DeepEP fused communication kernels to distribute specialized expert weights dynamically across GPU clusters. Benchmark results demonstrate up to a 3.7x increase in training throughput alongside a 32% reduction in peak GPU memory usage compared to native Transformers v5 libraries.
Get the most interesting AI stories and breakthroughs delivered in a free daily email.
Join 1,100,000 readers for
one daily email