TLDR AI 2026-01-05
Nano Banana 2 Flash 🍌, Grok Business 💼, Plaud AI notetaker ✏️
Google is testing a new image AI, and it's going to be its fastest model (1 minute read)
Nano Banana 2 Flash will be faster and more affordable than Nano Banana Pro, Google's top-end image generation and editing model, but it will not be as powerful. Nano Banana Pro is built for harder creative work that requires accuracy, a better understanding of intention, and cleaner results.
ICYMI: xAI launches Grok Business and Enterprise plans (1 minute read)
xAI has launched Grok Business and Grok Enterprise. Business usage comes with higher rate limits and assurances that customer data is not used for model training. Employees will have access to a dedicated team workspace. Enterprise offers additional features such as custom SSO, SCIM directory sync, centralized organization governance, and advanced audit controls.
Plaud launches a new AI pin and a desktop meeting notetaker (3 minute read)
Plaud launched the AI notetaker Plaud NotePin S and a desktop app for digital meetings ahead of CES. The $179 NotePin S features a physical button for recording control and includes accessories like a clip and lanyard, with Apple Find My support. The desktop app supports meeting transcription using system audio and AI structuring, challenging competitors like Granola and Fireflies.
Chinese AI models have lagged the US frontier by 7 months on average since 2023 (2 minute read)
Every model at the frontier of AI capabilities has been developed in the United States since 2023. Chinese models have trailed US capabilities by an average of seven months over that period. The gap resembles the broader gap between proprietary and open-weight models. All leading Chinese models are open-weight, while frontier US models remain closed.
LLMs as Judges (19 minute read)
This post investigates whether large language models are fair judges when evaluating other LLMs. Using a modified MT-Bench benchmark, it reveals the influence of vendor identity, model tier, and hinting on evaluation outcomes across domains like coding, reasoning, and writing.
Existential Risk and Growth (127 minute read)
While technological development raises consumption, it may be a potential existential risk. However, technological development can also lower risk by speeding up technological solutions or increasing a planner's willingness to pay for safety. The risk-minimizing technology growth rate is typically positive and may easily be high. Below this rate, technological development poses no tradeoff between consumption and cumulative risk.
Three GPU Markets, Three Volatility Regimes (9 minute read)
Spare capacity determines price volatility in commodity markets. The 'GPU shortage' doesn't accurately describe what's happening in the market: newer GPUs are experiencing more volatility due to utilization. GPU markets are differentiating by maturity. As each market continues to develop, it should move towards a pattern where high utilization is a signal of market health rather than market stress.
Anthropic's ‘do more with less' bet has kept it at the AI frontier, co-founder Amodei tells CNBC (14 minute read)
'Do more with less' has become a sort of governing principle for Anthropic's entire strategy. It is a direct challenge to the rest of the industry, which is treating scale as destiny. Anthropic believes that disciplined spending, algorithmic efficiency, and smarter deployment can keep it at the frontier. The startup has always had a fraction of what its competitors have had in terms of compute and capital, yet it has consistently produced the most powerful and performant models.
Get the most interesting AI stories and breakthroughs delivered in a free daily email.
Join 920,000 readers for
one daily email