TLDR AI 2023-11-06
xAIβs Grok beats ChatGPT π
, OpenAI Dev Day π
, NeurIPS 2023 papers digest π
Elon Musk's xAI beats ChatGPT on LLM benchmarks (4 minute read)
xAI trained an autoregressive language model at 34B params. It is quite performant and will power an AI system built into the X platform.
What To Expect At OpenAI Dev Day (3 minute read)
OpenAI Dev Day is today and it is rumored that we can expect enhancements to OpenAIβs developer tools, new pricing plans for ChatGPT, and a peek at Gizmo V8, a new and improved version of the ChatGPT iOS app.
Expanding VRP for Enhanced Generative AI Security (3 minute read)
Google is extending its Vulnerability Rewards Program (VRP) to include generative AI, encouraging research into AI safety and security. Simultaneously, it's expanding its open-source work to ensure better AI supply chain safety. It is also leveraging the Secure AI Framework (SAIF) and the Google Open Source Security Team (GOSST) to protect AI supply chains from threats like model tampering and data poisoning.
π§
Research & Innovation
Highlights from NeurIPS 2023 (45 minute read)
Paper digest generated summaries of many papers from NeurIPS 2023. Interestingly, much of this work is almost a year old at this point and has already been readily adopted by the community.
HelixNet is 3 models combined into one (4 minute read)
If you fine-tune three task-specific models from the Mistral base, one for generation, one for critique, and a final one for regeneration then the entire system shows dramatically improved generation performance. Synthetic data is used to tune these models.
Sketch what you want your robot to do (6 minute read)
Sketching a rough outline of what you want a robot to accomplish is a novel form of communication and turns out to work surprisingly well for standard pick and place tasks.
AI data pipeline attacks (6 minute read)
Poisoning the data well and other data pipeline attacks are a huge challenge for the cyber security community and often a blindspot for most AI organizations. This post outlines (with code) what the attacks are to help with future mitigation.
Could Cruise Be The Theranos Of AI? (4 minute read)
Cruise, GM's driverless car company, frequently relies on remote operators for its vehicles, challenging the company's claims of autonomy. The revelation raises concerns about the true capabilities of self-driving cars and prompts calls for transparency and investigation into the safety and autonomy of such vehicles.
Extensions prompt injection and data exfiltration (12 minute read)
A great deep dive into some novel vulnerabilities that come from generative AI plugins. In this case, the attack resembles SQL injection but came via an infected Google Doc. While Google has fixed the issue, the security researcher was able to exfiltrate other usersβ prompts.
Get the most interesting AI stories and breakthroughs delivered in a free daily email.
Join 920,000 readers for
one daily email