A Right To Warn About Advanced AI (3 minute read)
A group of current and former AI employees is calling for advanced AI companies to commit to principles that ensure transparency and protection for employees who raise risk-related concerns. They highlight the need for companies to avoid enforcing non-disparagement agreements, facilitate anonymous reporting processes, support open criticism, and prevent retaliation against whistleblowers.
Will Scaling Solve Robotics? (15 minute read)
Over 900 people attended last year's Conference on Robot Learning, which featured 11 workshops and almost 200 accepted papers. One of the largest debates at the event was whether training a large neural network on a very large data set was a feasible way to solve robotics. This post presents the different sides of the argument to deepen people's understanding of the debate. Scaling has worked for other similar fields. However, it is impractical as there isn't much robotics data available and there's no clear way to get it. Even if scaling works as well as in other fields, it likely still won't solve robotics.
๐ง
Research & Innovation
MMLU Pro (26 minute read)
MMLU is a common benchmark for reasoning tasks. It is often considered to be both the gold standard and something models have overfit. MMLU Pro is a new, harder, and cleaner benchmark for measuring language model reasoning.
๐จโ๐ป
Engineering & Resources
Image Compression (GitHub Repo)
Control-GIC is a new framework for generative image compression that allows fine-grained bitrate adjustment while maintaining high-quality results.
Omost Image Synthesis (GitHub Repo)
From the creator of ControlNet, Omost is a way to gain control over your image generation. It first rewrites prompts as a set of descriptive code. Then it uses that to render the final image. Importantly, you can edit the code before or after generation to slightly change the model output.
Improved Video Super Resolution (4 minute read)
Researchers have developed a training-free video interpolation method for generative video diffusion models. This new approach, compatible with various models, enhances frame rates without the need for extensive training or large datasets.
Facia (Product)
Prevent fraud and spoofing attacks with advanced facial recognition technology.
Using AI To Decode Doc Vocalizations (2 minute read)
University of Michigan researchers, in collaboration with Mexico's INAOE, have developed AI tools that analyze dog barks to determine playfulness or aggression, as well as identify the dog's breed, age, and sex.