DALL·E 3 understands significantly more nuance and detail than previous systems, allowing users to easily translate your ideas into exceptionally accurate images. It will soon be integrated natively into ChatGPT.
GitHub is expanding the availability of its AI-powered coding chatbot, Copilot Chat. Previously in public beta for business users, Copilot Chat is now available to individual users in Visual Studio and Visual Studio Code.
The Toyota Research Institute (TRI) is showcasing advancements in research that can teach a robot a new skill overnight. TRI utilizes traditional robot learning techniques and diffusion models similar to generative AI models. Its system has been trained on 60 skills, focusing on versatility in different environments, aiming to make robots that can operate effectively in diverse settings and adapt to new tasks.
This study introduces RenderIH, a massive collection of ultra-realistic images showing hands in various positions that tackles issues with older datasets that were too simple. It also presents TransHand, a new tool that reads these hand positions very accurately.
This study looks at how LLMs, which are revolutionizing many tech areas, can make sequential recommendations (like suggesting what song or movie comes next) even better.
This research focuses on making large language models, which are already great at understanding and generating text, even better at answering medical questions.
Language models regularly hallucinate details to factual queries. By retrieving the facts, we can reduce these inaccuracies. However, the naive form of Retrieval Augmented Generation (RAG) of embedding a query and searching against a vector store often performs poorly. This post discusses how to dramatically improve performance with query objects and information retrieval techniques.
Truly great innovation comes in pairs. Technology innovation and business model innovation work together to deliver outlier results. AI is the next technology innovation, but have we found its counterpart on the business model side yet?
Generative AI’s first act of developing foundational models and novelty apps is coming to an end. In its place comes Act Two, where generative AI solves human problems end-to-end.
Google's Bard chatbot now offers a "double-check" feature, which evaluates its responses by checking the web for corroborating or contradictory information when prompted by users. This addition aims to enhance accuracy and accountability but may still require users to verify search results and interpretations themselves.
Google has developed Optimization by PROmpting (OPRO), a technique to improve the math skills of LLMs using human-style encouragement, such as “let’s think step by step”.