- OpenTools' Newsletter
- Posts
- đ¤DeepMindâs AI Is Making Robots Smarter
đ¤DeepMindâs AI Is Making Robots Smarter
PLUS: Whatâs Coming at Nvidia GTC

Reading time: 5 minutes
Today we will discuss:
Sponsored: Carousel StudiosâThe fastest way to create LinkedIn carousels using AI
DeepMindâs AI upgrade for robotsâDeepMindâs latest models are enhancing robot reasoning and dexterity
Nvidiaâs GTC conferenceâExpect AI breakthroughs, next-gen GPUs, and quantum computing insights
In other AI news todayâApple is reportedly developing AirPods with built-in cameras for AI features, Snap debuts AI Video Lenses powered by its own generative model, OpenAI urges the Trump administration to prioritize speed and light regulation in its AI strategy, and Alibaba releases an AI model that reads emotions from video
Donât miss this weekâs Workflow WednesdayâAI and creativity unlocked with Sudowrite.com, Suno.com, and Descript.com; now available at only $20/month!
Saved the best for lastâ9 must-try AI tools
Building a personal brand on Linkedin? You need this. (9500 creators already use it!)
Carousel Studio lets you create LinkedIn carousels directly in Canva in <5 mins with AI.
Hereâs the kicker: It's 100% FREE for you!
Creating carousels manually sucks because you have to:
- Think of a good copy
- Mess around with the layout
- Pick the right colors, fonts, theme
Carousel Studio takes all the hard work out:
Give it a topic, pick some pre-defined colors
Click âGenerateâ and it will create a 4-slide carousel perfect for Linkedin for you
Just edit & post!

GoogleImages
Key Points
Google DeepMind introduced Gemini Robotics and Gemini Robotics-ER to enhance robot adaptability, interactivity, and dexterity in real-world tasks.
DeepMind is prioritizing safety with AI-driven risk assessment and new benchmarks, building on its Asimov-inspired 'Robot Constitution.'
â¨ď¸News - Google DeepMind is rolling out two new AI models designed to help robots take on a broader range of real-world tasks. The first, called Gemini Robotics, is a vision-language-action model that can interpret new situations, even if it hasnât been specifically trained on them.
đ¤Here's the lowdown - Built on Gemini 2.0, Googleâs latest AI model, Gemini Robotics combines multimodal understanding with physical actions. It improves three key areas essential for robotics: generality (adapting to unfamiliar scenarios), interactivity (engaging with people and environments), and dexterity (handling precise tasks like folding paper or opening a bottle cap).
đ¤šââď¸What's more? DeepMind is also launching Gemini Robotics-ER (Embodied Reasoning), an advanced visual language model designed to help robots better understand and navigate complex real-world situations. Senior director Carolina Parada gave an interesting example: when packing a lunchbox, you need to recognize where everything is, open the box, pick up items, and place them correctlyâGemini Robotics-ER is built to tackle that kind of reasoning. It can also integrate with existing robotic systems, adding new AI-powered capabilities.
đ¤What about safety? DeepMind researcher Vikas Sindhwani says the company is taking a âlayered approachâ to safety. Gemini Robotics-ER models are trained to assess whether an action is safe before executing it. DeepMind is also introducing new safety benchmarks and frameworks, expanding on its previously announced âRobot Constitution,â a set of Asimov-inspired rules for AI-driven robots.
đWhat's next? DeepMind is working with robotics company Apptronik to develop the next generation of humanoid robots. Itâs also giving early access to Gemini Robotics-ER to companies like Agile Robots, Agility Robotics, Boston Dynamics, and Enchanted Tools. As Parada puts it, DeepMindâs focus is on building intelligence that can not only understand the physical world but also act within it.

GoogleImages
Key Points
CEO Jensen Huangâs keynote on Tuesday will focus on AI, accelerated computing, and potential updates to Nvidiaâs GPU lineup.
Blackwell Ultra and Rubin GPUs could take center stage, with Nvidia also hosting a dedicated session on quantum computing.
âNews â GTC, Nvidiaâs biggest conference of the year, kicks off Monday and runs through Friday.
â¨Here's what to expect â CEO Jensen Huang will deliver his keynote on Tuesday at 10 a.m. Pacific, focusing on AI and accelerated computing. Nvidia is also teasing major announcements in robotics, sovereign AI, AI agents, and automotive, alongside 1,000 sessions featuring 2,000 speakers and nearly 400 exhibitors.
Hardware news will likely take center stage. Nvidia has a history of using GTC to unveil its latest hardware, and this year, all eyes are on an upgraded iteration of its Blackwell chips. The Blackwell B300 series, known as Blackwell Ultra, is expected in the second half of the year, featuring more memory (288GB) for demanding AI workloads.
Looking further ahead, Nvidiaâs next-gen GPU series, Rubin, is expected to get a mention. Set for release in 2026, Rubin is being positioned as a major leap in performance, with Huang describing it as a âbig, big, huge step up.â
Beyond GPUs, Nvidia has scheduled a dedicated âquantum dayâ at GTC, bringing in industry leaders to discuss how the technology is progressing toward real-world applications.
đđťââď¸What else is happening?
Apple reportedly developing AirPods with built-in cameras for AI-powered features // While this next-gen innovation wonât make it to the AirPods Pro 3, Apple is reportedly planning to introduce the feature in 2027, aligning it with the rumored launch of its smart glasses
Snap introduces AI Video Lenses powered by its in-house generative model // Snapchat is launching with three AI Video LensesââRaccoonâ and âFox,â which add animated furry friends, and âSpring Flowers,â which reveals a bouquet; More Lenses will roll out weekly
OpenAI urges Trump administration to focus 'AI Action Plan' on speed, light regulation // OpenAI called for "a copyright strategy that promotes the freedom to learn" and for "preserving American AI models' ability to learn from copyrighted materialâ
Alibaba releases AI model that reads emotions to take on OpenAI // The open-source r1-omni model infers emotions from video while describing clothing and surroundings, adding another layer of understanding to so-called computer vision
Weâve just launched the Tenth edition of Workflow Wednesday for AI-minded professionals like youâactionable AI workflows delivered directly to your inbox.
This weekâs topic: AI-Powered Creativity
đŠđźâđDiscover mind-blowing AI tools
Learn How to Use AI - Starting January 8, 2025, weâre launching Workflow Wednesday, a series where we teach you how to use AI effectively. Lock in early bird pricing now and secure your spot. Check it out here
OpenTools AI Tools Expert - Find the perfect AI Tool to solve supercharge your workflow. This GPT is connected to our database, so you can ask in depth questions on any AI tool directly in ChatGPT (free)
Avaturn - A platform that enables the creation of life-like 3D avatars from selfies
Anymoji: An app that allows users to design their own emojis or modify existing ones to suit their personal style and preferences
ClarifyPDF - A tool that helps users summarize, extract, and interact with information from PDFs in any language
Infographic Ninja - An AI-powered tool that converts keywords and articles into infographics with customizable templates, icons, fonts, and branding options
YoursTruly.AI - An online service that enables users to create and send handwritten greeting cards using their phone or laptop
Logo Diffusion - A tool that helps users create unique logos, redesign existing logos, convert sketches into digital logos, and transform 2D logos into 3D illustrations
Optimo - An AI-powered platform designed to streamline various marketing tasks

How likely is it that you would recommend the OpenTools' newsletter to a friend or colleague? |
Interested in featuring your services with us? Email us at [email protected] |