- OpenTools' Newsletter
- Posts
- š¤ÆAnthropicās AI Jailbreak Breakthrough
š¤ÆAnthropicās AI Jailbreak Breakthrough
PLUS: OpenAIās Vision for AI Hardware

Reading time: 5 minutes
Today we will discuss:
Sponsored: Guidde - AI-powered how-to videosāCreate professional, detailed video guides in minutes using Guiddeās GPT-powered platform
Anthropicās Defense Against AI JailbreaksāA new security barrier that reduces successful attacks from 86% to 4.4%
OpenAIās Hardware AmbitionsāA closer look at OpenAIās recent trademark filing and what it says about their AI hardware plans
In other AI news todayāThe Beatles win a Grammy with AI, SoftBank's AGI forecast, Cloudflare tackles image authenticity, and Meta limits risky AI development
In case you missed itāLast weekās Workflow Wednesday featured AI solutions for lead generation, ad writing, financial tracking, and a bonus guide on using DeepSeek safely
Saved the best for lastā10 must-try AI tools

Tired of explaining the same thing over and over again to your colleagues? Itās time to delegate that work to AI. Guidde is a GPT-powered tool that helps you explain the most complex tasks in seconds with AI-generated documentation.
Share or embed your guide anywhere.
Turn boring documentation into stunning visual guides.
Save valuable time by creating video documentation 11x faster.
Simply click capture on the browser extension and the app will automatically generate step-by-step video guides complete with visuals, voiceover, and call to actions.
The best part? The extension is 100% free.
Check out Guidde
Key Points
Rather than fixing models, Anthropic created a filter that intercepts jailbreak attempts before they reach the AI.
Anthropicās new shield blocks 95% of AI jailbreak attacks, reducing successful exploits from 86% to just 4.4%.
šØāš»News - Anthropic has introduced a new safeguard designed to stop jailbreak attacksātactics that trick AI models into ignoring their safety rules. This new approach isnāt just another security update; it could be the most effective defense yet.
š¤What has Anthropic done differently? Instead of trying to fix vulnerabilities in its AI models, Anthropic built a filter that prevents jailbreak attempts from working in the first place. The focus was on universal jailbreaksāattacks that can completely shut down a modelās safety mechanisms. A well-known example is the āDo Anything Nowā (DAN) exploit, which tries to override restrictions by making the AI behave differently.
To train this new defense, Anthropic compiled a list of restricted topics and had its AI generate thousands of question-and-answer pairs, covering both safe and unsafe prompts. The company then expanded this dataset by translating and rephrasing the exchanges in ways commonly used by jailbreakers. This data was used to train a filter that blocks suspicious requests before they reach the model.
š§Is it foolproof? Not entirelyābut itās a big step forward. Anthropic tested its system with a bug bounty, offering $15,000 to anyone who could get the model to answer 10 restricted questions. After 3,000+ hours of testing by 183 participants, nobody managed to crack more than five.
A second test using 10,000 AI-generated jailbreak prompts showed similar results. Without the shield, 86% of jailbreaks worked. With it, only 4.4% got through.
Key Points
OpenAIās trademark filing includes AI-powered hardware like smartwatches, VR headsets, and humanoid robots for interactive experiences.
The application hints at custom AI chips, with OpenAI aiming for a release in collaboration with Broadcom and TSMC.
āNews - OpenAI has filed a new trademark application with the U.S. Patent and Trademark Office (USPTO), hinting at a potential expansion into new product areas. While such filings are routine, this one stands out due to the variety of products listed, suggesting a future beyond software.
OpenAI has filed a new trademark application with the U.S. Patent and Trademark Office (USPTO), hinting at a potential expansion into new product areas. While such filings are routine, this one stands out due to the variety of products listed, suggesting a future beyond software.
š¤What's more? The filing also mentions āuser-programmable humanoid robotsā with communication and learning functions designed for entertainment and assistance. OpenAI has recently been building a robotics team, with Caitlin Kalinowski, former head of Metaās AR glasses division, now leading the effort.
Additionally, the trademark application references custom AI chips and the use of quantum computing to optimize AI performance. Reports suggest OpenAI is aiming to release its own AI chips by 2026, possibly in collaboration with Broadcom and TSMC, further emphasizing its push into hardware development.
šš»āāļøWhat else is happening?
The Beatles won a Grammy, thanks to AI // The Beatlesā AI-assisted track āNow and Thenā won the Grammy for Best Rock Performance, marking the first time that a song of its kind has taken home the award
SoftBankās Masayoshi Son says AGI will arrive āmuch earlierā than he thought // SoftBank is also partnering with OpenAI on an AI system called āCristal intelligenceā
Cloudflare is making it easier to track authentic images online // Cloudflare is integrating Adobeās Content Credentials system to help people detect AI-manipulated images
Meta says it may stop development of AI systems it deems too risky // If Meta determines a system is high-risk, the company says it will limit access to the system internally and wonāt release it until it implements mitigations to āreduce risk to moderate levelsā
In last weekās Workflow Wednesday we provided AI workflows for the tasks our readers needed help with, have a question you want answered? Join here
Q&A: Growth Unlocked!
Q: "Iāve been guessing emails, sending cold DMs, and barely getting responses. How can I find and verify the right contacts without wasting my time?"
A: Hunter.io finds verified emails so you can ditch the guesswork. No more cold emails to black holes. Read our guide to better lead generation.
Q: "How can I use AI to accelerate my companyās growth and streamline my daily tasks without spending a ton of time?"
A: Copy.ai turns a few inputs into polished, high-converting Google Ads in seconds. Read our guide to AI-powered ad writing.
Q: "How can I track my expenses more efficiently to scale my business without wasting hours on spreadsheets?"
A: SheetGPT + ChatGPT analyze your spending, set budgets, and even warn you about overspendingāstraight from Google Sheets. Read our guide to AI-powered financial tracking.
+ Bonus Guide: How to Use DeepSeek Without Risking Your Data
š©š¼āšDiscover mind-blowing AI tools
Learn How to Use AI - Starting January 8, 2025, weāre launching Workflow Wednesday, a series where we teach you how to use AI effectively. Lock in early bird pricing now and secure your spot. Check it out here
OpenTools AI Tools Expert - Find the perfect AI Tool to solve supercharge your workflow. This GPT is connected to our database, so you can ask in depth questions on any AI tool directly in ChatGPT (free)
Amplitude - A comprehensive suite of tools designed to provide fast and easy access to customer insights at every step of their journey
Mage.space - An innovative online platform offering a wide array of AI-generated art styles and models for users seeking unique and customized visuals
ClarifyPDF - A tool that helps users summarize, extract, and interact with information from PDFs in any language
Logo Diffusion - A tool that helps users create unique logos, redesign existing logos, convert sketches into digital logos, and transform 2D logos into 3D illustrations
Aspen - A no-code platform for building AI-powered web apps quickly and easily
AdCopy.ai - A platform that combines advanced data analytics and AI to help advertisers create, publish, and optimize ad campaigns efficiently
Scrip AI - A no-cost, user-friendly tool designed for generating hashtags to enhance social media engagement
2short.ai - An innovative AI-powered platform designed to transform long videos into engaging short clips

How likely is it that you would recommend the OpenTools' newsletter to a friend or colleague? |
Interested in featuring your services with us? Email us at [email protected] |