đŸ”„Baidu vs. DeepSeek Heats Up

PLUS: Gemini AI Sparks Copyright Concerns

Reading time: 5 minutes

Today we will discuss:

GoogleImages

Key Points 

  • Baidu has launched Ernie 4.5 and Ernie X1, with claims of high performance and strong multimodal capabilities.

  • While Baidu was an early AI leader, rising competition from DeepSeek has pushed the company to accelerate its AI roadmap, with Ernie 5 set to launch later this year.

☕News – Baidu has introduced two new AI models: Ernie 4.5, an upgraded version of its foundational model, and Ernie X1, a reasoning model designed to enhance complex problem-solving.

The company claims that Ernie X1 delivers performance comparable to DeepSeek R1 but at half the cost. Ernie 4.5, meanwhile, is being promoted for its "high EQ," with the ability to understand memes and satire. Both models support multimodal capabilities, meaning they can process video, images, and audio alongside text.

đŸ„žBaidu’s big problem – Despite being one of the first Chinese companies to launch a ChatGPT rival, Baidu has struggled to gain widespread traction. At the same time, DeepSeek has drawn attention by releasing competitive models at significantly lower prices, raising concerns among U.S. AI companies and investors.

In response, Baidu is accelerating its AI roadmap, with plans to release Ernie 5 later this year, promising even more advanced multimodal capabilities.

GoogleImages

Key Points 

  • Unlike some AI models, Gemini 2.0 Flash doesn’t block watermark removal and even reconstructs missing image details seamlessly.

  • While labeled “experimental,” the tool is available in developer platforms, making its potential misuse a concern for copyright holders.

♚News – Social media users have discovered a controversial capability in Google’s new Gemini AI model: the ability to remove watermarks from images, including those from stock media companies like Getty Images.  

🧐For context – Last week, Google expanded access to Gemini 2.0 Flash’s image generation and editing tools. While the model is powerful, it appears to have few safeguards in place. It can generate images of celebrities and copyrighted characters without restrictions—and as users have demonstrated, it can also remove watermarks from existing images.

What makes this particularly concerning is that Gemini 2.0 Flash doesn’t just erase a watermark; it attempts to fill in the missing parts of an image, making the edits harder to detect. Other AI tools have similar capabilities, but this model stands out for its effectiveness—and the fact that it’s free to use. That said, it isn’t perfect. It struggles with semi-transparent watermarks and those covering large portions of an image.

đŸ˜”â€đŸ’«Why this is problematic – Google currently labels Gemini 2.0 Flash’s image feature as “experimental” and “not for production use,” and it’s only available in developer tools like AI Studio. Still, copyright holders are likely to push back. Under U.S. law, removing a watermark without permission is generally illegal, with few exceptions.

Also, by comparison, other AI models, including Anthropic’s Claude 3.7 Sonnet and OpenAI’s GPT-4o, refuse to remove watermarks entirely. Claude explicitly states that doing so is “unethical and potentially illegal.”

đŸ™†đŸ»â€â™€ïžWhat else is happening?

We’ve just launched the Tenth edition of Workflow Wednesday for AI-minded professionals like you—actionable AI workflows delivered directly to your inbox.

This week’s topic: AI-Powered Creativity

đŸ‘©đŸŒâ€đŸš’Discover mind-blowing AI tools

  1. Learn How to Use AI - Starting January 8, 2025, we’re launching Workflow Wednesday, a series where we teach you how to use AI effectively. Lock in early bird pricing now and secure your spot. Check it out here

  2. OpenTools AI Tools Expert  - Find the perfect AI Tool to solve supercharge your workflow. This GPT is connected to our database, so you can ask in depth questions on any AI tool directly in ChatGPT (free)

  3. Pika Labs - A Text-to-Video platform that converts text into engaging videos, making it easier for users to communicate their ideas visually

  4. MusicGen - A tool that uses AI to generate music in various genres, including pop, techno, and house

  5. CandyIcons - Offers a simple three-step process to generate unique icons based on keywords and preferences

  6. QRCraft - Generates QR codes and transforms them into works of art

  7. FindWise - An AI-powered search assistant that allows users to ask questions and get answers based on the content of a website

  8. Krizmi - An interactive learning platform that offers auto-generated flashcards and quizzes to help students retain and test their knowledge

  9. Zeliq - An all-in-one sales solution that helps businesses increase their sales and streamline their outreach efforts

  10. Ask Jules - A book discovery companion that helps users find their next book and answers book-related questions

How likely is it that you would recommend the OpenTools' newsletter to a friend or colleague?

Login or Subscribe to participate in polls.

Interested in featuring your services with us? Email us at sales@opentools.ai