DeepSeek has a new rival, and you can try it out right now

Alibaba has just unveiled its latest reasoning model, and it seems that DeepSeek and OpenAI might have something to worry about — at least if all of Alibaba’s promises turn out to be true. It’s open-source, so I checked it out. You can try it out for free, too, although unsurprisingly, you’ll find that there are some things it won’t talk to you about.

The new model, dubbed QwQ-32b (Quan-with-Questions) runs on much fewer parameters, meaning that it requires less resources, but Alibaba claims that it performs at the same level as DeepSeek or OpenAI’s o1-mini.

Recommended Videos

DeepSeek’s R1 large language model (LLM) was all the rage earlier in February when it came out, suddenly capable of rivaling the golden standard set by ChatGPT and other alternatives, but at a much lower cost. It seems that Alibaba might be pushing the envelope even further here.

DeepSeek AI running on an iPhone.
Nadeem Sarwar / Digital Trends

As explained by VentureBeat, DeepSeek-R1 requires 671 billion parameters to run, 37 billion of which are activated. Meanwhile, Alibaba’s new QwQ-32b can get by with 32 billion parameters. Those numbers are totally abstract to many, but there’s a huge difference in compute power; while DeepSeek R1 requires 1600GB of VRAM to run, QwQ-32b can get by with just 24GB of VRAM. In most cases, this will mean Nvidia’s H100 or equivalents, but even the gaming-focused RTX 4090 sports 24GB. The latest RTX 5090 ups that to 32GB.

Alibaba’s QwQ-32b is available under an Apache 2.0 license, meaning that companies and researchers can use it. More importantly, we can use it by trying out Alibaba’s Qwen Chat. Like DeepSeek, it comes with some limitations, but also has a couple of immediate perks that I noticed quite quickly.

It seems to give quite in-depth answers even to quick, simple questions. This can be good, but in a way, it was mostly annoying as it gives you a lot of unnecessary context that you didn’t ask for. I like that it shows you its whole reasoning process, though, which is similar to ChatGPT’s Deep Thinking feature — but with much less depth.

When asked about political matters, Qwen Chat flags it as inappropriate. There might be ways to jailbreak it — it was possible with DeepSeek, after all — but I haven’t managed to just yet.

Whether Alibaba’s claims turn out to be true remains to be seen, but it looks like ChatGPT and DeepSeek now have a new rival.

Comments on "DeepSeek has a new rival, and you can try it out right now" :

Leave a Reply

Your email address will not be published. Required fields are marked *

RECOMMENDED NEWS

WeTransfer backlash highlights need for smarter AI practices
COMPUTING

WeTransfer backlash highlights need for smarter AI practices

A recent update to WeTransfer’s terms of service caused consternation after some of its customers ...

Read More →
Apple is eyeing a ChatGPT-like search, but it must focus beyond Siri
COMPUTING

Apple is eyeing a ChatGPT-like search, but it must focus beyond Siri

It’s no secret that Apple is currently struggling to deliver a smash-hit AI product, the way Googl...

Read More →
What is Copilot? Everything you need to know about Microsoft’s AI chatbot
COMPUTING

What is Copilot? Everything you need to know about Microsoft’s AI chatbot

articiOver the past few years, AI has gone from limited chatbots to suddenly dominating the news cyc...

Read More →
Microsoft Edge Canary new tab page replaces MSN with Copilot
COMPUTING

Microsoft Edge Canary new tab page replaces MSN with Copilot

Microsoft is testing a new Copilot-powered interface in the Canary version of Edge, replacing the MS...

Read More →
AI headphones driven by Apple M2 can translate multiple speakers at once
COMPUTING

AI headphones driven by Apple M2 can translate multiple speakers at once

Google’s Pixel Buds wireless earbuds have offered a fantastic real-time translation facility for a...

Read More →
Mountainhead creator says he ‘scraped AI companies back’ to make his movie
COMPUTING

Mountainhead creator says he ‘scraped AI companies back’ to make his movie

Mountainhead writer and director Jesse Armstrong has said he’s “pretty sure that the AI companie...

Read More →
WWDC may not deliver the macOS magic I’d love to see. Here’s why
COMPUTING

WWDC may not deliver the macOS magic I’d love to see. Here’s why

Apple WWDC This story is part of our complete Apple WWDC covera...

Read More →
Anthropic’s new Claude model offers both real-time and long-pondered responses
COMPUTING

Anthropic’s new Claude model offers both real-time and long-pondered responses

AnthropicOpenAI’s o3 and DeepSeek’s R1 models have some new competition. Anthropic announced Mon...

Read More →
Apple Intelligence could solve my App Store pet peeve, but I’m skeptical
COMPUTING

Apple Intelligence could solve my App Store pet peeve, but I’m skeptical

It’s no secret that Apple’s App Store has its problems, but it generally works pretty well. Yet ...

Read More →