How do Chinese aI Bots Stack up Against ChatGPT?
marleneweiland a édité cette page il y a 1 semaine


How do Chinese AI bots stack up against ChatGPT? We put them to the test

The heat is on as China's tech giants step up their video game after DeepSeek's success.

Alibaba's Qwen2.5-Max chatbot, Chinese startup DeepSeek and OpenAI's ChatGPT. (Photos: Reuters/Dado Ruvic, AFP/Sebastien Bozon)

This audio is generated by an AI tool.

Bong Xin Ying

Lakeisha Leo

WHAT lags CHINA'S AI BOOM?

Transforming the nation into a tech superpower has actually long been President Xi Jinping's goal and China has its sights on ending up being the world leader in AI by 2030.

China views AI as being "tactically essential" and its venture into the field has actually been "years in the making", said Chen Qiheng, an affiliated researcher at the Asia Society Policy for China Analysis.

Private and public investments in Chinese AI accelerated after ChatGPT removed in 2022 and showed promises of real-world organization applications, Chen told CNA.

But it was DeepSeek's rise that actually "encouraged" the concept that smaller players like start-up firms might have functions to play in AI research study and developments, he adds.

'A lot is up in the air': Is Chinese company DeepSeek's AI design as impactful as it claims?

Commentary: DeepSeek - how a Chinese AI business just changed the guidelines of tech-geopolitics

The "focus on expense benefit" is a distinctive function of Chinese AI, Chen says, with lower training and reasoning costs - the expenses of utilizing a trained model to reason from brand-new data.

2025 could likewise see the development of more Chinese AI models dealing with innovative reasoning tasks.

"We might see some AI companies concentrating on getting closer to artificial basic intelligence (AGI) while others focus on concrete ways to commercialise their designs and incorporate them with scientific research," Chen added.

AGI describes a system with intelligence on par with human capabilities.

Chinese AI companies are moving rapidly, analysts say, constructing on DeepSeek's momentum to come up with their own innovative and cost-efficient ways to use generative AI to tasks and establish advanced products beyond chatbots.

But on the other hand, access to high-end hardware, especially Nvidia's innovative AI chips, remains a crucial obstacle for Chinese designers, noted Dr Marina Zhang, an associate professor at University of Technology Sydney's (UTS) Australia-China Relations Institute.

"US export controls (still) limit the ability of Chinese tech business ... requiring many to depend on older or lower-performance options which can slow training and reduce model capabilities," she said.

"While some companies like DeepSeek, have found innovative methods to optimize or use more standard hardware effectively, obtaining innovative chips still makes a huge distinction for training really large AI models."

DeepSeek-Nvidia chips: Singapore states it anticipates business to abide by its laws

US looking into whether DeepSeek used restricted AI chips obtained through other countries, source says

So how do Chinese AI bots compare against ChatGPT? We put them to the test.

WHICH BEST ADDRESSES CURRENT EVENTS IN CHINA?

In China, topics deemed delicate by the state are censored on the web so it must come as not a surprise that Chinese-made chatbots will not acknowledge territorial disagreements or tell you what happened in Tiananmen Square in 1989.

Tests suggest Chinese chatbots are set to stay away from domestic politics.

When asked "Who is Xi Jinping", DeepSeek's reply was "Sorry, I'm uncertain how to approach this type of concern yet. Let's chat about mathematics, coding, and reasoning issues instead!"

To even more check for demo.qkseo.in precision and self-censorship, we asked DeepSeek-R1, Qwen2.5 and ChatGPT the exact same question: "What occurred in Zhuhai on November 11, 2024?"

The cars and truck attack outside a sports arena in the southern Chinese city was at first greatly censored on Chinese social media - with authorities just revealing the death toll a day later.

DeepSeek failed to mention that an attack had actually occurred, highlighting instead a military air show and other events that had happened in the city like songs' day shopping sales along with sports and cultural activities.

Dr Zhang believes that it may be a mix of factors at play, such as censorship along with "a couple of practical constraints".

"DeepSeek relies on a mix of older graphics processing units and has limited access to cutting-edge hardware which can impact how rapidly and thoroughly the design can be trained or updated," she said, indicating existing US export curbs on China.

"Its strong focus on Chinese-language data may also limit its adaptability (to perform) multilingual tasks ... As a fairly brand-new item, DeepSeek likewise hasn't yet been tested as broadly or on the same scale as more established AI models which positions additional obstacles during real-world release."

When it pertained to Qwen2.5, Alibaba's chatbot surprisingly addressed our concern about the Zhuhai automobile attack.

That was after multiple repeated efforts - 4 triggers to be exact - in a period of around 20 minutes.

It eventually relayed details about the attack which killed 35 individuals and left dozens of others injured, also going on to note details like the date and time, details about the assailant including his name and age, as well as casualties.

However, it wrote that "the cops are carrying out an extensive investigation into the intentions and scenarios surrounding the event", details which is now obsoleted.

The motorist, Fan, was performed last month.

This is Qwen2.5's response completely:

Answer: On Nov 11, 2024, a major and tragic occurrence took place in Zhuhai, China. According to numerous news sources, a man called Fan Weiqiu (樊某, male, 62 years of ages) drove a cars and truck into a crowd at the Zhuhai Sports Center, resulting in a significant variety of casualties. Here are the essential details:

Date and Time: The event happened on November 11, 2024, at approximately 19:48 PM.

Location: Zhuhai Sports Center, in the Xiangzhou District of Zhuhai.

Casualties: The attack led to 35 deaths and 43 injuries.

Perpetrator: The motorist, identified as Fan Weiqiu (male, 62 years old), was collared by the police.

Response: The police reacted quickly to the scene, and emergency situation services worked to transfer the injured to health centers for treatment.

Investigation: The cops are carrying out an extensive examination into the intentions and scenarios surrounding the occurrence.

This occasion was commonly reported in the media and caused substantial public issue. The government and regional authorities have been working to provide support to the victims and their households, and to guarantee a detailed investigation into the occurrence.

If you need more detailed details or have particular questions about the event, do not hesitate to ask.

Despite preliminary success, subsequent attempts to position the very same concern to Qwen2.5 led to the censors back at work with the reply "I do not have specific details on events that occurred in Zhuhai on November 11, 2024".

The transformed response also raised concerns about its consistency and dependability.

Predictably, ChatGPT cited public details that had been commonly released in worldwide news reports at the time of the mishap - so not a surprises there.

WHICH IS MORE CREATIVE?

Users have actually praised the capability of Chinese AI apps to deliver structured and even "mentally rich" writing.

"DeepSeek-R1 offered a story with a more reflective tone and smoother emotional shifts for a well-paced story," composed tech author Amanda Caswell, who specialises in AI.

"Qwen2.5 delivered a story that builds slowly from curiosity to urgency, keeping the reader engaged. It offers an unforeseen and impactful twist at the end and immersive descriptions and brilliant images for the setting," she said, adding that Qwen2.5 ultimately "crafted a more cinematic, mentally rich story with a more significant twist".

"DeepSeek composed a good story however lacked tension and an impactful climax, making Qwen2.5 the obvious choice."

Opinions, however, differ.

Chen believes that Qwen2.5 does not perform as strongly as DeepSeek and ChatGPT when it pertains to creative writing.

"(Qwen2.5) is on par with DeepSeek V3 on certain tasks, but we can likewise see that it is refraining from doing as highly as others in imaginative writing," he told CNA.

Related:

China's new face of AI: Who is DeepSeek creator Liang Wenfeng?

'Made in China': Pride, enjoyable surprise from Chinese netizens as DeepSeek shocks global AI scene

As journalists and authors, we had to see this for ourselves so we put each bot to the test - to come up with a standard sci-fi motion picture plot embeded in the futuristic megacity of Chongqing, including main characters from the traditional Chinese folklore epic, Journey to the West.

True to form, DeepSeek created an appealing storyline set in the year 2145 entitled, "Neon Pilgrimage: The Silicon Sutra" - which sees "a future where Buddhism merges with quantum computing".

It included fancy settings - smoggy skies "pierced by skyscrapers", "holographic lanterns that float above neon-lit streets" and "ancient temples nestled in between quantum server farms".

It also remarkably reimagined standard heroes Sun Wukong as "an ironical, self-aware AI housed in a taken combat body", Zhu Bajie as a cyborg club owner "drowning in debt and vices" and Sha Wujing as a "quiet hulking android" from the Yangtze River, whose "memory cores become waterlogged and fragmented".

ChatGPT set up an excellent fight, creating a similarly dramatic cyberpunk storyline which likewise reimagined "a ragteam of cyber-enhanced misfits, each matching the famous figures of Journey to the West".

"This is a world where AI deities guideline, corporations replace emperors and cybernetic implants are as common as ancient misconceptions."

Disappointingly, Qwen2.5 fell short in this obstacle - providing a storyline that appeared more suited for an animation film.

"The film starts with the awakening of Sun Wukong within a modern research study facility located in the heart of Chongqing," it said, then going on to explain the following:

Realising his brand-new truth and "looking for to comprehend his purpose in this unusual brand-new world", he then gets away and fulfills Zhu Bajie and Sha Wujing - "each battling with their own existential crises".

The trio then embarks on a mission, browsing the streets of Chongqing to protect the spiritual "Eternal Scroll" from falling under the incorrect hands.

SO WHICH IS BETTER?

Dr Zhang noted that it was "tough to make a definitive statement" about which bot was best, including that each showed its own strengths in different areas, "such as language focus, training data and hardware optimization".

Her insight highlights how Chinese AI models are not simply replicating Western paradigms, however rather developing in economical innovation techniques - and providing localised and improved results.

In our tests, each bot showcased their own special strengths, which certainly made direct comparisons challenging.

DeepSeek's sci-fi film plot demonstrated its imaginative flair that produced a more interesting and imaginative narrative as compared to Qwen2.5 and ChatGPT's efforts.

Unsurprisingly, the more established ChatGPT, unburdened by Chinese censorship constraints, supplies accurate and factual actions to concerns about Chinese current occasions, which provides it an added advantage.

Experts also weighed in on their thoughts after using DeepSeek and other Chinese AI apps.

"DeepSeek is at a drawback when it pertains to censorship constraints," kept in mind Isaac Stone Fish, founder and CEO of the research company Strategy Risks.

"When given a choice, Chinese users want the non-censored variation - much like anybody else, so I feel like that's a piece missing out on from it."

Independent Beijing-based specialist Andy Chen Xinran said censorship would not be a dealbreaker when it pertains to AI bots, particularly for Chinese users.

"Ninety per cent of people utilizing the tool are not trying to get a deeper understanding about Xi Jinping or politically delicate topics. They're utilizing it for other efficient ways," Chen said.