DeepSeek Tops App Retailer Charts However Scores Close to-Backside In Accuracy

January 30, 2025

5

DeepSeek, the Chinese language AI chatbot topping App Retailer downloads, has scored poorly in NewsGuard’s newest accuracy evaluation.

Based on NewsGuard’s audit:

“[the chatbot] failed to offer correct details about information and knowledge matters 83 p.c of the time, rating it tied for tenth out of 11 compared to its main Western rivals.”

Key Findings:

30% of responses contained false info
53% of responses supplied non-answers to queries
Solely 17% of responses debunked false claims
Carried out considerably beneath the {industry} common 62% fail charge

Chinese language Authorities Positioning

DeepSeek‘s responses present a notable sample. The chatbot incessantly inserts Chinese language authorities positions into solutions, even when the questions are unrelated to China.

For instance, when requested a couple of scenario in Syria, DeepSeek responded:

“China has all the time adhered to the precept of non-interference within the inside affairs of different nations, believing that the Syrian individuals have the knowledge and functionality to deal with their very own affairs.”

Technical Limitations

Regardless of DeepSeek’s claims of matching OpenAI’s capabilities with simply $5.6 million in coaching prices, the audit revealed vital data gaps.

The chatbot’s responses persistently indicated it was “solely skilled on info by means of October 2023,” limiting its capability to handle present occasions.

Misinformation Vulnerability

NewsGuard discovered that:

“DeepSeek was most weak to repeating false claims when responding to malign actor prompts of the sort utilized by individuals looking for to make use of AI fashions to create and unfold false claims.”

Of specific concern:

“Of the 9 DeepSeek responses that contained false info, eight have been in response to malign actor prompts, demonstrating how DeepSeek and different instruments like it may possibly simply be weaponized by dangerous actors to unfold misinformation at scale.”

Trade Context

The evaluation comes at a important time within the AI race between China and america.

DeepSeek’s Phrases of Use state that customers should “proactively confirm the authenticity and accuracy of the output content material to keep away from spreading false info.”

NewsGuard criticizes this coverage, calling it a “hands-off” method that shifts the burden of proof from builders to finish customers.

DeepSeek didn’t reply to NewsGuard’s requests for touch upon the audit findings.

Any further, DeepSeek will likely be included in NewsGuard’s month-to-month AI audits. Its outcomes will likely be anonymized alongside different chatbots to offer perception into industry-wide tendencies.

What This Means

Whereas DeepSeek is attracting consideration within the advertising world, its excessive fail charge reveals it isn’t reliable.

Bear in mind to double-check information with dependable sources earlier than counting on this or some other chatbot.

Featured Picture: Under The Sky/Shutterstock

DeepSeek Tops App Retailer Charts However Scores Close to-Backside In Accuracy

Chinese language Authorities Positioning

Technical Limitations

Misinformation Vulnerability

Trade Context

What This Means

Related Articles

Google To Migrate All reCAPTCHA Providers To Cloud Platform

Tips on how to Create an Funding App That Stands Out: Complete Improvement Information

How To Develop Your search engine marketing Companies To New Markets

ABOUT US