DeepSeek rolls out latest method to boost AI reasoning

DeepSeek rolls out latest method to boost AI reasoning
DeepSeek rolls out latest method to boost AI reasoning

The Chinese Artificial Intelligence (AI) startup DeepSeek has launched the latest method to improve the reasoning capabilities of large language models (LLMs).

With this significant update, the company highlights the ongoing efforts to enhance the capabilities of LLMs and their potential applications.

It was developed in collaboration with researchers from Tsinghua University.

Related: OpenAI calls DeepSeek 'state-subsidized' and 'state-controlled' 

The latest method combines generative reward modelling (GRM) and self-principled critique tuning to enhance LLMs' capability to respond to general queries with greater precision.

The dual approach incorporates GRM with self-principled critique tuning, enabling LLMs to optimise their behaviour by human feedback.

The resulting DeepSeek-GRM models outpace all the existing techniques, offering improved overall performance with powerful public reward models, according to a paper published.

DeepSeek has also planned to make its GRM models open source, although there is no certain timeline.

The news comes ahead of the company’s rising interest in future advancements, following significant attention garnered by its flagship V3 foundation model and famous R1 reasoning model.

R1 reasoning model rose to prominence after performing better than previous models, including OpenAI’s first ChatGPT model.

RelatedHuawei combines Pangu, DeepSeek AI models in new Pura X

Related
Read more : Sci-Tech

Reddit introduces age verification in the UK to comply with latest rules

Reddit introduces age verification in the UK to comply with latest rules
Restricted content will remain hidden from users under 18, which includes self-harm posts, disordered eating, and more

Amazon dives into AI vibe coding with ‘Kiro’ preview

Amazon dives into AI vibe coding with ‘Kiro’ preview
Kiro currently supports several languages, including English and more

Meta announces policy to curb 'unoriginal' content on Facebook

Meta announces policy to curb 'unoriginal' content on Facebook
This significant effort is a part of Meta's broader push to secure content integrity and support original creators on Meta

Google NotebookLM receives featured notebooks with expert-curated content

Google NotebookLM receives featured notebooks with expert-curated content
Google NotebookLM's latest update offers a curated collection of premium content developed in partnership with famous authors

WhatsApp support just got easier with new instant chat feature

WhatsApp support just got easier with new instant chat feature
The new feature now simplifies the access by allowing users to open a support chat right away

Mark Zuckerberg unveils massive billion-dollar plans for AI data centers

Mark Zuckerberg unveils massive billion-dollar plans for AI data centers
Meta CEO Mark Zuckerberg vows to bring the first multi-gigawatt data centre online next year

Google to combine ChromeOS and Android in major platform overhaul

Google to combine ChromeOS and Android in major platform overhaul
Google is still collecting feedbacks and exploring ideas to improve the project before it is officially released

WhatsApp testing new ‘Questions’ feature for channels with private responses

WhatsApp testing new ‘Questions’ feature for channels with private responses
In the past few days, WhatsApp has introduced several new and unique features to give users a fresh experience