Cadastre-se agora para um orçamento mais personalizado!

NOTÍCIAS QUENTES

DeepSeek unveils new approach to improve AI reasoning

Apr, 08, 2025 Hi-network.com

Chinese AI firm DeepSeek has unveiled a new method to improve LLM reasoning skills, claiming it offers more accurate and faster responses than current technologies. The approach, developed with researchers from Tsinghua University, combines generative reward modeling (GRM) with a self-principled critique tuning technique.

The method aims to refine how AI LLMs respond to general queries by better aligning their outputs with human preferences. According to a paper published on the arXiv scientific repository, the resulting DeepSeek-GRM models showed stronger performance than existing methods and proved competitive against widely accepted public reward models.

DeepSeek has announced intentions to release these models as open source, though no release date has been set. The move follows increased global interest in the company, which had earlier gained attention for its V3 foundation model and R1 reasoning model.

tag-icon Tags quentes : Inteligência artificial Acesso digital Desenvolvimento DeepSeek publish

Copyright © 2014-2024 Hi-Network.com | HAILIAN TECHNOLOGY CO., LIMITED | All Rights Reserved.
Our company's operations and information are independent of the manufacturers' positions, nor a part of any listed trademarks company.