Home Financial News The Chinese artificial intelligence model Depseek learns more when receiving ‘reward’

Financial News

The Chinese artificial intelligence model Depseek learns more when receiving ‘reward’

By

Internationalfinancialnews.com

-

21 September 2025

5

The Chinese artificial intelligence model Depseek-R1 learns more and better when it receives “rewards” to solve problems, but those stimuli require human intervention, so that approach can be expensive and also limit its growth capacity.

It was verified by a team of researchers and technologists, among which are responsible for the Chinese company that launched this open artificial intelligence model (AI), which analyzed their potentialities and limitations; Today they publish the results of their work in Nature magazine.

Teaching AI models to reason in the same way that humans is a challenge, and researchers corroborated that large -scale language models (LLM) are already demonstrating certain reasoning capabilities, although that training requires important computational resources.

You are interested: Deepseek spears new improved version of its AI model, compatible with Chinese chips

AI models begin to reason

The Deepseek-R1 model includes an additional training stage under human supervision to improve the reasoning process, and uses a “reinforcement” learning system instead of human examples to develop the reasoning steps, which according to the researchers and responsible for the company reduces the costs and complexity of the training.

Notwithstanding, however in the article that some of the limitations of the current version of that model of AI, including that combines two languages, Chinese and English, or that is only optimized for those languages.

They also cite, as a limitation, that there are some tasks in which its model showed no important improvements, such as software engineering, and stressed that future research must focus on improving these ‘reward’ processes to guarantee the reliability of reasoning and tasks performed by this AI.

The researchers showed that the model obtains good results in mathematical, biology, physical or chemical tests, in programming competitions, and concluded that training AI to reason with less human intervention is possible, which opens the door to get models capable of growing, more powerful and cheaper, although there are still many challenges to be resolved.

With EFE information

Subscribe to our YouTube channel and do not miss our content

LEAVE A REPLY Cancel reply

Bitcoin(BTC)$114,451.00-0.93%
Ethereum(ETH)$4,330.37-3.07%
XRP(XRP)$2.91-2.27%
Tether(USDT)$1.000.01%
BNB(BNB)$1,037.53-3.68%
Solana(SOL)$233.27-3.03%
USDC(USDC)$1.000.00%
Dogecoin(DOGE)$0.250196-6.67%
Lido Staked Ether(STETH)$4,327.23-3.44%
TRON(TRX)$0.339248-2.15%
Cardano(ADA)$0.86-3.66%
Wrapped stETH(WSTETH)$5,257.33-3.19%
Wrapped Beacon ETH(WBETH)$4,670.49-3.50%
Chainlink(LINK)$22.31-4.54%
Wrapped Bitcoin(WBTC)$114,395.00-0.94%
Ethena USDe(USDE)$1.00-0.08%
Hyperliquid(HYPE)$49.93-7.64%
Avalanche(AVAX)$31.84-3.40%
Sui(SUI)$3.53-3.34%
Figure Heloc(FIGR_HELOC)$1.000.00%
Stellar(XLM)$0.371234-4.02%
Bitcoin Cash(BCH)$588.70-1.50%
Wrapped eETH(WEETH)$4,661.22-3.39%
WETH(WETH)$4,333.55-3.40%
Hedera(HBAR)$0.227868-5.56%
LEO Token(LEO)$9.510.29%
Litecoin(LTC)$111.86-2.10%
USDS(USDS)$1.00-0.06%
Toncoin(TON)$3.03-2.04%
Cronos(CRO)$0.213988-6.63%
Shiba Inu(SHIB)$0.000013-3.06%
Binance Bridged USDT (BNB Smart Chain)(BSC-USD)$1.000.07%
Coinbase Wrapped BTC(CBBTC)$114,439.00-1.02%
World Liberty Financial(WLFI)$0.2348953.89%
Polkadot(DOT)$4.19-3.84%
WhiteBIT Coin(WBT)$42.71-1.49%
Ethena Staked USDe(SUSDE)$1.200.07%
Monero(XMR)$291.29-2.06%
Uniswap(UNI)$8.79-4.30%
Mantle(MNT)$1.52-9.07%
Dai(DAI)$1.000.05%
Aave(AAVE)$286.93-3.43%
Ethena(ENA)$0.63-6.61%
Pepe(PEPE)$0.000010-5.15%
Story(IP)$13.303.48%
MemeCore(M)$2.42-6.75%
OKB(OKB)$193.72-2.52%
NEAR Protocol(NEAR)$3.03-2.97%
Bitget Token(BGB)$5.16-3.46%
Jito Staked SOL(JITOSOL)$287.24-3.13%
Aptos(APT)$4.50-2.42%
Bittensor(TAO)$326.27-5.76%
Ondo(ONDO)$0.97-4.60%
Ethereum Classic(ETC)$19.45-3.95%
Binance Staked SOL(BNSOL)$250.33-3.28%
Worldcoin(WLD)$1.44-3.68%
USDT0(USDT0)$1.00-0.08%
Pi Network(PI)$0.347065-3.04%
USD1(USD1)$1.000.00%
Binance-Peg WETH(WETH)$4,335.50-3.27%
Arbitrum(ARB)$0.465495-5.72%
POL (ex-MATIC)(POL)$0.238392-4.55%
Internet Computer(ICP)$4.60-3.32%
Aster(ASTER)$1.41-12.48%
Provenance Blockchain(HASH)$0.047174-2.33%
Pump.fun(PUMP)$0.006400-9.37%
Jupiter Perpetuals Liquidity Provider Token(JLP)$5.78-1.39%
sUSDS(SUSDS)$1.07-0.34%
Kaspa(KAS)$0.080163-4.10%
BlackRock USD Institutional Digital Liquidity Fund(BUIDL)$1.000.00%
VeChain(VET)$0.023963-2.44%
Cosmos Hub(ATOM)$4.34-2.90%
Render(RENDER)$3.92-1.93%
Pudgy Penguins(PENGU)$0.032066-8.86%
Kinetiq Staked HYPE(KHYPE)$50.03-7.71%
Falcon USD(USDF)$1.00-0.02%
Algorand(ALGO)$0.226959-3.46%
KuCoin(KCS)$15.62-1.05%
Gate(GT)$16.56-1.61%
Rocket Pool ETH(RETH)$4,942.59-3.50%
Fasttoken(FTN)$4.48-0.11%
Kelp DAO Restaked ETH(RSETH)$4,564.52-3.41%
Sei(SEI)$0.306375-3.75%
MYX Finance(MYX)$9.64-9.55%
Flare(FLR)$0.024503-1.28%
USDtb(USDTB)$1.00-0.14%
BFUSD(BFUSD)$1.00-0.01%
Bonk(BONK)$0.000022-6.21%
Official Trump(TRUMP)$8.32-1.58%
StakeWise Staked ETH(OSETH)$4,565.13-3.36%
Filecoin(FIL)$2.35-3.70%
Artificial Superintelligence Alliance(FET)$0.62-4.27%
Sky(SKY)$0.068359-2.06%
Jupiter(JUP)$0.51-4.48%
Liquid Staked ETH(LSETH)$4,681.97-3.35%
Lombard Staked BTC(LBTC)$114,457.00-1.23%
Immutable(IMX)$0.75-6.10%
Polygon Bridged USDT (Polygon)(USDT)$1.000.01%
Quant(QNT)$95.99-1.69%
Renzo Restaked ETH(EZETH)$4,587.51-3.39%