'deepseek-r1' 태그의 글 목록

본문 바로가기

deepseek-r1

Automating GPU Kernel Generation with DeepSeek-R1 and Inference Time Scaling https://developer.nvidia.com/blog/automating-gpu-kernel-generation-with-deepseek-r1-and-inference-time-scaling/?ncid=so-link-284103&linkId=100000338909940 Automating GPU Kernel Generation with DeepSeek-R1 and Inference Time Scaling | NVIDIA Technical BlogAs AI models extend their capabilities to solve more sophisticated challenges, a new scaling law known as test-time scaling or inference-time s.. 더보기

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning https://arxiv.org/abs/2501.12948 DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement LearningWe introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1. DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning (RL) without supervised fine-tuning (SFT) as a preliminary step, demonstrates remarkable reasoninarxiv.org 초록우리의 첫 번째 세대 추론 .. 더보기

이전 1 다음

티스토리툴바