본문 바로가기

인공지능

DeepSeek-V2.5

https://huggingface.co/deepseek-ai/DeepSeek-V2.5

 

deepseek-ai/DeepSeek-V2.5 · Hugging Face

Paper Link👁️ DeepSeek-V2.5 1. Introduction DeepSeek-V2.5 is an upgraded version that combines DeepSeek-V2-Chat and DeepSeek-Coder-V2-Instruct. The new model integrates the general and coding abilities of the two previous versions. For model details, p

huggingface.co

 

v2는 논문하고 구조가 있는데 2.5는 없음...

https://github.com/deepseek-ai/DeepSeek-V2

 

GitHub - deepseek-ai/DeepSeek-V2: DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model - deepseek-ai/DeepSeek-V2

github.com

 

MLA가 메인인 논문