https://huggingface.co/deepseek-ai/DeepSeek-V2.5
v2는 논문하고 구조가 있는데 2.5는 없음...
https://github.com/deepseek-ai/DeepSeek-V2
MLA가 메인인 논문
'인공지능' 카테고리의 다른 글
iTransformer: Inverted Transformers Are Effective for Time Series Forecasting (1) | 2024.11.25 |
---|---|
Pixtral 12B (1) | 2024.11.23 |
De novo design of high-affinity protein binders with AlphaProteo (3) | 2024.11.22 |
AlphaProteo generates novel proteins for biology and health research (1) | 2024.11.22 |
VASA-1: Lifelike Audio-Driven Talking FacesGenerated in Real Time (3) | 2024.11.21 |