DeepSeek:2025年DeepSeek-OCR技术报告: 视觉压缩长文本的探索性研究(英文版).pdf |
下载文档 |
资源简介
We present DeepSeek-OCR as an initial investigation into the feasibility of compressing long contexts via optical 2D mapping. DeepSeek-OCR consists of two components: DeepEncoder and DeepSeek3B-MoE-A570M as the decoder. Specifically, DeepEncoder serves as the core engine, designed to maintain low activations under high-resolution input while achieving high compression ratios to ensure an optimal and manageable number of vision tokens. Experiments show that when the number of text tokens is wi
本文档仅能预览20页



