DeepSeek：2025年DeepSeek-OCR技术报告：视觉压缩长文本的探索性研究（英文版）

发布者：wx****a5

2025-11-04

7 MB 22 页

人工智能（AI）

文件列表：

DeepSeek：2025年DeepSeek-OCR技术报告：视觉压缩长文本的探索性研究（英文版）.pdf

下载文档

资源简介

We present DeepSeek-OCR as an initial investigation into the feasibility of compressing long contexts via optical 2D mapping. DeepSeek-OCR consists of two components: DeepEncoder and DeepSeek3B-MoE-A570M as the decoder. Specifically, DeepEncoder serves as the core engine, designed to maintain low activations under high-resolution input while achieving high compression ratios to ensure an optimal and manageable number of vision tokens. Experiments show that when the number of text tokens is wi

加载中...

本文档仅能预览20页

继续阅读请下载文档