OCR识别小模型：GOT-OCR2

2025/1/3 0:18:48 来源：https://blog.csdn.net/weixin_42357472/article/details/142256639 浏览: 次关键词：OCR识别小模型：GOT-OCR2

参考：
https://huggingface.co/ucaslcl/GOT-OCR2_0

模型：

export HF_ENDPOINT=https://hf-mirror.comhuggingface-cli download --resume-download --local-dir-use-symlinks False ucaslcl/GOT-OCR2_0 --local-dir got_ocr

安装：

!pip install verovio megfile -i https://pypi.tuna.tsinghua.edu.cn/simple

使用：

from transformers import AutoModel, AutoTokenizertokenizer = AutoTokenizer.from_pretrained('/ai/got_ocr', trust_remote_code=True)
model = AutoModel.from_pretrained('/ai/got_ocr', trust_remote_code=True, low_cpu_mem_usage=True, device_map='cuda', use_safetensors=True, pad_token_id=tokenizer.eos_token_id)
model = model.eval().cuda()# input your test image
image_file = 'image1.jpg'# plain texts OCR
res = model.chat(tokenizer, image_file, ocr_type='ocr')res
# format texts OCR:
res = model.chat(tokenizer, image_file, ocr_type='format')
res

原图
在这里插入图片描述
结果：

在线latex查看：

https://arachnoid.com/latex/
在这里插入图片描述

公式render，打开html

# render the formatted OCR results:
image_file = 'image1.jpg'
res = model.chat(tokenizer, image_file, ocr_type='format', render=True, save_render_file = './demo.html')print(res)

在这里插入图片描述

image_file = 'road.png'# plain texts OCR
res = model.chat(tokenizer, image_file, ocr_type='ocr')res

原图
在这里插入图片描述

结果
在这里插入图片描述

image_file = 'simles1.jpg'
# format texts OCR:
res = model.chat(tokenizer, image_file, ocr_type='format')
res

在这里插入图片描述

OCR识别小模型：GOT-OCR2

最新新闻

热搜词