Skip to content

Commit 8e19e3e

Browse files
Update README.md
1 parent 13d605f commit 8e19e3e

File tree

1 file changed

+17
-0
lines changed

1 file changed

+17
-0
lines changed

README.md

Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -32,6 +32,7 @@
3232
</div>
3333

3434
## 🔥 News
35+
- **[2026/04/08]** 🎉 Our works on document parsing and text-image machine translation have been accepted to the CVPR 2026 Main Conference! Check out the papers: [Towards Real-World Document Parsing via Realistic Scene Synthesis and Document-Aware Training](https://arxiv.org/abs/2603.23885) and [MMTIT-Bench: A Multilingual and Multi-Scenario Benchmark with Cognition-Perception-Reasoning Guided Text-Image Machine Translation](https://arxiv.org/abs/2603.23896).
3536
- **[2026/01/13]** ⭐ We have released a stable official [online demo](https://hunyuan.tencent.com/chat/HunyuanDefault?modelId=HY-OCR-1.0&mid=308&from=vision-zh), feel free to try it out!
3637
- **[2025/11/28]** 🛠️ We fixed vLLM inference bugs and hyperparameter configuration issues such as system prompt. It is recommended to use the latest vLLM installation steps and the [inference script](https://github.com/Tencent-Hunyuan/HunyuanOCR/blob/main/Hunyuan-OCR-master/Hunyuan-OCR-vllm/run_hy_ocr.py) for performance testing. Currently, there is still a certain accuracy difference between Transformers and the vLLM framework (we are working on fixing this).
3738
- **[2025/11/25]** 📝 Inference code and model weights publicly available.
@@ -393,6 +394,22 @@ Our model is able to translate images of minor languages ​​taken into Chines
393394
journal={arXiv preprint arXiv:2511.19575},
394395
url={https://arxiv.org/abs/2511.19575},
395396
}
397+
398+
@misc{li2026mmtitbench,
399+
title={MMTIT-Bench: A Multilingual and Multi-Scenario Benchmark with Cognition-Perception-Reasoning Guided Text-Image Machine Translation},
400+
author={Gengluo Li and Chengquan Zhang and Yupu Liang and Huawen Shen and Yaping Zhang and Pengyuan Lyu and Weinong Wang and Xingyu Wan and Gangyan Zeng and Han Hu and Can Ma and Yu Zhou},
401+
year={2026},
402+
journal={arXiv preprint arXiv:2603.23896},
403+
url={https://arxiv.org/abs/2603.23896},
404+
}
405+
406+
@misc{li2026towardsrealworlddocument,
407+
title={Towards Real-World Document Parsing via Realistic Scene Synthesis and Document-Aware Training},
408+
author={Gengluo Li and Pengyuan Lyu and Chengquan Zhang and Huawen Shen and Liang Wu and Xingyu Wan and Gangyan Zeng and Han Hu and Can Ma and Yu Zhou},
409+
year={2026},
410+
journal={arXiv preprint arXiv:2603.23885},
411+
url={https://arxiv.org/abs/2603.23885},
412+
}
396413
```
397414

398415
## 🙏 Acknowledgements

0 commit comments

Comments
 (0)