Algorithm Engineer
算法工程师
Led document parsing and enterprise RAG QA projects for the AI Platform. Drove the migration of the document parsing pipeline from multi-model LayoutLM+OCR to end-to-end QwenVL, building fine-tuning datasets and evaluation systems to achieve 94.6% text parsing accuracy and increase table recognition TEDS from 74% to 87%. Built a multi-agent framework to generate 20k+ multi-turn medical SFT dialogues.
负责 AI Platform 方向的文档解析、企业知识库问答项目。推动文档解析从 LayoutLM+OCR 多模型管线迁移至 QwenVL 端到端方案,构建微调数据与评测体系,使文本识别准确率达 94.6%,表格识别 TEDS 从 74% 提升至 87%。搭建多智能体生成框架,合成 20k+ 多轮问诊 SFT 数据。