矿山数工
矿山数工通过构建全流程智能化数据体系,显著提升矿山行业数据应用效能:针对数据标注效率低、成本高的痛点,建立矿山场景级标签体系,融合7大模态数据研发标准化标注模板,实现500亿级传感数据、300万视频数据与200亿词元文本数据的统一标注,支撑小模型敏捷训练;依托太阳石大模型,在跨模态语义增强技术上实现突破,构建行业问答对自动生成机制,攻克长尾数据缺失难题,实现大模型微调数据的高效合成与行业词汇理解力跃升;同步打造非结构化文本智能转换系统,结合合规校验模块与动态知识库划分技术,将行业标准、安全规程等数据实现结构化存储与智能更新,使知识检索准确率突破90%,并形成可迁移复用的一体化知识底座,全面释放矿山数据资产价值,为智能化转型提供核心驱动力。
By building a full-process intelligent data system, the Mine Dt has significantly improved the data application efficiency of the mining industry. Targeting the pain points of low efficiency and high cost in data annotation, it has established a mine scenario-level tag system, developed standardized annotation templates by integrating seven major modal data, and realized unified annotation of 50-billion-level sensing data, 3 million video data and 20-billion-token text data, supporting the agile training of small models. Based on the Solstone Model, breakthroughs have been made in cross-modal semantic enhancement technology. An automatic generation mechanism for industry question-answer pairs has been constructed to tackle the problem of long-tail data scarcity, enabling efficient synthesis of fine-tuning data for large models and a leap in the understanding of industry terminology. In parallel, an intelligent conversion system for unstructured text has been developed. Combined with a compliance verification module and dynamic knowledge base partitioning technology, it has realized structured storage and intelligent updating of data such as industry standards and safety regulations, pushing the knowledge retrieval accuracy to over 90%. An integrated, transferable and reusable knowledge base has been formed, fully unlocking the value of mine data assets and providing core driving force for the intelligent transformation of the mining industry.

长按二维码【关注公众号】,获取更多精彩内容!
煤科总院
联系人:祝广森     联系电话:15110818666