研究报告

酒精饮料食品安全合规风险信息的自动检测方法

  • 陶丹丹 ,
  • 白国志 ,
  • 寇晨光 ,
  • 杨勇 ,
  • 朱慧茹 ,
  • 郑淼 ,
  • 王凯波
展开
  • 1(清华大学 万科公共卫生与健康学院,北京,100084)
    2(重庆邮电大学 软件工程系,重庆,400065)
    3(北京顺鑫农业股份有限公司牛栏山酒厂,北京,101301)
    4(中国食品发酵工业研究院有限公司,北京,100015)
第一作者:博士,助理研究员(郑淼高级工程师和王凯波教授为共同通信作者,E-mail:fjyzm@foxmail.com;kbwang@tisnghua.edu.cn)

收稿日期: 2024-03-07

  修回日期: 2024-04-25

  网络出版日期: 2024-08-21

基金资助

北京市科技计划课题项目(Z221100007122009)

Automated detection of regulatory changes with machine learning to mitigate compliance risks in the alcoholic beverage industry

  • TAO Dandan ,
  • BAI Guozhi ,
  • KOU Chenguang ,
  • YANG Yong ,
  • ZHU Huiru ,
  • ZHENG Miao ,
  • WANG Kaibo
Expand
  • 1(Vanke School of Public Health, Tsinghua University, Beijing 100084,China)
    2(Department of Software Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065,China)
    3(Beijing Shun Xin Agriculture Co.Ltd., Niulanshan Distillery, Beijing 101301,China)
    4(China National Research Institute of Food and Fermentation Industries Co.Ltd., Beijing 100015,China)

Received date: 2024-03-07

  Revised date: 2024-04-25

  Online published: 2024-08-21

摘要

随着食品安全法规的不断发展,紧跟时事,及时了解条款修订情况对企业确保遵守并降低潜在的风险至关重要。利用计算机技术有助于自动识别监管变更,精简监察流程并能够及时地响应。该研究旨在探讨机器学习算法在食品安全风险管理中的应用。研究提出了一个提高合规风险管理效率和有效性的框架,基于Transformer的双向编码器表示(bidirectional encoder representations from transformers, BERT)——一个预训练的自然语言处理模型,以典型的监督学习模型为基线,被用来识别与特定食品类别潜在食品安全风险相关的新闻信息变化情况。新闻报道中的组织、食物、风险、法规等关键实体被BERT模型自动提取。以酒精饮料为例,结合领域专家提供的标注数据,研究得到了一个微调的(有提高的)BERT模型,该模型可以自动检测与酒精饮料和与之相关的关键实体相关的潜在监管变化。结果表明,相关性预测的F1分值为0.88,实体识别的F1分值为0.60。所提出的方法有可能显著减少手工工作,提高检测监管变化的准确性,最终强化食品企业的合规策略。

本文引用格式

陶丹丹 , 白国志 , 寇晨光 , 杨勇 , 朱慧茹 , 郑淼 , 王凯波 . 酒精饮料食品安全合规风险信息的自动检测方法[J]. 食品与发酵工业, 2024 , 50(15) : 281 -288 . DOI: 10.13995/j.cnki.11-1802/ts.039133

Abstract

With the ever-evolving landscape of food safety regulations,staying abreast of modifications is crucial for enterprises to ensure adherence and mitigate potential risks.Leveraging computational technologies facilitates the automatic identification of regulatory changes,streamlining the monitoring process and enabling timely responses.This study explored the implementation of advanced machine learning algorithms in the context of the food industry,presenting a framework that enhanced the efficiency and effectiveness of compliance risk management.In this work,bidirectional encoder representations from transformers (BERT),a pretrained natural language processing model,was employed to identify regulatory news relevant to potential food safety risks of specific food categories,with typical supervised learning models as baselines.Key entities such as organization, food, contaminant,and regulation reported in the news reports were also automatically extracted by the BERT model.Using alcoholic beverage as an example along with the labeled data provided by domain experts,we obtained a fine-tuned BERT model that can automatically detect potential regulatory change related to alcoholic beverage and the critical entities associated with it.The results showed that the F1-score of relevance prediction was 0.81,and the F1-score of entity detection was 0.60.The proposed approach holds the potential to significantly reduce manual efforts,enhance accuracy in detecting regulatory alterations,and ultimately fortify the compliance strategies of food enterprises.

参考文献

[1] HENSON S, CASWELL J.Food safety regulation:An overview of contemporary issues[J].Food Policy, 1999, 24(6):589-603.
[2] KOTSANOPOULOS K V, ARVANITOYANNIS I S.The role of auditing, food safety, and food quality standards in the food industry:A review[J].Comprehensive Reviews in Food Science and Food Safety, 2017, 16(5):760-775.
[3] MENSAH L D, JULIEN D.Implementation of food safety management systems in the UK[J].Food Control, 2011, 22(8):1216-1225.
[4] NGUYEN T T B, LI D.A systematic literature review of food safety management system implementation in global supply chains[J].British Food Journal, 2022, 124(10):3014-3031.
[5] HIGNETTE G, BUCHE P, COUVERT O, et al.Semantic annotation of Web data applied to risk in food[J].International Journal of Food Microbiology, 2008, 128(1):174-180.
[6] TAO D D, ZHANG D Y, HU R F, et al.Crowdsourcing and machine learning approaches for extracting entities indicating potential foodborne outbreaks from social media[J].Scientific Reports, 2021, 11(1):21678.
[7] WANG J, YUE H L.Food safety pre-warning system based on data mining for a sustainable food supply chain[J].Food Control, 2017, 73:223-229.
[8] NYCHAS G J E, PANAGOU E Z, MOHAREB F.Novel approaches for food safety management and communication[J].Current Opinion in Food Science, 2016, 12:13-20.
[9] WU Y N, CHEN J S.Food safety monitoring and surveillance in China:Past, present and future[J].Food Control, 2018, 90:429-439.
[10] PANG X N, LI Z J, CHEN J Y, et al.A comprehensive review of spirit drink safety standards and regulations from an international perspective[J].Journal of Food Protection, 2017, 80(3):431-442.
[11] 胡康, 王雅洁, 杨冰, 等.白酒质量安全风险分析与防范[J].中国酿造, 2019, 38(8):216-223.
HU K, WANG Y J, YANG B, et al.Risk analysis and prevention of Baijiu quality safety[J].China Brewing, 2019, 38(8):216-223.
[12] HE N X, BAYEN S.An overview of chemical contaminants and other undesirable chemicals in alcoholic beverages and strategies for analysis[J].Comprehensive Reviews in Food Science and Food Safety, 2020, 19(6):3916-3950.
[13] BŁASZCZYK I.The management of food safety in beverage industry[M]//Safety Issues in Beverage Production.Amsterdam:Elsevier, 2020:1-38.
[14] WANG Z F, QU P F, ZHAO Y F, et al.A nationwide survey and risk assessment of ethyl carbamate exposure due to daily intake of alcoholic beverages in the Chinese general population[J].Foods, 2023, 12(16):3129.
[15] WALDNER C.Big data for infectious diseases surveillance and the potential contribution to the investigation of foodborne disease in Canada[D].Winnipeg, Canada:National Collaborating Centre for Infectious Diseases, 2018.
[16] VAN DE BRUG F J, LUCAS LUIJCKX N B, CNOSSEN H J, et al.Early signals for emerging food safety risks:From past cases to future identification[J].Food Control, 2014, 39:75-86.
[17] MAEDA Y, KURITA N, IKEDA S.An early warning support system for food safety risks[M]//New Frontiers in Artificial Intelligence.Berlin, Heidelberg:Springer Berlin Heidelberg, 2006:446-457.
[18] KATE K, CHAUDHARI S, PRAPANCA A, et al.FoodSIS:A text mining system to improve the state of food safety in Singapore[C]//Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining.ACM, 2014:1709-1718.
[19] CHEN S Q, HUANG D D, NONG W Y, et al.Development of a food safety information database for Greater China[J].Food Control, 2016, 65:54-62.
[20] ZHU X Y, HUANG I Y, MANNING L.The role of media reporting in food safety governance in China:A dairy case study[J].Food Control, 2019, 96:165-179.
[21] BOUZEMBRAK Y, STEEN B, NESLO R, et al.Development of food fraud media monitoring system based on text mining[J].Food Control, 2018, 93:283-296.
[22] MARVIN H J P, HOENDERDAAL W, GAVAI A K, et al.Global media as an early warning tool for food fraud;an assessment of MedISys-FF[J].Food Control, 2022, 137:108961.
[23] TAO D D, YANG P K, FENG H.Utilization of text mining as a big data analysis tool for food science and nutrition[J].Comprehensive Reviews in Food Science and Food Safety, 2020, 19(2):875-894.
[24] ZHAI C X, MASSUNG S.Text Data Management and Analysis:A Practical Introduction to Information Retrieval and Text Mining[M]. New York:Association for Computing Machinery, 2016.
[25] OLDROYD R A, MORRIS M A, BIRKIN M.Identifying methods for monitoring foodborne illness:Review of existing public health surveillance techniques[J].JMIR Public Health and Surveillance, 2018, 4(2):e57.
[26] HU R, ZHANG D, TAO D, et al.TWEET-FID:An annotated dataset for multiple foodborne illness detection tasks[J].arXiv preprint arXiv, 2022, 2205.10726.
[27] QIAN C, MURPHY S I, ORSI R H, et al.How can AI help improve food safety?[J].Annual Review of Food Science and Technology, 2023, 14:517-538.
[28] ROBERTSON A S, REISIN MILLER A, DOLZ F.Supporting a data-driven approach to regulatory intelligence[J].Nature Reviews.Drug Discovery, 2021, 20(3):161-162.
[29] ROBERTS K, THAKKAR R, ALJUBURI L, et al.A vision for integrated publicly available information on regulated medical products[J].Clinical and Translational Science, 2022, 15(6):1321-1327.
[30] PATIL R S, KULKARNI S B, GAIKWAD V L.Artificial intelligence in pharmaceutical regulatory affairs[J].Drug Discovery Today, 2023, 28(9):103700.
[31] LI F Y, DONG S Y, LEIER A, et al.Positive-unlabeled learning in bioinformatics and computational biology:A brief review[J].Briefings in Bioinformatics, 2022, 23(1):bbab461.
文章导航

/