基于热裂解-气相色谱-质谱法和随机森林的加热卷烟烟叶原料适用性评估
Applicability Evaluation of Heat-Not-Burn Tobacco Raw Materials Based on Thermal Pyrolysis-Gas Chromatography-Mass Spectrometry and Random Forests
-
摘要: 基于热裂解-气相色谱-质谱法(Py-GC-MS)和随机森林(RF),从化学成分角度分析加热卷烟烟叶原料适用性。称取过筛后的样品粉末0.90mg于样品杯中,采用Py-GC-MS对28种不同类型的加热卷烟进行检测,用MZmine软件对Py-GC-MS数据进行处理,获得含有峰强度信息的特征峰表。分别以样品的特征峰表和感官评价得分作为自变量和因变量,采用RF回归算法建立加热卷烟烟叶原料适用性模型。结果显示:RF模型在训练集上的决定系数为0.93,均方根误差为0.85,在测试集上的决定系数为0.92,均方根误差为0.96;根据NIST 2017库的定性结果,共筛选出20个特征重要性评分较高的化学成分。Abstract: Based on thermal pyrolysis-gas chromatography-mass spectrometry (Py-GC-MS) and random forests (RF), the applicability of heat-not-burn tobacco raw materials was analyzed from the perspective of chemical composition. The sieved sample powder (0.90 mg) was placed into the sample cup, Py-GC-MS was used to detect 28 different types of heat-not-burn tobaccos, and MZmine software was used to process the Py-GC-MS data to obtain the characteristic peak table containing intensity information. The characteristic peak table and the sensory evaluation scores of the samples were used as independent variables and dependent variables, respectively, and RF regression algorithm was used to establish the applicability model of heat-not-burn tobacco raw materials. It was shown that the coefficient of determination of RF model on the training set was 0.93, and the root mean square error was 0.85. The coefficient of determination on the testing set was 0.92, and the root mean square error was 0.96. According to the qualitative results of NIST 2017 database, 20 chemical compositions with higher feature importance scores were screened out.