OSPLS：提升数据驱动批处理质量建模与监控的效率与解释性

145 浏览量更新于2024-08-26 收藏 1.04MB PDF 举报

本文主要探讨的是"基于优化的稀疏偏最小二乘的数据驱动批处理端质量建模和监控"这一主题。在现代工业生产环境中，批处理端质量建模是一项关键任务，它通过收集和分析批量数据来预测产品质量，这对于提升生产效率和减少废品至关重要。然而，传统的多路偏最小二乘（PLS）方法在处理大量预测变量时可能会遇到问题，因为其中可能存在众多无关或冗余的变量，这可能导致预测性能下降，模型复杂度增加且解释性减弱。为了克服这个问题，研究人员提出了优化的稀疏PLS（OSPLS）模型。OSPLS的核心理念是通过一种优化策略，同时实现两个目标：一是提高质量预测的准确性，二是进行变量选择，只保留与产品质量紧密相关的变量。这种方法旨在消除那些对质量预测贡献较小或无影响的变量，从而简化模型，增强其稳健性和可靠性。在构建OSPLS模型过程中，作者深入分析了无关变量对质量预测性能的具体影响，强调了相关变量选择的重要性。他们运用了可变分辨率优化技术，结合稀疏PLS，以确保模型能有效地处理高维数据，并找到最优的变量子集。这种方法有助于提高模型的解释性，使工程师能够更好地理解影响产品质量的关键因素。论文还展示了OSPLS方法在实际应用中的有效性，将其应用于补料分批青霉素发酵的Craft.io系统和工业注射成型的Craft.io系统上。通过对这些工业案例的研究，OSPLS方法显示出显著的优势，其预测精度和模型简洁性优于现有的一些最新方法。通过建立基于选定相关变量的统计信息，研究人员还设计了一套有效的质量监控体系，可以实时监测和预警潜在的质量问题。总结来说，这篇研究论文为批处理端质量控制提供了一个创新的解决方案，通过优化的稀疏PLS方法，有效地处理了大量预测变量中的冗余问题，提升了质量预测的精确度和模型的易解读性，对于工业生产过程中的质量管理具有重要的实际价值和理论贡献。

0278-0046 (c) 2019 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TIE.2019.2922941, IEEE

Transactions on Industrial Electronics

IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS

Abstract—Batch-end quality modeling is used to predict

the quality by using batch measurements and generally

involves a large number of predictor variables. However,

not all of the variables are beneficial for the prediction.

Conventional multiway partial least squares (PLS) may not

function properly for batch-end quality modeling because

of many irrelevant predictor variables. This study proposes

an optimized sparse PLS (OSPLS) modeling approach for

simultaneous batch-end quality prediction and

relevant-variable selection. The effect of irrelevant

variables on the quality-prediction performance is analyzed,

and the importance of the relevant-variable selection is

emphasized. Then, an OSPLS batch-end quality modeling

approach is developed by incorporating the variable

resolution optimization and sparse PLS modeling. The

quality-prediction accuracy and modeling interpretability

are improved because only quality-relevant variables are

selected, and quality-irrelevant variables are eliminated.

Based on the selected quality-relevant variables, a statistic

is established for monitoring the quality status. The

proposed OSPLS-based modeling and monitoring

approach is applied on a fed-batch penicillin fermentation

process and an industrial injection molding process. The

results are compared with the state-of-the-art methods to

verify the effectiveness of the OSPLS approach.

Index Terms—Sparse modeling, optimized sparse partial

least square, batch-end quality prediction, batch processes,

soft sensing

This work was supported in part by National Natural Science

Foundation of China under Grants 61603138 and 21878081, in part by

Shanghai Pujiang Program under Grant 17PJD009, in part by Hong

Kong Research Grant Council Project under Grant 16207717, and in

part by the Programme of Introducing Talents of Discipline to

Universities (the 111 Project) under Grant B17017. (Corresponding

author: X. Yan)

Q. Jiang and X. Yan are with the Key Laboratory of Advanced Control

and Optimization for Chemical Processes of Ministry of Education, East

China University of Science and Technology, Shanghai 200237, P.R.

China (e-mail: qchjiang@ecust.edu.cn; xfyan@ecust.edu.cn).

H. Yi is with the College of Electronic Engineering and Control

Science, Nanjing University of Technology, Nanjing 211816, P.R. China

(email: jsyihui@126.com).

F. Gao is with the Department of Chemical and Biomolecular

Engineering, The Hong Kong University of Science and Technology,

Clear Water Bay, Kowloon, Hong Kong (e-mail: kefgao@ust.hk).

I. INTRODUCTION

ARGE portions of value-added products are produced in

chemical and pharmaceutical industries by batch processes.

Generally, a batch process consists of several phases, and the

variables in a batch run are expected to follow a pre-defined

recipe. Due to the variations in environmental conditions,

reaction depths, or raw materials, the variable evolution recipe

may be deviated, and the final product quality may be

unsatisfactory. Thus, timely assessment of the process state and

estimation of the final product quality is important [1, 2].

However, the quality variable is generally obtained with some

delay because of the technique used or economic limitation.

Establishing a soft-sensor model for quality prediction is

important. Quality modeling and monitoring techniques are

typically classified into two types, namely, mechanism

(white-box) models and data-driven (black-box) models [3-7].

On the one hand, establishing a mathematic model is difficult

because the reaction during a process is generally complex. On

the other hand, abundant of history data are stored with the

rapid advancement of sensing techniques. Data-driven

modeling and monitoring techniques are gaining increasing

attention [8-13].

Least square (LS) is the basic linear regression method for

quality or key-performance-indicator modeling [14]. However,

the LS generally fails in dealing with high-dimensional and

highly correlated data, because of the regression coefficient

stability and computational efficiency problems. To handle

high-dimensional and highly correlated data, partial least

squares (PLS) is proposed and among the most popular

data-driven soft-sensor development methods [15]. For batch

processes, the multiway PLS (MPLS) that unfolds the

three-way data as two-way data is generally used [16].

However, the following defects of classical MPLS method exist,

which may degrade the prediction performance. After data

unfolding, the number of predictor variables can be remarkably

large, whereas the number of predictor measurements is

generally small. For example, a batch process that has 10

variables and 200 measurements in each batch and a set of data

with 100 batches, the number of predictor variables is

10 200=2000

while the number of predictor measurements is

100. The number of samples n is much smaller than the number

of variables m, this refers to the large m small n problem. Not

all predictor variables are beneficial for predicting the final

quality; the existence of irrelevant variables may damage useful

Data-Driven Batch-End Quality Modeling

and Monitoring Based on Optimized Sparse

Partial Least Squares

Qingchao Jiang, Member, IEEE, Xuefeng Yan, Hui Yi, and Furong Gao

下载后可阅读完整内容，剩余9页未读，立即下载

weixin_38502639

粉丝: 6
资源: 913

OSPLS：提升数据驱动批处理质量建模与监控的效率与解释性

融入深度学习的偏最小二乘优化方法.pdf

关于稀疏最小二乘算法的matlab程序。最小二乘法（又称最小平方法）是一种数学优化技术。它通过最小化误差的平方和寻找数据的最佳函数匹配.zip

二维非负稀疏偏最小二乘在人脸识别中的应用.pdf

最小二乘支持向量机svm

lsqr命令可以得到线性方程的最小范数最小二乘解吗

最小二乘支持向量机(

gomp算法中除了运用最小二乘回归，还有那些方法

共轭梯度法求解最小二乘问题

基于迭代最小化的稀疏贝叶斯重构方法sbrimmatlab\

最新资源