Home
Scholarly Works
A semi-supervised learning algorithm for high and...
Journal article

A semi-supervised learning algorithm for high and low-frequency variable imbalances in industrial data

Abstract

This work introduces a semi-supervised learning algorithm to estimate missing data for processes where measured data is comprised of variables that are measured at high frequency and low frequency. A semi-supervised learning algorithm named “Weight-Adjusted Consistency Regularization Algorithm for Semi-Supervised Learning” (WACR-SSL) based on consistency regularization is proposed. The algorithm splits the irregular unbalanced data set into three parts and processes them separately. To address the loss balancing problem, five loss balancing methods have been tested: Uncertainty Weights (UW), Random Loss Weighting (RLW), Dynamic Weight Average (DWA), Geometric Loss Strategy (GLS) and the logarithmic transformation (LogT). When applied to data from a hydrocracking process, the algorithm effectively leverages partially labeled data. With carefully chosen noise scales and the coefficient for the unsupervised loss, the uncertainty weight (UW) variant performs the best when compared to the other loss balancing methods.

Authors

Zhu J; Fan C; Yang M; Qian F; Mahalec V

Journal

Computers & Chemical Engineering, Vol. 193, ,

Publisher

Elsevier

Publication Date

February 1, 2025

DOI

10.1016/j.compchemeng.2024.108933

ISSN

0098-1354

Labels

Contact the Experts team