Identification of Fault Indicator Variables of Wind Turbine Pitch System Based on SGD-R's Improved K-means Algorithm

Publication Name

2021 Global Reliability and Prognostics and Health Management, PHM-Nanjing 2021


K-means algorithm is a numerical, unsupervised, and uncertain iterative method. This method is simple and fast, and has been proved to be a very effective clustering method in practical applications. This article focuses mainly on the two shortcomings of the K-means algorithm, that is, the algorithm is sensitive to the outliers and noise points of the data sample, and the objective function has a local optimal solution. The improvement of this algorithm is proposed by using the stochastic gradient descent method to replace the traditional K-means algorithm to obtain the average value of the distance within the cluster to update the cluster center. After continuously reducing the learning rate, the algorithm is implemented in the direction of gradient descent, pulling the cluster center nearer to the data sample. Through this improvement, the influence of isolated points on the cluster center is effectively avoided. At the same time, for the situation where the objective function has a local optimal solution, a restart method is used to perform multiple search judgments. It is to determine if the restart conditions are met. When the conditions are met, the action of start to jump is performed in the program. By enlarging increment of the learning rate and then continuously reducing the learning rate, more optimal values are found and the optimized value is finally retained. By combining the principal component analysis with the improved clustering algorithm, and comparing the purity of clustering, the fault indicator variables that effectively reflect the wind turbine pitch system faults are obtained.

Open Access Status

This publication is not available as open access