Integrating active learning and semi-supervised learning for improved data-driven HVAC fault diagnosis performance

APPLIED ENERGY(2024)

引用 0|浏览15
暂无评分
摘要
Data-driven methods have drawn increasing interests in HVAC fault diagnosis tasks due to their intrinsic advantages in making real-time automated decisions. To ensure the reliability of data-driven models, it is essential to prepare sufficient labeled data for predictive modeling. In practice, it can be very time-consuming and laborintensive to determine the actual operating condition or label of each data sample (e.g., Normal or Faulty), making it highly challenging to develop robust data-driven solutions through conventional supervised learning methods. To tackle such challenges, this study proposes a data analytic framework to integrate active learning and semi-supervised learning to utilize massive unlabeled data for improved fault diagnosis performance. More specifically, five active learning methods have been tested to quantify their effectiveness in discovering valuable unlabeled data for expert labeling. Semi-supervised data-driven models have been developed to enable autonomous knowledge discovery from unlabeled building operational data through self-training protocols. Data experiments have been conducted to explore the separated and integrated values of active and semi-supervised learning. The results show that active learning can effectively identify valuable data samples for fault diagnosis and thereby, reducing approximately 50% labeling costs. Cost-effective combinatorial strategies have been derived to integrate active learning and semi-supervised learning for practical applications. The research outcomes are valuable for developing advanced data-driven solutions with substantial decreases in manual costs.
更多
查看译文
关键词
Active learning,Semi-supervised learning,HVAC fault diagnosis,Data-driven model,Artificial intelligence
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要