论文标题
SECOE:减轻机器学习耦合物联网系统中的传感器故障
SECOE: Alleviating Sensors Failure in Machine Learning-Coupled IoT Systems
论文作者
论文摘要
机器学习(ML)应用程序继续彻底改变了许多领域。近年来,人们对为各种物联网(IoT)领域(例如Precision农业,智能城市和智能制造)建立新颖的ML应用程序有很多研究兴趣。物联网域的特征是连续的数据流来自不同的地理传感器,并且通常需要实时或半实时响应。物联网特征在设计和实施有效的ML应用方面构成了一些基本挑战。导致数据流中断的传感器/网络故障就是一个挑战。不幸的是,当面对数据不完整时,许多ML应用程序的性能迅速降低。处理数据不完整的当前技术是基于数据插补的(即,他们试图填写丢失的数据)。不幸的是,这些技术可能会失败,尤其是当多个传感器的数据流并同时不可用时(由于同时的传感器故障)。为了构建强大的物联网耦合的ML应用,本文提出了Secoe,这是一种减轻潜在的同时传感器失败的独特而主动的方法。 SECOE背后的基本思想是创建一个精心选择的ML模型集合,在该模型中,训练了每个模型,假设一组失败的传感器(即,训练集省略了相应的值)。 SECOE包括一种新型技术,可通过利用传感器之间的相关性来最大程度地减少集合中的模型数量。我们通过一系列涉及三个不同数据集的实验来证明SECOE方法的功效。实验发现表明,在存在传感器故障的情况下,SECOE有效地保留了预测准确性。
Machine learning (ML) applications continue to revolutionize many domains. In recent years, there has been considerable research interest in building novel ML applications for a variety of Internet of Things (IoT) domains, such as precision agriculture, smart cities, and smart manufacturing. IoT domains are characterized by continuous streams of data originating from diverse, geographically distributed sensors, and they often require a real-time or semi-real-time response. IoT characteristics pose several fundamental challenges to designing and implementing effective ML applications. Sensor/network failures that result in data stream interruptions is one such challenge. Unfortunately, the performance of many ML applications quickly degrades when faced with data incompleteness. Current techniques to handle data incompleteness are based upon data imputation ( i.e., they try to fill-in missing data). Unfortunately, these techniques may fail, especially when multiple sensors' data streams become concurrently unavailable (due to simultaneous sensor failures). With the aim of building robust IoT-coupled ML applications, this paper proposes SECOE, a unique, proactive approach for alleviating potentially simultaneous sensor failures. The fundamental idea behind SECOE is to create a carefully chosen ensemble of ML models in which each model is trained assuming a set of failed sensors (i.e., the training set omits corresponding values). SECOE includes a novel technique to minimize the number of models in the ensemble by harnessing the correlations among sensors. We demonstrate the efficacy of the SECOE approach through a series of experiments involving three distinct datasets. The experimental findings reveal that SECOE effectively preserves prediction accuracy in the presence of sensor failures.