论文标题
Integriscreen:在视觉上监督受损客户端的远程用户交互
IntegriScreen: Visually Supervising Remote User Interactions on Compromised Clients
论文作者
论文摘要
用户通过其本地客户端(笔记本电脑或台式机)访问的远程服务和应用程序通常会假设,在会话开始时成功的用户身份验证后,所有后续的通信都反映了用户的意图。但是,如果对手获得对客户端的控制,并且可以操纵用户看到的内容以及发送到远程服务器的内容,则这是不正确的。 为了保护用户与远程服务器的通信,尽管有可能受到损害的本地客户端,我们建议使用配备相机的第二个设备连续视觉监督的概念。由于使用前置摄像头(例如增强现实耳机和智能家居助理)迅速增加传入设备数量的动机,我们以核心思想为基础,即用户的实际预期输入是客户屏幕上显示的内容,尽管最终发送到了远程服务器。因此,静态定位的相机设备可以不断地分析客户端的屏幕,以强制执行客户的行为,尽管可能是恶意的。 我们通过开发功能齐全的原型,在三种不同的移动设备上进行了大量实验测试,并进行了一项用户研究,在各种模拟攻击中分析参与者对系统的使用,我们通过开发功能齐全的原型来评估此概念的当前可行性和可部署性。实验评估确实证实了视觉监督概念的可行性,因为该系统始终检测到98%的评估攻击,而教学的研究参与者很少检测到剩余的攻击,其概率很高。
Remote services and applications that users access via their local clients (laptops or desktops) usually assume that, following a successful user authentication at the beginning of the session, all subsequent communication reflects the user's intent. However, this is not true if the adversary gains control of the client and can therefore manipulate what the user sees and what is sent to the remote server. To protect the user's communication with the remote server despite a potentially compromised local client, we propose the concept of continuous visual supervision by a second device equipped with a camera. Motivated by the rapid increase of the number of incoming devices with front-facing cameras, such as augmented reality headsets and smart home assistants, we build upon the core idea that the user's actual intended input is what is shown on the client's screen, despite what ends up being sent to the remote server. A statically positioned camera enabled device can, therefore, continuously analyze the client's screen to enforce that the client behaves honestly despite potentially being malicious. We evaluate the present-day feasibility and deployability of this concept by developing a fully functional prototype, running a host of experimental tests on three different mobile devices, and by conducting a user study in which we analyze participants' use of the system during various simulated attacks. Experimental evaluation indeed confirms the feasibility of the concept of visual supervision, given that the system consistently detects over 98% of evaluated attacks, while study participants with little instruction detect the remaining attacks with high probability.