论文标题

反向GPS野生动植物跟踪系统的信号处理:CPU和GPU实施体验

Signal Processing for a Reverse-GPS Wildlife Tracking System: CPU and GPU Implementation Experiences

论文作者

Rubinpur, Yaniv, Toledo, Sivan

论文摘要

我们介绍了由称为Atlas的高通量野生动物跟踪系统执行的信号处理任务的强大高性能实现。该系统通过估计无线电数据包到达多个接收器(基站)的时间来跟踪野生动物的无线电发射机。宽带无线电信号的到达时间在计算上是昂贵的,尤其是在采集模式下(当传输时间不知道,甚至大致时)。这些计算是限制系统吞吐量的瓶颈。几年前,我们开发了对计算的顺序高性能实施,更是一种GPU实现。就像大多数实际代码一样,都努力平衡性能与简单,可维护性和发展工作。该论文报告了这两个实现,并仔细评估了它们的绩效。评估表明,相对于在当前基站典型的计算机典型的台式CPU上运行的顺序CPU实现,GPU实现可极大地提高性能和功能性能。高端GPU上的性能提高了50倍以上,而GPU平台的功率比CPU平台少了5倍,超过4倍。每瓦的绩效比率也提高了(超过16倍),价格绩效比率也是如此。

We present robust high-performance implementations of signal-processing tasks performed by a high-throughput wildlife tracking system called ATLAS. The system tracks radio transmitters attached to wild animals by estimating the time of arrival of radio packets to multiple receivers (base stations). Time-of-arrival estimation of wideband radio signals is computationally expensive, especially in acquisition mode (when the time of transmission is not known, not even approximately). These computations are a bottleneck that limits the throughput of the system. We developed a sequential high-performance CPU implementation of the computations a few years back, and more recencely a GPU implementation. Both strive to balance performance with simplicity, maintainability, and development effort, as most real-world codes do. The paper reports on the two implementations and carefully evaluates their performance. The evaluations indicates that the GPU implementation dramatically improves performance and power-performance relative to the sequential CPU implementation running on a desktop CPU typical of the computers in current base stations. Performance improves by more than 50X on a high-end GPU and more than 4X with a GPU platform that consumes almost 5 times less power than the CPU platform. Performance-per-Watt ratios also improve (by more than 16X), and so do the price-performance ratios.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源