Research

SACoD: Sensor Algorithm Co-Design Towards Efficient CNN-Powered Intelligent PhlatCam

Authors

Yang Zhang
Yonggan Fu
Yue Wang
Zhihan Lu
Vivek Boominathan
Ashok Veeraraghavan
Yingyan Lin

Cite

Research

SACoD: Sensor Algorithm Co-Design Towards Efficient CNN-Powered Intelligent PhlatCam

ICCV

Cite Paper Code

Authors

Yang Zhang
Yonggan Fu
Yue Wang
Zhihan Lu
Vivek Boominathan
Ashok Veeraraghavan
Yingyan Lin

Published on

10/17/2021

There has been a booming demand for integrating Convolutional Neural Networks (CNNs) powered functionalities into Internet-of-Thing (IoT) devices to enable ubiquitous intelligent “IoT cameras”. However, more extensive applications of such IoT systems are still limited by two challenges. First, some applications, especially medicine- and wearable-related ones, impose stringent requirements on the camera form factor. Second, powerful CNNs often require considerable storage and energy cost, whereas IoT devices often suffer from limited resources. PhlatCam, with its form factor potentially reduced by orders of magnitude, has emerged as a promising solution to the first aforementioned challenge, while the second one remains a bottleneck. Existing compression techniques, which can potentially tackle the second challenge, are far from realizing the full potential in storage and energy reduction, because they mostly focus on the CNN algorithm itself. To this end, this work proposes SACoD, a Sensor Algorithm Co-Design framework to develop more efficient CNN-powered PhlatCam. In particular, the mask coded in the PhlatCam sensor and the backend CNN model are jointly optimized in terms of both model parameters and architectures via differential neural architecture search. Extensive experiments including both simulation and physical measurement on manufactured masks show that the proposed SACoD framework achieves aggressive model compression and energy savings while maintaining or even boosting the task accuracy, when benchmarking over two state-of-the-art (SOTA) designs with six datasets across four different vision tasks including classification, segmentation, image translation, and face recognition.

Please cite our work using the BibTeX below.

@InProceedings{Fu_2021_ICCV,
    author    = {Fu, Yonggan and Zhang, Yang and Wang, Yue and Lu, Zhihan and Boominathan, Vivek and Veeraraghavan, Ashok and Lin, Yingyan},
    title     = {SACoD: Sensor Algorithm Co-Design Towards Efficient CNN-Powered Intelligent PhlatCam},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {5168-5177}
}