基于卷积高斯混合模型的统计压缩感知

汪韧; 郭静波; 惠俊鹏; 王泽; 刘红军; 许元男; 刘韵佛

doi:10.7498/aps.68.20190414

摘要

高斯混合模型被广泛应用于统计压缩感知中信号先验概率分布的建模. 利用高斯混合模型对图像的概率分布进行建模时, 通常需要先对图像分块, 再对图像块的概率分布进行建模. 本文提出卷积高斯混合模型对整幅图像的概率分布进行建模. 通过期望极大化算法求解极大边缘似然估计, 实现模型中未知参数的估计. 此外, 考虑到在整幅图像上计算的复杂度较高, 本文在卷积高斯混合模型和压缩测量模型中引入循环卷积, 所有的训练和恢复过程都可以利用二维快速傅里叶变换实现快速运算. 仿真实验表明, 本文所提的MMLE-convGMM算法的恢复性能要优于传统的压缩感知算法的恢复性能.

关键词:

Abstract

Statistical compressive sensing needs to use the statistical description of source signal. By decomposing a whole image into a set of non-overlapping or overlapping patches, the Gaussian mixture model (GMM) has been used to statistically represent patches in an image. Compressive sensing, however, always imposes compression on the whole image. It is obvious that the entire image contains much richer information than the small patches. Extending from the small divided patches to an entire image, we propose a convolutional Gaussian mixture model (convGMM) to depict the statistics of an entire image and apply it to compressive sensing. We present the algorithm details by learning a convGMM from training images based on maximizing the marginal log-likelihood estimation. The learned convGMM is used to perform the model-based compressive sensing by using the convGMM as a model of the underlying image. In addition, aiming at the problem of high-dimensional image that makes learning, estimation and optimization suffer high computational complexity, all of the training and reconstruction process in our method can be fast and efficiently calculated in the frequency-domain by two-dimensional fast Fourier transforms. The performance of the convGMM on compressive sensing is demonstrated on several image sets.

Keywords:

作者及机构信息

1.
中国运载火箭技术研究院研究发展部, 北京　100076

2.
清华大学电机工程与应用电子技术系, 北京　100084

通信作者: 汪韧, wangren94@126.com

基金项目: 国家自然科学基金(批准号: 51677094)资助的课题

Authors and contacts

1.
China Academy of Launch Vehicle Technology R&D Center, Beijing 100076, China

2.
Department of Electrical Engineering, Tsinghua University, Beijing 100084, China

Corresponding author: Wang Ren, wangren94@126.com

Funds: Project supported by the National Natural Science Foundation of China (Grant No. 51677094)

文章全文

参考文献

[1]	Donoho D L 2006 IEEE Trans. Inform. Theory 52 1289 Google Scholar
[2]	Candès E J, Romberg J, Tao T 2006 IEEE Trans. Inform. Theory 52 489 Google Scholar
[3]	Ji S, Xue Y, Carin L 2008 IEEE Trans. Sig. Process. 56 2346 Google Scholar
[4]	Chen M, Silva J, Paisley J, Wang C, Dunson D, Carin L 2010 IEEE Trans. Sig. Process. 58 6140 Google Scholar
[5]	Yu G, Sapiro G 2011 IEEE Trans. Sig. Process. 59 5842 Google Scholar
[6]	Yang J, Liao X, Yuan X, Llull P, Brady D J, Sapiro G, Carin L 2015 IEEE Trans. Sig. Process. 24 106 Google Scholar
[7]	Yu G, Sapiro G, Mallat S 2012 IEEE Tran. Image Process. 21 2481 Google Scholar
[8]	Wang C, Liu X F, Yu W K, Yao X R, Zheng F, Dong Q, Lan R M, Sun Z B, Zhai G J, Zhao Q 2017 Chin. Phys. Lett. 34 104203 Google Scholar
[9]	Wang Y Y, Ren Y C, Chen L Y, Song C, Li C Z, Zhang C, Xu D G, Yao J Q 2018 Chin. Phys. B 27 114204 Google Scholar
[10]	Xiao D, Cai H K, Zheng H Y 2015 Chin. Phys. B 24 060505 Google Scholar
[11]	宁方立, 何碧静, 韦娟 2013 物理学报 62 174212 Google Scholar Ning F L, He B J, Wei J 2013 Acta Phys. Sin. 62 174212 Google Scholar
[12]	李少东, 陈永彬, 刘润华, 马晓岩 2017 物理学报 66 038401 Google Scholar Li S D, Chen Y B, Liu R H, Ma X Y 2017 Acta Phys. Sin. 66 038401 Google Scholar
[13]	Duarte M F, Davenport M A, Takhar D, Laska J N, Sun T, Kelly K F, Baraniuk R G 2008 IEEE Sig. Process. Mag. 25 83 Google Scholar
[14]	Zeiler M D, Krishnan D, Taylor G W, Fergus R 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) San Francisco, June 13−18, 2010 p2528
[15]	Grosse R, Raina R, Kwong H, Ng A Y 2007 Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence Vancouver, July 19−22, 2007 p149
[16]	Yang J, Yu K, Huang T 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) San Francisco, June 13−18, 2010 p3517
[17]	Wohlberg B 2016 IEEE Trans. Image Process. 25 301 Google Scholar
[18]	Davis P J 2012 Circulant Matrices (Providence, Rhode Island: American Mathematical Society)
[19]	Dempster A, Laird N, Rubin D 1977 J. R. Stat. Soc. 39 1
[20]	Renna F, Calderbank R, Carin L, Rodrigues M R D 2014 IEEE Trans. Sig. Process. 62 2265 Google Scholar
[21]	Tropp J A, Gilbert A C 2007 IEEE Trans. Inform. Theory 53 4655 Google Scholar
[22]	Yang J, Zhang Y 2011 SIAM J Sci. Comput. 33 250 Google Scholar
[23]	Liao X, Li H, Carin L 2014 SIAM Imaging Sci. 7 797 Google Scholar
[24]	Xu Y, Yin W, Osher S 2014 Inverse Probl. Imag. 8 901 Google Scholar
[25]	Krizhevsky A, Hinton G 2009 Learning Multiple Layers of Features from Tiny Images (Toronto: University of Toronto) Technical Report Vol. 1, No. 4, p7
[26]	Li F F, Rob F, Pietro P 2007 Comput. Vis. Image Und. 106 59 Google Scholar
[27]	Liu Z W, Luo P, Wang X G, Tang X O 2015 IEEE International Conference on Computer Vision Santiago, December 7−13, 2015 p3730

施引文献

图 1 基于convGMM的压缩测量

Fig. 1. Structure of convGMM with application to compressive sensing.

下载: 全尺寸图片幻灯片

图 2 CIFAR-10图像, 不同算法下恢复图像的PSNR随采样率的变化

Fig. 2. Averaged PSNR of reconstructed images from CIFAR-10 dataset as a function of sampling rate.

下载: 全尺寸图片幻灯片

图 3 Caltech 101图像, 不同算法下恢复图像的PSNR随采样率的变化

Fig. 3. Averaged PSNR of reconstructed images from Caltech 101 dataset as a function of sampling rate.

下载: 全尺寸图片幻灯片

图 4 采样率为0.4时, 12张Caltech 101“飞机”图像在不同算法下的恢复情况　(a)原图像; (b) MMLE-convGMM下的恢复图像; (c) MMLE-GMM下的恢复图像; (d) KSVD-YALL1下的恢复图像; (e) DCT-YALL1下的恢复图像; (f) DCT-GAP下的恢复图像; (g) DCT-OMP下的恢复图像

Fig. 4. Reconstructed performance comparison of 12 randomly selected “airplane” images from Caltech 101: (a) Original images; (b) images reconstructed by MMLE-convGMM; (c) images reconstructed by MLE-GMM; (d) images reconstructed by KSVD-YALL1; (e) images reconstructed by DCT-YALL1; (f) images reconstructed by DCT-GAP; (g) images reconstructed by DCT-OMP. All of the sampling rates are 0.4.

下载: 全尺寸图片幻灯片

图 5 MMLE-convGMM算法恢复CelebA图像的PSNR随采样率的变化

Fig. 5. Averaged PSNR of reconstructed images from CelebA dataset as a function of sampling rate by MMLE-convGMM.

下载: 全尺寸图片幻灯片

图 6 随机选取的CelebA图像的恢复情形　(a)原图像; (b) MMLE-convGMM算法恢复的图像, 采样率为0.4

Fig. 6. Reconstructed performance of randomly selected CelebA face images: (a) Original images; (b) images reconstructed by MMLE-convGMM. The sampling rates are 0.4.

下载: 全尺寸图片幻灯片

[1]	Donoho D L 2006 IEEE Trans. Inform. Theory 52 1289 Google Scholar
[2]	Candès E J, Romberg J, Tao T 2006 IEEE Trans. Inform. Theory 52 489 Google Scholar
[3]	Ji S, Xue Y, Carin L 2008 IEEE Trans. Sig. Process. 56 2346 Google Scholar
[4]	Chen M, Silva J, Paisley J, Wang C, Dunson D, Carin L 2010 IEEE Trans. Sig. Process. 58 6140 Google Scholar
[5]	Yu G, Sapiro G 2011 IEEE Trans. Sig. Process. 59 5842 Google Scholar
[6]	Yang J, Liao X, Yuan X, Llull P, Brady D J, Sapiro G, Carin L 2015 IEEE Trans. Sig. Process. 24 106 Google Scholar
[7]	Yu G, Sapiro G, Mallat S 2012 IEEE Tran. Image Process. 21 2481 Google Scholar
[8]	Wang C, Liu X F, Yu W K, Yao X R, Zheng F, Dong Q, Lan R M, Sun Z B, Zhai G J, Zhao Q 2017 Chin. Phys. Lett. 34 104203 Google Scholar
[9]	Wang Y Y, Ren Y C, Chen L Y, Song C, Li C Z, Zhang C, Xu D G, Yao J Q 2018 Chin. Phys. B 27 114204 Google Scholar
[10]	Xiao D, Cai H K, Zheng H Y 2015 Chin. Phys. B 24 060505 Google Scholar
[11]	宁方立, 何碧静, 韦娟 2013 物理学报 62 174212 Google Scholar Ning F L, He B J, Wei J 2013 Acta Phys. Sin. 62 174212 Google Scholar
[12]	李少东, 陈永彬, 刘润华, 马晓岩 2017 物理学报 66 038401 Google Scholar Li S D, Chen Y B, Liu R H, Ma X Y 2017 Acta Phys. Sin. 66 038401 Google Scholar
[13]	Duarte M F, Davenport M A, Takhar D, Laska J N, Sun T, Kelly K F, Baraniuk R G 2008 IEEE Sig. Process. Mag. 25 83 Google Scholar
[14]	Zeiler M D, Krishnan D, Taylor G W, Fergus R 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) San Francisco, June 13−18, 2010 p2528
[15]	Grosse R, Raina R, Kwong H, Ng A Y 2007 Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence Vancouver, July 19−22, 2007 p149
[16]	Yang J, Yu K, Huang T 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) San Francisco, June 13−18, 2010 p3517
[17]	Wohlberg B 2016 IEEE Trans. Image Process. 25 301 Google Scholar
[18]	Davis P J 2012 Circulant Matrices (Providence, Rhode Island: American Mathematical Society)
[19]	Dempster A, Laird N, Rubin D 1977 J. R. Stat. Soc. 39 1
[20]	Renna F, Calderbank R, Carin L, Rodrigues M R D 2014 IEEE Trans. Sig. Process. 62 2265 Google Scholar
[21]	Tropp J A, Gilbert A C 2007 IEEE Trans. Inform. Theory 53 4655 Google Scholar
[22]	Yang J, Zhang Y 2011 SIAM J Sci. Comput. 33 250 Google Scholar
[23]	Liao X, Li H, Carin L 2014 SIAM Imaging Sci. 7 797 Google Scholar
[24]	Xu Y, Yin W, Osher S 2014 Inverse Probl. Imag. 8 901 Google Scholar
[25]	Krizhevsky A, Hinton G 2009 Learning Multiple Layers of Features from Tiny Images (Toronto: University of Toronto) Technical Report Vol. 1, No. 4, p7
[26]	Li F F, Rob F, Pietro P 2007 Comput. Vis. Image Und. 106 59 Google Scholar
[27]	Liu Z W, Luo P, Wang X G, Tang X O 2015 IEEE International Conference on Computer Vision Santiago, December 7−13, 2015 p3730

[1]	王翔, 周义深, 张轩阁, 陈希浩. 融合注意力机制的卷积网络单像素成像. 物理学报, 2025, 74(8): 084202. doi: 10.7498/aps.74.20250010
[2]	孙康生, 韩超, 秦海峰, 顾涛, 李薇, 于程. 基于注意力卷积神经网络的高质量全息图快速生成算法. 物理学报, 2025, 74(8): 084203. doi: 10.7498/aps.74.20241713
[3]	何瑞辉, 张海峰, 王欢, 马闯. 基于高斯混合模型的无向网络重构. 物理学报, 2024, 73(17): 178901. doi: 10.7498/aps.73.20240552
[4]	王攀, 王仲根, 孙玉发, 聂文艳. 新型压缩感知计算模型分析三维电大目标电磁散射特性. 物理学报, 2023, 72(3): 030202. doi: 10.7498/aps.72.20221532
[5]	曹海燕, 叶震宇. 基于压缩感知理论的大规模MIMO系统下行信道估计中的导频优化理论分析与算法设计. 物理学报, 2022, 71(5): 050101. doi: 10.7498/aps.71.20211504
[6]	陈炜, 郭媛, 敬世伟. 基于深度学习压缩感知与复合混沌系统的通用图像加密算法. 物理学报, 2020, 69(24): 240502. doi: 10.7498/aps.69.20201019
[7]	王晨阳, 段倩倩, 周凯, 姚静, 苏敏, 傅意超, 纪俊羊, 洪鑫, 刘雪芹, 汪志勇. 基于遗传算法优化卷积长短记忆混合神经网络模型的光伏发电功率预测. 物理学报, 2020, 69(10): 100701. doi: 10.7498/aps.69.20191935
[8]	冷雪冬, 王大鸣, 巴斌, 王建辉. 基于渐进添边的准循环压缩感知时延估计算法. 物理学报, 2017, 66(9): 090703. doi: 10.7498/aps.66.090703
[9]	丰卉, 孙彪, 马书根. 分块稀疏信号1-bit压缩感知重建方法. 物理学报, 2017, 66(18): 180202. doi: 10.7498/aps.66.180202
[10]	柴水荣, 郭立新. 基于压缩感知的一维海面与二维舰船复合后向电磁散射快速算法研究. 物理学报, 2015, 64(6): 060301. doi: 10.7498/aps.64.060301
[11]	杨祎巍, 张宏博, 李斌. 面向纳米电路的改进型卷积核可制造性模型建模研究. 物理学报, 2015, 64(5): 058501. doi: 10.7498/aps.64.058501
[12]	文方青, 张弓, 贲德. 基于块稀疏贝叶斯学习的多任务压缩感知重构算法. 物理学报, 2015, 64(7): 070201. doi: 10.7498/aps.64.070201
[13]	郑仕链, 杨小牛. 用于认知无线电协作频谱感知的混合蛙跳算法群体初始化技术. 物理学报, 2013, 62(7): 078405. doi: 10.7498/aps.62.078405
[14]	肖迪, 谢沂均. 一种结合JPEG压缩编码的彩色图像加密算法. 物理学报, 2013, 62(24): 240508. doi: 10.7498/aps.62.240508
[15]	韩冬, 陈良富, 李莘莘, 陶金花, 苏林, 邹铭敏, 范萌. 基于振动拉曼散射的差分水Ring效应系数卷积计算模型. 物理学报, 2013, 62(10): 109301. doi: 10.7498/aps.62.109301
[16]	宁方立, 何碧静, 韦娟. 基于lp范数的压缩感知图像重建算法研究. 物理学报, 2013, 62(17): 174212. doi: 10.7498/aps.62.174212
[17]	郑仕链, 楼才义, 杨小牛. 基于改进混合蛙跳算法的认知无线电协作频谱感知. 物理学报, 2010, 59(5): 3611-3617. doi: 10.7498/aps.59.3611
[18]	邢莉娟, 李卓, 白宝明, 王新梅. 量子卷积码的编译码方法. 物理学报, 2008, 57(8): 4695-4699. doi: 10.7498/aps.57.4695
[19]	刘少斌, 莫锦军, 袁乃昌. 等离子体的分段线性电流密度递推卷积FDTD算法. 物理学报, 2004, 53(3): 778-782. doi: 10.7498/aps.53.778
[20]	赵宝华, 郑兆勃. 态密度的卷积律及维度性. 物理学报, 1987, 36(11): 1459-1471. doi: 10.7498/aps.36.1459

计量

文章访问数: 9076
PDF下载量: 64
被引次数: 0

姓名
邮箱
手机号码
标题
留言内容
验证码

搜索

留言板

基于卷积高斯混合模型的统计压缩感知