基于改进沙漏网络的红外图像视线估计

INFRARED IMAGE GAZE ESTIMATION BASED ON IMPROVED HOURGLASS NETWORK

  • 摘要: 随着 VR/AR 等元宇宙概念的兴起,集成虹膜识别的视线估计技术是目前的研究热点,而虹膜识别与视线跟踪都采用近红外摄像头捕捉眼部图像,仅从单眼红外眼部图像进行视线估计是一项挑战性任务。该文提出新的视线估计网络,首先利用圆形眼球与椭圆虹膜的映射得到中间眼部模型标签图像,引入中间监督;再通过改进沙漏网络回归红外虹膜图像的中间眼部模型图像;最后再结合 DenseNet 回归 3D 视线角度。设计近红外光虹膜采集系统,建立红外虹膜数据集 SEPAD-Gaze。实验结果表明,该文提出的视线估计方法在数据集 SEPAD-Gaze 上误差为 4.62°,并且在可见光下的公共数据集 MPIIGaze 上做了泛化性验证实验,误差低至 4.48°,超过目前所有主流算法。

     

    Abstract: With the rise of metauniverse concepts such as VR/AR, gaze estimation technology integrating iris recognition is a research hotspot at present. Both iris recognition and gaze tracking use near-infrared cameras to capture eye images. It is a challenging task to estimate the line of sight only from monocular infrared eye images. In this paper, a new gaze estimation network is proposed. The middle eye model label image was obtained by using the mapping of circular eye and elliptical iris, and the intermediate supervision was introduced. The middle eye model image of the infrared iris image was regressed through the improved hourglass network, and the 3D gaze angle was regressed with DenseNet. The near-infrared rainbow film acquisition system was designed, and the infrared iris dataset SEPAD-Gaze was established. The experimental results show that the error of the gaze estimation method proposed in this paper is 4.62° on the dataset SEPAD-Gaze. The generalization verification experiment is done on the public dataset MPIIGaze under visible light, the error is as low as 4.48°, which exceeds all the current mainstream algorithms.

     

/

返回文章
返回