首页> 外文期刊>Multimedia Tools and Applications >Robust facial landmark extraction scheme using multiple convolutional neural networks
【24h】

Robust facial landmark extraction scheme using multiple convolutional neural networks

机译:使用多个卷积神经网络的鲁棒面部地标提取方案

获取原文
获取原文并翻译 | 示例
           

摘要

Facial landmarks are a set of features that can be distinguished on the human face with the naked eye. Typical facial landmarks include eyes, eyebrows, nose, and mouth. Landmarks play an important role in human-related image analysis. For example, they can be used to determine whether there is a human being in the image, identify who the person is, or recognize the orientation of a face when taking a photograph. General techniques for detecting facial landmarks can be classified into two groups: One is based on traditional image processing techniques, such as Haar cascade classifiers and edge detection. The other is based on machine learning techniques in which landmarks can be detected by training neural network using facial features. However, such techniques have shown low accuracy, especially in some special conditions such as low luminance and overlapped faces. To overcome these problems, we proposed in our previous work a facial landmark extraction scheme using deep learning and semantic segmentation, and demonstrated that with even a small dataset, our scheme could achieve reasonable facial landmark extraction performance under such conditions. Nevertheless, for more extensive dataset, we found several exceptional cases where the scheme could not detect face landmarks precisely. Hence, in this paper, we revise our facial landmark extraction scheme using a deep learning model called Faster R-CNN and show how our scheme can improve the overall performance by handling such exceptional cases appropriately. Also, we show how to expand the training dataset by using image filters and image operations such as rotation for more robust landmark detection.
机译:面部地标是可以用肉眼在人脸上区分的一组功能。典型的面部标志包括眼睛,眉毛,鼻子和嘴巴。地标在与人相关的图像分析中起着重要作用。例如,它们可以用于确定图像中是否有人,识别该人是谁或在拍摄照片时识别脸部朝向。用于检测面部标志的常规技术可以分为两类:一类基于传统的图像处理技术,例如Haar级联分类器和边缘检测。另一个基于机器学习技术,其中可以通过使用面部特征训练神经网络来检测界标。但是,这样的技术显示出较低的精度,特别是在某些特殊条件下,例如低亮度和重叠的面部。为了克服这些问题,我们在先前的工作中提出了一种使用深度学习和语义分割的面部地标提取方案,并证明即使使用很小的数据集,我们的方案也可以在这种条件下实现合理的面部地标提取性能。不过,对于更广泛的数据集,我们发现了几种无法精确检测人脸标志的特殊情况。因此,在本文中,我们使用称为Faster R-CNN的深度学习模型修改了面部标志提取方案,并展示了我们的方案如何通过适当处理此类异常情况来改善整体性能。此外,我们还将展示如何通过使用图像过滤器和图像操作(例如旋转)来扩展训练数据集,以实现更强大的界标检测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号