年龄估计

Dataset

  • FG-Net:1002 images,0到69岁,82个人
  • MORPH1:1690 images
  • MORPH2:55608 images、unbalanced ethnic(96% African and European ,less than 1% Asian),16到77岁
  • AFAD:160K Asian facial images(unrelease)
  • HOIP:300个人的306600幅,150个男0150个女
  • IAD数据库:从Internet上获取的175000幅,0-80岁 未公开
  • OAPFD数据库:865个人的865幅,23-60岁,全是男。清华大学数据库。
  • LHI亚洲人脸库:50000幅,20-80岁
  • PCSO数据集:10036幅,17-68岁
  • Audience benchmark:26580幅,年龄和性别标签,8个年龄组/0-2,4-6,8-13,15-20,25-32,38-43,48-53,60+

Ordinal Regression

paper:Ordinal Regression With Multiple Output CNN for Age EstimationCVPR2016
code:Ordinal Regression

Abstract

To address the non-stationary property of aging patterns, age estimation can be cast as an ordinal regression problem.However, the processes of extracting features and learning a regression model are often separated and optimized independently in previous work. In this paper, we propose an End-to-End learning approach to address ordinal regression problems using deep Convolutional Neural Network, which could simultaneously conduct feature learning andregression modeling. In particular, an ordinal regression problem is transformed into a series of binary classification sub-problems. And we propose a multiple output CNN learning algorithm to collectively solve these classification sub-problems, so that the correlation between these tasks could be explored. In addition, we publish an Asian Face Age Dataset (AFAD) containing more than 160K facial images with precise age ground-truths, which is the largest public age dataset to date. To the best of our knowledge, this is the first work to address ordinal regression problems by using CNN, and achieves the state-of-the-art performance on both the MORPH and AFAD datasets
翻译:为了解决衰老模式的非平稳特性,可以将年龄估计作为序数回归问题。然而,在以前的工作中,提取特征和学习回归模型的过程通常是独立分离和优化的。在本文中,我们提出了一种端到端学习方法,使用深度卷积神经网络解决序数回归问题,可以同时进行特征学习和回归建模。特别是,序数回归问题转化为一系列二元分类子问题。我们提出了一种多输出CNN学习算法来集体解决这些分类子问题,从而可以探索这些任务之间的相关性。此外,我们发布了一个亚洲面部年龄数据集(AFAD),其中包含超过160K的面部图像,具有精确的年龄真实性,这是迄今为止最大的公共年龄数据集。据我们所知,这是通过使用CNN解决序数回归问题的第一项工作,并在MORPH和AFAD数据集上实现了最先进的性能。
Q图片2018120212082

Contributions

  • 利用端到端的深度学习方法解决序数回归问题
  • released a dataset named AFAD , the largest public dataset to date

Summary

Rank的思想比较好,针对有序的问题,将序数回归转化为一系列的二分类问题,但是这篇论文并不是第一个使用ranking思想的论文,和以前的方法(提取手工特征或者深度学习特征,然后用SVM/SVR的分类器作为二分类器)不同的是,将ranking和CNN结合起来,可以端到端的训练。


RankingCNN

paperUsing Ranking-CNN for Age EstimationCVPR2017
code: Using-Ranking-CNN-for-Age-Estimation

Abstract

Human age is considered an important biometric trait for human identification or search. Recent research shows that the aging features deeply learned from large-scale data lead to significant performance improvement on facial image-based age estimation. However, age-related ordinal information is totally ignored in these approaches. In this paper, we propose a novel Convolutional Neural Network (CNN)-based framework, ranking-CNN, for age estimation. Ranking-CNN contains a series of basic CNNs, each of
which is trained with ordinal age labels. Then, their binary outputs are aggregated for the final age prediction. We theoretically obtain a much tighter error bound for rankingbased age estimation. Moreover, we rigorously prove that ranking-CNN is more likely to get smaller estimation errors when compared with multi-class classification approaches. Through extensive experiments, we show that statistically, ranking-CNN significantly outperforms other state-of-theart age estimation models on benchmark datasets .
翻译:
人类年龄被认为是一个重要的生物特征,对人类的识别或搜索。最近的研究表明,从大规模数据中深入学习到的年龄特征使得基于人脸图像的年龄估计的性能有了显着的提高。然而,在这些方法中,与年龄相关的顺序信息被完全忽略。本文提出了一种新的基于卷积神经网络(CNN)的年龄估计框架-Ranking-CNN。Ranjing-CNN包含一系列基本CNN,每个CNN都是用顺序年龄标签训练的。然后,将它们的二进制输出进行聚合,以进行最终的年龄预测。我们从理论上得到了一个更严格的基于秩的年龄估计的误差界。此外,我们严格地证明了,与多类分类方法相比,Ranking-CNN更容易获得较小的估计误差。通过大量的实验,我们表明,在统计上,Ranking-CNN在基准数据集上的性能明显优于其他最先进的年龄估计模型。
Q图片2018120212115Q图片2018120212122

Contributions

  • To the best of our knowledge, ranking-CNN is the first work that uses a deep ranking model for age estimation, in which binary ordinal age labels are used to train a series of basic CNNs, one for each age group.Each basic CNN in ranking-CNN can be trained using all the labeled data, leading to better performance of feature learning and also preventing overfitting.
  • We provide a much tighter error bound for age ranking than that introduced in [2], which claimed that the final ranking error is bounded by the sum of errors generated by all the classifiers. We obtain the approximation for the final ranking error that is controlled by the maximum error produced among sub-problems. From a technical perspective, the tighter error bound provides several advantages for the training of ranking-CNN.
  • We prove that ranking-CNN, by taking the ordinal relation between ages into consideration, is more likely to get smaller estimation errors when compared with multi-class classification approaches (i.e., CNNs using the softmax function). Moreover, through extensive experiments, we show that statistically, ranking-CNN significantly outperforms other state-of-the-art age estimation methods

Summary

这篇论文和上篇oridnal regression的思想类似,但是效果要好很多,不同的是每个二分类器都有单独的模型,所有二分类器都不共享中间层,然后聚合所有二分类器的结果得到最终的年龄估计,另外还会得到最终排序误差的近似值,该误差由子问题中产生的最大误差控制,比上一篇论文的方法误差控制更严格。


SSR-Net

paperSSR-Net: A Compact Soft Stagewise Regression Network for Age EstimationIJCAI2018
code: SSRNet

Abstract

This paper presents a novel CNN model called Soft Stagewise Regression Network (SSR-Net) for age estimation from a single image with a compact model size. Inspired by DEX, we address age estimation by performing multi-class classification andthen turning classification results into regression by calculating the expected values. SSR-Net takes a coarse -to-fine strategy and performs multi-class classification with multiple stages.Each stage is only responsible for refining the decision of its previous stage for more accurate age estimation. Thus, each stage performs a task with few classes and requires few neurons, greatly reducing the model size. For addressing the quantization issue introduced by grouping ages into classes, SSR-Net assigns a dynamic range to each age class by allowing it to be shifted and scaled according to the input face image. Both the multi-stage strategy and the dynamic range are incorporated into the formulation of soft stagewise regression. A novel network architecture is proposed for carrying out soft stagewise regression. The resultant SSR-Net model is very compact and takes only 0.32 MB. Despite its compact size, SSR-Net’ s performance approaches those of the state -of-the -art methods whose model sizes are often more than 1500x larger.
翻译:本文提出了一种新的CNN模型,称为Soft stagewise regression net(SSR-Net),使用一个很小的compact模型对单个图像进行年龄估计。受DEX的启发,我们通过执行多类分类来解决年龄估计,然后通过计算预期值将分类结果转化为回归。 SSR-Net采用粗略到精细的策略并执行多阶段的多级分类。每个阶段仅负责完善其前一阶段的决策以进行更准确的年龄估计。因此,每个阶段执行具有少量类的任务并且需要很少的神经元,从而大大减小了模型尺寸。为了解决通过将年龄分组到类中引入的量化问题,SSR-Net通过允许根据输入面部图像移位和缩放来为每个年龄类别分配动态范围。多阶段策略和动态范围都包含在软阶段回归的公式中。提出了一种新颖的网络架构,用于进行软分段回归。由此产生的SSR-Net模型非常紧凑,仅需0.32MB。尽管尺寸紧凑,但SSR-Net的性能接近于那些模型尺寸通常超过1500倍的方法。
Q图片2018120212095

Contributions

新的年龄估计模型,平均误差绝对值为3.16,与最好的模型(当时)差距0.5岁,但参数不到其1/1000,整个模型参数仅0.3M,非常适合用于嵌入式系统。

Summary

采用多分段的多级分类的思想,大大减少了模型的参数,在准确性和模型大小方面做了一个均衡,另外,这么小的模型做到这种准确率很厉害。


DeepRegressionForest

paper:Deep Regression Forests for Age EstimationCVPR2018
code:caffe-DeepRegressionForests

Abstract

Age estimation from facial images is typically cast as a nonlinear regression problem. The main challenge of this problem is the facial feature space w.r.t. ages is heterogeneous, due to the large variation in facial appearance across different persons of the same age and the nonstationary property of aging patterns. In this paper, we propose Deep Regression Forests (DRFs), an end-to-end model, for age estimation. DRFs connect the split nodes to a fully connected layer of a convolutional neural network (CNN) and deal with heterogeneous data by jointly learning input-dependant data partitions at the split nodes and data abstractions at the leaf nodes. This joint learning follows an alternating strategy: First, by fixing the leaf nodes, the split nodes as well as the CNN parameters are optimized by Back-propagation; Then, by fixing the split nodes, the leaf nodes are optimized by iterating a step-size free and fastconverging update rule derived from Variational Bounding. We verify the proposed DRFs on three standard age estimation benchmarks and achieve state-of-the-art results on all of them.
翻译
来自面部图像的年龄估计通常被视为非线性回归问题。这个问题的主要挑战是面部特征空间w.r.t.年龄是不均匀的,这是由于同一年龄段不同人的面部外观差异很大,以及老龄化模式的非平稳性。在本文中,我们提出了深度回归森林(DRFs),一种端到端模型,用于年龄估计。DRF将分裂节点连接到卷积神经网络(CNN)的完全连接层,并通过联合学习分裂节点处的输入相关数据分区和叶节点处的数据抽象来处理异构数据。这种联合学习遵循交替策略:首先,通过固定叶节点,分裂节点以及CNN参数通过反向传播进行优化;然后,通过修剪拆分节点,通过迭代从变分边界派生的步长自由和快速收敛更新规则来优化叶节点。我们在三个标准年龄估算基准上验证了提议的DRF,并在所有这些基准上取得了最新的成果。
83KC{3A7Y}PVX~YU`383R

Contributions

  • We propose Deep Regression Forests (DRFs), an endto-end model, to deal with heterogeneous data by jointly learning input-dependant data partition at split nodes and data abstraction at leaf nodes .
  • Based on Variational Bounding, the convergence of our update rule for leaf nodes in DRFs is mathematically guaranteed.
  • We apply DRFs on three standard age estimation
    benchmarks, and achieve state-of-the-art results.

Summary

用深度回归森林的思想来进行年龄估计的任务,属于一种distribution learning的办法,论文中涉及到大量的概率论的知识和大量的推导公式,将年龄分布分布到多个叶子节点,最终采用随机森林和决策树的思想,对模型训练结果做融合,模型很好训练,且准确率也不错。


Mean-Variance Loss

paper:Mean-Variance Loss for Deep Age Estimation From a FaceCVPR2018
code:-

Abstract

Age estimation has wide applications in video surveillance, social networking, and human-computer interaction. Many of the published approaches simply treat age estimation as an exact age regression problem, and thus do not leverage a distribution’s robustness in representing labels with ambiguity such as ages. In this paper, we propose a new loss function, called mean-variance loss, for robust age estimation via distribution learning. Specifically, the mean-variance loss consists of a mean loss, which penalizes difference between the mean of the estimated age distribution and the ground-truth age, and a variance loss, which penalizes the variance of the estimated age distribution to ensure a concentrated distribution. The proposed meanvariance loss and softmax loss are jointly embedded into Convolutional Neural Networks (CNNs) for age estimation. Experimental results on the FG-NET, MORPH Album II, CLAP2016, and AADB databases show that the proposed approach outperforms the state-of-the-art age estimation methods by a large margin, and generalizes well to image aesthetics assessment.1
翻译
年龄估计在视频监控,社交网络和人机交互中具有广泛的应用。许多已发表的方法仅将年龄估计视为精确的年龄回归问题,因此不会利用分布的稳健性来表示具有模糊性的标签,例如年龄。在本文中,我们提出了一种新的损失函数,称为均值 - 方差损失,用于通过分布学习进行稳健的年龄估计。具体而言,均值 - 方差损失包括平均损失,其惩罚估计年龄分布的平均值与地面实际年龄之间的差异,以及方差损失,其惩罚估计年龄分布的方差以确保集中分布。 。所提出的均值方差损失和softmax损失共同嵌入到卷积神经网络(CNN)中用于年龄估计。在FG-NET,MORPH Album II,CLAP2016和AADB数据库上的实验结果表明,所提出的方法大大超过了现有技术的年龄估计方法,并且很好地概括了图像美学评估。
Q图片2018120213002

Contributions

  1. propose a new loss:mean-variance loss,aiming at the estimate of an age distribution with its mean as close to the groud-truth age as possible , and its variance as small as possible
  2. diffferent from the age disturibution learning methods,the proposed approach doesn’t require that each training image must have a mean age and a variance (neither real nor assumed) labels during model training, but it can still give a distribution estimate for a face image
  3. the proposed loss can be easily embedded into different CNNs, and the network can be optimized via SGD end-to-end

Summary

思想简单且新颖,有用且有效。