画像処理・認識｜O. Hasegawa Lab.（長谷川研究室） | 東京工業大学　像情報工学研究所

画像処理・認識
研究テーマ
ホーム

このページでは，長谷川研で研究・開発した顔画像処理や画像認識の研究例についてご紹介します。

これまでの発表文献

顔検出 (Face Detection)

Shinji Hayashi and Osamu Hasegawa : "A detection technique for degraded face images", Proc. IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2 (CVPR '06), pp.1506-1512, (2007)

ABSTRACT

This paper describes a face detection technique that enables detection of extremely small faces such as 6 × 6 pixel. This is the first approach to detect very low-resolution faces in the field of face detection. First, we use a conventional AdaBoost-based face detector to show that the face detection rate falls to 39% from 88% as face resolution decreases from 24 × 24 pixels to 6 × 6 pixels. Second, we propose a new face detection method comprising four techniques. It improves the face detection rate from 39% to 73% for 6 × 6 pixel faces of the MIT+CMU frontal face test set. Finally, we applied our method to real world data. By merging resultsof some frames and judging using a threshold, we show that our method is effective for processing real world data.

デモビデオ

林伸治，長谷川修 : "低解像度画像からの顔検出", 画像電子学会誌 , Vol.34, No.6, pp.726-737, (2005)[PDF]

ABSTRACT

顔検出はコンピュータビジョンで最も研究が盛んな分野の一つであり、近年、大きな進歩を遂げている。しかし、低解像度画像からの顔検出については、ほとんど研究されていない。本論文では、顔検出における標準的な検出器と評価画像であるAdaBoostベースの顔検出器およびMIT+CMU 評価画像を用い、24×24ピクセルの顔に対し88%であった顔検出率が、6×6 ピクセルの顔に対しては39%まで低下することを示す。これに対し、「肩まで含む顔画像の使用」「画像の拡大」「特徴の周波数制限」「二つの検出器の統合」から構成される提案手法を示し、6×6ピクセルの顔に対して、71%の検出率が得られたことを示す。なお、上記検出率は、いずれも評価画像112枚に対して誤検出数100 における値である。

物体認識

Takahiro Toyoda, and Osamu Hasegawa, “A random field model for integration of local information and global information”, IEEE Trans. Pattern Analysis and Machine Intelligence (PAMI), Vol.30, No.8, pp.1483-1489, (2008)

ABSTRACT

This paper presents a proposal of a general framework that explicitly models local information and global information in a conditional random field. The proposed method extracts global image features as well as local ones and uses them to predict the scene of the input image. Scene-based top-down information is generated based on the predicted scene. It represents a global spatial configuration of labels and category compatibility over an image. Incorporation of the global information helps to resolve local ambiguities and achieves locally and globally consistent image recognition. In spite of the model's simplicity, the proposed method demonstrates good performance in image labeling of two datasets.

豊田崇弘，田上啓介，長谷川修："局所情報と大域情報を統合する条件付確率場による画像ラベリング", 電子情報通信学会論文誌，Vol.J90-D No.6 pp.1542-1554 (2007) [PDF]

ABSTRACT

本研究では画像中のすべての画素にカテゴリーラベルを割り当てる画像ラベリングを行う。提案手法では従来手法のように局所的な特徴のみを用いるのではなく、大域的な特徴も用いてラベルの推定を行う。また、カテゴリー間の同時に起こりやすい関係や起こりにくい関係もモデル化し、ラベルの推定に利用する。局所的な推定と大域的な推定の統合には条件付確率場（Conditional Random Field: CRF）を利用する。従来のCRFは局所的な関係をモデル化するのには優れているが、大域的な関係は十分にモデル化できない。そこで複数の確率場を用いたモデル化が行われているが、モデル構造が複雑となるという欠点がある。これに対し提案手法では一つの確率場において推定を統合するためモデル構造が簡単である。また、従来手法では局所的な特徴への依存が大きいという問題があったが、提案手法では大域的な特徴も利用するためこの問題にも対応している。提案手法の有効性は2種類の屋外シーン画像のラベリング実験により確認した。大域的な特徴を用いることで認識精度は10％程度向上し、大域的な視点から明らかな誤りは大きく減少した。

高次局所自己相関特徴の拡張

Takahiro Toyoda and Osamu Hasegawa : "Extension of higher order local autocorrelation features", Pattern Recognition, Vol.40, No.5, pp.1466-1473 (2007)

ABSTRACT

This study investigates effective image features that are widely applicable in image analysis. We specifically address higher order local autocorrelation (HLAC) features, which are used in various applications. The original HLAC features are restricted up to the second order and are represented by 25 mask patterns. We increase their orders up to eight and extract the extended HLAC features using 223 mask patterns. Furthermore, we create large mask patterns and construct multi-resolution features to support large displacement regions. In texture classification and face recognition, the proposed method outperformed Gaussian Markov random fields, Gabor features, and local binary pattern operator.

豊田崇弘，長谷川修, "高次局所自己相関特徴の拡張", 画像電子学会誌，Vol.34, No.4, pp.390-397 (2005) [PDF]

ABSTRACT

高次局所自己相関特徴は画像解析において広く利用されている基本的な特徴である。これらは従来、次数2次までに限定され、25個のマスクパターンによって表現されていた。一方、本研究では次数を最高8次まで拡張し、最大223個のマスクパターンを用いて特徴を表現する。さらにマスクパターンのサイズを拡大することで広い変位領域に関する特徴も抽出できるようにし、これらを用いて多重解像度化特徴を構成する。提案手法は種々のテクスチャ識別においてガウシアンマルコフ確率場、 Gabor特徴、Local Binary Pattern 特徴より良好な結果を得た。例えば 32x32 画素の小さい画像の識別ではGabor特徴で93.4% の識別率だったのに対し提案手法では97.5%を得た。他にもスケール変化や90度回転を含む画像の識別、300クラス以上の多クラスの識別において良好な識別性能を示した。また顔識別では Gabor特徴で識別率が96.0%だったのに対し、提案手法では98.4%が得られ、特定の構造を持つ対象に対しても有効であることを確認した。

サーチ無し高速フラクタル画像圧縮 (Fractal image compression “without search”)

Shen Furao, Osamu Hasegawa : "A fast no search fractal image coding method", Signal Processing: Image Communication , vol.19, pp.393-404, (2004)

ABSTRACT

A new no search fractal image coding scheme is introduced which is able to improve the speed of fractal image compression greatly. Every time-consuming part of fractal coding is redesigned and accelerated with new techniques. Compared with the most recent scheme of Tong and Wong, this method speeds up the encoding process by 22 times and maintain the compression quality. Experiments on standard images show that the proposed scheme gets the fastest speed of fractal image coding up to the present and holds high reconstruction fidelity. For example, using PII 450 MHz PC, the proposed scheme spends 0.515 s to compress the Lena (512×512×8) with 36.04 dB PSNR decoding quality. Using Dell PIV 2.8 GHz PC, it spends only 0.078 s to finish the encoding process and gets 36.04 dB PSNR.