推定分類精度評価に対する考察

4.2.8 3 層の CNN についてモデルの構成要素を変更した場合の認識結果の比較

4.3 推定分類精度評価に対する考察

第 5 ^{章おわりに}

本研究では，CNNによる衛星画像上の地物認識手法の解析および地物認識のための CNNのモデルの提案を行った．ゴルフ場を対象とした地物認識実験では，既存のCNN モデルを用いて地物認識を行い，その際の入力データの与え方による認識結果への影響やCNNの中間層の解析を行った．メガソーラーを対象とした地物認識実験では，ゴルフ場と同様に既存のCNNモデルを用いて入力データの与え方による認識結果への影響の解析を行い，そのうえで，地物認識のためのCNNモデルの提案と教師データの作成方法の提案を行った．

ゴルフ場を対象とした地物認識実験では，教師データの負例のランダムアンダーサンプリングや入力する衛星画像のバンドの組み合わせ，衛星画像から切り出して入力するセルのサイズについて解析を行った．負例のランダムアンダーサンプリングについては，

それによって負例を半分に減らしても F-valueに大きな影響はないが，precisionとrecall にトレードオフの関係があることが確認された．入力するバンドの組み合わせについて

は，F-valueで評価を行うと，多バンドを入れたほうが良いことが分かったが，ゴルフ場

のテクスチャのコントラストが大きいバンドを単バンドで入力しても比較的良い結果となることが分かった．入力するセルのサイズについては，32×32, 48×48, 64×64の3 通りを比較したところ，48×48の場合が比較的良い結果となった．また，テストデータについてCNNの出力するクラス確率を解析したところ，負例の大部分は正例のクラス確率が0.1以下であったが，それでも正例の数と比較すると無視できない数の負例が0.1以上に分布しており，クラス確率に閾値を設定するのは難しいことが分かった．CNNの中間層の特徴マップの解析では，正例と負例で活性化している特徴マップに違いがあることがわかり，負例の中でも市街地や森，海などの入力の違いによって特徴マップにも違いが出ていることが確認された．

メガソーラーを対象とした地物認識実験では，既存のCNNモデルについて教師データの負例のランダムアンダーサンプリング数や入力する衛星画像のバンドの組み合わせを変えた場合の解析を行った．その結果，負例のランダムアンダーサンプリングについては，ゴルフ場と同じ傾向が確認された．バンドの組み合わせについては，単バンドや可視域のバンドではほとんど認識することができず，近赤外域のバンドの組み合わせが重

要であることが分かった．提案するCNNのモデルについては，バンドを選択せずに同一解像度の7バンドを全て入力することでメガソーラーを認識できることが確認され，既存手法と比べても優れた性能を示すことが確認された．教師データの不完全さについては，CNNで実際には正例であるセルを含んだ負例を学習しても，負例に含まれていた正例も含めて正例と認識できることが確認された．

今後の展望としては，教師データの質や正例の量を向上することが出来れば性能の向上が可能であると考えられる．また，本研究ではLandsat 8衛星画像のみを用いたが，その他の多バンドの衛星画像やより高解像度の衛星画像に対しても同様のフレームワークで適用することが可能であると考えられる．

謝辞

本研究を進めるに当たり，指導教員の石川博教授から毎週丁寧なご指導を頂きました．

望月義彦助教からは，論文作成，発表資料作成など多くのことでご指導を頂きました．飯塚里志研究員助教，エドガー・シモセラ研究員助教からは特に機械学習に関して，研究方針，プログラムの実装，論文作成などでご指導を頂きました．国立情報学研究所の杉本晃宏教授からは研究方針についてご指導を頂きました．鳥取大学の小山田雄仁助教からは研究や発表資料についてご指導を頂きました．石川研究室の学生の皆様にも日頃から大変お世話になりました．また，本研究は産業技術総合研究所リサーチアシスタントとしての研究成果でもあり，機械学習研究チームの中村良介主任研究員にはリモートセンシング分野の視点から丁寧なご指導を頂きました．メガソーラーの教師データの作成では機械学習研究チームの皆様に大変お世話になりました．ここに感謝の意を表します．

2016 年 2 月 1 日石井智大

参考文献

[1] The Landsat-8 data immediate release site, Japan. http://landsat8.geogrid.

org/.

[2] Christopher M. Bishop. Pattern Recognition and Machine Learning. Information Science and Statistics. Springer-Verlag New York, Inc., 2006.

[3] Marco Castelluccio, Giovanni Poggi, Carlo Sansone, and Luisa Verdoliva. Land use classification in remote sensing images by convolutional neural networks. arXiv preprint arXiv:1508.00092, 2015.

[4] Chih-Chung Chang and Chih-Jen Lin. LIBSVM: A library for support vector ma-chines. ACM Transactions on Intelligent Systems and Technology, Vol. 2, pp. 27:1–

27:27, 2011. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm. [5] Nitesh V. Chawla. Data mining for imbalanced datasets: An overview. In Data

Mining and Knowledge Discovery Handbook, pp. 875–886. Springer, 2010.

[6] Piotr Dollar, Zhuowen Tu, and Serge Belongie. Supervised learning of edges and object boundaries. InIEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2006, Vol. 2, pp. 1964–1971. IEEE, 2006.

[7] Rong-En Fan, Kai-Wei Chang, Cho-Jui Hsieh, Xiang-Rui Wang, and Chih-Jen Lin.

Liblinear: A library for large linear classification. The Journal of Machine Learning Research, Vol. 9, pp. 1871–1874, June 2008.

[8] Kunihiko Fukushima and Sei Miyake. Neocognitron: A new algorithm for pattern recognition tolerant of deformations and shifts in position. Pattern Recognition, Vol. 15, No. 6, pp. 455 – 469, 1982.

[9] Haibo He and Edwardo A. Garcia. Learning from imbalanced data. IEEE Transac-tions on Knowledge and Data Engineering, Vol. 21, No. 9, pp. 1263–1284, 2009.

[10] Sergey Ioﬀe and Christian Szegedy. Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:1502.03167, 2015.

[11] Tomohiro Ishii, Ryosuke Nakamura, Hidemoto Nakada, Yoshihiko Mochizuki, and Hiroshi Ishikawa. Surface object recognition with CNN and SVM in landsat 8 images.

In 14th IAPR International Conference on Machine Vision Applications (MVA), 2015, pp. 341–344. IEEE, 2015.

[12] Stefan Kluckner and Horst Bischof. Semantic classification by covariance descriptors within a randomized forest. In IEEE 12th International Conference on Computer Vision Workshops (ICCV Workshops), 2009, pp. 665–672. IEEE, 2009.

[13] Stefan Kluckner, Thomas Mauthner, Peter M. Roth, and Horst Bischof. Semantic classification in aerial imagery by integrating appearance and height information. In Computer Vision–ACCV 2009, pp. 477–488. Springer, 2010.

[14] Alex Krizhevsky. Learning multiple layers of features from tiny images. Master’s thesis, University of Toronto, 2009.

[15] Alex Krizhevsky, Ilya Sutskever, and Geoﬀrey E. Hinton. Imagenet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems, Vol. 25, pp. 1106–1114, 2012.

[16] Yann LeCun, Bernhard Boser, John S Denker, Donnie Henderson, Richard E Howard, Wayne Hubbard, and Lawrence D Jackel. Backpropagation applied to handwritten zip code recognition. Neural computation, Vol. 1, No. 4, pp. 541–551, 1989.

[17] A. Mathur and G.M. Foody. Multiclass and binary SVM classification: Implications for training and classification users. Geoscience and Remote Sensing Letters, IEEE, Vol. 5, No. 2, pp. 241–245, 2008.

[18] Volodymyr Mnih and Geoﬀrey E. Hinton. Learning to detect roads in high-resolution aerial images. In Computer Vision–ECCV 2010, pp. 210–223. Springer, 2010.

[19] Volodymyr Mnih and Geoﬀrey E. Hinton. Learning to label aerial images from noisy data. In Proceedings of the 29th International Conference on Machine Learning

[20] Otavio A. B. Penatti, Keiller Nogueira, and Jefersson A. dos Santos. Do deep fea-tures generalize from everyday objects to remote sensing and aerial scenes domains?

In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, June 2015.

[21] D.P. Roy, M.A. Wulder, T.R. Loveland, C.E. Woodcock, R.G. Allen, M.C. Anderson, D. Helder, J.R. Irons, D.M. Johnson, R. Kennedy, et al. Landsat-8: Science and product vision for terrestrial global change research.Remote Sensing of Environment, Vol. 145, pp. 154–172, 2014.

[22] Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. Going deeper with convolutions. InThe IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2015.

[23] Compton J. Tucker. Red and photographic infrared linear combinations for mon-itoring vegetation. Remote Sensing of Environment, Vol. 8, No. 2, pp. 127 – 150, 1979.

[24] Yi Yang and Shawn Newsam. Bag-of-visual-words and spatial extensions for land-use classification. InProceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems, pp. 270–279. ACM, 2010.

[25] 平井雄三. はじめてのパターン認識,第7章. 森北出版株式会社, 2012.

[26] 岡谷貴之, 齋藤真樹. コンピュータビジョン最先端ガイド 6, 第4章. アドコム・メディア株式会社, 2013.

[27] 麻生英樹, 安田宗樹, 前田新一, 岡野原大輔, 岡谷貴之, 久保陽太郎, ボレガラダヌシカ. 深層学習: Deep Learning. 株式会社近代科学社, 2015.

[28] 日本リモートセンシング学会. 基礎からわかるリモートセンシング. 理工図書株式会社, 2011.

[29] 中村良介. パーソナル・コミュニケーション, 2015.

付録 A ^実験環境

ドキュメント内 i iv vi (ページ 95-102)

4.2.8 3 層の CNN についてモデルの構成要素を変更した場合の認識結 果の比較

4.3 推定分類精度評価に対する考察

第 5 章 おわりに

謝辞

参考文献

付 録 A 実験環境

4.2.8 3 層の CNN についてモデルの構成要素を変更した場合の認識結果の比較

第 5 ^{章おわりに}

付録 A ^実験環境