研究紹介 -

ゲームAIの思考方法や学習方法は，ゲームの性質たとえば完全情報/不完全情報，人数(1,2,3以上)，報酬(ゼロ和/その他)などで，適する手法や難しさが大きく変わります．適した性質を持つゲームを題材とすることで，様々な正則面から手法を評価することができます．二人ゼロ和完全情報ゲームについては，囲碁や将棋のページもご覧ください．

HFO (Half Field Offense) での強化学習

Half field offense RoboCup 2D Soccer (HFO) は RoboCup のシミュレーション環境の課題の一つで，強化学習の課題の一つです．行動空間の階層性を適切に扱うため(turn 30 のようにパラメータ付き行動を扱うため)，HA-PPOを提案しました．1:1での得点成功率約71%と強化学習手法では過去最高の成績を実現しました．

ゲームにおける強化学習を研究しています．対象ゲームに関する人の知識や棋譜などを使わずに，ゲームのルール，あるいはシミュレータを通しての経験だけから学ぶことに挑戦があります．AlphaZeroも強化学習の応用と位置づけられます．

コンピュータ囲碁の研究を行っています

論文など

Mandai, Y. and T. Kaneko “RankNet for evaluation functions of the game of Go,” ICGA Jour- nal, Vol. 41, No. 2, pp. 78–91 (2019), DOI: 10.3233/ICG-190108.
万代, 金子:「囲碁ニューラルネットワークの判断根拠の可視化」，第 23 回ゲームプログラミングワークショップ，9–15 (2018). http://id.nii.ac.jp/1001/00191957
渡辺順哉, 美添, 金子: モンテカルロ木探索を統合したプレイアウト方策の最適化, 第20回ゲームプログラミングワークショップ, 5-11 (2015) (研究奨励賞受賞)
Evaluation of Game Tree Search Methods by Game Records Takeuchi, S.; Kaneko, T.; Yamaguchi, K.; IEEE Transactions on Computational Intelligence and AI in Games, 2 (4), 288 - 302, Dec. 2010.
H. Yoshimoto, K. Yoshizoe, T. Kaneko, A. Kishimoto, and K. Taura: Monte Carlo Go Has a Way to Go, Twenty-First National Conference on Artificial Intelligence (AAAI-06), pages 1070-1075, 2006

Migo

https://github.com/tkaneko/migo

並列・分散探索

多数の計算機で協調して、効率よく探索する方法を研究しています。

S. Yokoyama, T. Kaneko, and T. Tetsuro: Parameter-Free Tree Style Pipeline in Asynchronous Parallel Game-Tree Search, The 14th International Conference on Advances in Computers and Games
Scalable Distributed Monte-Carlo Tree Search. Kazuki Yoshizoe, Akihiro Kishimoto, Tomoyuki Kaneko, Haruhiro Yoshimoto and Yutaka Ishikawa. In Proceedings of the 4th Symposium on Combinatorial Search (SoCS'2011), pages 180-187, 2011

モンテカルロ木探索

モンテカルロ木探索の性能を改善する研究を、理論的な側面と実践的な側面の双方から行っています。

コンピュータ将棋の研究を行っています。

学習

Kaneko, T. and T. Takizawa “Computer Shogi Tournaments and Techniques,” IEEE Transac- tions on Games, Vol. 11, No. 3, pp. 267–274 (2019), DOI: 10.1109/TG.2019.2939259.
S. Wan and T. Kaneko. Heterogeneous Multi-Task Learning of Evaluation Functions for Chess and Shogi, ICONIP 2018.
Wan, S. and T. Kaneko “Pos2Pos: Automatic Position-to-Position Translation in Chess-Like Games,” in 23rd Game Programming Workshop, pp. 51–54, 11 (2018). http://id.nii.ac.jp/1001/00191964/
S. Wan and T. Kaneko. Building Evaluation Functions for Chess and Shogi with Uniformity Regularization Networks, IEEE CIG 2018
Zhu, H. and T. Kaneko “Comparison of Loss Functions for Training of Deep Neural Networks in Shogi,” in IEEE Technologies and Applications of Artificial Intelligence, pp. 18–23 (2018), DOI: 10.1109/TAAI.2018.00014.
K. Hoki and T. Kaneko (2014) “Large-Scale Optimization for Evaluation Functions with Minmax Search”, Volume 49, pages 527-568. JAIR
コンピュータ将棋の評価関数と棋譜を教師とした機械学習. 金子知適. 人工知能学会誌, (2012), 27(1), pp. 75–82.

棋風

S. Omori and T. Kaneko: Learning of Evaluation Functions to Realize Playing Styles in Shogi, LNCS, PRICAI. 367-379.

第二回電王戦 (2013)

2013年4月にGPS将棋と三浦八段(当時,現九段)と対局が第二回電王戦第五局で行われました。

研究紹介

ゲーム一般

サッカー