• 検索結果がありません。

JAIST Repository: マルチエージェント系における組織学習を用いた動的環境への適応に関する研究

N/A
N/A
Protected

Academic year: 2021

シェア "JAIST Repository: マルチエージェント系における組織学習を用いた動的環境への適応に関する研究"

Copied!
3
0
0

読み込み中.... (全文を見る)

全文

(1)JAIST Repository https://dspace.jaist.ac.jp/. Title. マルチエージェント系における組織学習を用いた動的 環境への適応に関する研究. Author(s). 篠田, 孝祐. Citation Issue Date. 2001-03. Type. Thesis or Dissertation. Text version. author. URL. http://hdl.handle.net/10119/723. Rights Description. Supervisor:國藤 進, 知識科学研究科, 修士. Japan Advanced Institute of Science and Technology.

(2) The adaptive learning based on organization learning for dynamic environment on multi-agent system Kosuke Shinoda School of Knowledge Sience, Japan Advanced Institute of Science and Technology February 13, 2001 Keywords: Multi-Agent System, Soccer Simulation Game, Adaptive Learning, Organizational Learning. The research of learning of the agent had aim that was improvement of the action diction ability. In addition, the learning of multi-agent system had same aim. In the field of the learning on multi-agent system, its design of the learning model for organization, which consist of many agents, are used by information sharing or distribute acquisition reward. However, the purpose of these learning models was the construction of the cooperation system by the ability adjustment of the agent. Moreover, these are adapted to the environment, which changed dynamically, by forgetting it was learned once. It’s forgetting has the reason with a lot of uselessness, why all members memorize learning data. Nevertheless, its forgetting is problem for the agent. The agent cannot know all situations in own environment, so that it is difficult to recognize the change in its environment for it. Because, the forgetting is meaning of the creating knowledge of adapting to new environment for the agent. Therefore, the case of when the agent knew own knowledge is not useful, it must use old knowledge until created new knowledge, which is adapting to new environment. Moreover, the agent needs to learning many times in the environment, which experienced once. The reason is the forgetting the old learning data. These are problem for the society basic system, which is needed safety and stability. In this research, we proposed new learning system for multi-agent system to solve the problem, has the purpose was improvement of the agent’s action decision ability, of agent’s learning. Its learning model solves the problem is learning delay which happened because of forgetting and surely adjusts to the change in dynamically environment. Concretely, it uses organizational learning in organization theory for multi-agent system. The purpose of this technique in this report is three points, as follows: c 2001 by Kosuke Shinoda Copyright . 1.

(3) 1. The convergence of learning in short time(in real time) by learning experience sharing. 2. Dynamical organization formation according to a change in the stuation. 3. Operation of phase change of organization according to a change in the environment. We proposed, in this report, the method of reinforcement learning, which used the pair of scenario and action as learning data, in partial observable environment and the model of organizational learning, in multi-agent system, which is sharing learning source with under level agents used by upper level agent. Moreover, the upper level agent sets standard model of under level agent, according to in various environment, by analyzed the environment. This learning model was evaluated by implemented soccer team and gamed it. At that time, we used multi-agent soccer simulator of the RoboCup. Consequently, our organization-learning model for multi-agent system was effective for the learning of agent, as shown Figure 1 and Figure 2.. Figure 1: The result of learning, games with same team, of Personal Learning(PL Team) and Organizational Learning(OL Team): The vertical axis means score difference and the horizon axis mean game number.. Figure 2: The result of learning, games with difference team, of Personal Learning(PL Team) and Organizational Learning(OL Team): The vertical axis means score difference and the horizon axis mean game number.. 2.

(4)

Figure 1: The result of learning, games with same team, of Personal Learning(PL Team) and Organizational Learning(OL Team):

参照

関連したドキュメント

H ernández , Positive and free boundary solutions to singular nonlinear elliptic problems with absorption; An overview and open problems, in: Proceedings of the Variational

Keywords: Convex order ; Fréchet distribution ; Median ; Mittag-Leffler distribution ; Mittag- Leffler function ; Stable distribution ; Stochastic order.. AMS MSC 2010: Primary 60E05

It is suggested by our method that most of the quadratic algebras for all St¨ ackel equivalence classes of 3D second order quantum superintegrable systems on conformally flat

Inside this class, we identify a new subclass of Liouvillian integrable systems, under suitable conditions such Liouvillian integrable systems can have at most one limit cycle, and

Then it follows immediately from a suitable version of “Hensel’s Lemma” [cf., e.g., the argument of [4], Lemma 2.1] that S may be obtained, as the notation suggests, as the m A

To derive a weak formulation of (1.1)–(1.8), we first assume that the functions v, p, θ and c are a classical solution of our problem. 33]) and substitute the Neumann boundary

In this section we state our main theorems concerning the existence of a unique local solution to (SDP) and the continuous dependence on the initial data... τ is the initial time of

After briefly summarizing basic notation, we present the convergence analysis of the modified Levenberg-Marquardt method in Section 2: Section 2.1 is devoted to its well-posedness