This paper proposed a novel Estimation of Distribution Algorithm (EDA), where a directed graph network is used to represent its chromosome. In the proposed algorithm, a probabilistic model is constructed from the promising individuals of the current generation using reinforcement learning, and used to produce the new population. The node connection probability is studied to develop the probabilistic model, therefore pairwise interactions can be demonstrated to identify and recombine building blocks in the proposed algorithm. The proposed algorithm is applied to a problem of agent control, i.e., mobile robot control. The experimental results show the superiority of the proposed algorithm over conventional algorithms by comparing the quality and generalization ability of the solutions.