GridWorld

class magent2.gridworld.GridWorld(config: Config, **kwargs)

The main MAgent2 class for implementing environments. MAgent2 environments are square Gridworlds wherein each coordinate may contain an agent, a wall, or nothing.

The class attributes are not accessible directly due to them living in the underlying C++ code. Thus, there are get/set methods for retrieving and manipulating their values.

Methods

magent2.gridworld.GridWorld.new_group(self, name: str) c_int

Registers a new group of agents into environment.

Parameters:

name (str) – Name of the group.

Returns:

handle (ctypes.c_int32) – A handle to reference the group in future gets and sets.

magent2.gridworld.GridWorld.add_agents(self, handle: c_int, method: str, **kwargs)

Adds agents to environment.

Parameters:
  • handle (ctypes.c_int32) – The handle of the group to which to add the agents.

  • method (str) – Can be ‘random’ or ‘custom’. If method is ‘random’, then kwargs[“n”] is a int. If method is ‘custom’, then kwargs[“pos”] is a list of coordination.

``` # add 1000 walls randomly >>> env.add_agents(handle, method=”random”, n=1000)

# add 3 agents to (1,2), (4,5) and (9, 8) in map >>> env.add_agents(handle, method=”custom”, pos=[(1,2), (4,5), (9,8)]) ```

magent2.gridworld.GridWorld.add_walls(self, method: str, **kwargs)

Adds walls to the environment.

Parameters:

method (str) – Can be ‘random’ or ‘custom’. If method is ‘random’, then kwargs[“n”] is an int. If method is ‘custom’, then kwargs[“pos”] is a list of coordination

``` # add 1000 walls randomly >>> env.add_walls(method=”random”, n=1000)

# add 3 walls to (1,2), (4,5) and (9, 8) in map >>> env.add_walls(method=”custom”, pos=[(1,2), (4,5), (9,8)]) ```

magent2.gridworld.GridWorld.reset(self)

Resets the environment to an initial internal state.

magent2.gridworld.GridWorld.set_action(self, handle: c_int, actions: ndarray)

Set actions for whole group.

Parameters:
  • handle (ctypes.c_int32) – Group handle.

  • actions (np.ndarray) – Array of actions, 1 per agent. The dtype must be int32.

magent2.gridworld.GridWorld.step(self)

Runs one timestep of the environment using the agents’ actions.

Returns:

done (bool) – Flag indicating whether the game is done or not.

magent2.gridworld.GridWorld.render(self)

Renders a step.

magent2.gridworld.GridWorld.set_seed(self, seed: int)

Set random seed of the engine.

Parameters:

seed (int) – Seed value.

magent2.gridworld.GridWorld.set_render_dir(self, name: str)

Sets the directory to save render file.

Parameters:

name (str) – Name of render directory.

magent2.gridworld.GridWorld.clear_dead(self)

Clears dead agents in the engine. Must be called after step().

magent2.gridworld.GridWorld.get_handles(self) List[c_int]

Returns all group handles in the environment.

Returns:

handles (List[ctypes.c_int32]) – All group handles in the environment.

magent2.gridworld.GridWorld.get_observation(self, handle: c_int) Tuple[ndarray, ndarray]

Returns the observation for each agent in a group.

Parameters:

handle (ctypes.c_int32) – Group handle.

Returns:

obs (Tuple[np.ndarray, np.ndarray]) – (views, features) Views is a numpy array whose shape is n * view_width * view_height * n_channel. Features is a numpy array whose shape is n * feature_size. For agent i, (views[i], features[i]) is its observation at this step.

magent2.gridworld.GridWorld.get_reward(self, handle: c_int) ndarray

Returns the rewards for all agents in a group.

Parameters:

handle (ctypes.c_int32) – Group handle.

Returns:

rewards (np.ndarray[float32]) – Rewards for all agents in the group.

magent2.gridworld.GridWorld.get_action_space(self, handle: c_int) Tuple[int]

Returns the action space for a group.

Parameters:

handle (ctypes.c_int32) – Group handle.

Returns:

action_space (Tuple[int]) – Action space for the group.

magent2.gridworld.GridWorld.get_view_space(self, handle: c_int) Tuple[int, int, int]

Returns the view space for a group.

Parameters:

handle (ctypes.c_int32) – Group handle.

Returns:

view_space (Tuple[int, int, int]) – View space for the group.

magent2.gridworld.GridWorld.get_feature_space(self, handle: c_int) Tuple[int]

Returns the feature space for a group.

Parameters:

handle (ctypes.c_int32) – Group handle.

Returns:

feature_space (Tuple[int]) – Feature space for the group.

magent2.gridworld.GridWorld.get_num(self, handle: c_int) int

Returns the number of agents in a group.

Parameters:

handle (ctypes.c_int32) – Group handle.

Returns:

num (int) – Number of agents in the group.

magent2.gridworld.GridWorld.get_agent_id(self, handle: c_int) ndarray

Returns the ids of all agents in the group.

Parameters:

handle (ctypes.c_int32) – Group handle.

Returns:

ids (np.ndarray[int32]) – Ids of all agents in the group.

magent2.gridworld.GridWorld.get_alive(self, handle: c_int) ndarray

Returns the alive status of all agents in a group.

Parameters:

handle (ctypes.c_int32) – Group handle.

Returns:

alives (np.ndarray[bool]) – Whether the agents are alive or not.

magent2.gridworld.GridWorld.get_pos(self, handle: c_int) ndarray

Returns the positions of all agents in a group.

Parameters:

handle (ctypes.c_int32) – Group handle.

Returns:

pos (np.ndarray[int]) – The positions of all agents in the group. The shape is (n, 2).

magent2.gridworld.GridWorld.get_view2attack(self, handle: c_int) Tuple[int, ndarray]

Get a matrix with the same size of view_range. If element >= 0, then it is an attackable point, and the corresponding action number is the value of that element.

Parameters:

handle (ctypes.c_int32) – Group handle.

Returns:
  • attack_base (int) – Attack action base value.

  • buf (np.ndarray) – Map attack action into view.

magent2.gridworld.GridWorld.get_global_minimap(self, height: int, width: int) ndarray

Compress global map into a minimap of given size.

Parameters:
  • height (int) – Height of minimap.

  • width (int) – Width of minimap.

Returns:

minimap (np.ndarray) – Map of shape (n_group + 1, height, width).