PyDPOMDP

A minimal package providing a Python API to decentralized partially observable Markov process (Dec-POMDP) problems in the .dpomdp file format.

Installation

git clone [email protected]:laurimi/pydpomdp.git
pip install pydpomdp

Examples

Before starting, download a problem file, for example:

wget https://www.st.ewi.tudelft.nl/mtjspaan/decpomdp/dectiger.dpomdp

The following script shows the functionality of this package.

from pydpomdp import DecPOMDP
d = DecPOMDP("dectiger.dpomdp")

# Basic properties
d.num_agents()
d.num_joint_actions()
d.num_joint_observations()
d.num_states()
d.discount()

# Initial belief state
d.initial_belief_at(state=0)

# Individual agent/state/action/observation properties
d.num_actions(agent=0)
d.num_observations(agent=1)

d.agent_name(agent=1)
d.state_name(state=0)
d.action_name(agent=0, action=1)
d.observation_name(agent=1, observation=1)

# Converting between joint and individual action/observation indices
individual_actions = [0, 1]
ja = d.individual_to_joint_action_indices(individual_actions)
d.joint_to_individual_action_indices(ja) # equals individual_actions

joint_observation = 3
individual_observations = d.joint_to_individual_observation_indices(joint_observation)
d.individual_to_joint_observation_indices(individual_observations) # equals joint_observation

# Rewards, state transition and observation probabilities.
# s is current state, s' is new state after taking joint action a, 
# z' is joint observation received in new state
state = 0
joint_action = 1
new_state = 1

d.reward(state, joint_action) # immediate reward

d.transition_probability(new_state, state, joint_action) # P(s' | s, a)

joint_observation = 0
d.observation_probability(joint_observation, new_state, joint_action) # P(z' | s', a)

# Sampling new states and observations
import random
state = d.sample_initial_state(random.random())
new_state = d.sample_next_state(state, joint_action, random.random())
d.sample_observation(new_state, joint_action, random.random())

Notes

This package does not provide problem files such as dectiger.dpomdp. Many classic Dec-POMDPs from the research literature may be found here.
This package works for "flat" Dec-POMDPs, and support for factored or transition/observation independent Dec-POMDPs is not implemented so far. If you require a parser for these problems, refer to the MADP toolbox.
Indices for states and (joint) actions and (joint) observations are int on the Python side, and unsigned int in C++ - you may have issues with overflows with very large Dec-POMDPs

Acknowledgments

This project depends on the parser from the MADP toolbox, which is gratefully acknowledged.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
pydpomdp		pydpomdp
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyDPOMDP

Installation

Examples

Notes

Acknowledgments

About

Releases

Packages

Contributors 2

Languages

License

laurimi/pydpomdp

Folders and files

Latest commit

History

Repository files navigation

PyDPOMDP

Installation

Examples

Notes

Acknowledgments

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages