bettermdptools

bettermdptools

Welcome to bettermdptools, a comprehensive library designed to facilitate working with Markov Decision Processes (MDPs) and reinforcement learning environments. This library is built to provide robust tools for both model-based and model-free reinforcement learning algorithms, along with utilities for environment modeling, visualization, and more.

Key Features

Reinforcement Learning Algorithms

bettermdptools includes implementations of popular model-free reinforcement learning algorithms such as Q-Learning and SARSA. These algorithms are designed to work seamlessly with any gymnasium environments that have single discrete valued state spaces. For environments that do not fit this format, a lambda function can be used to convert state spaces accordingly.

Planning Algorithms

The library also provides model-based planning algorithms like Value Iteration and Policy Iteration. These algorithms are essential for solving MDPs where the transition probabilities and rewards are known.

Environment Models

bettermdptools comes with pre-built environment models for popular problems such as Blackjack, CartPole, and Pendulum. These models include discretized versions of the environments, making it easier to apply traditional reinforcement learning and planning algorithms.

Visualization Tools

To help you better understand and analyze the performance of your algorithms, bettermdptools includes a variety of plotting utilities. These tools can generate heatmaps, line plots, and other visualizations to track the progress and performance of your learning agents.

Getting Started

Installation

You can install bettermdptools via pip or by cloning the GitHub repository:

https://github.com/jlm429/bettermdptools

Modules

  • Utils: Utility functions and classes used across the library.
  • Envs: Environment wrappers and models for various reinforcement learning environments.
  • Algorithms: Implementations of various reinforcement learning and planning algorithms.
 1r"""
 2# bettermdptools
 3
 4Welcome to **bettermdptools**, a comprehensive library designed to facilitate working with Markov Decision Processes (MDPs) and reinforcement learning environments. This library is built to provide robust tools for both model-based and model-free reinforcement learning algorithms, along with utilities for environment modeling, visualization, and more.
 5
 6## Key Features
 7
 8### Reinforcement Learning Algorithms
 9bettermdptools includes implementations of popular model-free reinforcement learning algorithms such as Q-Learning and SARSA. These algorithms are designed to work seamlessly with any gymnasium environments that have single discrete valued state spaces. For environments that do not fit this format, a lambda function can be used to convert state spaces accordingly.
10
11### Planning Algorithms
12The library also provides model-based planning algorithms like Value Iteration and Policy Iteration. These algorithms are essential for solving MDPs where the transition probabilities and rewards are known.
13
14### Environment Models
15bettermdptools comes with pre-built environment models for popular problems such as Blackjack, CartPole, and Pendulum. These models include discretized versions of the environments, making it easier to apply traditional reinforcement learning and planning algorithms.
16
17### Visualization Tools
18To help you better understand and analyze the performance of your algorithms, bettermdptools includes a variety of plotting utilities. These tools can generate heatmaps, line plots, and other visualizations to track the progress and performance of your learning agents.
19
20## Getting Started
21
22### Installation
23
24You can install bettermdptools via pip or by cloning the GitHub repository:
25
26https://github.com/jlm429/bettermdptools
27
28## Modules
29
30- [Utils](./bettermdptools/utils.html): Utility functions and classes used across the library.
31- [Envs](./bettermdptools/envs.html): Environment wrappers and models for various reinforcement learning environments.
32- [Algorithms](./bettermdptools/algorithms.html): Implementations of various reinforcement learning and planning algorithms.
33"""