tf_agents.bandits.environments.movielens_py_environment.MovieLensPyEnvironment

Implements the MovieLens Bandit environment.

Inherits From: BanditPyEnvironment, PyEnvironment

tf_agents.bandits.environments.movielens_py_environment.MovieLensPyEnvironment(
    data_dir: Text,
    rank_k: int,
    batch_size: int = 1,
    num_movies: int = 20,
    csv_delimiter: Text = ',',
    name: Optional[Text] = 'movielens'
)

This environment implements the MovieLens 100K dataset, available at: https://www.kaggle.com/prajitdatta/movielens-100k-dataset

This dataset contains 100K ratings from 943 users on 1682 items. This csv list of: user id | item id | rating | timestamp. This environment computes a low-rank matrix factorization (using SVD) of the data matrix A, such that: A ~= U * V.

The reward of recommending item j to user i is provided as A_{ij}.

Args
`data_dir`	(string) Directory where the data lies (in text form). rank_k : (int) Which rank to use in the matrix factorization.
`batch_size`	(int) Number of observations generated per call.
`num_movies`	(int) Only the first `num_movies` movies will be used by the environment. The rest is cut out from the data.
`csv_delimiter`	(string) The delimiter to use in loading the data csv file.
`name`	The name of this environment instance.

Attributes
`batch_size`	The batch size of the environment.
`batched`	Whether the environment is batched or not. If the environment supports batched observations and actions, then overwrite this property to True. A batched environment takes in a batched set of actions and returns a batched set of observations. This means for all numpy arrays in the input and output nested structures, the first dimension is the batch size. When batched, the left-most dimension is not part of the action_spec or the observation_spec and corresponds to the batch dimension. When batched and handle_auto_reset, it checks `np.all(steps.is_last())`.
`name`

Attributes

batch_size The batch size of the environment.

batched

Whether the environment is batched or not.

If the environment supports batched observations and actions, then overwrite this property to True.

A batched environment takes in a batched set of actions and returns a batched set of observations. This means for all numpy arrays in the input and output nested structures, the first dimension is the batch size.

When batched, the left-most dimension is not part of the action_spec or the observation_spec and corresponds to the batch dimension.

When batched and handle_auto_reset, it checks np.all(steps.is_last()).

name

tf_agents.bandits.environments.movielens_py_environment.MovieLensPyEnvironment Stay organized with collections Save and categorize content based on your preferences.

Args

Attributes

Methods

action_spec

close

compute_optimal_action

compute_optimal_reward

current_time_step

discount_spec

get_info

get_state

observation_spec

render

reset

reward_spec

seed

set_state

should_reset

step

time_step_spec

__enter__

__exit__

tf_agents.bandits.environments.movielens_py_environment.MovieLensPyEnvironment

`action_spec`

`close`

`compute_optimal_action`

`compute_optimal_reward`

`current_time_step`

`discount_spec`

`get_info`

`get_state`

`observation_spec`

`render`

`reset`

`reward_spec`

`seed`

`set_state`

`should_reset`

`step`

`time_step_spec`

`enter`

`exit`