API Reference - ExperienceReplays - BaseExperienceReplay
The base class for most experience replay classes. It serves as a template for using it with the reinforcement learning models.
Constructors
new()
Creates a new base experience replay object.
BaseExperienceReplay.new(batchSize: number, numberOfRunsToUpdate: number, maxBufferSize: number)
Parameters:
-  batchSize: The number of experience to sample from for training. 
-  numberOfRunsToUpdate: The number of run() function needed to be called to run a single event of experience replay. 
-  maxBufferSize: The maximum number of experiences that can be kept inside the object. 
Functions
setParameters()
Change the parameters of a base experience replay object.
BaseExperienceReplay:setParameters(batchSize: number, numberOfRunsToUpdate: number, maxBufferSize: number)
Parameters:
-  batchSize: The number of experience to sample from for training. 
-  numberOfRunsToUpdate: The number of run() function needed to be called to run a single event of experience replay. 
-  maxBufferSize: The maximum number of experiences that can be kept inside the object. 
extendResetFunction()
Adds new function on reset alongside with the current reset() function.
BaseExperienceReplay:setResetFunction(resetFunction)
Parameters:
- resetFunction: The function to run when reset() is called.
reset()
Resets the base experience replay object.
BaseExperienceReplay:reset()
setRunFunction()
Sets the model’s run function.
BaseExperienceReplay:setRunFunction(updateFunction)
Parameters:
- runFunction: The function to run when run() is called.
run()
For every nth experience collected, it will run the updateFunction once.
BaseExperienceReplay:run(updateFunction)
Parameters:
- updateFunction: The update function that updates a model.
addExperience()
Adds an experience to the experiance replay object.
BaseExperienceReplay:addExperience(previousFeatureVector, action: number/string, rewardValue: number, currentFeatureVector)
Parameters:
-  previousFeatureVector: The previous features of the environment. 
-  action: The action selected. 
-  rewardValue: The reward gained at current state. 
-  currentFeatureVector: The currrent features of the environment. 
addTemporalDifferenceError()
Adds a temporal difference error to the experiance replay object.
BaseExperienceReplay:addTemporalDifferenceError(temporalDifferenceErrorVectorOrValue)
Parameters:
- temporalDifferenceErrorVectorOrValue: The temporal difference error in a form of vector or value.
setIsTemporalDifferenceErrorRequired()
Set whether or not to store temporal difference errors to the experiance replay object.
BaseExperienceReplay:setIsTemporalDifferenceErrorRequired(option)
Parameters:
- option: Set whether or not to store temporal difference errors.