schola.scripts.ray.settings.IMPALASettings

类定义

class schola.scripts.ray.settings.IMPALASettings(vtrace=True, vtrace_clip_rho_threshold=1.0, vtrace_clip_pg_rho_threshold=1.0)

基类： RLLibAlgorithmSpecificSettings

IMPALA（Importance Weighted Actor-Learner Architecture）算法特定设置的数据类。此类定义了 IMPALA 算法中使用的参数，包括用于离策略校正的 V-trace 设置。

类型： bool

类型： float

类型： float

类型： str

类型： Type[IMPALAConfig]

类型： bool
默认值： True

是否在 IMPALA 算法中使用 V-trace 算法进行离策略校正。V-trace 是一种纠正使用离策略数据进行训练所引入偏差的方法。它有助于确保价值估计更准确、更稳定。

类型： float
默认值： 1.0

策略梯度中 V-trace rho 值的裁剪阈值。

类型： float
默认值： 1.0

V-trace rho 值的裁剪阈值。

__init__(vtrace=True, vtrace_clip_rho_threshold=1.0, vtrace_clip_pg_rho_threshold=1.0)

返回类型： None

classmethod get_parser()

将设置添加到解析器或子解析器

get_settings_dict()

以 Ray 中正确的参数名称作为键，将设置获取为字典