跳至内容

FRLlibTrainingSettings

struct FRLlibTrainingSettings : public FTrainingSettings

一个保存 RLlib 训练脚本所有设置的结构体。

依赖项

此类型依赖于

继承自: public FTrainingSettings

公共接口

析构函数

~FRLlibTrainingSettings

virtual ~FRLlibTrainingSettings()

属性: virtual

来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 95 行,第 9 列)

实现: Schola/Source/Schola/Private/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.cpp (第 39-41 行)

公共函数

GenerateTrainingArgs

virtual void GenerateTrainingArgs(int Port, FScriptArgBuilder &ArgBuilder) const const

使用 ArgBuilder 为脚本生成训练参数。

注意:端口作为脚本的常见参数传入,并在高层设置,但特定子设置可能需要它。

参数

  • Port (int) – [输入] 用于脚本的端口
  • ArgBuilder (FScriptArgBuilder &) – [输入] 用于生成参数的构建器

属性: const, virtual

来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 93 行,第 6 列)

实现: Schola/Source/Schola/Private/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.cpp (第 6-37 行)

公共成员

int Timesteps

int Timesteps = = 8000

训练的时长(以时间步为单位)。

来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 40 行,第 5 列)

float LearningRate

float LearningRate = = 0.0003

来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 43 行,第 7 列)

int MinibatchSize

int MinibatchSize = = 128

来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 46 行,第 5 列)

int TrainBatchSizePerLearner

int TrainBatchSizePerLearner = = 256

来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 49 行,第 5 列)

int NumSGDIter

int NumSGDIter = = 5

来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 52 行,第 5 列)

float Gamma

float Gamma = = 0.99

来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 55 行,第 7 列)

FRLlibLoggingSettings LoggingSettings

FRLlibLoggingSettings LoggingSettings

训练脚本的日志设置。

来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 59 行,第 23 列)

FRLlibCheckpointSettings CheckpointSettings

FRLlibCheckpointSettings CheckpointSettings

训练脚本的检查点设置。

来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 63 行,第 26 列)

FRLlibResumeSettings ResumeSettings

FRLlibResumeSettings ResumeSettings

训练脚本的恢复设置。

来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 67 行,第 22 列)

FRLlibNetworkArchSettings NetworkArchitectureSettings

FRLlibNetworkArchSettings NetworkArchitectureSettings

训练脚本的网络架构设置。

来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 71 行,第 27 列)

FRLlibResourceSettings ResourceSettings

FRLlibResourceSettings ResourceSettings

训练脚本的资源设置。

来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 75 行,第 24 列)

ERLlibTrainingAlgorithm Algorithm

ERLlibTrainingAlgorithm Algorithm = = ERLlibTrainingAlgorithm::PPO

训练期间使用的算法(例如:

SAC, PPO)。

来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 79 行,第 25 列)

FRLlibPPOSettings PPOSettings

FRLlibPPOSettings PPOSettings

PPO 特定设置。

来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 83 行,第 19 列)

FRLlibAPPOSettings APPOSettings

FRLlibAPPOSettings APPOSettings

APPO 特定设置。

来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 87 行,第 20 列)

FRLlibIMPALASettings IMPALASettings

FRLlibIMPALASettings IMPALASettings

IMPALA 特定设置。

来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 91 行,第 22 列)

被使用: FScriptSettings

来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 33 行,第 1 列)

© . This site is unofficial and not affiliated with AMD.