FRLlibTrainingSettings
struct FRLlibTrainingSettings : public FTrainingSettings一个保存 RLlib 训练脚本所有设置的结构体。
依赖项
此类型依赖于
- FRLlibAPPOSettings
- FRLlibCheckpointSettings
- FRLlibIMPALASettings
- FRLlibLoggingSettings
- FRLlibNetworkArchSettings
- FRLlibPPOSettings
- FRLlibResourceSettings
- FRLlibResumeSettings
- FScriptArgBuilder
- FTrainingSettings
继承自: public FTrainingSettings
公共接口
析构函数
~FRLlibTrainingSettings
virtual ~FRLlibTrainingSettings()属性: virtual
来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 95 行,第 9 列)
实现: Schola/Source/Schola/Private/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.cpp (第 39-41 行)
公共函数
GenerateTrainingArgs
virtual void GenerateTrainingArgs(int Port, FScriptArgBuilder &ArgBuilder) const const使用 ArgBuilder 为脚本生成训练参数。
注意:端口作为脚本的常见参数传入,并在高层设置,但特定子设置可能需要它。
参数
Port(int) – [输入] 用于脚本的端口ArgBuilder(FScriptArgBuilder &) – [输入] 用于生成参数的构建器
属性: const, virtual
来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 93 行,第 6 列)
实现: Schola/Source/Schola/Private/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.cpp (第 6-37 行)
公共成员
int Timesteps
int Timesteps = = 8000训练的时长(以时间步为单位)。
来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 40 行,第 5 列)
float LearningRate
float LearningRate = = 0.0003来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 43 行,第 7 列)
int MinibatchSize
int MinibatchSize = = 128来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 46 行,第 5 列)
int TrainBatchSizePerLearner
int TrainBatchSizePerLearner = = 256来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 49 行,第 5 列)
int NumSGDIter
int NumSGDIter = = 5来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 52 行,第 5 列)
float Gamma
float Gamma = = 0.99来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 55 行,第 7 列)
FRLlibLoggingSettings LoggingSettings
FRLlibLoggingSettings LoggingSettings训练脚本的日志设置。
来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 59 行,第 23 列)
FRLlibCheckpointSettings CheckpointSettings
FRLlibCheckpointSettings CheckpointSettings训练脚本的检查点设置。
来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 63 行,第 26 列)
FRLlibResumeSettings ResumeSettings
FRLlibResumeSettings ResumeSettings训练脚本的恢复设置。
来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 67 行,第 22 列)
FRLlibNetworkArchSettings NetworkArchitectureSettings
FRLlibNetworkArchSettings NetworkArchitectureSettings训练脚本的网络架构设置。
来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 71 行,第 27 列)
FRLlibResourceSettings ResourceSettings
FRLlibResourceSettings ResourceSettings训练脚本的资源设置。
来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 75 行,第 24 列)
ERLlibTrainingAlgorithm Algorithm
ERLlibTrainingAlgorithm Algorithm = = ERLlibTrainingAlgorithm::PPO训练期间使用的算法(例如:
SAC, PPO)。
来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 79 行,第 25 列)
FRLlibPPOSettings PPOSettings
FRLlibPPOSettings PPOSettingsPPO 特定设置。
来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 83 行,第 19 列)
FRLlibAPPOSettings APPOSettings
FRLlibAPPOSettings APPOSettingsAPPO 特定设置。
来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 87 行,第 20 列)
FRLlibIMPALASettings IMPALASettings
FRLlibIMPALASettings IMPALASettingsIMPALA 特定设置。
来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 91 行,第 22 列)
被使用: FScriptSettings
来源: Schola/Source/Schola/Public/Subsystem/SubsystemSettings/Ray/RLlibTrainingSettings.h (第 33 行,第 1 列)