- 个人 mac 纯cpu模式执行训练, python3 train.py --actor_device_cpu --num_actors 10 --training_device cpu --learning_rate 1e-6
正常运行:
[INFO:39823 dmc:289 2026-05-20 13:42:25,385] After 896956800 (L:308675200 U:290566400 D:297715200) frames: @ 3838.7 fps (avg@ 4103.1 fps) (L:1279.6 U:1279.6 D:1279.6) Stats:
{'loss_landlord': 1.3504133224487305,
'loss_landlord_down': 1.889407992362976,
'loss_landlord_up': 1.6238871812820435,
'mean_episode_return_landlord': -0.2186794877052307,
'mean_episode_return_landlord_down': 0.21825198829174042,
'mean_episode_return_landlord_up': 0.21791546046733856}
2 服务上,同样使用cpu 模式, python3 train.py --actor_device_cpu --num_actors 10 --training_device cpu --learning_rate 1e-6
一直显示 fps = 0
INFO:110833 dmc:299 2026-05-20 13:43:04,796] After 0 (L:0 U:0 D:0) frames: @ 0.0 fps (avg@ 0.0 fps) (L:0.0 U:0.0 D:0.0) Stats:
{'loss_landlord': 0,
'loss_landlord_down': 0,
'loss_landlord_up': 0,
'mean_episode_return_landlord': 0,
'mean_episode_return_landlord_down': 0,
'mean_episode_return_landlord_up': 0}
正常运行:
[INFO:39823 dmc:289 2026-05-20 13:42:25,385] After 896956800 (L:308675200 U:290566400 D:297715200) frames: @ 3838.7 fps (avg@ 4103.1 fps) (L:1279.6 U:1279.6 D:1279.6) Stats:
{'loss_landlord': 1.3504133224487305,
'loss_landlord_down': 1.889407992362976,
'loss_landlord_up': 1.6238871812820435,
'mean_episode_return_landlord': -0.2186794877052307,
'mean_episode_return_landlord_down': 0.21825198829174042,
'mean_episode_return_landlord_up': 0.21791546046733856}
2 服务上,同样使用cpu 模式, python3 train.py --actor_device_cpu --num_actors 10 --training_device cpu --learning_rate 1e-6
一直显示 fps = 0
INFO:110833 dmc:299 2026-05-20 13:43:04,796] After 0 (L:0 U:0 D:0) frames: @ 0.0 fps (avg@ 0.0 fps) (L:0.0 U:0.0 D:0.0) Stats:
{'loss_landlord': 0,
'loss_landlord_down': 0,
'loss_landlord_up': 0,
'mean_episode_return_landlord': 0,
'mean_episode_return_landlord_down': 0,
'mean_episode_return_landlord_up': 0}