ddpg-eddy-detumbling

RL-based Robotic Trajectory Planning for Efficient Eddy Current Debris Detumbling

Master’s Thesis at Politecnico di Milano & Beihang University (中文: 北京航空航天大学)

Author: Daniele Agamennone
Supervisor: Prof. Zhong Rui¹, Prof. Massari Mauro²

Affiliations:

School of Astronautics, Beihang University, Beijing, China.
Department of Aerospace Engineering (DAER), Politecnico di Milano, Milan, Italy.

Year: 2024

Abstract

Capturing space debris is complex, as many objects tumble at angular rates between 3°/s and 30°/s, increasing collision risk during collection and potentially producing further fragments. Detumbling debris using Eddy current-based methods has shown promise as a contactless solution, but existing approaches, such as those employing chaser spacecraft with along-track electromagnets, can take up to 14 days to fully detumble debris. This research proposes a novel approach employing a robotic arm equipped with an electromagnetic end-effector, enabling the application of a magnetic field with a variable direction. It is found that to maximize the Eddy Current Torque (ECT), it is essential to maintain perpendicularity between the relative angular velocity (RAV) vector and the applied magnetic field, a trajectory that is rarely within the manipulator's workspace. A near-optimal feasible solution is achieved using the Deep Deterministic Policy Gradient (DDPG) algorithm. The results demonstrate that the agent can learn a policy that allows detumbling in just 4 days, a 71.73% reduction compared to the along-track method. Additionally, the agent's robustness to stochastic uncertainties in sensor measurements of the RAV is tested by developing statistical ensemble models comprising 500 instances of the trained agent for noise standard deviations of 0.05 rad/s and 0.2 rad/s. The test results show how the agent exhibits strong robustness against uncertainties in the RAV in both scenarios, with just mild performance decreases of 2.54% and 10.91%, respectively, further validating the effectiveness of this approach for potential real-world applications.

Modular SW Architecture

The software architecture follows a modular and independent design¹:

ODE Solver: Utilizes SUNDIALS' CVODE solver for fast and accurate integration of multi-body coupled dynamics.
Equations of Motion (EoMs): Supplies the integrator with pre-computed EoMs based on the chosen Denavit-Hartenberg parameters.
DDPG: Handles online trajectory planning and action generation at each timestep.
Reward Function: A hybrid formulation integrating dense, sparse, and shaped components for distance, α-angle, and inverse kinematics.

The environment (implemented in C++) and the agent (implemented in Python) communicate via a language binding using cppyy. This allows Python to directly interact with C++ classes, functions, and libraries. This setup is advantageous as the main training loop runs in Python, while the integrator—representing the simulation's computational bottleneck—executes in C++. The use of cppyy ensures efficient data exchange, including states and actions, between the two blocks. ↩

Name		Name	Last commit message	Last commit date
Latest commit History 85 Commits
examples		examples
processing		processing
propagator		propagator
robotics		robotics
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.py		environment.py
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ddpg-eddy-detumbling

RL-based Robotic Trajectory Planning for Efficient Eddy Current Debris Detumbling

Abstract

Modular SW Architecture

About

Uh oh!

Languages

License

whitehole07/ddpg-eddy-detumbling

Folders and files

Latest commit

History

Repository files navigation

ddpg-eddy-detumbling

RL-based Robotic Trajectory Planning for Efficient Eddy Current Debris Detumbling

Abstract

Modular SW Architecture

Footnotes

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages