|
# :orange[Abstract:] |
|
Hall effect thrusters are one of the most versatile and |
|
popular electric propulsion systems for space use. Industry trends |
|
towards interplanetary missions arise advances in design development |
|
of such propulsion systems. It is understood that correct sizing of |
|
discharge channel in Hall effect thruster impact performance greatly. |
|
Since the complete physics model of such propulsion system is not yet |
|
optimized for fast computations and design iterations, most thrusters |
|
are being designed using so-called scaling laws. But this work focuses |
|
on rather novel approach, which is outlined less frequently than |
|
ordinary scaling design approach in literature. Using deep machine |
|
learning it is possible to create predictive performance model, which |
|
can be used to effortlessly get design of required hall thruster with |
|
required characteristics using way less computing power than design |
|
from scratch and way more flexible than usual scaling approach. |
|
:orange[author:] Korolev K.V [^1] |
|
title: Hall effect thruster design via deep neural network for additive |
|
manufacturing |
|
|
|
# Nomenclature |
|
|
|
<div class="longtable*" markdown="1"> |
|
|
|
$U_d$ = discharge voltage |
|
$P$ = discharge power |
|
$T$ = thrust |
|
$\dot{m}_a$ = mass flow rate |
|
$I_{sp}$ = specific impulse |
|
$\eta_m$ = mass utilization efficiency |
|
$\eta_a$ = anode efficiency |
|
$j$ = $P/v$ \[power density\] |
|
$v$ = discharge channel volume |
|
$h, d, L$ = generic geometry parameters |
|
$C_*$ = set of scaling coefficients |
|
$g$ = free-fall acceleration |
|
$M$ = ion mass |
|
|
|
</div> |
|
|
|
# Introduction |
|
|
|
<span class="lettrine">T</span><span class="smallcaps">he</span> |
|
application of deep learning is extremely diverse, but in this study it |
|
focuses on case of hall effect thruster design. Hall effect thruster |
|
(HET) is rather simple DC plasma acceleration device, due to complex and |
|
non linear process physics we don’t have any full analytical performance |
|
models yet. Though there are a lot of ways these systems are designed in |
|
industry with great efficiencies, but in cost of multi-million research |
|
budgets and time. This problem might be solved using neural network |
|
design approach and few hardware iteration tweaks(Plyashkov et al. |
|
2022-10-25). |
|
|
|
Scaled thrusters tend to have good performance but this approach isn’t |
|
that flexible for numerous reasons: first and foremost, due to large |
|
deviations in all of the initial experimental values accuracy can be not |
|
that good, secondly, it is hardly possible to design thruster with |
|
different power density or $I_{sp}$ efficiently. |
|
|
|
On the other hand, the neural network design approach has accuracy |
|
advantage only on domain of the dataset(Plyashkov et al. 2022-10-25), |
|
this limitations is easily compensated by ability to create relations |
|
between multiple discharge and geometry parameters at once. Hence this |
|
novel approach and scaling relations together could be an ultimate |
|
endgame design tool for HET. |
|
|
|
Note that neither of these models do not include cathode efficiencies |
|
and performances. So as the neutral gas thrust components. Most |
|
correlations in previous literature were made using assumption or |
|
physics laws(Shagayda and Gorshkov 2013-03), in this paper the new |
|
method based on feature generation, GAN dataset augmentation and ML |
|
feature selection is suggested. |
|
|
|
## Dataset enlargement using GAN |
|
|
|
As we already have discussed, the data which is available is not enough |
|
for training NN or most ML algorithms, so I suggest using Generative |
|
Adversarial Network to generate more similar points. Generative model |
|
trains two different models - generator and discriminator. Generator |
|
learns how to generate new points which are classified by discriminator |
|
as similar to real dataset. Of course it is very understandable that |
|
model needs to be precise enough not to overfit on data or create new |
|
unknown correlations. Model was checked via Mean Absolute Percentage |
|
Error (MAPE) and physical boundary conditions. After assembling most |
|
promising architecture, the model was able to generate fake points with |
|
MAPE of $~4.7\%$. We need to measure MAPE to be sure point lie on same |
|
domain as original dataset, as in this work we are interested in |
|
sub-kilowatt thrusters. After model generated new points they were check |
|
to fit in physical boundaries of scaled values (for example thrust |
|
couldn’t be more than 2, efficiency more than 1.4 and so on, data was |
|
scaled on original dataset to retain quality), only 0.02% of points were |
|
found to be outliers. The GAN architecture and dataset sample is |
|
provided as follows. |
|
|
|
<!--  |
|
 --> |
|
|
|
# General Relations |
|
|
|
As we will use dataset of only low power hall thrusters, we can just |
|
ignore derivation of any non-linear equations and relations and use |
|
traditional approach here. Let’s define some parameters of anode: |
|
$$\alpha = \frac{\dot{m}\beta}{{\dot{m}_a}},$$ |
|
Where $\alpha$ is anode |
|
parameter of $\beta$ thruster parameter. This is selected because this |
|
way cathode and other losses wont be included in the model. One of key |
|
differences in this approach is fitting only best and most appropriate |
|
data, thus we will eliminate some variance in scaling laws. Though due |
|
to machine learning methods, we would need a lot of information which is |
|
simply not available in those volumes. So some simplifications and |
|
assumptions could be made. Firstly, as it was already said, we don’t |
|
include neutralizer efficiency in the model. Secondly, the model would |
|
be correct on very specific domain, defined by dataset, many parameters |
|
like anode power and $I_{sp}$ still are using semi-empirical modelling |
|
approach. The results we are looking for are outputs of machine learning |
|
algorithm: specific impulse, thrust, efficiency, optimal mass flow rate, |
|
power density. Function of input is solely dependant on power and |
|
voltage range. For the matter of topic let’s introduce semi-empirical |
|
equations which are used for scaling current thrusters. |
|
|
|
<div class="longtable*" markdown="2"> |
|
|
|
$$h=C_hd$$ |
|
|
|
$$\dot{m_a} = C_m hd$$ |
|
|
|
$$P_d=C_pU_dd^2$$ |
|
|
|
$$T=C_t\dot{m_a}\sqrt{U_d}$$ |
|
|
|
$$I_{spa}=\frac{T}{\dot{m_a} g}$$ |
|
|
|
$$\eta_a=\frac{T}{2\dot{m_a}P_d}$$ |
|
|
|
</div> |
|
|
|
Where $C_x$ is scaling coefficient obtained from analytical modelling, |
|
which makes equations linear. Generally it has 95% prediction band but |
|
as was said earlier this linearity is what gives problems to current |
|
thrusters designs (high mass, same power density, average performance). |
|
The original dataset is |
|
|
|
| | | | | | | | | | |
|
|:---------|:---------|:-------|:------|:------|:------|:-------------|:-----|:----------| |
|
| Thruster | Power, W | U_d, V | d, mm | h, mm | L, mm | m_a,.g/s, | T, N | I\_spa, s | |
|
| SPT-20 | 52.4 | 180 | 15.0 | 5.0 | 32.0 | 0.47 | 3.9 | 839 | |
|
| SPT-25 | 134 | 180 | 20.0 | 5.0 | 10 | 0.59 | 5.5 | 948 | |
|
| Music-si | 140 | 288 | 18 | 2 | 6.5 | 0.44 | 4.2 | 850 | |
|
| HET-100 | 174 | 300 | 23.5 | 5.5 | 14.5 | 0.50 | 6.8 | 1386 | |
|
| KHT-40 | 187 | 325 | 31.0 | 9.0 | 25.5 | 0.69 | 10.3 | 1519 | |
|
| KHT-50 | 193 | 250 | 42.0 | 8.0 | 25.0 | 0.88 | 11.6 | 1339 | |
|
| HEPS-200 | 195 | 250 | 42.5 | 8.5 | 25.0 | 0.88 | 11.2 | 1300 | |
|
| BHT-200 | 200 | 250 | 21.0 | 5.6 | 11.2 | 0.94 | 12.8 | 1390 | |
|
| KM-32 | 215 | 250 | 32.0 | 7.0 | 16.0 | 1.00 | 12.2 | 1244 | |
|
| ... | | | | | | | | | |
|
| HEPS-500 | 482 | 300 | 49.5 | 15.5 | 25.0 | 1.67 | 25.9 | 1587 | |
|
| UAH-78AM | 520 | 260 | 78.0 | 20 | 40 | 2 | 30 | 1450 | |
|
| BHT-600 | 615 | 300 | 56.0 | 16.0 | 32 | 2.60 | 39.1 | 1530 | |
|
| SPT-70 | 660 | 300 | 56.0 | 14.0 | 25.0 | 2.56 | 40.0 | 1593 | |
|
| MaSMi60 | 700 | 250 | 60 | 9.42 | 19 | 2.56 | 30 | 1300 | |
|
| MaSMiDm | 1000 | 500 | 67 | 10.5 | 21 | 3 | 53 | 1940 | |
|
| SPT-100 | 1350 | 300 | 85.0 | 15.0 | 25.0 | 5.14 | 81.6 | 1540 | |
|
|
|
Hosting only 24 entries in total. The references are as follows(Beal et |
|
al. 2004-11)(Belikov et al. 2001-07-08)(Kronhaus et al. 2013-07)(Misuri |
|
and Andrenucci 2008-07-21)(Lee et al. 2019-11) |
|
|
|
In the next section the used neural networks architectures will be |
|
discussed. |
|
|
|
# Data driven HET designs |
|
|
|
Neural networks are a type of machine learning algorithm that is often |
|
used in the field of artificial intelligence. They are mathematical |
|
models that can be trained to recognize patterns within large datasets. |
|
The architecture of GAN’s generator was already shown. In this section |
|
we will focus on fully connected networks, which are most popular for |
|
type for these tasks. HETFit code leverages dynamic architecture |
|
generation of these FcNN’s which is done via meta learning algorithm |
|
Tree-structured Parzen Estimator for every data input user selects. This |
|
code uses state-of-art implementation made by OPTUNA. The dynamically |
|
suggested architecture has 2 to 6 layers from 4 to 128 nodes on each |
|
with SELU, Tanh or ReLU activations and most optimal optimizer. The code |
|
user interface is as follows: 1. Specify working environment 2. Load or |
|
generate data 3. Tune the architecture 4. Train and get robust scaling |
|
models |
|
|
|
## FNN |
|
|
|
All of Fully connected neural networks are implemented in PyTorch as it |
|
the most powerful ML/AI library for experiments. When the network |
|
architecture is generated, all of networks have similar training loops |
|
as they use gradient descend algorithm : Loss function: |
|
$$L(w, b) \equiv \frac{1}{2 n} \sum_x\|y(x)-a\|^2$$ This one is mean |
|
square error (MSE) error function most commonly used in FNNs. Next we |
|
iterate while updating weights for a number of specified epochs this |
|
way. Loop for number of epochs: |
|
|
|
\- Get predictions: $\hat{y}$ |
|
|
|
\- Compute loss: $\mathscr{L}(w, b)$ |
|
|
|
\- Make backward pass |
|
|
|
\- Update optimizer |
|
|
|
It can be mentioned that dataset of electric propulsion is extremely |
|
complex due to large deviations in data. Thanks to adavnces in data |
|
science and ML it is possible to work with it. |
|
|
|
This way we assembled dataset on our ROI domain of $P$\<1000 $W$ input |
|
power and 200-500 $V$ range. Sadly one of limitations of such model is |
|
disability to go beyond actual database limit while not sacrificing |
|
performance and accuracy. |
|
|
|
## Physics Informed Neural Networks |
|
|
|
For working with unscaled data PINN’s were introduced, they are using |
|
equations 2-7 to generate $C_x$ coefficients. Yes, it was said earlier |
|
that this method lacks ability to generate better performing HETs, but |
|
as we have generated larger dataset on same domain as Lee et al. |
|
(2019-11) it is important to control that our dataset is still the same |
|
quality as original. Using above mentioned PINN’s it was possible to fit |
|
coefficients and they showed only slight divergence in values of few % |
|
which is acceptable. |
|
|
|
## ML approach notes |
|
|
|
We already have discussed how HETFit code works and results it can |
|
generate, the overiew is going to be given in next section. But here i |
|
want to warn that this work is highly experimental and you should always |
|
take ML approaches with a grain of salt, as some plasma discharge |
|
physics in HET is yet to be understood, data driven way may have some |
|
errors in predictions on specific bands. Few notes on design tool I have |
|
developed in this work: it is meant to be used by people with little to |
|
no experience in ML field but those who wants to quickly analyze their |
|
designs or create baseline one for simulations. One can even use this |
|
tool for general tabular data as it has mostly no limits whatsoever to |
|
input data. |
|
|
|
## Two input variables prediction |
|
|
|
One of main characteristics for any type of thruster is efficiency, in |
|
this work I researched dependency of multiple input values to $\eta_t$. |
|
Results are as follows in form of predicted matrix visualisations. |
|
Figure 3 takes into account all previous ones in the same time, once |
|
again it would be way harder to do without ML. |
|
|
|
|
|
|
|
# Results discussion |
|
|
|
Let’s compare predictions of semi empirical approach(Lee et al. |
|
2019-11), approach in paper(Plyashkov et al. 2022-10-25), and finally |
|
ours. Worth to mention that current approach is easiest to redesign from |
|
scratch. |
|
|
|
## NN architecture generation algorithm |
|
|
|
As with 50 iterations, previously discussed meta learning model is able |
|
to create architecture with score of 0.9+ in matter of seconds. HETFit |
|
allows logging into neptune.ai environment for full control over |
|
simulations. Example trail run looks like that. |
|
|
|
|
|
|
|
## Power density and magnetic flux dependence |
|
|
|
Neither of the models currently support taking magnetic flux in account |
|
besides general physics relations, but we are planning on updating the |
|
model in next follow up paper. For now $\vec{B}$ relation to power |
|
remains unresolved to ML approach but the magnetic field distribution on |
|
z axis is computable and looks like that for magnetically shielded |
|
thrusters: |
|
|
|
|
|
|
|
## Dependency of T on d,P |
|
|
|
Following graph is describing Thrust as function of channel diameter and |
|
width, where hue map is thrust. It is well known dependency and it has |
|
few around 95% prediction band (Lee et al. 2019-11) |
|
|
|
|
|
|
|
## Dependency of T on P,U |
|
|
|
|
|
|
|
## Dependency of T on $m_a$,P |
|
|
|
Compared to(Shagayda and Gorshkov 2013-03) The model accounts for more |
|
parameters than linear relation. So such method proves to be more |
|
precise on specified domain than semi empirical linear relations. |
|
|
|
|
|
|
|
## Dependency of $I_{sp}$ on d,h |
|
|
|
|
|
|
|
We generated many models so far, but using ML we can make single model |
|
for all of the parameters at the same time, so these graphs tend to be |
|
3d projection of such model inference. |
|
|
|
## Use of pretrained model in additive manufacturing of hall effect thruster channels |
|
|
|
The above mentioned model was used to predict geometry of channel, next |
|
the simulation was conducted on this channel. Second one for comparison |
|
was calculated via usual scaling laws. The initial conditions for both |
|
are: |
|
|
|
| Initial condition | Value | |
|
|:------------------|:------------------| |
|
| $n_{e,0}$ | 1e13 \[m\^-3\] | |
|
| $\epsilon_0$ | 4 \[V\] | |
|
| V | 300 \[V\] | |
|
| T | 293.15 \[K\] | |
|
| P\_abs | 0.5 \[torr\] | |
|
| $\mu_e N_n$ | 1e25 \[1/(Vm s)\] | |
|
| dt | 1e-8 \[s\] | |
|
| Body | Ar | |
|
|
|
Outcomes are so that ML geometry results in higher density generation of |
|
ions which leads to more efficient thrust generation. HETFit code |
|
suggests HET parameters by lower estimate to compensate for not included |
|
variables in model of HET. This is experimentally proven to be efficient |
|
estimate since SEM predictions of thrust are always higher than real |
|
performance. Lee et al. (2019-11) |
|
|
|
|
|
|
|
## Code description |
|
|
|
Main concepts: - Each observational/design session is called an |
|
environment, for now it can be either RCI or SCI (Real or scaled |
|
interface) |
|
|
|
\- Most of the run parameters are specified on this object |
|
initialization, including generation of new samples via GAN |
|
|
|
\- Built-in feature generation (log10 Power, efficiency, $\vec{B}$, |
|
etc.) |
|
|
|
\- Top feature selection for each case. (Boruta algorithm) |
|
|
|
\- Compilation of environment with model of choice, can be any torch |
|
model or sklearn one |
|
|
|
\- Training |
|
|
|
\- Plot, inference, save, export to jit/onnx, measure performance |
|
|
|
## COMSOL HET simulations |
|
|
|
The simulations were conducted in COMSOL in plasma physics interface |
|
which gives the ability to accurately compute Electron densities, |
|
temperatures, energy distribution functions from initial conditions and |
|
geometry. Here is comparison of both channels. |
|
|
|
|
|
|
|
# Conclusion |
|
|
|
In conclusion the another model of scaling laws was made and presented. |
|
HETFit code is open source and free to be used by anyone. Additively |
|
manufactured channel was printed to prove it’s manufactureability. |
|
Hopefully this work will help developing more modern scaling relations |
|
as current ones are far from perfect. |
|
|
|
Method in this paper and firstly used in Plyashkov et al. (2022-10-25) |
|
has advantages over SEM one in: ability to preidct performance more |
|
precisely on given domain, account for experimental data. I believe with |
|
more input data the ML method of deisgning thrusters would be more |
|
widely used. |
|
|
|
The code in this work could be used with other tabular experimental data |
|
since most of cases and tasks tend to be the same: feature selection and |
|
model optimization. |
|
|
|
|
|
<div id="refs" class="references csl-bib-body hanging-indent" |
|
markdown="1"> |
|
|
|
<div id="ref-beal_plasma_2004" class="csl-entry" markdown="1"> |
|
|
|
Beal, Brian E., Alec D. Gallimore, James M. Haas, and William A. Hargus. |
|
2004-11. “Plasma Properties in the Plume of a Hall Thruster Cluster.” |
|
*Journal of Propulsion and Power* 20 (6): 985–91. |
|
<https://doi.org/10.2514/1.3765>. |
|
|
|
</div> |
|
|
|
<div id="ref-belikov_high-performance_2001" class="csl-entry" |
|
markdown="1"> |
|
|
|
Belikov, M., O. Gorshkov, V. Muravlev, R. Rizakhanov, A. Shagayda, and |
|
A. Snnirev. 2001-07-08. “High-Performance Low Power Hall Thruster.” In |
|
*37th Joint Propulsion Conference and Exhibit*. Salt Lake |
|
City,UT,U.S.A.: American Institute of Aeronautics; Astronautics. |
|
<https://doi.org/10.2514/6.2001-3780>. |
|
|
|
</div> |
|
|
|
<div id="ref-kronhaus_discharge_2013" class="csl-entry" markdown="1"> |
|
|
|
Kronhaus, Igal, Alexander Kapulkin, Vladimir Balabanov, Maksim |
|
Rubanovich, Moshe Guelman, and Benveniste Natan. 2013-07. “Discharge |
|
Characterization of the Coaxial Magnetoisolated Longitudinal Anode Hall |
|
Thruster.” *Journal of Propulsion and Power* 29 (4): 938–49. |
|
<https://doi.org/10.2514/1.B34754>. |
|
|
|
</div> |
|
|
|
<div id="ref-lee_scaling_2019" class="csl-entry" markdown="1"> |
|
|
|
Lee, Eunkwang, Younho Kim, Hodong Lee, Holak Kim, Guentae Doh, Dongho |
|
Lee, and Wonho Choe. 2019-11. “Scaling Approach for Sub-Kilowatt |
|
Hall-Effect Thrusters.” *Journal of Propulsion and Power* 35 (6): |
|
1073–79. <https://doi.org/10.2514/1.B37424>. |
|
|
|
</div> |
|
|
|
<div id="ref-misuri_het_2008" class="csl-entry" markdown="1"> |
|
|
|
Misuri, Tommaso, and Mariano Andrenucci. 2008-07-21. “HET Scaling |
|
Methodology: Improvement and Assessment.” In *44th AIAA/ASME/SAE/ASEE |
|
Joint Propulsion Conference &Amp; Exhibit*. Hartford, CT: American |
|
Institute of Aeronautics; Astronautics. |
|
<https://doi.org/10.2514/6.2008-4806>. |
|
|
|
</div> |
|
|
|
<div id="ref-plyashkov_scaling_2022" class="csl-entry" markdown="1"> |
|
|
|
Plyashkov, Yegor V., Andrey A. Shagayda, Dmitrii A. Kravchenko, Fedor D. |
|
Ratnikov, and Alexander S. Lovtsov. 2022-10-25. “On Scaling of |
|
Hall-Effect Thrusters Using Neural Nets,” 2022-10-25. |
|
<http://arxiv.org/abs/2206.04440>. |
|
|
|
</div> |
|
|
|
<div id="ref-shagayda_hall-thruster_2013" class="csl-entry" |
|
markdown="1"> |
|
|
|
Shagayda, Andrey A., and Oleg A. Gorshkov. 2013-03. “Hall-Thruster |
|
Scaling Laws.” *Journal of Propulsion and Power* 29 (2): 466–74. |
|
<https://doi.org/10.2514/1.B34650>. |
|
|
|
</div> |
|
|
|
</div> |
|
|
|
[^1]: Founder, Pure EP |