BENSALAH Nouhaila, AYAD Habib, ADIB Abdellah and IBN EL FAROUK Abdelhamid+

The document presents a machine translation model using bidirectional LSTM encoder-decoder for translating between Arabic and English. The model maps an input sequence to a vector using a bidirectional LSTM encoder and decodes the target sequence from the vector using an LSTM decoder. The model achieves improved BLEU scores over a baseline for both English to Arabic and Arabic to English translation, demonstrating the effectiveness of the approach.

Uploaded by

Ahmed Blog

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

163 views2 pages

BENSALAH Nouhaila, AYAD Habib, ADIB Abdellah and IBN EL FAROUK Abdelhamid+

Uploaded by

Ahmed Blog

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Arabic machine translation using Bidirectional LSTM

Encoder-Decoder
BENSALAH Nouhaila*, AYAD Habib*, ADIB Abdellah* and IBN EL FAROUK Abdelhamid+
*Team Networks, Telecoms & Multimedia
LIM@II-FSTM, B.P. 146
Mohammedia 20650, Morocco
+Teaching, Languages and Cultures Laboratory Mohammedia,
bensalah.3.nouhaila@gmail.com, ayad.habib@gmail.com, adib@fstm.ac.ma, farouklettres@gmail.com

Abstract As shown in Fig 2 , we create as first step a vector that represents the English sentence, we embed our
Due to the language structure, applying the same machine translation approach may not work for Arabic lan- input sentence sequence into the BiLSTM encoder word by word until the end of the English sentence
guage as for European languages. So, there is a great need to develop a model to solve this issue. Machine
sequence. We obtain the hidden and cell (or memory) states and we feed the vectors that represent the
Translation ( MT ) using neural networks has recently become a viable alternative approach to the most widely-
used statistical MT . Although a lot of research has been done on MT for Arabic language, to the best of our meaning of the sentence into the LSTM decoder as its initial state. Finally, the output of the decoder
knowledge, no work has been used a Bidirectional Recurrent Neural Network ( BiRNN ) encoder/decoder for this is sent to softmax layer that is compared with the target data.
task. In this poster, we aim to fulfill this goal by developping a model based mainly on Bidirectional Long Short-
Term Memory ( BiLSTM ) with to map the input sequence to a vector, and we use then another Long Short-Term
Memory ( LSTM ) to decode the target sequence from the obtained vector. Our work offers encouraging results in Results
terms of correlation with human judgment.
To build our translation corpus, we have used the English-French parallel corpus from the github
website1 . Since our task is machine translation between the Arabic and English, our system starts by
Introduction translating the English sentences into Arabic. Then, the best Arabic translation are selected for each
English sentence to form our final translation corpus. Finally, all the sentences in both the English
Automatic machine translation is considered to be the major problem in natural language processing. and Arabic languages are normalized and tokenized
It has proved to be both the most attractive and the least accessible task. Since the introduction of MT In our sequence-to-sequence model, an embedding dimension of R20 for inputs and for R15 outputs
, many approches have been applied, from traditional rule-based methods to the more recent statistical have been used. The maximal sequence length has been set to 186 words for English and 519 words
methods. for Arabic.
MT has been an active research topic since 1950s [1]. Originally, MT systems were developed using A mini-batch size of 64 have been incorporated. The training has been done by means of stochastic
both dictionaries and rules to generate correct word order. In the 1990s, statistical methods became Gradient Descent ( SGD ) with Adam optimization function [15].
dominant [2] due to the availability of large corpora, comutational speed, and software for performing Our model implemented using python has been trained using CPU with 4GB of memory.
basic translation process such as alignement, recordering, filtering, etc. Table 1: Translation results
The particular problem of MT has a long history as well. In 1982, a paper by Nagao [3] applies a rule-
based machine translation between English and Japanese to transfer grammatical concepts between Metric Test Bleu score( % ) Metric Test Bleu score( % )
the two languages. Another phrase-based statistical machine translation sytems between English and Ilya Sutskever et al. [11] 16 Ilya Sutskever et al. [11] 26
Arabic have been proposed by [4] with an impressive improvement over other sytems without us-
Our approach 18 Our approach 27
ing any neural network. However, the authors state that the results on statistical machine translation
achieve only a baseline level of success. (a) English-to-Arabic (b) Arabic-to-English
Recently, neural machine translation has been extremely powerful due to its exellent performance The BLEU [16] obtained by our model and the approach proposed by [11] using our corpus are
on difficult problems such as speech recognition [5] and visual object recognition [6] for a modest provided in Table 1a and Table 1b for the tasks of translation form English-to-Arabic and Arabic-to-
number of steps, and have been achieved close to state-of the art accuracy in machine translation [2]. English respectively.
However, RNNs suffer from the vanishing and exploding gradient problem [7]. So, if we are trying to So all these results show that our Bi-seq2seq gives best results, which demonstrates the efficiency of
translate a paragraph of text, RNNs may leave out important information from the beginning.A com- our proposal for the translation task.
mon solution is to use either LSTM [8] or the Gated Recurrent Unit ( GRU ) [9] neural networks wich
solve these problems and have proved to perform equally well at capturing long-term dependencies.
In this paper, our aim is therfore to present the first result on the Arabic translation using BiLSTM as Conclusion
encoder to map the input sequence to a vector, and a simple LSTM as a decoder to decode the target
In this work, we have presented a BiLSTM encoder and LSTM decoder model for the task of machine
sentence from the obtained vector.
translation between English and Arabic texts . Our system addresses the case of machine translation
The outline of the paper is structured as follows. The next section details the proposed approach. The
between English and Arabic using a deep learning sequence-to-sequence model, which has not been
experiments and obtained results are presented in section . At the end of this paper, a conclusion is
investigated before, the obtained performances offer encouraging results in terms of correlation with
presented.
human judgment. This work can be further developed in various directions. One way is to consider
the case of translation between other languages besides French. Another interesting future one is to
The proposed Approach integrate this model into an English-to-Arabic machine transliteration system.

Most of the state-of-the art machine translation systems employ RNN’s [[10], [11], [12], [13]]. These
models often use an encoder-decoder approach to predict translations. In this section, we will explain References
in detail the architecture of the proposed model presented in the experiments results.
[1] John Hutchins. The history of machine translation in a nutshell. page 5.
[2] Eric Greenstein and Daniel Penner. Japanese-to-English Machine Translation Using Recurrent
The architecture of LSTM Neural Networks. page 7.
LSTM [14] was created to solve the problem of short-term memory. They have internal mechanism [3] Makoto Nagao. A framework of a mechanical translation between Japanese and English by
called gates, that can regulate the flow of information. The general architecture of LSTM is illustrated analogy principle. [No source information available], October 1984.
in Fig 1 . [4] Jacob Devlin, Rabih Zbib, Zhongqiang Huang, Thomas Lamar, Richard M. Schwartz, and
ht John Makhoul. Fast and Robust Neural Network Joint Models for Statistical Machine Trans-
Ct−1 Ct lation. In ACL, 2014.
× +
forget gate input gate [5] G. E. Dahl, D. Yu, L. Deng, and A. Acero. Context-Dependent Pre-Trained Deep Neural Net-
× output gate tanh
f( t) i( t) k( t) × works for Large-Vocabulary Speech Recognition. IEEE Transactions on Audio, Speech, and
σ σ tanh σ O( t) Language Processing, 20(1):30–42, January 2012.
ht−1 ht
[6] Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. ImageNet classification with deep
Xt convolutional neural networks. Communications of the ACM, 60(6):84–90, May 2017.
Figure 1: The architecture of LSTM [7] Boris Hanin. Which Neural Net Architectures Give Rise to Exploding and Vanishing Gradi-
ents? page 10.
So, the forget gate is used to keep the important informations in memory from previous steps. The
input gate decodes what information is important to add from the current step. Finally, the output gate [8] Rahul Dey and Fathi M Salem. Gate-Variants of Gated Recurrent Unit (GRU) Neural Net-
determines the next hidden state. works. page 5.
[9] Junyoung Chung, Caglar Gulcehre, KyungHyun Cho, and Yoshua Bengio. Empirical Eval-
uation of Gated Recurrent Neural Networks on Sequence Modeling. arXiv:1412.3555 [cs],
The architecture of our model
December 2014. arXiv: 1412.3555.
ouputs [10] Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez,
Decoder Softmax

Predict Predict Predict Predict Lukasz Kaiser, and Illia Polosukhin. Attention Is All You Need. arXiv:1706.03762 [cs], June
word word word word 2017. arXiv: 1706.03762.
probabilities probabilities probabilities probabilities
[11] Ilya Sutskever, Oriol Vinyals, and Quoc V Le. Sequence to Sequence Learning with Neural
Memory vector z z-final Networks. page 9.
LSTM LSTM LSTM LSTM
Vector state e e-final [12] Mitchell Marcus, Mary Ann Marcinkiewicz, and Beatrice Santorini. Building a Large Anno-
h-final
c-final

tated Corpus of English: The Penn Treebank. Computational Linguistics, 19:313–330, July
2002.
THOUGHT VECTOR [13] Kyunghyun Cho, Bart van Merrinboer, Caglar Gulcehre, Fethi Bougares, Holger Schwenk,
c-final h-final and Y Bengio. Learning Phrase Representations using RNN Encoder-Decoder for Statistical
Encoder

Memory vector c Machine Translation. June 2014.

BiLSTM BiLSTM BiLSTM BiLSTM
[14] Sepp Hochreiter and Jrgen Schmidhuber. Long Short-term Memory. Neural computation,
Vector state h
9:1735–80, December 1997.
Embeddings

[15] Diederik P. Kingma and Jimmy Ba. Adam: A Method for Stochastic Optimization.
arXiv:1412.6980 [cs], December 2014. arXiv: 1412.6980.
[16] Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. BLEU: a method for au-
inputs tomatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting on
Association for Computational Linguistics - ACL ’02, page 311, Philadelphia, Pennsylvania,
Figure 2: The architecture of our model 2001. Association for Computational Linguistics.
1
https://github.com/susanli2016/NLP-with-Python/tree/master/data

From Recurrent Neural Network Techniques To Pre-Trained Models: Emphasis On The Use in Arabic Machine Translation
No ratings yet
From Recurrent Neural Network Techniques To Pre-Trained Models: Emphasis On The Use in Arabic Machine Translation
10 pages
(PDF) NLP Project Final Report
No ratings yet
(PDF) NLP Project Final Report
13 pages
A Recipe For Arabic-English Neural Machine Translation
No ratings yet
A Recipe For Arabic-English Neural Machine Translation
5 pages
Machine Translation of Vedic Sanskrit Using Deep Learning Algorithm
No ratings yet
Machine Translation of Vedic Sanskrit Using Deep Learning Algorithm
4 pages
Bilingual Machine Translation
No ratings yet
Bilingual Machine Translation
8 pages
Arabic To Bangla Machine Translation Using Encoder Decoder Approach
No ratings yet
Arabic To Bangla Machine Translation Using Encoder Decoder Approach
4 pages
A Rule-Based English To Arabic Machine Translation Approach: December 2015
No ratings yet
A Rule-Based English To Arabic Machine Translation Approach: December 2015
8 pages
1679506287709733
No ratings yet
1679506287709733
15 pages
A Transformer-Based Neural Mac
No ratings yet
A Transformer-Based Neural Mac
29 pages
Referance 5
No ratings yet
Referance 5
7 pages
Translating Similar Languages: Role of Mutual Intelligibility in Multilingual Transformers
No ratings yet
Translating Similar Languages: Role of Mutual Intelligibility in Multilingual Transformers
7 pages
Luận Văn Integrated Linguistic to Statistical Machine Translation Tích Hợp Thông Tin Ngôn Ngữ Vào Dịch Máy Tính Thống Kê
No ratings yet
Luận Văn Integrated Linguistic to Statistical Machine Translation Tích Hợp Thông Tin Ngôn Ngữ Vào Dịch Máy Tính Thống Kê
16 pages
NLP Project Final Report1
No ratings yet
NLP Project Final Report1
10 pages
Challenges in Rendering Arabic Text To English Using Machine Translation A Systematic Literature Review
No ratings yet
Challenges in Rendering Arabic Text To English Using Machine Translation A Systematic Literature Review
8 pages
359 1632 1 PB
No ratings yet
359 1632 1 PB
5 pages
2022 Arabic MT Nagoudi
No ratings yet
2022 Arabic MT Nagoudi
11 pages
NLP Project Final Report1
No ratings yet
NLP Project Final Report1
10 pages
Challenges in Rendering Arabic Text To English Usi
No ratings yet
Challenges in Rendering Arabic Text To English Usi
10 pages
Arabic Machine Translation A Survey With Challenges and Future Directions
No ratings yet
Arabic Machine Translation A Survey With Challenges and Future Directions
24 pages
Use of Neural Networks and Deep Learning in Urdu Translation
No ratings yet
Use of Neural Networks and Deep Learning in Urdu Translation
8 pages
Evaluation of Arabic To English Machine Translation Systems
No ratings yet
Evaluation of Arabic To English Machine Translation Systems
6 pages
Interactive English To Urdu Machine Translation Using Example-Based Approach
100% (2)
Interactive English To Urdu Machine Translation Using Example-Based Approach
8 pages
Paper 14038
No ratings yet
Paper 14038
4 pages
Machine Translation Paper
No ratings yet
Machine Translation Paper
24 pages
Article 5
No ratings yet
Article 5
7 pages
Paper Review
No ratings yet
Paper Review
41 pages
NLP Project Final Report1
No ratings yet
NLP Project Final Report1
10 pages
Hindi To French Ankur
No ratings yet
Hindi To French Ankur
6 pages
Hindi To English Machine Translation
No ratings yet
Hindi To English Machine Translation
4 pages
Innovatively Fused Deep Learning For Evaluating Translations From Poor Into Rich Morphology-Coling2020
No ratings yet
Innovatively Fused Deep Learning For Evaluating Translations From Poor Into Rich Morphology-Coling2020
11 pages
Machine Translation in Arabic Teaching
No ratings yet
Machine Translation in Arabic Teaching
11 pages
Turkish-English Machine Translation System
No ratings yet
Turkish-English Machine Translation System
47 pages
French To English Translator in PyTorch
No ratings yet
French To English Translator in PyTorch
30 pages
Multilingual Interpreter App Development
No ratings yet
Multilingual Interpreter App Development
6 pages
Translation Table Compression Under End-Tagged Dense Code
No ratings yet
Translation Table Compression Under End-Tagged Dense Code
6 pages
06 Chapter2
No ratings yet
06 Chapter2
10 pages
Arabic Text-to-SQL Challenges
No ratings yet
Arabic Text-to-SQL Challenges
6 pages
English To Marathi Text Translation Using Deep Learning
No ratings yet
English To Marathi Text Translation Using Deep Learning
5 pages
Electronics 14 00243
No ratings yet
Electronics 14 00243
30 pages
Lit
No ratings yet
Lit
6 pages
Lang Gragh
No ratings yet
Lang Gragh
14 pages
English-to-Malayalam Machine Translation Framework Using Transformers
No ratings yet
English-to-Malayalam Machine Translation Framework Using Transformers
5 pages
On Application of Natural Language Processing in Machine Translation
No ratings yet
On Application of Natural Language Processing in Machine Translation
5 pages
Baidutrans
No ratings yet
Baidutrans
2 pages
Machine Translation
No ratings yet
Machine Translation
3 pages
.Exploring The Problems of Machine Translation From Arabic - Into English Language
No ratings yet
.Exploring The Problems of Machine Translation From Arabic - Into English Language
12 pages
Marathi-English Translation via Transformers
No ratings yet
Marathi-English Translation via Transformers
5 pages
New Approach Inmachine Translationthat Using Arabic To English Words
No ratings yet
New Approach Inmachine Translationthat Using Arabic To English Words
8 pages
Neural Machine Translation Model For University Email Application
No ratings yet
Neural Machine Translation Model For University Email Application
6 pages
Problem Statement:: Rule-Based Machine Translation (RBMT), Statistical Machine Translation (SMT), Neural
No ratings yet
Problem Statement:: Rule-Based Machine Translation (RBMT), Statistical Machine Translation (SMT), Neural
4 pages
Unit 5
No ratings yet
Unit 5
42 pages
Eng Arabic RBMT
No ratings yet
Eng Arabic RBMT
5 pages
Amharic Arabic Neural Machine Translatio
No ratings yet
Amharic Arabic Neural Machine Translatio
14 pages
Cs224n 2020 Lecture08 NMT
No ratings yet
Cs224n 2020 Lecture08 NMT
77 pages
Machine Translation Baselines For Arabic Swahili
No ratings yet
Machine Translation Baselines For Arabic Swahili
4 pages
Salloum Columbia 0054D 14922
No ratings yet
Salloum Columbia 0054D 14922
201 pages
Interactive Language Translator Using NMT-LSTM
No ratings yet
Interactive Language Translator Using NMT-LSTM
5 pages
Artificial Intelligent Decoding of Rare Words in Natural Language Translation Using Lexical Level Context
No ratings yet
Artificial Intelligent Decoding of Rare Words in Natural Language Translation Using Lexical Level Context
7 pages
Multi-Task Learning For Multiple Language Translation
No ratings yet
Multi-Task Learning For Multiple Language Translation
10 pages
Discover Your Learning Style Quiz
No ratings yet
Discover Your Learning Style Quiz
3 pages
Descriptive Essay Writing Guide
No ratings yet
Descriptive Essay Writing Guide
2 pages
History of Artificial Intelligence Befor
No ratings yet
History of Artificial Intelligence Befor
6 pages
Non Technical Questions For Interview
No ratings yet
Non Technical Questions For Interview
2 pages
AI & Neural Networks in Engineering
No ratings yet
AI & Neural Networks in Engineering
9 pages
II-Day 2
No ratings yet
II-Day 2
2 pages
Invent and Simplify
No ratings yet
Invent and Simplify
2 pages
Gr4 Wk12 They See With Their Ears
No ratings yet
Gr4 Wk12 They See With Their Ears
2 pages
English Unlimited Advanced Coursebook With e Portfolio Frontmatter
60% (5)
English Unlimited Advanced Coursebook With e Portfolio Frontmatter
5 pages
Social Justice Campaign Rubric
No ratings yet
Social Justice Campaign Rubric
1 page
5e Lesson Plan Template 1
No ratings yet
5e Lesson Plan Template 1
7 pages
Approaches in Literary Criticisms
No ratings yet
Approaches in Literary Criticisms
34 pages
Faculty of Education and Languages: Hbel 4403: Morphology, Semantics and Syntax
No ratings yet
Faculty of Education and Languages: Hbel 4403: Morphology, Semantics and Syntax
4 pages
Facilitation Theory and Practice
100% (1)
Facilitation Theory and Practice
53 pages
Module 14: Leadership Learning Objectives: George Terry
No ratings yet
Module 14: Leadership Learning Objectives: George Terry
11 pages
Total Quality Management
No ratings yet
Total Quality Management
65 pages
Hinterhuber 2021
No ratings yet
Hinterhuber 2021
11 pages
Chap 1 The Office Environment
No ratings yet
Chap 1 The Office Environment
46 pages
Soal Ulangan Bahasa Inggris Simple Present Tense
No ratings yet
Soal Ulangan Bahasa Inggris Simple Present Tense
2 pages
My Guam Design Challenge Plan
No ratings yet
My Guam Design Challenge Plan
4 pages
Argumentative Paper Format
100% (1)
Argumentative Paper Format
3 pages
Week 1 Reading Remediation LP
No ratings yet
Week 1 Reading Remediation LP
6 pages
Vocabulary French
100% (1)
Vocabulary French
48 pages
Unleashing Your Potential: Top Tips To Crack IIT
No ratings yet
Unleashing Your Potential: Top Tips To Crack IIT
10 pages
Affermative To Negative
No ratings yet
Affermative To Negative
6 pages
Year 2 Money Lesson with Smiggle
No ratings yet
Year 2 Money Lesson with Smiggle
2 pages
UNIT III - Lesson 1 Objective Related Principles of Learning
No ratings yet
UNIT III - Lesson 1 Objective Related Principles of Learning
7 pages
DLL - English 3 - Q1 - W7
No ratings yet
DLL - English 3 - Q1 - W7
5 pages
Statement On Scientific Temper
No ratings yet
Statement On Scientific Temper
5 pages
Out-of-Field Mentors' Experiences
No ratings yet
Out-of-Field Mentors' Experiences
9 pages

BENSALAH Nouhaila, AYAD Habib, ADIB Abdellah and IBN EL FAROUK Abdelhamid+

Uploaded by

BENSALAH Nouhaila, AYAD Habib, ADIB Abdellah and IBN EL FAROUK Abdelhamid+

Uploaded by

Arabic machine translation using Bidirectional LSTM

Memory vector c Machine Translation. June 2014.

You might also like