0% found this document useful (0 votes)

42 views3 pages

Project Plan: 3.1 Procedures and Deliverables

This project aimed to classify Quran verses and other Arabic texts as interesting or not interesting using machine learning algorithms. The project was divided into 13 procedures with defined deliverables. An initial schedule was created but later adjusted for additional tasks like presentations. Background research involved understanding natural language processing algorithms for text classification. The system was designed to classify Quran verses and another Arabic dataset. Features were defined and used to train a classifier on interesting and non-interesting texts. The classifier was implemented in Java programs and evaluated on Quran and hadith datasets using 10-fold cross validation with decision trees and Naive Bayes classifiers.

Uploaded by

memo-am

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views3 pages

Project Plan: 3.1 Procedures and Deliverables

Uploaded by

memo-am

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Chapter 3

Project Plan

3.1 Procedures and Deliverables

This project was divided into a number of procedures to achieve the explained aim and
requirements. To keep track of the performance, a blog was created to include the all information
that was taken from the papers that were used in the background reading chapter, the steps that
were carried out to define the features and the design and implementation of the classifier.
Understanding the problem and the requirements was the main point to begin with, and then a
preferable schedule was planned. In the process of implementation there were a number of java
programs and reports to deliver. The procedure explained in the following steps:
1. Better understanding of the problem and the possible way of implementing the
solutions.
2. Understand the natural language processing algorithms that help in text classification,
was described in the background research.
3. Design the system that will define a possible classification of Arabic Quran. The system
should classify Quran verses under two classes, which are interesting or not interesting.
4. The system should be able to classify another Arabic data set of the same format into
interesting and non-interesting.
5. Classification is done based on predefined features that will characterise the interesting
verses.
6. This project used supervised learning algorithm which requires training data that was
provided in the case of implementing the classifier on the holy Quran data set and
training data sets that was created in the case of hadith. A number of text files were
used later in the main steps of implementation to train and test the system.
7. Defined the features that will classify the verses, then train the system on the interesting
text and non-interesting using WEKA
8. Implement java program that create the complement set that is used as training data
(Deliverable: java program, complement set text file, Random subset, WEKA arff file).
9. Write up a mid-project report, includes the introduction, background research, and the
plan schedule (Deliverables: mid-term report).
10. Work on the further work specified. (Deliverables: classification of verses).
11. A demonstration was organized and presented to the assessor and supervisor to present
the work that was accomplished.
12. Evaluation of the system implemented.
13. Write-up of the final report. (Deliverables: The final report).

3.2 Schedule
In the beginning of the project, a suitable schedule was planned. The schedule was
organised to fit all requirements in the time provided. It was obvious that some adjustments were
necessary even though an effort to keep time in hand. The reason of the adjustments that were
made on the schedule was to include a presentation that wasn’t planned for previously. In addition,
the background reading took more time than what was expected. The reason for this is to get hold
of the recent papers that doesn’t belong to the years before 2000. Moreover, getting hold of the
papers which are related to the project specifically since text classification is broad area of study. In
addition, it was necessary to get hold of other examples that will hold the concept of the hereafter
other than the Holy Quran which will help in testing the performance of the classifier on different
texts. One of the examples of data set that will be considered is hadith, using Sahih Muslim and
creating a data set that has similar format of the holy Quran data set. The design of the data set that
was created manually took some time since hadith corpus wasn’t available over the intrenet. Then
use this data set to test the classification on this text and test the features that were selected if they
were appropriate for this classification.

3.3 Methodology
The classification in this project was processed in a number of steps to achieve the
requirements and evaluate the language model that was built. In order to accomplish this two java
program where designed and ran on the English data set of the holy Quran. However changes were
made later to achieve the same results on different Arabic data sets including the Arabic version of
the Holy Quran. The reason for the changes that were done is when the classification was
implemented on the English sets of the Holy Quran it did not perform in the right way and was
obvious that it wouldn’t on the Arabic set Holy Quran too. An example of the changes is one java
program that will return the required files for classification. The final java program that was
produced extracts features from the random sub set that was selected from the complement text
file and counts the frequencies of these features line by line and output an .arff file. In addition the
lines in the .arff file were labelled (Yes or No) in reference existence. Another change that was made
is using additional Arabic text files and performs the classification on it. Changes that were made will
be explained in full in the design and implementation part. Based on the classification that was
implemented the attempt of combine it with another classification will be an additional option in
order to try and increase the accuracy of the results. According to [ CITATION Atw1 \l 2057 ], Meccan
chapters give more highlighting the end of day’s topics. This can be used as an extra feature in which
a verse that was classified to be interesting should be contained in a Meccan chapter. Furthermore,
implementation and tests on the hadith data set was carried out after completing the main steps of
designing the data set and java program that will help in retrieving features and creating the
required .arff file that was in classification in WEKA. Since the testing sets and the training set were
available for both data sets, the use of 10-fold cross validation was an option to evaluate the some
results. Additionally, one of the options to evaluate the results of the classification is using decision
trees and another option will be Naïve Bayes classifiers. The results of the classification will be
evaluated using the output produced by WEKA.

Chapter 1
No ratings yet
Chapter 1
3 pages
Hadith Data Mining and Classification A
No ratings yet
Hadith Data Mining and Classification A
16 pages
Natural Language Processing
No ratings yet
Natural Language Processing
5 pages
Building Domain-Specific Llms Faithful To The Islamic Worldview: Mirage or Technical Possibility?
No ratings yet
Building Domain-Specific Llms Faithful To The Islamic Worldview: Mirage or Technical Possibility?
6 pages
Accessing An Information System by Chatting: (Bshawar, Eric) @comp - Leeds.ac - Uk
No ratings yet
Accessing An Information System by Chatting: (Bshawar, Eric) @comp - Leeds.ac - Uk
6 pages
Automatic Hadith Isnad Processing
No ratings yet
Automatic Hadith Isnad Processing
3 pages
A Novel LLM-Based Approach For Automated
No ratings yet
A Novel LLM-Based Approach For Automated
5 pages
Fyp - SDD
No ratings yet
Fyp - SDD
12 pages
TeachIslamBot As An IT Solution For The Muslim Ummah
No ratings yet
TeachIslamBot As An IT Solution For The Muslim Ummah
22 pages
Documentation V1docx 1
No ratings yet
Documentation V1docx 1
51 pages
Uml Synopsis
No ratings yet
Uml Synopsis
9 pages
Proceeding New Horizons Religious Text
No ratings yet
Proceeding New Horizons Religious Text
105 pages
Genetic Algorithm for Quranic Verse Classification
No ratings yet
Genetic Algorithm for Quranic Verse Classification
6 pages
Aarti
No ratings yet
Aarti
12 pages
COEN 296 Term Project Nature Language Processing: Voice Driven Conversational Agents Using Text Processing
No ratings yet
COEN 296 Term Project Nature Language Processing: Voice Driven Conversational Agents Using Text Processing
63 pages
Prolog-Based Language Interpreter
No ratings yet
Prolog-Based Language Interpreter
41 pages
Srs 2
No ratings yet
Srs 2
3 pages
Amharic ASR Project Proposal
No ratings yet
Amharic ASR Project Proposal
7 pages
UC Anniversary Programming Contest 2011 Contest Session Universidad de Carabobo, Venezuela
No ratings yet
UC Anniversary Programming Contest 2011 Contest Session Universidad de Carabobo, Venezuela
8 pages
Fse2025 ProphetAgent
No ratings yet
Fse2025 ProphetAgent
6 pages
Project 3
No ratings yet
Project 3
5 pages
Using Lightweight Formal Methods in User Interface Verification
No ratings yet
Using Lightweight Formal Methods in User Interface Verification
5 pages
Aaron - Generating User Stories in Groups With Prompts
No ratings yet
Aaron - Generating User Stories in Groups With Prompts
8 pages
2025 - Comparative Analysis of Text Mining and Clustering Techniques For Assessing Functional Dependency Between Manual Test Cases
No ratings yet
2025 - Comparative Analysis of Text Mining and Clustering Techniques For Assessing Functional Dependency Between Manual Test Cases
36 pages
Language Identification of Text
No ratings yet
Language Identification of Text
62 pages
BDCC 07 00141
No ratings yet
BDCC 07 00141
18 pages
SIGN LANGUAGEbbvj
No ratings yet
SIGN LANGUAGEbbvj
88 pages
Admission Chatbot Internship Report
No ratings yet
Admission Chatbot Internship Report
34 pages
Interlinking User Stories and GUI Prototyping: A Semi-Automatic LLM-based Approach
No ratings yet
Interlinking User Stories and GUI Prototyping: A Semi-Automatic LLM-based Approach
9 pages
"Asma'ak Sign Language Translator Report"
No ratings yet
"Asma'ak Sign Language Translator Report"
30 pages
NLP & Islam: Bridging Language Gaps
100% (1)
NLP & Islam: Bridging Language Gaps
6 pages
Programming Language Trends: An Empirical Study: Yaofei Chen Rose Dios
No ratings yet
Programming Language Trends: An Empirical Study: Yaofei Chen Rose Dios
8 pages
Quran Chronology via NLP
100% (1)
Quran Chronology via NLP
88 pages
Anjum, Nasreen Et Al. (2025) Cyber-Biosecurity Challenges in Next-Generation Sequencing A Comprehensive Analysis of Emerging Threat Vectors
No ratings yet
Anjum, Nasreen Et Al. (2025) Cyber-Biosecurity Challenges in Next-Generation Sequencing A Comprehensive Analysis of Emerging Threat Vectors
8 pages
Sample
No ratings yet
Sample
8 pages
Comparative Assessment of The Performance of Three WEKA Text Classifiers Applied To Arabic Text
No ratings yet
Comparative Assessment of The Performance of Three WEKA Text Classifiers Applied To Arabic Text
15 pages
Proceedings of The 6Th Workshop On Language Technology For Cultural Heritage, Social Sciences, and Humanities (Latech 2012)
No ratings yet
Proceedings of The 6Th Workshop On Language Technology For Cultural Heritage, Social Sciences, and Humanities (Latech 2012)
129 pages
A Data-Driven Exploration of A New Islamic Fatwas
No ratings yet
A Data-Driven Exploration of A New Islamic Fatwas
15 pages
ASSIGNMENT
No ratings yet
ASSIGNMENT
5 pages
Chatbot Access to Qur'an Information
No ratings yet
Chatbot Access to Qur'an Information
21 pages
Lect 02
No ratings yet
Lect 02
23 pages
DLL MC - CISSE07 - v7n
No ratings yet
DLL MC - CISSE07 - v7n
5 pages
Integrating Natural Language Processing and Software Engineering
No ratings yet
Integrating Natural Language Processing and Software Engineering
11 pages
(IJCST-V11I2P2) :pooja Shirude, Mohit Chaudhari, Gaurav Baviskar, Mahesh Kanhere
No ratings yet
(IJCST-V11I2P2) :pooja Shirude, Mohit Chaudhari, Gaurav Baviskar, Mahesh Kanhere
3 pages
Compilers Intro Jan2025
No ratings yet
Compilers Intro Jan2025
60 pages
Sample 1
No ratings yet
Sample 1
10 pages
Language Processing System
No ratings yet
Language Processing System
6 pages
FYP ProposalFirst
No ratings yet
FYP ProposalFirst
10 pages
Minor Project File
No ratings yet
Minor Project File
26 pages
Report
No ratings yet
Report
5 pages
Novita 2021 IOP Conf. Ser. Mater. Sci. Eng. 1088 012020
No ratings yet
Novita 2021 IOP Conf. Ser. Mater. Sci. Eng. 1088 012020
7 pages
Test Case Generation AI ML
No ratings yet
Test Case Generation AI ML
66 pages
01 04 Documenting and Debugging
No ratings yet
01 04 Documenting and Debugging
4 pages
A Discipline of Programming - Edsger Dijkstra PDF
100% (2)
A Discipline of Programming - Edsger Dijkstra PDF
232 pages
Project Report
No ratings yet
Project Report
12 pages
Presentation Notes
No ratings yet
Presentation Notes
3 pages
NLP &
No ratings yet
NLP &
21 pages
PHD Regulation 2015
No ratings yet
PHD Regulation 2015
5 pages
NLP (VN-3, VN-14)
No ratings yet
NLP (VN-3, VN-14)
4 pages
RESEARCH FORMAT 3is
No ratings yet
RESEARCH FORMAT 3is
12 pages
Voice Wukong
No ratings yet
Voice Wukong
22 pages
Reading A Say No To Yes Men
No ratings yet
Reading A Say No To Yes Men
43 pages
Truong Gia Huy 001 BM5107
No ratings yet
Truong Gia Huy 001 BM5107
19 pages
PsychAss - Finals
No ratings yet
PsychAss - Finals
26 pages
How To Write An Evaluation Thesis
100% (3)
How To Write An Evaluation Thesis
8 pages
Demand and Supply Analysis A. Demand Study A. Historical Demand Data
No ratings yet
Demand and Supply Analysis A. Demand Study A. Historical Demand Data
12 pages
Finding Your Flow Hemanth 191454
100% (1)
Finding Your Flow Hemanth 191454
1 page
Sagir Complete
No ratings yet
Sagir Complete
77 pages
Speech Acts in Philippine TELEVISION TV
No ratings yet
Speech Acts in Philippine TELEVISION TV
21 pages
Work Shop (23 HLM)
No ratings yet
Work Shop (23 HLM)
23 pages
Vision 2020
No ratings yet
Vision 2020
20 pages
Enrollmnt No 207680592200 Patel Janu S
No ratings yet
Enrollmnt No 207680592200 Patel Janu S
9 pages
Quiz 3
No ratings yet
Quiz 3
4 pages
Cyber Security Auditing Framework CSAF F
No ratings yet
Cyber Security Auditing Framework CSAF F
99 pages
Lesson 5 - Understanding Ways To Collect Data
No ratings yet
Lesson 5 - Understanding Ways To Collect Data
84 pages
Lecture 2 - Quantitative Genetics & Biometrical Techniques in Plant Breeding 2020
No ratings yet
Lecture 2 - Quantitative Genetics & Biometrical Techniques in Plant Breeding 2020
4 pages
Factors Affecting The Criminology Licensure Examination Performance of The Criminology Graduates in Pangasinan State University Binmaley
No ratings yet
Factors Affecting The Criminology Licensure Examination Performance of The Criminology Graduates in Pangasinan State University Binmaley
9 pages
GEC 002 Final Performance Task
No ratings yet
GEC 002 Final Performance Task
4 pages
Quarter 3 Exam
No ratings yet
Quarter 3 Exam
3 pages
Student Reflection on Case Study
No ratings yet
Student Reflection on Case Study
2 pages
2023.de Castro - Anxiety and Depression Signs Among Adolescents in 26 Low
No ratings yet
2023.de Castro - Anxiety and Depression Signs Among Adolescents in 26 Low
9 pages
A Systematic Analysys of Septal Deviation Associated With Rhnosinusitis
No ratings yet
A Systematic Analysys of Septal Deviation Associated With Rhnosinusitis
9 pages
1-Introduction To Analytical Chemistry
100% (2)
1-Introduction To Analytical Chemistry
57 pages
Complex Engineering Activity: Objective
No ratings yet
Complex Engineering Activity: Objective
3 pages
Oracle Supply Chain Exam Guide
100% (1)
Oracle Supply Chain Exam Guide
4 pages
QTM Notes LPP TP Network
No ratings yet
QTM Notes LPP TP Network
45 pages
Cons Guide Instr
No ratings yet
Cons Guide Instr
6 pages
AOA & AON Questions
0% (1)
AOA & AON Questions
14 pages
Architecture Practice Course Guide
No ratings yet
Architecture Practice Course Guide
2 pages

Project Plan: 3.1 Procedures and Deliverables

Uploaded by

Project Plan: 3.1 Procedures and Deliverables

Uploaded by

Chapter 3

3.1 Procedures and Deliverables

You might also like