VISVESVARAYA TECHNOLOGICAL UNIVERSITY
Jnana Sangama, Belagavi- 590018
                               A
                INTERNSHIP REPORT
                                on
               “MACHINE LEARNING”
                     Submitted in partial fulfillment
                                   Of
                           INTERNSHIP
                                   in
          INFORMATION SCIENCE AND ENGINEERING
             VIII SEMESTER INTERNSHIP (18CS185)
                                   By
               S VARSHITHA              1HK19IS082
 Under the guidance of                        Under the guidance of
 Dr.V. Balaji Vijayan                         Mr. Mallikarjun Kumbar
 Associate Professor                          Take It Smart Pvt Ltd
     Department of Information Science and Engineering
                          2022 – 2023
           HKBK COLLEGE OF ENGINEERING
              22/1, Nagawara, Bengaluru – 560045
       E-mail: info@hkbk.edu.in, URL: www.hkbk.edu.in
         DEPARTMENT OF INFORMATION SCIENCE AND ENGINEERING
            PROGRAMME EDUCATIONAL OBJECTIVES (PEOs)
PEO 1: To Empower Students through Wholesome Education to achieve academic excellent
education in the field of Information Science and Engineering.
PEO 2: To Provide Students with in-depth disciplinary knowledge in engineering fundamentals
that require to succeed in Information Science and Engineering.
PEO 3: To Create Highly Qualified Professionals in multi-disciplinary areas with the knowledge
ofInformation Technologies, Services Globally
PEO 4: To Inculcate in Students Professional and Ethical attitude with a strong character with
effective communication skills, teamwork skills, multidisciplinary approach, and an ability to
relate Engineering issues to broader social context.
PEO 5: To Provide Students with an academic environment aware of advanced technological
growth leading to life-long learning through innovation and research with professional ethics
that uplifts mankind
                       PROGRAM SPECIFIC OUTCOMES(PSOs)
Professional Skills:
An ability to identify and analyse requirements, and in designing and implementing
well- tested technology solutions for rapidly changing computing problems and
information system environments.
Problem-Solving Skills:
An ability to Design, develop and optimize solutions for information systems
employing fundamentals of system hardware & software, graph theory, finite
automata, data storage and communication networks.
Collaborative Skills:
An ability to communicate and develop leadership skills and work effectively in team
environments. They are capable of collaborating to design and implement well tested
solutions for rapidly changing computing problems and information system
environments.
Successful Career and Entrepreneurship Skills:
An ability to adapt for innovation and changes and be successful in ethical professional
careers along with the impact of computing on society, and platforms in creating innovative
career paths to be an entrepreneur, and a zest for higher studies.
                           Bengaluru – 560045
DEPARTMENT OF INFORMATION SCIENCE AND
        ENGINEERING VISVESVARAYA TECHNOLOGICAL
        UNIVERSITY
                     INTERNSHIP REPORT
                               ON
                   “MACHINE LEARNING”
              Submitted by partial fulfillment of the
                      INTERNSHIP (18CS185)
 VIII Semester, Department of Information Science and Engineering
                            2022 – 2023
                        SUBMITTED BY :
               S VARSHITHA            1HK19IS082
                                 ii
                               ACKNOWLEDGEMENT
I would like to place my regards and acknowledgment to all who helped in making this project
possible. I thank all those whose guidance served as a beacon of light and crowned our efforts
with success.
First of all I would take this opportunity to express our heartfelt gratitude to the management
committee - Chairman Mr. C. M. Ibrahim, Director Mr. C.M. Faiz Mohammed and the in
charge Principal Dr. Tabassum Ara for all the infrastructures provided to complete the
INTERNSHIP in time.
I deeply indebted to Dr. A Syed Mustafa, HOD, Information Science and Engineering for the
ineffable encouragement he provided for the successful completion of the project.
A special and an earnest word of thanks to the internship guide, Dr.V.Balaji Vijayan and our
coordinator Prof. Shravana K for constant assistance, support, patience, endurance and
constructive suggestions for the betterment of the project.
I’m extremely thankful to the teaching and non-teaching staff of the Department of Information
Science and Engineering for their valuable guidance and cooperation throughout our dissertation.
I thank my parents for their support and guidance provided to us to finish the internship well ahead
of time. I thank my friends who lent their support in every way possible to make sure the mini
project has been completed. Last, but not least I would like to thank God for giving us this
opportunity to do everything in the appropriate time to finish this project.
                                                                    S VARSHITHA (1HK19IS082)
                                               iv
                                         ABSTRACT
Machine Learning is used across many ranges around the world. The healthcare industry is no
exclusion. Machine Learning can play an essential role in predicting presence/absence of
locomotors disorders, Heart diseases and more. Such information, if predicted well in advance, can
provide important intuitions to doctors who can then adapt their diagnosis and dealing per patient
basis. Doing internship should involve work related to your interest of area, which makes it
challenging and thisis recognized by the organization that fills the entire work term. For this I did
internship in “Machine Learning” at “Take It Smart”. This company assured me quality internship
on Machine Learning. The project carried out during this internship was based on ‘Heart disease
prediction’, where I learned how to train, test and use a basic machine learning model and
implementing algorithms, visualize the dataset as well as understanding the evaluation metrics and
accuracy score. In this internship report, I have tried to show how I efficiently learned and
incorporated the skills acquired during my internship. This report of internship takes through all
the knowledge and experience that I acquired from the concepts and work done during the
internship period.
                                                  v
                     TABLE OF CONTENTS
Chapter No.                Chapter Title      Page No.
              Acknowledgement                   iv
              Abstract                          v
              List of Figures                  viii
              List of Tables                    ix
              List of Abbreviations             x
    1         Company Profile                   1
                  1.1 Introduction              1
                  1.2 Mission                   1
                  1.3 Vision                    1
                  1.4 Company Strategy          2
                  1.5 Overall Turnover          2
                  1.6 Company Services          2
                  1.7 Domains                   2
                  1.8 Departments               3
                  1.9 About the Company         3
   2          About the Project                 4
                  2.1 Introduction              4
                  2.2 Problem Statement         4
                  2.3 Motivation                5
                  2.4 Objectives                5
                  2.5 Advantages                6
                  2.6 Limitations               6
   3          Structure                         7
                  3.1 Internship Structure      7
                  3.2 Project Assigned          8
                  3.3 Internship Objectives     8
   4          Methodology                       9
                   4.1 Existing System          9
                   4.2 Proposed System         10
                                     vi
5   Project Assigned                      13
       5.1 System Software Requirements   13
       5.2 System Architecture            14
       5.3 Steps to be followed           15
       5.4 Machine Learning               16
       5.5 Algorithms                     17
       5.6 Source code with Snapshots     23
       5.7 Dataset Details                29
       5.8 Performance Analysis           30
       5.9 Results                        32
6   Internship Outcomes                   33
       6.1 Technical Outcomes             33
       6.2 Non-Technical Outcomes         33
7   Conclusion                            34
    Scope of Future work                  34
    References                            35
                        vii
          LIST OF FIGURES
Fig No.     Figure Name                           Page No.
 4.1        Collection of Data                      10
 4.2        Correlation Matrix                      11
 4.3        Data Pre-Processing                      11
 4.4        Data Balancing                          12
 4.5        Prediction of Disease                   12
 5.1        System Architecture                     14
 5.2        K-NN Algorithm                          17
 5.3        K-NN New point                          18
 5.4        Euclidean Distance                      18
 5.5        Nearest Neighbor Points                 19
 5.6        Random Forest Classifier                 21
 5.7        Required Libraries, Load Dataset         23
 5.8        Describing the Dataset                  24
 5.9        Feature Selection                        24
 5.10       Correlation Matrix                       25
 5.11       Histograms                              25
 5.12       Target Values                            26
 5.13       Display Head of Dataset                 26
 5.14       Import KNN Classifier                   27
 5.15       KNN Plotted Graph                       27
 5.16       KNN Classifier Score                    28
 5.17       Random Forest Classifier Mean Score     28
 5.18       Dataset Attributes                      29
 5.19       Performance Analysis                    31
 5.20       KNN Mean Score                          31
                   viii
            LIST OF TABLES
Table No.       Table Name            Page No.
   3.1        Internship Structure       7
   5.1        Attributes of Dataset     30
   5.2        Accuracy Table            32
                     ix
                        LIST OF ABBREVIATIONS
PVT     -   Private
LTD     -   Limited
IT      -   Information Technology
ERP     -   Enterprise Resource Planning
IOS     -   Iphone Operating System
IOT     -   Internet of Things
KNN     -   K-Nearest Neighbors
AI      - Artificial Intelligence
ML      -   Machine Learning
HCL     -   High Compatibility
List GB -   Giga Byte
MB     -    Mega Byte
IDE     -   Integrated Development Environment