UNIT-I
S.No     Leve   Leve   Question             A                     B                 C                   D                ANSWE
.        l      l                                                                                                        R
1        L1            Data science is      organizing data       processing        analysing data      All of     the   D
                       the process of                             data                                  above
                       diverse set of
                       data through ?
2        L3            The       modern     William S.            John              Arthur Samuel       Satoshi          A
                       conception     of                          McCarthy                              Nakamoto
                       data science as
                       an independent
                       discipline     is
                       sometimes
                       attributed to?
3        L2            Which of the         C                     C++               R                   Ruby             C
                       following
                       language is used
                       in Data science?
4        L4            Which of the         Subsetting can be     Raw        data   Merging             None Of the      B
                       following      is    used to select and    should       be   concerns            above
                       false?               exclude variables     processed         combining
                                            and observations      only one time.    datasets on the
                                                                                    same
                                                                                    observations to
                                                                                    produce a result
                                                                                    with       more
                                                                                    variables
5        L3            What is the work     utilize large data    work      with    build        data   All of     the   C
                       of          Data     sets to gather        businesses to     solutions that      above
                       Architect?           information that      determine the     are optimized
                                            meets         their   best usage of     for performance
                                            company's needs       the               and       design
                                                                  information       applications
                                                                  yielded from
                                                                  data
6        L3            Which of the         Probability       &   Machine           Data Wrangling      All of     the   D
                       following       is   Statistics            Learning     /                        above
                       correct skills for                         Deep
                       a Data Scientist?                          Learning
7        L1            Which of the         Data Engineering      Advanced          Domain              All of     the   D
                       following      are                         Computing         expertise           above
                       correct
                       component for
                       data science?
8        L2            Which of the         Discovery             Model             Communication       Operationaliz    C
                       following is not                           Planning          Building            e
                       a part of data
                       science process?
9        L2            Which of the         Structured            UnStructured      Both A and B        None Of the      C
                       following are the                                                                above
                       Data Sources in
                       data science?
10       L5            Which of the         Recommendation        Image       &     Online   Price      Privacy          D
                       following is not     Systems               Speech            Comparison          Checker
                       a application for                          Recognition
                       data science?
11       L4            Data can be          1                     2                 3                   4                B
                       categorized into
                       ______ groups.
12   L4   Unstructured           TRUE                 FALSE             Can be true or     Can not say        A
          data     is   not                                             false
          organized.
13   L2   A column is a          horizontal           diagonal          vertical           Top                C
          ________
          representation of
          data.
14   L1   A ________ is a        database table       functions         data prepration    data frame         D
          structured
          representation of
          data.
15   L3   We          write      npm.                 np.               ng.                ngm.               B
          ______ in front
          of mean to let
          Python      know
          that we want to
          activate      the
          mean function
          from the Numpy
          library.
16   L4   Point out the          Raw      data   is   Preprocessed      Raw data is the    None of the        A
          correct                original source of   data         is   data obtained      above
          statement.             data                 original          after processing
                                                      source of data    steps
17   L5   Which of the           Statistics           Machine           Data               All of       the   D
          following is one                            Learning          Visualization      above
          of the key data
          science skills?
18   L3   Raw data should        TRUE                 FALSE             Can be true or     Can not say        B
          be       processed                                            false
          only one time.
19   L2   Which of the           Inference            Summarizing       Subsetting         None of the        A
          following is the                                                                 above
          common goal of
          statistical
          modelling?
20   L1   Causal analysis        TRUE                 FALSE             Can be true or     Can not say        B
          is      commonly                                              false
          applied to census
          data.
21   L2   Which of the           Inferential          Descriptive       Causal             All of       the   C
          following model                                                                  above
          is usually a gold
          standard for data
          analysis?
22   L4   Which of the           Data Cleaning        Data              Data               All of       the   A
          following step is                           Integration       Replication        above
          performed        by
          data       scientist
          after acquiring
          the data?
23   L2   Which of the           Data mining          BigData           Data wrangling     Machine            A
          following                                                                        Learning
          focuses on the
          discovery        of
          (previously)
          unknown
          properties on the
          data?
24   L2   Raw data should        TRUE                 FALSE             Can be true or     Can not say        B
          be processed                                                  false
          only one time.
25   L3   A data scientist     TRUE                   FALSE           Can be true or     Can not say        A
          is a job title for                                          false
          an employee or
          business
          intelligence (BI)
          consultant who
          excels at
          analyzing data,
          particularly large
          amounts of data.
26   L3   Which among          Answer                 Question        Data               None of the        B
          the following is                                                               above
          the top most
          important thing
          in data science?
27   L1   Which approach       Non stratify it        generalize it   randomize it       None of the        C
          should be used if                                                              above
          you can’t fix the
          variable?
28   L2   _________ is a       Have Replication       Generalize to   Measure            All of       the   D
          good way of                                 the problem     variability        above
          performing
          experiments in
          data science.
29   L2   Data fishing is      Data bagging           Data merging    Data dredging      None of the        C
          sometimes                                                                      above
          referred to as
          __________.
30   L1   Data dredging, is    Data bagging           Data merging    Data booting       Data               D
          also known as                                                                  snooping
          __________.
31   L3   __________           Data merging           Data booting    Data dredging      All of       the   C
          data mining                                                                    above
          technique is used
          to uncover
          patterns in data.
32   L4   The applications     Healthcare             Fraud     and   Airline Route      All of       the   D
          of Data Science                             Risk            Planning           above
          are __________.                             Detection
33   L5   The data science     Data Science for       Data Science    Drug Discovery     All of       the   D
          applications in      Medical Imaging        for Genomics    with      Data     above
          healthcare are                                              Science
          _______.
34   L3   Features of R are
                                                      Analytical      Supports           All of the
          ________.            Open-source                                                                  D
                                                      support         extensions         above
35   L2   Raw Data is also     secondary data         permanent       destination data   eggy data          D
          known as                                    data
          ________.
36   L1   Advantages of        Abundance         of   A Highly Paid   Data Science is    All of       the   D
          Data Science are     Positions              Career          Versatile          above
37   L2   Disadvantages      Large Amount of   Arbitrary Data   Data Science is    All of   the   D
          of Data Science    Domain            May      Yield   Blurry Term        above
          are _______.       Knowledge         Unexpected
                             Required          Results
38   L1   The        most    Encryption        Cryptographic    Encoding           All of   the   D
          common data                          hashing                             above
          loss prevention
          techniques are:
39   L3                      Misconfiguratio   Default          Bugs       in      All of   the   D
          What are the       n                 Settings         Operating          above
                                                                System or web
          different types                                       server
          of web server
          vulnerabilities?
40   L2                      Phishing          Password         All of the above   None of the    C
          What are some                        Attacks                             above
          common cyber-
          attacks?