UNIT-I
S.No Leve Leve Question A B C D ANSWE
. l l R
1 L1 Data science is organizing data processing analysing data All of the D
the process of data above
diverse set of
data through ?
2 L3 The modern William S. John Arthur Samuel Satoshi A
conception of McCarthy Nakamoto
data science as
an independent
discipline is
sometimes
attributed to?
3 L2 Which of the C C++ R Ruby C
following
language is used
in Data science?
4 L4 Which of the Subsetting can be Raw data Merging None Of the B
following is used to select and should be concerns above
false? exclude variables processed combining
and observations only one time. datasets on the
same
observations to
produce a result
with more
variables
5 L3 What is the work utilize large data work with build data All of the C
of Data sets to gather businesses to solutions that above
Architect? information that determine the are optimized
meets their best usage of for performance
company's needs the and design
information applications
yielded from
data
6 L3 Which of the Probability & Machine Data Wrangling All of the D
following is Statistics Learning / above
correct skills for Deep
a Data Scientist? Learning
7 L1 Which of the Data Engineering Advanced Domain All of the D
following are Computing expertise above
correct
component for
data science?
8 L2 Which of the Discovery Model Communication Operationaliz C
following is not Planning Building e
a part of data
science process?
9 L2 Which of the Structured UnStructured Both A and B None Of the C
following are the above
Data Sources in
data science?
10 L5 Which of the Recommendation Image & Online Price Privacy D
following is not Systems Speech Comparison Checker
a application for Recognition
data science?
11 L4 Data can be 1 2 3 4 B
categorized into
______ groups.
12 L4 Unstructured TRUE FALSE Can be true or Can not say A
data is not false
organized.
13 L2 A column is a horizontal diagonal vertical Top C
________
representation of
data.
14 L1 A ________ is a database table functions data prepration data frame D
structured
representation of
data.
15 L3 We write npm. np. ng. ngm. B
______ in front
of mean to let
Python know
that we want to
activate the
mean function
from the Numpy
library.
16 L4 Point out the Raw data is Preprocessed Raw data is the None of the A
correct original source of data is data obtained above
statement. data original after processing
source of data steps
17 L5 Which of the Statistics Machine Data All of the D
following is one Learning Visualization above
of the key data
science skills?
18 L3 Raw data should TRUE FALSE Can be true or Can not say B
be processed false
only one time.
19 L2 Which of the Inference Summarizing Subsetting None of the A
following is the above
common goal of
statistical
modelling?
20 L1 Causal analysis TRUE FALSE Can be true or Can not say B
is commonly false
applied to census
data.
21 L2 Which of the Inferential Descriptive Causal All of the C
following model above
is usually a gold
standard for data
analysis?
22 L4 Which of the Data Cleaning Data Data All of the A
following step is Integration Replication above
performed by
data scientist
after acquiring
the data?
23 L2 Which of the Data mining BigData Data wrangling Machine A
following Learning
focuses on the
discovery of
(previously)
unknown
properties on the
data?
24 L2 Raw data should TRUE FALSE Can be true or Can not say B
be processed false
only one time.
25 L3 A data scientist TRUE FALSE Can be true or Can not say A
is a job title for false
an employee or
business
intelligence (BI)
consultant who
excels at
analyzing data,
particularly large
amounts of data.
26 L3 Which among Answer Question Data None of the B
the following is above
the top most
important thing
in data science?
27 L1 Which approach Non stratify it generalize it randomize it None of the C
should be used if above
you can’t fix the
variable?
28 L2 _________ is a Have Replication Generalize to Measure All of the D
good way of the problem variability above
performing
experiments in
data science.
29 L2 Data fishing is Data bagging Data merging Data dredging None of the C
sometimes above
referred to as
__________.
30 L1 Data dredging, is Data bagging Data merging Data booting Data D
also known as snooping
__________.
31 L3 __________ Data merging Data booting Data dredging All of the C
data mining above
technique is used
to uncover
patterns in data.
32 L4 The applications Healthcare Fraud and Airline Route All of the D
of Data Science Risk Planning above
are __________. Detection
33 L5 The data science Data Science for Data Science Drug Discovery All of the D
applications in Medical Imaging for Genomics with Data above
healthcare are Science
_______.
34 L3 Features of R are
Analytical Supports All of the
________. Open-source D
support extensions above
35 L2 Raw Data is also secondary data permanent destination data eggy data D
known as data
________.
36 L1 Advantages of Abundance of A Highly Paid Data Science is All of the D
Data Science are Positions Career Versatile above
37 L2 Disadvantages Large Amount of Arbitrary Data Data Science is All of the D
of Data Science Domain May Yield Blurry Term above
are _______. Knowledge Unexpected
Required Results
38 L1 The most Encryption Cryptographic Encoding All of the D
common data hashing above
loss prevention
techniques are:
39 L3 Misconfiguratio Default Bugs in All of the D
What are the n Settings Operating above
System or web
different types server
of web server
vulnerabilities?
40 L2 Phishing Password All of the above None of the C
What are some Attacks above
common cyber-
attacks?