0% found this document useful (0 votes)

35 views7 pages

DM Questions

Uploaded by

Uma Mahesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views7 pages

DM Questions

Uploaded by

Uma Mahesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

www.android.universityupdates.in | www.universityupdates.in | www.ios.universityupdates.

te s
d a
U p
i t y
e r s
n i v
U
QUESTION BANK
UNIT-1
Short Answer Questions
QUESTIONS Blooms taxonomy
level
Course
Outcome
1.Define data mining? Understand CO1
2.Explain the functionalities of data mining? Understand CO1
3.Interpret the major issues in data mining? Knowledge CO1
4.Name the steps in knowledge discovery? Knowledge CO1
5.Distinguish between data ware house and data mining? Analyze CO1
Long Answer Questions

www.android.previousquestionpapers.com | www.previousquestionpapers.com | www.ios.previousquestionpapers.com

www.android.universityupdates.in | www.universityupdates.in | www.ios.universityupdates.in

1.Describe Data Mining? In your answer explain the Understanding CO1

following:
a. Is it another hype?
b. Is it simple transformation of technology developed from
databases, statistics and machine learning?
c. Explain how the evolutions of database technology lead to
data mining?
d. Describe the steps involved in data mining when viewed as
knowledge discovery process?
2.Discuss briefly about data smoothing techniques? Creating CO1
3.List and describe the five primitives for specifying the data Analyzing CO1
mining tasks?
4.Define data cleaning? Express the different techniques in
handling the missing values?
Understanding

te s CO1

a
5.Explain mining of huge amount of data (eg: billions of Analyzing CO1
tuples) in comparison with mining a small amount of data
(Eg: data set of few hundred of tuples).

UNIT-2
p d
U
Short Answer Questions
QUESTIONS Blooms taxonomy Course

y
level Outcomes
1.Explain the frequent item set?

i t
2. Explain about maximal frequent items set and closed item
set?

s
Understanding
Knowledge
CO2
CO2

r
3.Name the steps in association rule mining? Understand CO2

e
4.Explain the efficiency of APRIORI algorithm Analyze CO2
5.Define item set? Interpret the support and confidence rules Understand CO2

n i v
for item set A and item set B?
Long Answer Questions
1.Discuss which algorithm is an influential algorithm for
mining frequent item sets for Boolean association rules?
Analysis CO2

U
Explain with an example?
2.Describe the FP-growth algorithm with an example?
3.Explain how to mine frequent item sets using vertical data
format?
4.Explain how to mine the multi dimensional association
rules from relational data bases and data ware houses?
Analysis
Understand

Understand
CO2
CO2

CO2

5.Explain the APRIORI algorithm with an example? Analysis CO2

UNIT-3
Short Answer Questions
QUESTIONS Blooms taxonomy Course
level Outcomes
1.State classification and define regression analysis? Understand CO2

www.android.previousquestionpapers.com | www.previousquestionpapers.com | www.ios.previousquestionpapers.com

www.android.universityupdates.in | www.universityupdates.in | www.ios.universityupdates.in

2.Name the steps in data classification and define training Knowledge CO2
tuple?
3.Explain the IF-THEN rule in classification? Analysis CO3
4.What is tree pruning and define the Naïve Bayes Knowledge CO3
classification?
5.Explain the decision tree? Understand CO3
Long Answer Questions
1.Explain about the classification and discuss with an Analysis CO2
example?
2.Summarize how does tree pruning work? What are some Understanding CO2
enhancements to basic decision tree induction?
3.Describe the working procedures of simple Bayesian
classifier?
4.Discuss about Decision tree induction algorithm?
Analysis

Evaluate

te s CO3

CO3

5.Explain about IF-THEN rules used for classification with an

example and also discuss about sequential covering

d a
Knowledge CO3

p
algorithm?
UNIT-4

U
Short Answer Questions
QUESTIONS Blooms Course

y
taxonomy level Outcomes

t
1.Define clustering? Knowledge CO3

r s i
2.llustrate the meaning of cluster analysis?
3.Explain the different types of data used in clustering?
4.Explain the fields in which clustering techniques are used?
5.State the hierarchical methods?
Knowledge
Knowledge
Understand
Understand
CO3
CO4
CO4
CO4
Long Answer Questions

i v e
1.Discuss various types of data in cluster analysis?
2.Explain the categories of major clustering methods?
Analysis
Understand
CO3
CO3

k-means?

U n
3.Explain in brief about k-means algorithm and portioning in

4.Describe the different types of hierarchical methods?

5.Discuss about the outliers? Explain the weakness and
strengths in hierarchical clustering methods?
Analysis

Knowledge
Knowledge
CO4

CO4
CO4

UNIT-5
Short Answer Questions
QUESTIONS Blooms Course
taxonomy level Outcomes
1.Define Web mining and text mining? Knowledge CO4
2.Write a short note on web content mining. Understand CO4
3.What are the features of Unstructured text mining. Knowledg CO4
4. Write a short note on web structure mining. Understand CO4
5.Write a short note on web usage mining. Understand CO4

www.android.previousquestionpapers.com | www.previousquestionpapers.com | www.ios.previousquestionpapers.com

www.android.universityupdates.in | www.universityupdates.in | www.ios.universityupdates.in

Long Answer Questions

1.Explain about authoritative and Hub pages? Knowledge CO4
2.Give taxonomy of web mining activities.For what purpose Understand CO4
web usage mining is used?
3. what activities are involved in web usage mining? Knowledge CO4
4.Explain Episode rule discovery for texts. Knowledge CO4
5.Write a short note on Text clustering. Understand CO4
Objective Questions:
UNIT-1
1. The Synonym for data mining is
(a)Data warehouse (b)Knowledge discovery in database (c)ETL (d)Business intelligence
2. Data transformation includes which of the following?
a) A process to change data from a detailed level to a summary level
b). A process to change data from a summary level to a detailed level
c) Joining data from one source into various sources of data
te s
d). Separating data from one source into various sources of data

d a
3. Which of the following process includes data cleaning, data integration, data transformation, data
selection, data mining, pattern evaluation and knowledge presentation?
A. KDD process

(a)Business requirements level

(c) Detailed models level
U p
B. ETL process C. KTL process D. None of the above
4. At which level we can create dimensional models?
(b) Architecture models level
(d)Implementation level (e)Testing level.

i y
5. What are the specific application oriented databases?

t
A. Spatial databases, B. Time-series databases, C. Both a & b D. None of these
UNIT-2

A. Binary attribute.
attribute.
r s
1. Association rules are always defined on________.
B. Single attribute.

e
C. Relational database. D. Multidimensional

v
2. __________ is data about data.

i
A. Metadata. B. Microdata. C. Minidata D. Multidata.
3. Which of the following is the data mining tool?

U n
A. C. B. Weka. C. C++. D. VB.
4. Capability of data mining is to build __________ models.
A. Retrospective. B. Interrogative. C. Predictive. D. Imperative.
5. The _________is a process of determining the preference of customer’s majority.
A. Association. B. Preferencing. C. segmentation. D. classification.
UNIT-3
1. Another name for an output attribute.
a. predictive variable
b. independent variable
c. estimated variable
d. dependent variable
2. Classification problems are distinguished from estimation problems in that
a. classification problems require the output attribute to be numeric.
b. classification problems require the output attribute to be categorical.
c. classification problems do not allow an output attribute.

www.android.previousquestionpapers.com | www.previousquestionpapers.com | www.ios.previousquestionpapers.com

www.android.universityupdates.in | www.universityupdates.in | www.ios.universityupdates.in

d. classification problems are designed to predict future outcome.

3. Which statement is true about prediction problems?
a. The output attribute must be categorical.
b. The output attribute must be numeric.
c. The resultant model is designed to determine future outcomes.
d. The resultant model is designed to classify current behavior.
4. Which statement about outliers is true?
a. Outliers should be identified and removed from a dataset.
b. Outliers should be part of the training dataset but should not be present in the test
data.
c. Outliers should be part of the test dataset but should not be present in the training

s
data.

te
d. The nature of the problem determines how outliers are used.
e. More than one of a,b,c or d is true.
5. Which statement is true about neural network and linear regression models?
a. Both models require input attributes to be numeric.
b. Both models require numeric attributes to range between 0 and 1.
c. The output of both models is a categorical attribute value.

d a
e. More than one of a,b,c or d is true.
Unit IV
U p
d. Both techniques build models whose output is determined by a linear sum of
weighted input attribute values.

Multiple Choice Questions

t y
1. A trivial result that is obtained by an extremely simple method is called _______.

i
A. naive prediction. B. accurate prediction. C. correct prediction. D. wrong prediction.

s
2. K-nearest neighbor is one of the _______.

r
A. learning technique. B. OLAP tool. C. purest search technique. D. data warehousing tool.
3. Enrichment means ____.

e
A. adding external data. B. deleting data. C. cleaning data. D. selecting the data.

v
4. Clustering methods are______.

i
A. Hierarchical. B. Agglomarative. C. PAM algorithm. D. K-nearest neighbor. E. All the
above
UNIT-V
n
U
1. HITS abbreviation in Web Structure?
a. Hyperlink-Index Topic Search b. Hyperlink-Induces Topic Search
c. Hyperlink-Identification Text Search d. Hyperlink-Index Text Search
2. Preprocessing Web log activity is?
a. Count patterns that occur in sessions b. Remove extraneous Information
c. Count Page references d. Pattern Setting
3. Periodic Crawler defines?
a. Visits Portions of the Web b. Selectively searches the Web
c. Visits pages related to a particular subject d. Collect Information from visited pages
4. Which is assigns relevance score to each page based on crawl topic?
a. Distiller b. Hub pages
c. Hypertext Classifier d. scores

www.android.previousquestionpapers.com | www.previousquestionpapers.com | www.ios.previousquestionpapers.com

www.android.universityupdates.in | www.universityupdates.in | www.ios.universityupdates.in

5. What is main Objective of web mining?

a. Web Component, Score and Usage Mining b. Web Control, Text and Utility Mining
c. Web Content, Score and Utility Mining d. Web Content, Structure and Usage

Fill in the blanks:

Unit 1

s
1. Data Mining_________predicts future trends & behaviors, allowing business managers to
make proactive, knowledge-driven decisions

te
2. Data Cleaning is a process that removes …outliers………………..
3. The output of KDD is useful information

d a
4. Data Discrimination is a comparison of the general features of the target class data objects
against the general features of objects from one or multiple contrasting classes

p
5. Strategic value of data mining is time-sensitive

Unit 2

y U
t
1. ____Referencing_________ is a process of determining the preference of customer's
majority.

s i
2. __Data Mart__________ is a metadata repository

r
3. The two steps in Apriori includes …join…………. and ……prune……..
4. FP Growth stands for ……Frequent pattern growth………………..

Unit 3

i v e
5. Use normalization by decimal scaling to transform the value 35 for age……0.35………..

U n
1. Classification is the process of finding a model (or function) that describes and
distinguishes data classes or concepts.
2. Data mining methods discard outliers as noise or exceptions.
3. Prediction also used for to know the unknown or missing values.
4. In a decision tree, leaf nodes represent class labels or class distribution.
5. Decision Tree is constructed in a top-down recursive divide-and-conquer manner.
Unit 4:

1. A cluster analysis is the process of analysing the various clusters to organize the different
objects into meaningful and descriptive object.
2. …Agglomerative…………… clustering follows bottom up strategy
3. PAM means… “partition around medoids”……. …………………..
4. Bayesian classifiers exhibited high accuracy and speed when applied to large databases.

www.android.previousquestionpapers.com | www.previousquestionpapers.com | www.ios.previousquestionpapers.com

www.android.universityupdates.in | www.universityupdates.in | www.ios.universityupdates.in

5. Most data mining methods discard outliers as noise or exceptions.

Unit 5:

1. Hub Pages Contain links to many relevant pages

2. PageRank, CLEVER Techniques used in Web Structure Mining
3. Weighting is used to provide more importance to backlinks coming form important pages
4.PageRank equation PR(p)=c(PR(1)/N1 +...+PR(n)/Nn)
5.What is the use of CLEVER? Finding both Authoritative and Hub pages.
XI.WEBSITES:
1. www.autonlab.org/tutorials : Statistical Data mining Tutorials
2. www- db.standford.edu /`ullman/mining/mining.html : Data mining lecture notes

s
3.ocw.mit.edu/ocwweb/slon-School-of-management/15-062Data- MiningSpring2003/course

te
home/index.htm: MIT Data mining open courseware
XII.EXPERT DETAILS:
1. Jiaweihan, Abel Bliss Professor, Department of Computer Science, Univ. of Illinois at Urbana-
Champaign Rm 2132, Siebel Center for Computer Science

d a
2. Michelinekamber, Researcher,Master's degree in computer science (specializing in artificial
intelligence) from Concordia University, Canada

XIII.JOURNALS:

U p
3. Arun k pujari, Vice Chancellor, Central University Of Rajasthan - Central University Of
Rajasthan

1. Data warehousing, data mining, OLAP and OLTP technologies are essential elements to support
decision-making process in Industries

t y
2. Effective navigation of query results based on concept hierarchy

i
3. Advanced clustering data mining text algorithm

1. Fundamentals of Data Mining

e
2. Data Mining functionalities s
XIV.LIST OF TOPICS FOR STUDENT SEMINARS:

r
3. Classification of data mining system

v
4. Pre-processing Techniques

i
5. APRIORI Algorithm

n
6. FP-Growth Algorithm
7. Spatial data mining

U
8. Web mining
9. Trends and applications of data mining
10. Text mining

XV.CASE STUDIES / SMALL PROJECTS:

Case study-1:
Search queries on biomedical databases, such as PubMed, often return a large number of results,
only a small subset of which is relevant to the user. Ranking and categorization, which can also be
combined, have been proposed to alleviate this information overload problem. Results
categorization for biomedical databases is the focus of this work. A natural way to organize
biomedical citations is according to their MeSH annotations. First, the query results are organized
into a navigation tree

www.android.previousquestionpapers.com | www.previousquestionpapers.com | www.ios.previousquestionpapers.com

Data Mining Question Bank
No ratings yet
Data Mining Question Bank
8 pages
Bca IV Data Minining Qestion Bank - Dr. KK Sharma Socsa
No ratings yet
Bca IV Data Minining Qestion Bank - Dr. KK Sharma Socsa
5 pages
DM-Question Bank 2024-25 Objective Question Bank
No ratings yet
DM-Question Bank 2024-25 Objective Question Bank
14 pages
Write Your Roll Number: Time: Hours Max. Marks
No ratings yet
Write Your Roll Number: Time: Hours Max. Marks
2 pages
Updated DWDM Question Bank 2021-22, I Sem
No ratings yet
Updated DWDM Question Bank 2021-22, I Sem
4 pages
Subject Question Bank-1
No ratings yet
Subject Question Bank-1
6 pages
DM QB
No ratings yet
DM QB
25 pages
DM Question Bank
No ratings yet
DM Question Bank
5 pages
DM - One Word Old
No ratings yet
DM - One Word Old
13 pages
Data Mining Mid Question Bank 2025 2026
No ratings yet
Data Mining Mid Question Bank 2025 2026
20 pages
Classification
No ratings yet
Classification
14 pages
Script of E - Previous Question Papers - URR18 03.08.2023 - VI Semester - U18CS605 PDF
No ratings yet
Script of E - Previous Question Papers - URR18 03.08.2023 - VI Semester - U18CS605 PDF
10 pages
Q1R Ext
No ratings yet
Q1R Ext
4 pages
191CSC503T - Data Mining-Cat 2-Question Bank
No ratings yet
191CSC503T - Data Mining-Cat 2-Question Bank
6 pages
Data Mining Question Bank
0% (1)
Data Mining Question Bank
7 pages
Data Mining IMP Objective Questions - Sep 2023
No ratings yet
Data Mining IMP Objective Questions - Sep 2023
4 pages
Data Mining Long Answers
No ratings yet
Data Mining Long Answers
4 pages
Data Mining Question Bank
No ratings yet
Data Mining Question Bank
4 pages
DMA Question Bank
No ratings yet
DMA Question Bank
4 pages
Data Mining CO2 2marks Answers
No ratings yet
Data Mining CO2 2marks Answers
2 pages
DM
No ratings yet
DM
7 pages
BTech Data Mining Exam Prep
No ratings yet
BTech Data Mining Exam Prep
8 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
13 pages
Department of Computer Science and Design
No ratings yet
Department of Computer Science and Design
4 pages
Data Mining - DM 1-5 Question Bank
No ratings yet
Data Mining - DM 1-5 Question Bank
10 pages
Unit4 Mcqs
No ratings yet
Unit4 Mcqs
7 pages
DM IV YR MID2 Set2
No ratings yet
DM IV YR MID2 Set2
4 pages
Data Mining and Warehousing
No ratings yet
Data Mining and Warehousing
7 pages
Data Mining Merged
No ratings yet
Data Mining Merged
10 pages
Subject Code: 80359 Subject Name: Data Warehousing and Data Mining Common Subject Code (If Any)
No ratings yet
Subject Code: 80359 Subject Name: Data Warehousing and Data Mining Common Subject Code (If Any)
9 pages
Data Warehousing & Clustering Guide
No ratings yet
Data Warehousing & Clustering Guide
9 pages
Mcqs Unit 3
No ratings yet
Mcqs Unit 3
6 pages
DM Imp Bits
No ratings yet
DM Imp Bits
4 pages
CXCXX C C
No ratings yet
CXCXX C C
14 pages
QB Data Mining
No ratings yet
QB Data Mining
5 pages
Vi Sem Bca Qbank - Wcms - Fds
50% (2)
Vi Sem Bca Qbank - Wcms - Fds
11 pages
Cs1004: Data Warehousing and Mining Two Marks Questions and Answers Unit I
No ratings yet
Cs1004: Data Warehousing and Mining Two Marks Questions and Answers Unit I
31 pages
DWDM QB
No ratings yet
DWDM QB
6 pages
DM Obj
No ratings yet
DM Obj
16 pages
Data Warehousing & Mining Exam 2019
No ratings yet
Data Warehousing & Mining Exam 2019
4 pages
1 - Page
No ratings yet
1 - Page
11 pages
DMW MCQ
No ratings yet
DMW MCQ
388 pages
Data Mining
100% (1)
Data Mining
7 pages
Oral Questions LP II
No ratings yet
Oral Questions LP II
21 pages
Datamining Quiz
No ratings yet
Datamining Quiz
173 pages
Data Mining
No ratings yet
Data Mining
8 pages
DWDM SR2
No ratings yet
DWDM SR2
21 pages
Seperated
No ratings yet
Seperated
11 pages
DWDM Previous
No ratings yet
DWDM Previous
10 pages
DMDW QB
No ratings yet
DMDW QB
4 pages
Pec Cs 602b Cse Final
No ratings yet
Pec Cs 602b Cse Final
6 pages
DWDM MID - 2 Question Paper and Online Bits
No ratings yet
DWDM MID - 2 Question Paper and Online Bits
3 pages
BE Information Technology 0
No ratings yet
BE Information Technology 0
655 pages
DM 100
No ratings yet
DM 100
17 pages
Introduction To Statistical Data Analysis For The Life Sciences 1st Edition Srensen PDF Download
100% (4)
Introduction To Statistical Data Analysis For The Life Sciences 1st Edition Srensen PDF Download
77 pages
Scales and Its Types
No ratings yet
Scales and Its Types
7 pages
Data Collection Essentials
No ratings yet
Data Collection Essentials
1 page
Unit 4 Big Data Complete Notes
No ratings yet
Unit 4 Big Data Complete Notes
32 pages
Practical Research 2: Quarter 1 - Module 1
90% (10)
Practical Research 2: Quarter 1 - Module 1
35 pages
Psychological Research Goals
No ratings yet
Psychological Research Goals
11 pages
Factors in Uencing The Usage and Selection of Project Management Software
No ratings yet
Factors in Uencing The Usage and Selection of Project Management Software
12 pages
Chi Square Test of Proportion
No ratings yet
Chi Square Test of Proportion
3 pages
Engineering Data Analysis
No ratings yet
Engineering Data Analysis
12 pages
Household Food Insecurity
100% (1)
Household Food Insecurity
31 pages
Unit 1
No ratings yet
Unit 1
27 pages
Fundamentals of Biostatistics 8th Edition PDF
No ratings yet
Fundamentals of Biostatistics 8th Edition PDF
39 pages
Business Research Essentials
No ratings yet
Business Research Essentials
19 pages
ML-QB-Unit 1
No ratings yet
ML-QB-Unit 1
41 pages
R Manual To Agresti's Categorical Data Analysis
100% (1)
R Manual To Agresti's Categorical Data Analysis
280 pages
Discriminant Function Analysis Guide
100% (1)
Discriminant Function Analysis Guide
30 pages
Feature Selection Techniques in Machine Learning - Javatpoint
No ratings yet
Feature Selection Techniques in Machine Learning - Javatpoint
9 pages
Association Between Socioeconomic Status, Food Security, and Dietary Diversity Among Sociology Students at The Central University of Venezuela
No ratings yet
Association Between Socioeconomic Status, Food Security, and Dietary Diversity Among Sociology Students at The Central University of Venezuela
9 pages
R19 - Mech - VI - AUTOMATION AND ARTIFICIAL INTELLIGENCE - Sample - Question - Bank PDF
100% (1)
R19 - Mech - VI - AUTOMATION AND ARTIFICIAL INTELLIGENCE - Sample - Question - Bank PDF
5 pages
Thesis
No ratings yet
Thesis
37 pages
Flight Disruption Analysis Project
No ratings yet
Flight Disruption Analysis Project
17 pages
03b.session Notes On Dummy Variable Regression
No ratings yet
03b.session Notes On Dummy Variable Regression
5 pages
(Ebook PDF) Intro Stats Pearson New International Edition Install Download
No ratings yet
(Ebook PDF) Intro Stats Pearson New International Edition Install Download
50 pages
Prediction of Ocean Import Shipment Lead Time Using Machine Learning Methods
No ratings yet
Prediction of Ocean Import Shipment Lead Time Using Machine Learning Methods
20 pages
HR Analytics-: Data & Analysis Strategies
No ratings yet
HR Analytics-: Data & Analysis Strategies
27 pages
Stats Unit1
No ratings yet
Stats Unit1
27 pages
Adstat Final Exam Reviewer2
No ratings yet
Adstat Final Exam Reviewer2
29 pages
WUOAE 3 4 Manuscript FULL
No ratings yet
WUOAE 3 4 Manuscript FULL
30 pages
02 DataCategorization
No ratings yet
02 DataCategorization
41 pages
1.1 Descriptive Statistics
100% (1)
1.1 Descriptive Statistics
56 pages

DM Questions

Uploaded by

DM Questions

Uploaded by

www.android.universityupdates.in | www.universityupdates.in | www.ios.universityupdates.

www.android.previousquestionpapers.com | www.previousquestionpapers.com | www.ios.previousquestionpapers.com

1.Describe Data Mining? In your answer explain the Understanding CO1

5.Explain the APRIORI algorithm with an example? Analysis CO2

www.android.previousquestionpapers.com | www.previousquestionpapers.com | www.ios.previousquestionpapers.com

5.Explain about IF-THEN rules used for classification with an

4.Describe the different types of hierarchical methods?

www.android.previousquestionpapers.com | www.previousquestionpapers.com | www.ios.previousquestionpapers.com

Long Answer Questions

(a)Business requirements level

www.android.previousquestionpapers.com | www.previousquestionpapers.com | www.ios.previousquestionpapers.com

d. classification problems are designed to predict future outcome.

Multiple Choice Questions

www.android.previousquestionpapers.com | www.previousquestionpapers.com | www.ios.previousquestionpapers.com

5. What is main Objective of web mining?

Fill in the blanks:

www.android.previousquestionpapers.com | www.previousquestionpapers.com | www.ios.previousquestionpapers.com

5. Most data mining methods discard outliers as noise or exceptions.

1. Hub Pages Contain links to many relevant pages

1. Fundamentals of Data Mining

XV.CASE STUDIES / SMALL PROJECTS:

www.android.previousquestionpapers.com | www.previousquestionpapers.com | www.ios.previousquestionpapers.com

You might also like