0% found this document useful (0 votes)

75 views5 pages

Major Issues in DM

The document discusses major issues in data mining, including the need for diverse knowledge mining techniques, interactive mining processes, and the incorporation of domain knowledge. It highlights challenges related to handling various data types, ensuring efficiency and scalability of algorithms, and evaluating the interestingness of discovered patterns. Additionally, it addresses the complexities of mining from heterogeneous databases and the necessity for specialized systems to manage different data forms.

Uploaded by

sajusancharam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

75 views5 pages

Major Issues in DM

Uploaded by

sajusancharam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

1.

10 Major Issues in Data Mining

and interaction issues
Mining methodology user

These reflect the kinds of knowledge mined, the ability to mine

knowledge at multiple granularities, the use of domain knowledge, ad
hoc mining and knowledge visualization.
Mining different kinds of knowledge in databases:
Since different
users are interested in different kinds of knowledge, data mining should
cover a wide spectrum of data analysis and knowledge discovery tasks
including data characterization, discrimination, association and
correlation analysis, classification, prediction, clustering and outlier
different
analysis. These tasks may use the same database in ways and

require the development of numerous data mining techniques.

Interactive mining of knowledge at multiple levels ofabstraction
Since it is difficult to know exactly what can be discovered within a
database, the data mining process should be interactive. For databases

containing a huge amount of data, appropriate sampling techniques can

be applied to facilitate interactive data exploration. Interactive mining
allow users to focus the search for patterns, refining data mining requests
based on returned results. In this way, user can interact with the data
mining system to view data and discovered patterns at multiple
granularities and from different angles.

Incorporation of background knowledge: Background knowledge

or information regarding the domain under study may be used to guide.
the discovery process and allow discovered patterns to be expressed in
concise terms and at different levels of abstraction. Domain knowledge
related to databases, such as integrity constraints can help focus and
speed up a data mining process, or judge the interestingness of discovered
patternsS.

Data mining query languages and adhoc data mining: Relational

query languages (such as SQL) allow users to pose adhoc queries for
data retrieval. In a similar way, high-level data mining query languaged
need to be developed to allow users to describe adhoc data mining. This
include tasks of specifying relevant sets of data for analysis, the doma"
uledge. the kinds of knowledge to be mined and the conditions and
to be enforced on the discovered patterns. Such
o n s t r a i n t s

a
language
l d be integrated with a database or data warehouse query language

optimizcd for efticient and flexible data mining

and

Preventation and visualization ofdata mining results: Discovered

nawledge should be expressed in high-level languages, visual

forms so that the knowledge can be

reprcsentations. or other expressive
and directly usable by humans. This is especiallIy
casily understood
is to be interactive. This requires the
crucial if the data mining system
such
svstem to adopt expressive knowledge representation techniques,
as trees. tables, rules, graphs, charts, crosstabs. matrices or curves.

data stored in database

Handling noisy or incomplete data: The
a

When
may reflect noise, exceptional cases, or incomplete data objects.
confuse the process, causing
mining data regularities, these objects may
the knowledge model constructed to overtit the
data. As a result, the

can be poor. Data cleaning methods

accuracy of the discovered patterns
and data required, as well as
analysis methods that can handle noise are

outlier mining methods for the discovery and analysis of exceptional

cases.

Pattern evaluation- the interestingness problem: A data mining

system can uncover thousands of patterns. Many of the patterns
discovered may be uninteresting to the given user, either because they

represent common knowledge or lack novelty. Several challenges remain

Tegarding the development of techniques to assess the interestingness
of discovered patterns. The use o f interestingness measures or ser-

p e c i l i e d c o n s t r a i n t s to g u i d e the d i s c o v e r y p r o c e s s and reduce t e

space is another active arca of research.

P'erformance issues

efticiency. scalability
and parallelization of ddata
These include

mining algorithms.

algorithms: To efectivel
Eficiency and scalability of data mining
extract informationfrom a amount
huge data in databases, data minino
of g
words, the runnino
algorithms must be efficient and scalable. In other Ang
must be predictable and acceptable in
time of a data mining algorithm
on knowledge discovery
large databases. From a database perspective
of data
efticiency and scalability are key issues in the implementation
issues discussed above under mining
mining systems. Many of the
consider efficiency and
methodology and user interaction must also
scalability.
Parallel, distributed, and incremental mining algorithms: The huge
size of many databases, the wide distribution of data and the

computational complexity of some data mining methods are factors

motivating the development of parallel and distributed data mining

algorithms. Such algorithms divide the data into partitions, which are
processed in parallel. The results from the partitions are then merged.

Moreover, the high cost of some data mining processes promotes

the need for incremental data mining algorithms. Such algorithms
perform knowledge modification incrementally to amend and strengthen
what was previously discovered.

Issues relating to the diversity of database types

Handling of relational and complex types of data: Since relationa

databases and data warehouses are widely used, the development ol

nt.
efficient and effective data mining systems for such data is important
However, other databases may contain complex data objects, hyperteext
and multimedia data, spatial lata, temporal data, or
transaction data. It
unrealistie
is unrealistic
is
to expect one system to mine all kinds of data,
given the
diversity of data types and dilferent
goals of data mining. Specific data
nining systems should be constructed for
mining specific kinds of data
Therefore. one may expect to have different data mining systems for

different kinds of data.

Mining information from heterogeneous databases and global

information systems: Local and wide-area computer networks (such as
the Internet) connect many sources of data, forming huge, distributed
and heterogeneous databases. The discovery of knowledge from different
sources of structured, semi-structured, or unstructured data with diverse
data semantics poses great challenges to data mining. Data mining may

help to disclose high-level data regularities in multiple heterogeneous

databases. They are unlikely to be discovered by simple query systems
and may improve information exchange and interoperability in

heterogeneous databases. Web mining, which uncovers interesting

knowledge about Web contents, Web structures, Web usage and Web
dynamics, becomes a very challenging area in data mining.

Data Mining Issues
No ratings yet
Data Mining Issues
5 pages
5 Major Issues 10 Feb 2021material I 10 Feb 2021 Mod1 Issues
No ratings yet
5 Major Issues 10 Feb 2021material I 10 Feb 2021 Mod1 Issues
5 pages
Major Issues in Data Mining
No ratings yet
Major Issues in Data Mining
1 page
Data Mining Task Primitives and Major Issues
No ratings yet
Data Mining Task Primitives and Major Issues
18 pages
Unit 3 Data Mining
No ratings yet
Unit 3 Data Mining
21 pages
DM Lesson3
No ratings yet
DM Lesson3
14 pages
Data Mining
No ratings yet
Data Mining
22 pages
Data Mining: Key Issues and Tasks
No ratings yet
Data Mining: Key Issues and Tasks
5 pages
Laq 1
No ratings yet
Laq 1
2 pages
L-1 Data Mining Issues
No ratings yet
L-1 Data Mining Issues
24 pages
Advanced Databases and Mining Unit 4
No ratings yet
Advanced Databases and Mining Unit 4
10 pages
Major Issues in Data Mining
No ratings yet
Major Issues in Data Mining
2 pages
Notes For DMDWH - Module1
No ratings yet
Notes For DMDWH - Module1
21 pages
Data Mining Challenges Explained
No ratings yet
Data Mining Challenges Explained
4 pages
1.data Mining Functionalities
No ratings yet
1.data Mining Functionalities
14 pages
Adm 4 ND 5
No ratings yet
Adm 4 ND 5
51 pages
Unit III
No ratings yet
Unit III
101 pages
Unit-1 Notes Onl
No ratings yet
Unit-1 Notes Onl
25 pages
DM Chapter 1
No ratings yet
DM Chapter 1
10 pages
Week 1-2
No ratings yet
Week 1-2
3 pages
Data Mining & Warehousing Basics
No ratings yet
Data Mining & Warehousing Basics
30 pages
Chapter 1. Introduction
No ratings yet
Chapter 1. Introduction
323 pages
Data Mining
No ratings yet
Data Mining
44 pages
WINSEM2024-25 MCSE615L TH VL2024250502897 2024-12-19 Reference-Material-I
No ratings yet
WINSEM2024-25 MCSE615L TH VL2024250502897 2024-12-19 Reference-Material-I
58 pages
Data Mining - KTUweb PDF
No ratings yet
Data Mining - KTUweb PDF
82 pages
Whats App
No ratings yet
Whats App
23 pages
Data Mining Essentials for Analysts
No ratings yet
Data Mining Essentials for Analysts
73 pages
Chapter-1 - Introduction To Data Mining
No ratings yet
Chapter-1 - Introduction To Data Mining
10 pages
Data Mining & KDD Overview
No ratings yet
Data Mining & KDD Overview
22 pages
Data Mining Notes UNIT I
No ratings yet
Data Mining Notes UNIT I
21 pages
DMWH M1
No ratings yet
DMWH M1
25 pages
Data Mining Summaries PDF
No ratings yet
Data Mining Summaries PDF
22 pages
Unit
No ratings yet
Unit
27 pages
Unit 1 DMW
No ratings yet
Unit 1 DMW
41 pages
DWDM Unit II Notes
No ratings yet
DWDM Unit II Notes
22 pages
Week1 2
No ratings yet
Week1 2
24 pages
Data Mining
No ratings yet
Data Mining
26 pages
DM-Model Question Paper Solutions
No ratings yet
DM-Model Question Paper Solutions
27 pages
Unit-I Data Mining
No ratings yet
Unit-I Data Mining
28 pages
DW and DM Notes
No ratings yet
DW and DM Notes
89 pages
Data Mining
No ratings yet
Data Mining
44 pages
Data Mining Notes
No ratings yet
Data Mining Notes
82 pages
Data Mining Essentials Explained
No ratings yet
Data Mining Essentials Explained
24 pages
DWM - Module 2
No ratings yet
DWM - Module 2
74 pages
Data Mining Mod 1 Notes
No ratings yet
Data Mining Mod 1 Notes
25 pages
DWH Unit 3
No ratings yet
DWH Unit 3
7 pages
Data Mining Notes1
No ratings yet
Data Mining Notes1
56 pages
DM Notes
No ratings yet
DM Notes
91 pages
Unit 3
No ratings yet
Unit 3
34 pages
Assignment 1
No ratings yet
Assignment 1
6 pages
Data Mining
No ratings yet
Data Mining
27 pages
Kinds of Data: 1. Data Bases Data 2.data Warehouses Data 3. Transactional Data
No ratings yet
Kinds of Data: 1. Data Bases Data 2.data Warehouses Data 3. Transactional Data
24 pages
Data Mining-CH5
No ratings yet
Data Mining-CH5
49 pages
Module 4
No ratings yet
Module 4
54 pages
DWM Notes Class by Proff
No ratings yet
DWM Notes Class by Proff
88 pages
DM&DW SEE Module 1
No ratings yet
DM&DW SEE Module 1
6 pages
Data Warehousing & Data Mining Syllabus Subject Code:56055 L:4 T/P/D:0 Credits:4 Int. Marks:25 Ext. Marks:75 Total Marks:100
No ratings yet
Data Warehousing & Data Mining Syllabus Subject Code:56055 L:4 T/P/D:0 Credits:4 Int. Marks:25 Ext. Marks:75 Total Marks:100
52 pages
Unit 1 and 2
No ratings yet
Unit 1 and 2
145 pages
Analyses Attacker Techniques Using Honeypots
No ratings yet
Analyses Attacker Techniques Using Honeypots
30 pages
Ey Does Integration Playbook Tackle Merger Acquisition Challenge Manufacturing v1 20180914
No ratings yet
Ey Does Integration Playbook Tackle Merger Acquisition Challenge Manufacturing v1 20180914
4 pages
Hs 285 Group Presentation Grading Rubric Group 6 Sec 06
No ratings yet
Hs 285 Group Presentation Grading Rubric Group 6 Sec 06
4 pages
Wealth Management Client Portal
No ratings yet
Wealth Management Client Portal
3 pages
Tute1 Stacks
No ratings yet
Tute1 Stacks
3 pages
SLC 2023 Scheme PPT For BOS
No ratings yet
SLC 2023 Scheme PPT For BOS
15 pages
Nelson 5930i Easy Set Hose Timer Owners Manual
No ratings yet
Nelson 5930i Easy Set Hose Timer Owners Manual
2 pages
Salesforce & ServiceNow Certification Program
No ratings yet
Salesforce & ServiceNow Certification Program
14 pages
Compal Confidential: CSL50/CSL52 Schematics Document
No ratings yet
Compal Confidential: CSL50/CSL52 Schematics Document
43 pages
B1.2 Speaking
No ratings yet
B1.2 Speaking
5 pages
Syem Modelling and Simulation Final Exam
No ratings yet
Syem Modelling and Simulation Final Exam
2 pages
Kali Prospective Customer List11.ods
100% (1)
Kali Prospective Customer List11.ods
6 pages
Antamedia Features PDF
No ratings yet
Antamedia Features PDF
5 pages
ALS30C1023NP Capacitor Kit 6 PCS
No ratings yet
ALS30C1023NP Capacitor Kit 6 PCS
2 pages
BESS Control
No ratings yet
BESS Control
7 pages
FS-DG701 Gas Detector Install Guide
100% (1)
FS-DG701 Gas Detector Install Guide
4 pages
Applications of Matrices To Business and Economics
93% (175)
Applications of Matrices To Business and Economics
24 pages
Visual Basic Programming Assignments
No ratings yet
Visual Basic Programming Assignments
74 pages
Cycle Count1
No ratings yet
Cycle Count1
2 pages
Bba 3 - Iit - U1
No ratings yet
Bba 3 - Iit - U1
5 pages
Literature Review On Production Management
100% (1)
Literature Review On Production Management
6 pages
Fujitsu LZAS Service Manual
No ratings yet
Fujitsu LZAS Service Manual
171 pages
Day 5
No ratings yet
Day 5
4 pages
C C Bill April-Merged
No ratings yet
C C Bill April-Merged
4 pages
eyeOS 2.5 Styling Guide
No ratings yet
eyeOS 2.5 Styling Guide
2 pages
IBM Power System E980: Technical Overview and Introduction
No ratings yet
IBM Power System E980: Technical Overview and Introduction
184 pages
ECE422L Activity No. 01 Ohm's Law MarasiganAA, UmaliGCD
No ratings yet
ECE422L Activity No. 01 Ohm's Law MarasiganAA, UmaliGCD
13 pages
Valvulas Zme de Bosch
No ratings yet
Valvulas Zme de Bosch
5 pages
Kingston Memory KHX18C10AT3K2 - 16X
No ratings yet
Kingston Memory KHX18C10AT3K2 - 16X
2 pages
Session 24 - SRAM & Computation-In-Memory
No ratings yet
Session 24 - SRAM & Computation-In-Memory
153 pages

Major Issues in DM

Uploaded by

Major Issues in DM

Uploaded by

1.

10 Major Issues in Data Mining

These reflect the kinds of knowledge mined, the ability to mine

require the development of numerous data mining techniques.

containing a huge amount of data, appropriate sampling techniques can

Incorporation of background knowledge: Background knowledge

Data mining query languages and adhoc data mining: Relational

optimizcd for efticient and flexible data mining

Preventation and visualization ofdata mining results: Discovered

forms so that the knowledge can be

data stored in database

can be poor. Data cleaning methods

outlier mining methods for the discovery and analysis of exceptional

Pattern evaluation- the interestingness problem: A data mining

represent common knowledge or lack novelty. Several challenges remain

p e c i l i e d c o n s t r a i n t s to g u i d e the d i s c o v e r y p r o c e s s and reduce t e

space is another active arca of research.

computational complexity of some data mining methods are factors

Moreover, the high cost of some data mining processes promotes

Issues relating to the diversity of database types

Handling of relational and complex types of data: Since relationa

different kinds of data.

Mining information from heterogeneous databases and global

help to disclose high-level data regularities in multiple heterogeneous

heterogeneous databases. Web mining, which uncovers interesting

You might also like