0% found this document useful (0 votes)
32 views10 pages

Thúy

sxsxs

Uploaded by

vbui9695
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
32 views10 pages

Thúy

sxsxs

Uploaded by

vbui9695
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 10

1

Table of content
I. Introduction............................................................................................................................... 1
II. Data, information and knowledge...........................................................................................1
2.1. The concept of data, information and knowledge..........................................................1
2.2. The process of transforming data into information and knowledge...............................1
2.3. Example........................................................................................................................1
III. Data evaluation using ,ethods of analysis..............................................................................1
3.1. Descriptive data analysis...............................................................................................1
3.2. Exploratory data analysis...............................................................................................1
3.3. Confirmatory data analysis............................................................................................1
3.4. Evaluate data from variety of source.............................................................................1
3.5. “Critically evaluate” “ in application”.............................................................................1
IV. Analyze and evaluate data using statistical methods............................................................1
4.1. Qualitative and quantitative data..................................................................................1
4.2. Statiscal methods..........................................................................................................2
4.2.1. Descriptive statistics...............................................................................................2
4.2.2. Inforential statistics................................................................................................2
4.2.3. Measuring association............................................................................................2
4.2.4. The differences in application between Descriptive statistics, Inferential statistics &
Measuring association......................................................................................................2
4.3. Analyze and evaluate qualitative data...........................................................................3
4.4. Analyze and evaluate quantitative data.........................................................................3
V. Conclusion.................................................................................................................................3

I. Introduction

In the age of data-driven decision making, the ability to understand, analyze, and interpret data
has become a fundamental skill across various disciplines. Probability and statistics serve as
essential tools in transforming raw data into meaningful information and actionable knowledge.
2

This subject provides a structured approach to evaluate uncertainty, quantify relationships


between variables, and support evidence-based reasoning.
This report introduces the foundational concepts of data, information, and knowledge, and
outlines how data can be systematically transformed through statistical methods. It also
explores multiple layers of data analysis—including descriptive, exploratory, and confirmatory
approaches—to extract insights from a variety of data sources. Furthermore, the report
compares and contrasts statistical methods such as descriptive statistics, inferential statistics,
and measures of association, providing critical evaluation of their applications in practice.
By integrating both qualitative and quantitative perspectives, this study aims to equip learners
with the ability to analyze and evaluate data rigorously. Ultimately, the course in probability
and statistics not only enhances analytical thinking but also lays the groundwork for informed
decision-making in fields ranging from business and economics to healthcare and social
sciences.

II. Data, information and knowledge

2.1. The concept of data, information and knowledge

Data: Data is a term used to refer to various sets of information and facts, which are collected,
stored, processed and shared to obtain useful information. Data can exist in many different
forms such as letters, numbers written on text, or bytes in computer memory, or events
recorded in human memory. (Hue, 2025)
Information: Information is the output from analyzing, contextualizing, structuring, interpreting
or otherwise processing data. Information imparts meaning and value to data. It facilitates
understanding, communication and learning, and is a key element in system design and
strategic planning, as well as in problem solving and decision making. Information brings
context to data, turning what would otherwise be meaningless content into something
understandable and usable. (Sheldon, 2024)
Knowledge: Knowledge is “information in context to create actionable understanding”.
Knowledge is the most cherished remedy for complexity and uncertainty. It is a higher level of
abstraction that resides in the human mind. It is broader, richer, and more elusive than data or
information. People seek knowledge because it helps them succeed in their work. Knowledge is
actionable (relevant) information that is available in the right format, at the right time, and in
the right place for success. It is important to remember that knowledge has different meanings
depending on the discipline in which it is used. In the context of knowledge management,
knowledge is “human understanding of a specialized area of interest that has been acquired
through learning and experience”. In a way, it is information that has been enhanced by
experience provided by the validation process over time. It is based on learning, thinking, and
familiarity with the problem area within a department, division, or across the entire company.
The focus is on sustainable competitive advantage.(Chadha, 2021)

2.2. The process of transforming data into information and knowledge


3

Nature of Data
Data is raw, unorganized facts or figures that are collected and stored. Data can be in the form
of numbers, text, images or any other type of input data.
Data itself has no context and meaning. It is the most basic form of representation and needs
further processing to become useful. (Spilker, 2024)
Transforming data into information
Define goals and problems to solve: This is the first and most important step, where you define
what needs to be achieved or what problems need to be analyzed. From here, goals or KPIs
(Key Performance Indicators) are set based on the identified problems.
Data collection: This involves collecting data from various sources, in line with the set goals.
Therefore, data tracking will ensure that the data needed for analysis is collected
systematically.
Data Processing and Verification: In this step, the data will be screened so that only accurate
and relevant data is used for analysis. This step will support better Data Mapping to understand
the structure and connections between different data.
Data Analysis: Using statistical and analytical methods to draw information and insights from
processed data. This is the stage where data analysis focuses on revealing patterns, trends and
relationships, providing insights for subsequent strategies.
Visualization and interpretation of results: Finally, the analyzed data will be presented in a
graphical form for easy understanding, making data-driven decision making smarter and more
effective. This visualization not only helps to better understand the analysis results but also aids
in making accurate strategic decisions. (Hanh, 2024)
The nature of information
Information is a form of data that has been processed and organized. It is data that has been
analyzed, structured, and given context. Information provides meaning and can be used to
answer questions or make decisions. This is the result of data being transformed into a more
meaningful and useful state. (Spilker, 2024)
Transforming into knowledge
Information creates a framework for developing knowledge, knowledge drives good decision-
making, and wisdom upholds the culture to sustain decisions. A focused effort on the
transformation of information to knowledge and knowledge to wisdom is the soul of a
successful knowledge management system. Whether your knowledge management system is
set up to organize e-mail messages or to integrate the needs of a facility's complex web of
users, the fundamentals are basically the same-the ability to organize, interpret, communicate,
and make decisions using multiple sources and bits of information.
Knowledge is information combined with experience, context, interpretation, and reflection. It
is not just the sum of what is known that is important, it is the ability to both create and
effectively use knowledge that will distinguish successful organizations in the future. Knowledge
creation is the act of taking relatively random data from across a broad spectrum and
translating that data into a meaningful insightful context.
Knowledge is more than the accumulation of random facts and data relevant to a particular
topic. Knowledge is not merely information. It is the transformation of information into a
meaningful and insightful context through study, investigation, observation, and experience.
4

Wisdom is the ability to sustain and integrate the collected knowledge through developed
cultural norms and common understandings.
Transforming information to useful knowledge that will lead to good decision making is an
awesome task. The use of technology as a storage and retrieval tool is widely discussed as the
answer to this challenge. New systems and platforms are constantly being introduced providing
greater integration, increased storage, and faster retrieval causing greater frustration,
increased confusion, and faster tempers for those responsible for these efforts.( no date)

2.3. Example

According to data from the General Statistics Office of Vietnam (Gso.gov.vn, 2024), revenue
from accommodation, food and beverage and travel services in the first quarter of 2024 grew
strongly. Specifically, revenue from accommodation and food services reached VND 174.9
trillion, up 13.4% over the same period; while revenue from travel and tourism services reached
VND 14.1 trillion, up 46.3%. At this stage, the above numbers are considered raw data, because
when standing alone, they do not fully reflect any specific meaning or trend.
To convert data into information, it is necessary to put them in a specific context. In this case,
the context is: “Revenue from accommodation, food and beverage and travel services
increased sharply in the first quarter of 2024.” From this, two important information is drawn:
First, the total revenue from the three service groups above reached about 189 trillion VND,
equivalent to an increase of 15.3% over the same period last year.
Second, the number of international visitors to Vietnam reached more than 4.6 million, an
increase of 72% compared to the first quarter of 2023 and an increase of 3.2% compared to the
same period in 2019 - the time before the Covid-19 pandemic.
From the information that has been analyzed and linked, in-depth knowledge can be formed:
that is, the Vietnamese tourism industry is entering a period of strong recovery and growth
after the pandemic, with great potential in attracting international visitors and developing
related services. This can suggest strategies for local economic development, infrastructure
investment, tourism image promotion and regional linkage expansion. (NATIONAL STATISTICS
OFFICE, no date)

III. Data evaluation using ,ethods of analysis

3.1. Descriptive data analysis

Descriptive Analytics is one of the four processes of Data Analytics. Specifically, Descriptive
Analytics is the act of collecting and arranging a business's past data in a summary form,
thereby helping the business better understand its past performance to develop the right
strategies for the future.
Some cases of applying descriptive analytics include: annual financial reports, company
personnel statistics, statistics on the number of participants of an event, etc. In which, data is
divided into 2 main types as follows:
5

Numerical data (quantitative): specific numerical data such as height, weight, number of
invoices, sales revenue, etc.
Non-numeric data (qualitative): data that reflects the nature and is difficult to determine the
average value such as color, gender, industry, etc. (Top cv, 2023)

3.2. Exploratory data analysis

Exploratory data analysis (EDA) is the process of describing data using statistical and
visualization techniques to bring important aspects of that data into focus for further analysis.
This involves examining a dataset from multiple perspectives, describing and summarizing it
without making any assumptions about its content. EDA is an important step to take before
diving into statistical or machine learning modeling, to ensure that the data is what it claims to
be and that there are no obvious errors. It should be part of data science projects in every
organization. (Pratik, 2025)

3.3. Confirmatory data analysis

Confirmatory factor analysis (CFA) is a complex statistical technique used to verify the factor
structure of a set of observed variables. It allows researchers to test the hypothesis that there is
a relationship between the observed variables and their underlying latent constructs. CFA
differs from Exploratory Factor Analysis (EFA), in that the analysis determines the structure of
the data rather than having a pre-determined structure. (Statistics Solutions, 2025)

3.4. Evaluate data from variety of source

3.5. “Critically evaluate” “ in application”

IV. Analyze and evaluate data using statistical methods

4.1. Qualitative and quantitative data

Criteria Quantitative Analysis Qualitative Analysis

Measurement, hypothesis testing, Deep understanding of behavior,


Objective
relationship determination motivation, emotion or experience

Type of data Numerical data (numbers, Non-digital data (text, audio, images,
used statistics) speech, observation)

Survey using questionnaires with


Collection In-depth interviews, focus groups,
scales, secondary statistics,
method field observations
experiments
6

Data Analysis using statistical software: Coding, categorization,


processing SPSS, R, Excel, Stata, ... content/narrative analysis

Numerical results: average,


Results in the form of descriptions,
Output results standard deviation, correlation
themes, quotes, stories
coefficient, ...

Can be generalized to the whole if


Not generalizable, only specific to
Generality the sample is large enough and
each specific context
random

High - less affected by the Lower – depends on the researcher’s


Objectivity
subjectivity of the researcher perspective and interpretation

Understanding customer emotions


Application Consumer behavior research
about the brand through in-depth
examples through survey statistics
interviews

Easy to compare, accurate, Deep understanding of complex


Advantages
reliable, can be tested phenomena, grasping the context

Lack of depth, difficult to grasp Not generalizable, time-consuming to


Limitations
specific context analyze

4.2. Statiscal methods

4.2.1. Descriptive statistics


Descriptive statistics is a method in statistics used to summarize and present important
information about data. It helps us understand the basic characteristics of the data set we are
working with. Descriptive statistics often include calculating parameters such as mean, median,
variance, standard deviation, and other characteristics of the data.(CoderSchool, 2023)
4.2.2. Inforential statistics
Inferential statistics is a branch of statistics that makes the use of various analytical tools to
draw inferences about the population data from sample data. Apart from inferential statistics,
descriptive statistics forms another branch of statistics. Inferential statistics help to draw
conclusions about the population while descriptive statistics summarizes the features of the
data set.(CUEMATH, no date)
4.2.3. Measuring association
The measures of association refer to a wide variety of coefficients (including bivariate
correlation and regression coefficients) that measure the strength and direction of the
relationship between variables; these measures of strength, or association, can be described in
several ways, depending on the analysis.(statisticssolutions, no date)
7

4.2.4. The differences in application between Descriptive statistics, Inferential statistics &
Measuring association
Aspect Descriptive Statistics Inferential Statistics Measuring Association

To summarize and To draw conclusions To identify and measure


Main
describe the main about a population relationships between
Purpose
features of a dataset based on a sample variables

Sample (used to Two or more variables


Type of Entire population or
generalize to the (either in a sample or
Data sample
population) population)

Mean, median, mode,


Confidence intervals, Correlation coefficient,
standard deviation,
Key Outputs hypothesis tests, p- odds ratio, relative risk,
frequency, percent,
values, estimations regression coefficients
charts

t-tests, chi-square
Tables, graphs, Pearson/Spearman
tests, ANOVA, z-tests,
Common histograms, measures correlation, Cramér’s V, Phi
regression (for
Tools Used of central tendency coefficient, regression
prediction or
and dispersion models
inference)

Reporting average
Testing whether a new Examining whether there is
income, showing age
Application drug improves health a relationship between
distribution, or
Example outcomes in the exercise frequency and
summarizing test
general population blood pressure
scores

SPSS, R, Stata, Python R, Python, SPSS, SAS, STATA


Statistical Excel, SPSS (basic), R
(scipy.stats, (correlation, regression, chi-
Software (summary functions)
statsmodels) square for association)

Results depend on
Cannot be used to
assumptions May not imply causation
make generalizations
Limitations (normality, even if a strong association
or predictions beyond
independence, etc.), exists
the data
sampling errors

"What could be
inferred" about the "How variables are related"
Focus "What is" in the data
population from the to each other
data
8

4.3. Analyze and evaluate qualitative data

4.4. Analyze and evaluate quantitative data

V. conclusion

In conclusion, probability and statistics play a critical role in transforming raw data into
meaningful information and actionable knowledge. Through understanding key concepts such
as descriptive, exploratory, and confirmatory data analysis, and applying both qualitative and
quantitative methods, individuals can uncover patterns, test hypotheses, and make informed
decisions.
The distinction and interplay between descriptive statistics, inferential statistics, and measures
of association highlight the importance of selecting the appropriate analytical approach
depending on the research objectives. Furthermore, the ability to critically evaluate data from
diverse sources enhances the reliability and relevance of the findings.
By mastering statistical thinking and analytical techniques, learners not only develop a strong
foundation for academic research but also gain practical tools applicable in real-world contexts
—ranging from business and economics to healthcare, education, and public policy. Ultimately,
statistics empowers us to make sense of complexity, manage uncertainty, and support
decisions with evidence.
References
Huế, N.T. (2025) Data LÀ GÌ? Vai Trò quan Trọng Của dữ liệu đối Với Doanh Nghiệp, Base Blog.
Available at: https://base.vn/blog/data-la-gi/ (Accessed: 07 July 2025).
Sheldon, R. (2024) What is information?: Definition from TechTarget, Search Data
Management. Available at:
https://www.techtarget.com/searchdatamanagement/definition/information (Accessed: 07
July 2025).
Chadha, A. (2021) Defining knowledge, information, data, PHPKB. Available at:
https://www.phpkb.com/kb/article/defining-knowledge-information-data-239.html (Accessed:
07 July 2025).
Spilker, J. (2024) Data vs information vs knowledge: What are the differences? , Tettra. Available
at: https://tettra.com/article/data-information-knowledge/ (Accessed: 07 July 2025).
Hạnh, L. (2024) Ứng dụng dữ Liệu Trong Marketing #1: Tổng Quan VỀ việc ứng dụng dữ liệu
Trong Marketing: Hạnh LÊ, Advertising Vietnam. Available at:
https://advertisingvietnam.com/ung-dung-du-lieu-trong-marketing-1-tong-quan-ve-viec-ung-
dung-du-lieu-trong-marketing-p24620 (Accessed: 07 July 2025).
Spilker, J. (2024) Data vs information vs knowledge: What are the differences? , Tettra. Available
at: https://tettra.com/article/data-information-knowledge/ (Accessed: 07 July 2025).
(No date) Erappa 2021 by-law changes online ballot. Available at:
https://www1.appa.org/FacilitiesManager/article.cfm?ItemNumber=314&parentid=198
(Accessed: 07 July 2025).
9

CƠ QUAN THỐNG KÊ QUỐC GIA (no date) Doanh Thu Dịch vụ Lưu Trú, ăn uống và lữ Hành Tăng
Cao Trong Quý I/2024, General Statistics Office of Vietnam. Available at:
https://www.nso.gov.vn/du-lieu-va-so-lieu-thong-ke/2024/04/doanh-thu-dich-vu-luu-tru-an-
uong-va-lu-hanh-tang-cao-trong-quy-i-2024/ (Accessed: 07 July 2025).
Top cv (2023) TỪ A - Z | diagnostic analytics LÀ gì, hiểu Ngay Trong 5 Phút! Available at:
https: /tuyendung.topcv.vn/bai-viet/diagnostic-analytics-la-gi/ (Accessed: 07 July 2025).
Pratik (2025) What is exploratory data analysis ?, Analytics Vidhya. Available at:
https://www.analyticsvidhya.com/blog/2021/08/exploratory-data-analysis-and-visualization-
techniques-in-data-science/ (Accessed: 07 July 2025).
Confirmatory factor analysis (CFA): A detailed overview (2025) Statistics Solutions. Available at:
https://www.statisticssolutions.com/free-resources/directory-of-statistical-analyses/
confirmatory-factor-analysis/ (Accessed: 07 July 2025).
Phương Pháp thống kê MÔ tả (descriptive statistics) LÀ Gì và áp dụng ra Sao Trong Phân Tích
dữ liệu (2023) CoderSchool. Available at: https://www.coderschool.vn/blog/phuong-phap-
thong-ke-mo-ta-descriptive-statistics-la-gi-va-ap-dung-ra-sao-trong-phan-tich-du-lieu
(Accessed: 08 July 2025).
Inferential statistics - definition, types, examples, formulas (no date) Cuemath. Available at:
https://www.cuemath.com/data/inferential-statistics/ (Accessed: 08 July 2025).
Natural Resources Biometrics (no date) Chapter 7: Correlation and Simple Linear Regression |
Natural Resources Biometrics. Available at: https://courses.lumenlearning.com/suny-natural-
resources-biometrics/chapter/chapter-7-correlation-and-simple-linear-regression/ (Accessed:
08 July 2025).
10

You might also like