APRIL 2025 51432/SU45A/SE26B
Maximum: 75 marks
Time: Three hours
PART A -(10x2= 20marks)
in 30 words
Answer any TEN questions each
Define Data Science.
What is Big Data?
Define Distributed File system.
What is data modeling?
What is data preparation?
Define Machine Learning.
7. What is SciPy?
What is Hadoop?
9. What are the core principles of relational
database?
What is NoSQL?
What is data transformation?
What is Elastic Search?
PART B-(5x5= 25 marks)
Answer any FIVEquestions each in 200 words.
What are the benefits and uses of data science?
. What are the main categories of data? Explain.
15.) Write short notes on data exploration.
(16. What are the applications for Machine Learning
in Data science?
17. How does Hadoop achieve parallelism? Explain.
18. Explain different types of NoSQL database.
19) Explain about data retrieval process in data
science.
PART C (3 x 10= 30 marks)
Answer any THREE questions each in 500 words
Explain the Data Science process in detail.
21. Write the overview of techniques to handle
missing data.
2 51432/SU45ASE26B
22. What are the Python tools used in Machine
Learning? Explain in detail.
Explain the process of MapReduce flow with a
neat diagram.
24. Explain about presentation and automation
process in datascience.
Advanday oe dlus odurdanr jon al