0% found this document useful (0 votes)
326 views3 pages

Part B Questions

This document contains a question bank for the course "Big Data" covering 5 units: Unit 1 covers classification of digital data, challenges with big data, the 5 V's of big data, definitions of big data analytics, challenges facing big data analytics, and differences between parallel and distributed systems. Unit 2 compares reporting and analysis, discusses advanced, operationalized, and monetized analytics, skill requirements for analysts, analytical approaches, and analytical tools. Unit 3 covers MapReduce, Hadoop architecture including HDFS, ensuring data integrity, streaming access, and metadata. Unit 4 discusses NoSQL databases, differences between SQL and NoSQL, Hadoop architecture and components, Hadoop distributions, and HDF

Uploaded by

sangeetha
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
326 views3 pages

Part B Questions

This document contains a question bank for the course "Big Data" covering 5 units: Unit 1 covers classification of digital data, challenges with big data, the 5 V's of big data, definitions of big data analytics, challenges facing big data analytics, and differences between parallel and distributed systems. Unit 2 compares reporting and analysis, discusses advanced, operationalized, and monetized analytics, skill requirements for analysts, analytical approaches, and analytical tools. Unit 3 covers MapReduce, Hadoop architecture including HDFS, ensuring data integrity, streaming access, and metadata. Unit 4 discusses NoSQL databases, differences between SQL and NoSQL, Hadoop architecture and components, Hadoop distributions, and HDF

Uploaded by

sangeetha
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 3

QUESTION BANK 2019

VSB ENGINEERING COLLEGE


DEPARTMENT OF IT
QUESTION BANK – PART B
Subject with Code : Big Data (CS8091) Course & Branch: B.Tech.,
Sem: III-B.Tech & VI-Sem Regulation: R17

UNIT –I
1. Elaborate about the Classification of Digital Data. Explain?
2. Discuss about challenges with Big Data?
3. Explain about 5v's?
4. What is Big Data Analytics?
5. What Big Data Analytics isn't?
6. Explain about classification of Analytics?
7. Brief about top challenges facing big data?
8. Discuss why is big data analytics important?
9. Explain about Data Science?
10. Explain the difference between parallel and distributed system?
11. Explain CAP Theorem?

UNIT 2
1. Compare Reporting and Analysis with its process.
2. Explain the following
a. Advanced analytics
b. Operationalized analytics
c. Monetized analytics
3. How to develop an analytical team and what is the skill required for an analyst?
4. Distinguish statistical significance and business importance.
5. What are the roles of analytical team and IT team with a detailed note on text analysis?
6. Explain in detail the commonly used analytical approaches?
7. Discuss in detail the history of analytical tools.
8. How analytical tools have evolved from graphical user interfaces to point solutions to data
visualization tools?

Big Data (CS8091) Page 1


QUESTION BANK 2019

9. Give a detailed note on features and limitations of R programming and IBM SPSS.
10. Explain in detail the following
a. SAS
b. Compare various analytical tools.

UNIT 3
1. Brief about the main feature of MapReduce.
2. Describe the working of Map reduce with an relevant example.
3. Discuss the techniques which is used to optimize the map reduce jobs.
4. Discuss the points to be considered while designing a file system in mapreduce.
5. What is HBASE? Give detailed note on features of HBASE.
6. Write a short note on the Hadoop ecosystem and HDFS archiecture.
7. How does HDFS ensure data integrity in a Hadoop cluster?
8. Discuss the following terms
a.Streaming information access.
b.Low latency information access.
c.Rest and thrift
d.org.apcahe.hadoop.io.package
9. What is Meta data? What information does it provide and explain the role of Namenode in a HDFS
clusters?
10. Define Command line interface using HDFS files and give a brief note on Hadoop-specific file
system types and HDFS commands.

UNIT 4

1. What is NoSQL? What are the advantages of NoSQL? And Explain types of NoSQL Databases?
2. Differentiate between SQL vs NoSQL?
3. What is NewSQL? Differentiate between NewSQL and NoSQL?
4. With Neat sketch explain in detail Hadoop architecture and its components?
5. a) List hadoop distributions
b) Compare Hadoop vs SQL
6. With neat sketch explain HDFS?
7. With neat sketch explain processing data with Hadoop?
8. Explain in detail interacting with Hadoop Ecosystem?

Big Data (CS8091) Page 2


QUESTION BANK 2019

9. List and Explain HDFC commands?


10. What are the limitations of Hadoop 1.0? Explain Hadoop 2: HDFS and Hadoop 2: YARN?

UNIT 5
1. List some key elements of social media.
2. Describe the steps to perform text mining.
3. Discuss some commonly used text mining software.
4. List some common online tools used to perform sentiment analysis.
5. What do you understand by sentiment analysis?
6. Discuss some application areas of mobile analytics.
7. Briefly explain some popular mobile analytics tools available in the market.
8. What is the importance of location –based tracking tools?
9. Discuss the necessity of keeping data secure while conducting analytics.
10. Discuss some fields where mobile analytics can be used.

Big Data (CS8091) Page 3

You might also like