Download as pdf or txt
Download as pdf or txt
You are on page 1of 2

*X86564*

Reg. No. :

Question Paper Code : X86564


M.E./M.Tech. Degree Examinations, April/may 2021
Second Semester
Computer Science and Engineering
CP5293 – big data analytics
(Common to M.E. Mobile and Pervasive Computing/M.E. Software
Engineering/M.Tech. Information Technology)
(Regulations 2017)

Time : Three Hours Maximum : 100 Marks

Answer all questions

Part – a (10×2=20 Marks)

1. What is Big Data ?

2. List out the data analytics tools.

3. Define Map reduce.

4. How big data and Hadoop related to each other ?

5. Distinguish between correlation and regression.

6. What is R in programming language ?

7. List out the issues in stream processing.


8. What is sampling data in a stream ?

9. What is NoSQL ?

10. Identify in which pig programs can be executed.

Part – B (5×13=65 Marks)

11. a) Explain in detail about the challenges of conventional system.

(OR)
b) Discuss about the tools, trends and technology in big data.
X86564 *X86564*

12. a) Discuss the features of Apache Hadoop in detail with neat diagram.
(OR)

b) i) Describe Map Reduce framework in detail. Draw the architectural


diagram for physical organization of compute nodes. (7)
ii) What are the different configuration files in Hadoop ? (6)

13. a) Describe the various hierarchical methods of cluster analysis.


(OR)
b) Explain the K-means clustering algorithm with an example.

14. a) What are streams ? Explain stream data model with its architecture.
(OR)
b) Discuss about decaying window in detail.

15. a) List the classification of NoSQL Databases and explain about Key Value Stores.
(OR)
b) What is HBase ? Discuss about features of HBase in detail.

Part – C (1×15=15 Marks)

16. a) State the significances of MapReduce and discuss about Hadoop distributed
file system architecture with neat diagram.

(OR)
b) Analyze the use of Hive. How does Hive interact with Hadoop explain in
detail ?

–––––––––––––

You might also like