Skip to content
Most important Questions unit wise
Unit : 1
- What is Big data define big data platform & characteristic of Big Data?
- Compare Structure & Instructure data to each other.
- Define Importance & Architecture of big data.
- What is Big data Security & What are the step for Synchronizing big data compare conditional database with intelligent data tools.
- Define different type of data analytic tools.
Unit : 2
- Define Apache Hadoop with the short history of Hadoop.
- What is HDFS & Define the working HDFS.
- Describe architecture of Hadoop & also describe the Component use in Apache Hadoop.
- Write a short note on following topics : (i) Hadoop streaming (ii) Hadoop Pipe (iii) Scaling out (iv) Hadoop ecosystem.
- Explain different Phases of Map Reduce.
- Explain with example of Real world of MapReduce & also explain Input & output format.
Unit : 3
- Describe the design concept of HDFS also explain it’s benefit & challenges.
- Explain how HDFS can read & write file.
- Write short note on following topics: (i) Name Node (ii) Data Node (iii) Block Node (iv) HDFS federation.
- What is block abstraction.
- Explain Sqoop, flume and Avro.
Unit : 4
- Write a short notes on MR version1, MR version 2.
- Write a short note on Advantage & disadvantage of NoSQL.
- What Is MangoDB. Explain update & delete document in MangoDB.
- What is a spark .
- Short notes on : (i) Spark application (ii)Job (iii) stages and task
- What is Scala & how to use classes, fields method in scala.
Unit : 05
- What are various data processing in operator in Pig & also Compare b/w Pig Latin & SQL.
- What is Apache Hive & also explain its Architecture.
- Compare Hive with traditional database.
- Explain working of Hive with Hadoop & also explain. its services.
- What is H-Base Concept & Compare H-Base with RDBMS.
- What is Zookeeper & How it help in monitor a cluster.
- What is IBM big data strategy & Pig SQL.