Skip to content

Instantly share code, notes, and snippets.

@yogeshc
Created July 17, 2023 17:17
Show Gist options
  • Save yogeshc/1d315fec03bc449237bc75239dd4b8dd to your computer and use it in GitHub Desktop.
Save yogeshc/1d315fec03bc449237bc75239dd4b8dd to your computer and use it in GitHub Desktop.
Q 1 - In a Hadoop cluster, what is true for a HDFS block that is no longer available due to disk corruption or machine failure?
B - It can be replicated form its alternative locations to other live machines.
Q 2 - Which one of the following is not a main component of HBase?
B - Nagios.
Q 3 - When a client communicates with the HDFS file system, it needs to communicate with
C - both the namenode and datanode
Q 4 - Which of the following is not a Hadoop operation mode?
B - Globally distributed mode
Q 5 - The information mapping data blocks with their corresponding files is stored in
D – Namenode
Q 6 - HDFS stands for
D - Hadoop distributed file system.
Q 7 - The source of HDFS architecture in Hadoop originated as
A - Google distributed filesystem
Q 8 - The current limiting factor to the size of a hadoop cluster is
C - Upper limit of the RAM in namenode
Q 9 - The namenode loses its only copy of fsimage file. We can recover this from
C - Checkpoint node
Q 10 - Which of the following technologies is a document store database?
D – CouchDB
Assignment: A CSV data file including movie names and their release years is supplied to you. Your assignment as follows:
1) You must divide the information into two columns: “ReleasedYear” and “MovieTitle.
2) Identify the duplicate entries. The Expendables 3, The Expendables 2, The Expendables, Escape Plan, Last Action Hero
3) Find out how many times the word “red” appears in a title. 4
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment