Computer Science > QUESTIONS & ANSWERS > HaDoop Test Questions & Answers (All)
HaDoop Test Questions & Answers What project was Hadoop originally a part of and what idea was that project based on? - >>>>Nutch. It was based on returning web search results faster by distributing ... data and calculations across different compters. Fill in the blank. The solution to cataloging the increasing number of web pages in the late 1900's and early 2000's was _______. - >>>>Automation T/F: The Cassandra File System has many advantages over HDFS, but simpler deployment is not one of them. - >>>>False Name three features of Hive. - >>>>HiveQL, Indexing, Different Storage types Describe how Sqoop transfers data from a relational database to Hadoop. - >>>>Runs a query on a relational database and exports into files in a variety of formats. They are then saved on HDFS. Lists three drawbacks of using Hadoop. - >>>>Does not work well with small amounts of data, MapReduce is difficult to implement or understand, does not guarantee atomicity transactions Explain why the performance of join operation in Hadoop is inefficient. - >>>>HaDoop does not have indicies so the entire dataset is copied in the join operation. T/F: Hadoop is not recommended to company with small amount of data but it is highly recommended if this data requires instance analysis. - >>>>False FIll in the blank. Hadoop lacks notion of ________ and _______. Therefore, the analyzed result generated by Hadoop may or may not be 100% accurate. - >>>>Transaction Consistency, Recovery Checkpoint T/F: The main benefit of HadoopDB is that it is more scalable than Hadoop while maintaining the same performance level on structured data analysis workloads. - >>>>False Which of the following is NOT Hadoop drawbacks? A) inefficient join operation B) security issue C) does not optimize query for user D) high cost E) MapReduce is difficult to implement - >>>>D The command for removing a file from hadoop recursively is: hadoop dfs ___________ <directory> - >>>>-rmr The __________ holds the data in the HDFS and the application connects with the __________ to send and retrieve data from the cluster. - >>>>Datanode, Namenode T/F: Hadoop is open source. - >>>>True Which of the following is NOT true: A) Hadoop is decentralized B) Hadoop is distributed. C) Hadoop is open source. D) Hadoop is highly scalable. - >>>> [Show More]
Last updated: 3 years ago
Preview 1 out of 4 pages
Buy this document to get the full access instantly
Instant Download Access after purchase
Buy NowInstant download
We Accept:
Can't find what you want? Try our AI powered Search
Connected school, study & course
About the document
Uploaded On
Oct 30, 2022
Number of pages
4
Written in
All
This document has been written for:
Uploaded
Oct 30, 2022
Downloads
0
Views
94
Scholarfriends.com Online Platform by Browsegrades Inc. 651N South Broad St, Middletown DE. United States.
We're available through e-mail, Twitter, Facebook, and live chat.
FAQ
Questions? Leave a message!
Copyright © Scholarfriends · High quality services·