Question

    In Hadoop’s distributed storage system, which of the following is responsible for managing and storing the actual data blocks?

    A Job Tracker Correct Answer Incorrect Answer
    B NameNode Correct Answer Incorrect Answer
    C DataNode Correct Answer Incorrect Answer
    D YARN Correct Answer Incorrect Answer
    E HDFS Client Correct Answer Incorrect Answer

    Solution

    DataNode in Hadoop's HDFS (Hadoop Distributed File System) is responsible for managing and storing the actual data blocks. DataNodes perform read/write operations on the data blocks as directed by the NameNode and report back with the status of the operations. a) Job Tracker : The Job Tracker manages the jobs and distributes tasks to various TaskTrackers, not data blocks. b) NameNode : The NameNode manages the metadata and directory structure of HDFS but does not store data itself. d) YARN : YARN (Yet Another Resource Negotiator) is responsible for cluster resource management and job scheduling, not data storage. e) HDFS Client : The HDFS Client is used to interact with the HDFS, not to store or manage data blocks.

    Practice Next