Tuesday, 21 January 2014

Capacitive Scheduling in Hadoop


Capacity Scheduler
·        The capacity scheduler shares some of the principles of the fair scheduler but has distinct differences, too.

·        First, capacity scheduling was defined for large clusters, which may have multiple, independent consumers and target applications. For this reason, capacity scheduling provides greater control as well as the ability to provide a minimum capacity guarantee and share excess capacity among users. The capacity scheduler was developed by Yahoo!.

·        In capacity scheduling, instead of pools, several queues are created, each with a configurable number of map and reduce slots. Each queue is also assigned a guaranteed capacity (where the overall capacity of the cluster is the sum of each queue's capacity).


·        Queues are monitored; if a queue is not consuming its allocated capacity, this excess capacity can be temporarily allocated to other queues. Given that queues can represent a person or larger organization, any available capacity is redistributed for use by other users.

·        Another difference of fair scheduling is the ability to prioritize jobs within a queue. Generally, jobs with a higher priority have access to resources sooner than lower-priority jobs. The Hadoop road map includes a desire to support pre-emption (where a low-priority job could be temporarily swapped out to allow a higher-priority job to execute), but this functionality has not yet been implemented.


·        Another difference is the presence of strict access controls on queues (given that queues are tied to a person or organization). These access controls are defined on a per-queue basis. They restrict the ability to submit jobs to queues and the ability to view and modify jobs in queues.




2 comments:

  1. may i know on what basis does the queues are given capacity, is it a static or dynamic queue and what happens if all the queues are full n a new job enters where will it wait??

    ReplyDelete
  2. This is real to useful blogs and it has to be more useful innovations ideas from your blog sites Selenium Automation Training.We'll share keep it many numbers of useful blogs techniques. Eagerly waiting for your new blog and useful information. Keep doing more.Selenium Online Training

    ReplyDelete