Abstract:Standard models in stochastic resource allocation concern the economic processing of all jobs in some set J. We consider a set up in which tasks in various subsets of J are deemed to be alternative to one another, in that only one member of such a subset of alternative tasks will be completed during the evolution of the process. Existing stochastic scheduling methodology for single-machine problems is developed and extended to this novel class of models. A major area of application is in research planning.
“…From Theorem 1 of Glazebrook and Fay (1987), a result for the independence case, we deduce from (6) and 7that…”
Section: Optimal Policies For Two-job Problemsmentioning
confidence: 87%
“…Here a (discounted) reward R, is earned upon the jth job completion, j = 1, 2. For examples of-such models, see Glazebrook and Fay (1987). In (2) E:J( is an expectation conditional upon the application of policy n: Note that we shall introduce job-specific reward models in Section 4.…”
Section: A Class Of Two-job Single-machine Stochastic Scheduling Probmentioning
confidence: 99%
“…their own right. Glazebrook and Fay (1987) study such models for the case of independent processing times. They also cite examples from research planning in which the jobs are alternative routes to some research objective, only one of which needs to be completed successfully, see Ritchie (1972).…”
A single machine is available to process a collection of stochastic jobs preemptively. Rewards are received at job completions. We seek policies for machine allocation which maximize the total reward. Application areas point to the need to study such models for resource allocation when job processing requirements are dependent. To this end, models are developed in which the nature of such dependence is derived from various notions of positive and negative dependence in common usage in reliability. Optimal policies for resource allocation of simple structure are obtained for a variety of such models.
“…From Theorem 1 of Glazebrook and Fay (1987), a result for the independence case, we deduce from (6) and 7that…”
Section: Optimal Policies For Two-job Problemsmentioning
confidence: 87%
“…Here a (discounted) reward R, is earned upon the jth job completion, j = 1, 2. For examples of-such models, see Glazebrook and Fay (1987). In (2) E:J( is an expectation conditional upon the application of policy n: Note that we shall introduce job-specific reward models in Section 4.…”
Section: A Class Of Two-job Single-machine Stochastic Scheduling Probmentioning
confidence: 99%
“…their own right. Glazebrook and Fay (1987) study such models for the case of independent processing times. They also cite examples from research planning in which the jobs are alternative routes to some research objective, only one of which needs to be completed successfully, see Ritchie (1972).…”
A single machine is available to process a collection of stochastic jobs preemptively. Rewards are received at job completions. We seek policies for machine allocation which maximize the total reward. Application areas point to the need to study such models for resource allocation when job processing requirements are dependent. To this end, models are developed in which the nature of such dependence is derived from various notions of positive and negative dependence in common usage in reliability. Optimal policies for resource allocation of simple structure are obtained for a variety of such models.
“…We now apply the theory of Section 2 to a model first discussed by Glazebrook and Fay (1987). In this example of MDPs in parallel, each constituent MDP may be thought of as an R&D project with two phases.…”
Section: Scheduling Alternative Stochastic Tasksmentioning
confidence: 99%
“…In Section 2 we give more explicit expression to the ideas described here and then proceed to give an upper bound for the loss in total expected reward incurred when making the above reduction. These results are applied in Sections 3 and 4 to two classes of decision processes previously discussed by Glazebrook (1979) and Glazebrook and Fay (1987) respectively.…”
Whittle enunciated an important reduction principle in dynamic programming when he showed that under certain conditions optimal strategies for Markov decision processes (MDPs) placed in parallel to one another take actions in a way which is consistent with the optimal strategies for the individual MDPs. However, the necessary and sufficient conditions given by Whittle are by no means always satisfied. We explore the status of this computationally attractive reduction principle when these conditions fail.
Nash's generalization of Gittins’ classic index result to so-called generalized bandit problems (GBPs) in which returns are dependent on the states of all arms (not only the one which is pulled) has proved important for applications. The index theory for special cases of this model in which all indices are positive is straightforward. However, this is not a natural restriction in practice. An earlier proposal for the general case did not yield satisfactory index-based suboptimality bounds for policies — a central feature of classical Gittins index theory. We develop such bounds via a notion of duality for GBPs which is of independent interest. The index which emerges naturally from this analysis is the reciprocal of the one proposed by Nash.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.