We consider the problem of optimally allocating a fixed budget to construct a test collection with associated relevance judge that it can (i) accurately evaluate the relative performance of the participating systems, and (ii) systems. We propose a two stage approach. For a given set of queries, we adopt the traditional pooling method and use a porti budget to evaluate a set of documents retrieved by the participating systems. Next, prioritize the queries and associated documents for further refinement of the test collection. Our objective is to increase t the test collection for comparative evaluation and extendibilit optimization problem, thereby permitting efficient solution and providing a flexible framework to incorporate various constra the remaining budget to evaluate query-document p phase are expended during the construction of the test collection and consider only the documents that have been retrieved by participating systems. We evaluate our resource optimization approach on two TREC test collections namely TREC 8 and TREC 2004 Robust Track. We demonstrate that our optimization techniques are cost efficient and yield a significant improvement in the r the test collections. We consider the problem of optimally allocating a fixed budget to construct a test collection with associated relevance judge that it can (i) accurately evaluate the relative performance of the participating systems, and (ii) generalize to new, previously unseen systems. We propose a two stage approach. For a given set of queries, we adopt the traditional pooling method and use a porti budget to evaluate a set of documents retrieved by the participating systems. Next, we analyze the relevance judgments in order to prioritize the queries and associated documents for further refinement of the test collection. Our objective is to increase t the test collection for comparative evaluation and extendibility to new systems. The query prioritization is formulated as a convex optimization problem, thereby permitting efficient solution and providing a flexible framework to incorporate various constra document pairs with the highest priority scores. The budgets for the initial and the refinement phase are expended during the construction of the test collection and consider only the documents that have been retrieved by esource optimization approach on two TREC test collections namely TREC 8 and TREC 2004 Robust Track. We demonstrate that our optimization techniques are cost efficient and yield a significant improvement in the r
Prioritizing Relevance Judgments to Improve the IR Test CollectionsWe consider the problem of optimally allocating a fixed budget to construct a test collection with associated relevance judgements, such generalize to new, previously unseen systems. We propose a two stage approach. For a given set of queries, we adopt the traditional pooling method and use a portion of the we analyze the relevance judgments in order to prioritize the queries and associated documents for further refinement of the test collection....