Facilitating CBR for Incompletely-Described Cases: Distance Metrics for Partial Problem Descriptions

Bogaerts, Steven; Leake, David

doi:10.1007/978-3-540-28631-8_6

Cited by 27 publications

(16 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Alternatively, when the variability is high, feature values are distributed across the respective dimension, the organization of cases is sparse, and the retrieval capabilities are hampered. In order to show the correlation between the statistical properties of the data and the retrieval performance, we computed the values of pairs case , and two variability factors for the Free-Text and Numeric features 7 . var FT of a corpus was computed as a weighted average of the numbers of possible values of all the features that appear in a corpus.…”

Section: Discussionmentioning

confidence: 99%

“…This metric can be characterized as a pessimistic metric [7], as it requires exact matching of the values of the given feature. This pessimistic metric can be substituted, and softened metrics may be applied for non-matching values, e.g., assigning an average distance between the features, or a predefined value 0<x<1 to the distance between non-matching values within the given features [7].…”

Section: Similarity Metricsmentioning

confidence: 99%

“…This pessimistic metric can be substituted, and softened metrics may be applied for non-matching values, e.g., assigning an average distance between the features, or a predefined value 0<x<1 to the distance between non-matching values within the given features [7]. Although we admit that…”

Section: Similarity Metricsmentioning

confidence: 99%

“…Assume that the case c t is mapped to a location (5,6,7) in the hypercube. Then the retrieval will compare the c t with the candidate cases in the locations (*, 6, 7), (5, *, 7), and (5, 6, *), where * denotes any possible value in the respective dimension.…”

Section: Retrieved-cases = Retrieved-cases  {Test-case} (10) Return mentioning

confidence: 99%

“…For example, to accomplish the comparison with all (*, 6, 7) cases, a node (5,6,7), storing the target case c t , queries his immediate logical neighbors, i.e., sends the description of c t to the nodes (4,6,7) and (6,6,7). Upon receiving this query, these nodes autonomously perform two operations: (i) forward the description of the target case c t to their next logical neighbors, i.e., nodes (3,6,7) and (7,6,7), and (ii) compute the similarity between the target case c t and the cases stored in their nodes, and back propagate the similarity value to the user storing the target case c t on the same route by which the query was received. As a result of the pure distributed P2P communication middleware of UNSO, the propagated similarity computation both parallelizes the retrieval of most similar cases and distributes the required computational overhead among the MLH nodes (i.e., the respective users managing these nodes), actually storing the cases stored in the case base.…”

Section: Retrieved-cases = Retrieved-cases  {Test-case} (10) Return mentioning

confidence: 99%

See 4 more Smart Citations

P2P case storage and retrieval with an unspecified ontology

2007

View full text Add to dashboard Cite

Abstract. Traditional similarity-based retrieval of structured data, as in CaseBased Reasoning (CBR) approaches, has been largely implemented using centralized storage systems. In such systems, when the cases or records contain both numeric and symbolic attributes, similarity-based retrieval cannot exploit standard speedup techniques based on multi-dimensional indexing, and the retrieval is implemented by an exhaustive comparison of the case to be solved with the whole set of stored cases. In this work, to improve the performance of the case retrieval step and build CBR systems that can scale up to large case bases, we propose a novel approach for storage of the case base in a decentralized Peer-to-Peer environment using the notion of Unspecified Ontology. We also develop an algorithm for efficient retrieval of approximated most-similar cases, that exploits inherent characteristics of the unspecified ontology in order to improve the performance of the case retrieval step. The experiments show that the algorithm successfully retrieves cases that are very close to the mostsimilar cases, while reducing the number of cases to be compared. Hence, it improves the performance of the retrieval step, the first stage of the CBR problem solving cycle. Moreover, the distributed nature of our approach eliminates the need for a centralized server that not only becomes a computational bottleneck, but is also a single point of failure. Abstract. Traditional similarity-based retrieval of structured data, as in CaseBased Reasoning (CBR) approaches, has been largely implemented using centralized storage systems. In such systems, when the cases or records contain both numeric and symbolic attributes, similarity-based retrieval cannot exploit standard speedup techniques based on multi-dimensional indexing, and the retrieval is implemented by an exhaustive comparison of the case to be solved with the whole set of stored cases. In this work, to improve the performance of the case retrieval step and build CBR systems that can scale up to large case bases, we propose a novel approach for storage of the case base in a decentralized Peer-to-Peer environment using the notion of Unspecified Ontology. We also develop an algorithm for efficient retrieval of approximated most-similar cases, that exploits inherent characteristics of the unspecified ontology in order to improve the performance of the case retrieval step. The experiments show that the algorithm successfully retrieves cases that are very close to the mostsimilar cases, while reducing the number of cases to be compared. Hence, it improves the performance of the retrieval step, the first stage of the CBR problem solving cycle. Moreover, the distributed nature of our approach eliminates the need for a centralized server that not only becomes a computational bottleneck, but is also a single point of failure. Manuscript

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Similarity Metricsmentioning

confidence: 99%

Section: Similarity Metricsmentioning

confidence: 99%

Section: Retrieved-cases = Retrieved-cases  {Test-case} (10) Return mentioning

confidence: 99%

Section: Retrieved-cases = Retrieved-cases  {Test-case} (10) Return mentioning

confidence: 99%

See 3 more Smart Citations

P2P case storage and retrieval with an unspecified ontology

2007

View full text Add to dashboard Cite

show abstract

CBR for Modeling Complex Systems

Weber

Proctor

Waldstein

et al. 2005

Case-Based Reasoning Research and Development

View full text Add to dashboard Cite

Evaluating CBR Systems Using Different Data Sources: A Case Study

Aamodt

2006

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Abstract. The complexity and high construction cost of case bases make it very difficult, if not impossible, to evaluate a CBR system, especially a knowledge-intensive CBR system, using statistical evaluation methods on many case bases. In this paper, we propose an evaluation strategy, which uses both many simple case bases and a few complex case bases to evaluate a CBR system, and show how this strategy may satisfy different evaluation goals. The identified evaluation goals are classified into two categories: domain-independent and domain-dependent. For the evaluation goals in the first category, we apply the statistical evaluation method using many simple case bases (for example, UCI data sets); for evaluation goals in the second category, we apply different, relatively weak, evaluation methods on a few complex domain-specific case bases. We apply this combined evaluation strategy to evaluate our knowledge-intensive conversational CBR method as a case study.

show abstract

Facilitating CBR for Incompletely-Described Cases: Distance Metrics for Partial Problem Descriptions

Cited by 27 publications

References 14 publications

P2P case storage and retrieval with an unspecified ontology

P2P case storage and retrieval with an unspecified ontology

CBR for Modeling Complex Systems

Evaluating CBR Systems Using Different Data Sources: A Case Study

Contact Info

Product

Resources

About