Explainable Multimedia Feature Fusion for Medical Applications

Wagenpfeil, Stefan; Kevitt, Paul Mc; Cheddad, Abbas; Hemmje, Matthias

doi:10.3390/jimaging8040104

Cited by 4 publications

(6 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The extracted features are contributed to the MMFG, which can be further processed. Extensions of MMFGs have led to semantic analysis, such as Semantic Multimedia Feature Graphs (SMMFGs) and Explainable SMMFGs (ESMMFGs) [24]. Despite these extensions, the graph-based structure of MMFGs remains and can lead to slow processing times.…”

Section: Multimedia Features and Multimedia Feature Graphsmentioning

confidence: 99%

“…With the introduction of semantics to the MMFGs in [24], we introduced additional metrics to improve the efficiency and effectiveness of Graph Codes for MMIR. First, the feature discrimination M DIS is defined as the difference in the number of nonzero Graph Code fields for two feature vocabulary terms of a given Graph Code or Semantic Graph Code.…”

Section: Graph Codes and Algorithmsmentioning

confidence: 99%

“…This method has been used in similar cases [55]. Another approach is to employ the previously described Feature Relevance [24]. Graph Codes with only relevant features should increase the performance with a minimal loss of accuracy.…”

Section: Potential Parallelization On Modern Processorsmentioning

confidence: 99%

See 2 more Smart Citations

Parallelization Strategies for Graph-Code-Based Similarity Search

Steinert

Wagenpfeil

Kevitt³

et al. 2023

BDCC

Self Cite

View full text Add to dashboard Cite

The volume of multimedia assets in collections is growing exponentially, and the retrieval of information is becoming more complex. The indexing and retrieval of multimedia content is generally implemented by employing feature graphs. Feature graphs contain semantic information on multimedia assets. Machine learning can produce detailed semantic information on multimedia assets, reflected in a high volume of nodes and edges in the feature graphs. While increasing the effectiveness of the information retrieval results, the high level of detail and also the growing collections increase the processing time. Addressing this problem, Multimedia Feature Graphs (MMFGs) and Graph Codes (GCs) have been proven to be fast and effective structures for information retrieval. However, the huge volume of data requires more processing time. As Graph Code algorithms were designed to be parallelizable, different paths of parallelization can be employed to prove or evaluate the scalability options of Graph Code processing. These include horizontal and vertical scaling with the use of Graphic Processing Units (GPUs), Multicore Central Processing Units (CPUs), and distributed computing. In this paper, we show how different parallelization strategies based on Graph Codes can be combined to provide a significant improvement in efficiency. Our modeling work shows excellent scalability with a theoretical speedup of 16,711 on a top-of-the-line Nvidia H100 GPU with 16,896 cores. Our experiments with a mediocre GPU show that a speedup of 225 can be achieved and give credence to the theoretical speedup. Thus, Graph Codes provide fast and effective multimedia indexing and retrieval, even in billion-scale use cases.

show abstract

Section: Multimedia Features and Multimedia Feature Graphsmentioning

confidence: 99%

Section: Graph Codes and Algorithmsmentioning

confidence: 99%

See 1 more Smart Citation

Parallelization Strategies for Graph-Code-Based Similarity Search

Steinert

Wagenpfeil

Kevitt³

et al. 2023

BDCC

Self Cite

View full text Add to dashboard Cite

show abstract

“…"Smart MMIR" thus describes expressive, scalable, interoperable, explainable and human understandable MMIR solutions. In previous work [2][3][4], we already introduced, defined, and evaluated the core components, which contribute to Smart MMIR. However, the interoperability of these components and a corresponding formal model is a foundation for further improvements in the problem areas, which were mentioned above.…”

mentioning

confidence: 99%

“…In [3], we further showed that not only feature graphs, but also the indexing structures, such as, for example, graph codes, can be automatically transformed into humanunderstandable texts. Based on this, further metrics for semantic graph codes were introduced [4] as follows:…”

mentioning

confidence: 99%

Smart Multimedia Information Retrieval

Wagenpfeil

Kevitt²,

Hemmje

2023

Analytics

Self Cite

View full text Add to dashboard Cite

The area of multimedia information retrieval (MMIR) faces two major challenges: the enormously growing number of multimedia objects (i.e., images, videos, audio, and text files), and the fast increasing level of detail of these objects (e.g., the number of pixels in images). Both challenges lead to a high demand of scalability, semantic representations, and explainability of MMIR processes. Smart MMIR solves these challenges by employing graph codes as an indexing structure, attaching semantic annotations for explainability, and employing application profiling for scaling, which results in human-understandable, expressive, and interoperable MMIR. The mathematical foundation, the modeling, implementation detail, and experimental results are shown in this paper, which confirm that Smart MMIR improves MMIR in the area of efficiency, effectiveness, and human understandability.

show abstract