2024
DOI: 10.31219/osf.io/9y5ds
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Multimodal Integration in Large Language Models: A Case Study with Mistral LLM

Nuraini Sulaiman,
Farizal Hamzah

Abstract: This work presents significant advancements in the multimodal capabilities of the Mistral 8x7B model, a large language model designed with eight experts of seven billion parameters each. We introduce comprehensive modifications to its architecture, data fusion techniques, and training procedures, aimed at improving the integration and processing of text, image, and audio data. Our experimental results demonstrate that these enhancements lead to superior performance across multiple modalities when compared to e… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 13 publications
0
0
0
Order By: Relevance