Taming Simulators: Challenges, Pathways and Vision for the Alignment of Large Language Models

Bereska, Leonard; Gavves, Efstratios

doi:10.1609/aaaiss.v1i1.27478

AAAI-SS

2023

DOI: 10.1609/aaaiss.v1i1.27478

|View full text |Cite

Taming Simulators: Challenges, Pathways and Vision for the Alignment of Large Language Models

Leonard Bereska,

Efstratios Gavves

Abstract: As AI systems continue to advance in power and prevalence, ensuring alignment between humans and AI is crucial to prevent catastrophic outcomes. The greater the capabilities and generality of an AI system, combined with its development of goals and agency, the higher the risks associated with misalignment. While the concept of superhuman artificial general intelligence is still speculative, language models show indications of generality that could extend to generally capable systems. Regarding agency, this pap… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2024

Publication Types

Select...

Article1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

References 13 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

Towards Cognition-Aligned Visual Language Models via Zero-Shot Instance Retrieval

Ma,

Organisciak,

et al. 2024

Electronics

View full text Add to dashboard Cite

The pursuit of Artificial Intelligence (AI) that emulates human cognitive processes is a cornerstone of ethical AI development, ensuring that emerging technologies can seamlessly integrate into societal frameworks requiring nuanced understanding and decision-making. Zero-Shot Instance Retrieval (ZSIR) stands at the forefront of this endeavour, potentially providing a robust platform for AI systems, particularly large visual language models, to demonstrate and refine cognition-aligned learning without the need for direct experience. In this paper, we critically evaluate current cognition alignment methodologies within traditional zero-shot learning paradigms using visual attributes and word embedding generated by large AI models. We propose a unified similarity function that quantifies the cognitive alignment level, bridging the gap between AI processes and human-like understanding. Through extensive experimentation, our findings illustrate that this similarity function can effectively mirror the visual–semantic gap, steering the model towards enhanced performance in Zero-Shot Instance Retrieval. Our models achieve state-of-the-art performance on both the SUN (92.8% and 82.2%) and CUB datasets (59.92% and 48.82%) for bi-directional image-attribute retrieval accuracy. This work not only benchmarks the cognition alignment of AI but also sets a new precedent for the development of visual language models attuned to the complexities of human cognition.

show abstract

Towards Cognition-Aligned Visual Language Models via Zero-Shot Instance Retrieval

Ma,

Organisciak,

et al. 2024

Electronics

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Taming Simulators: Challenges, Pathways and Vision for the Alignment of Large Language Models

Cited by 1 publication

References 13 publications

Towards Cognition-Aligned Visual Language Models via Zero-Shot Instance Retrieval

Towards Cognition-Aligned Visual Language Models via Zero-Shot Instance Retrieval

Contact Info

Product

Resources

About