Rapid development of multimodal interactive systems: a demonstration of platform for situated intelligence

Bohus, Dan; Andrist, Sean; Jalobeanu, Mihai

doi:10.1145/3136755.3143021

Cited by 38 publications

(27 citation statements)

References 2 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This component consumes video streams from video stream producers, and produces detection results for each frame which can be consumed by latter components in the pipeline. Additionally, the base functionality of OpenFace is extended with a facial expression recognition model 5 . Integration of this functionality into OpenSense involves the use of ML.NET 6 with Open Neural Network Exchange 7 to import the pre-trained model into the system.…”

Section: Exportersmentioning

confidence: 99%

“…Figure 1 provides a screenshot of the system in action (currently only available on Windows). OpenSense is built on the Microsoft's Platform for Situated Intelligence 2 (\psi) [5], an open source and extensible framework that enables the development of situated integrative-AI systems. Consequently, OpenSense inherits all computational tools provided by the \psi runtime and core libraries, including parallel computing over streams of data, reasoning about time, data stream synchronization, and multimodal data fusion.…”

mentioning

confidence: 99%

See 1 more Smart Citation

OpenSense: A Platform for Multimodal Data Acquisition and Behavior Perception

Stefanov

Huang

et al. 2020

Proceedings of the 2020 International Conference on Multimodal Interaction

View full text Add to dashboard Cite

Automatic multimodal acquisition and understanding of social signals is an essential building block for natural and effective humanmachine collaboration and communication. This paper introduces OpenSense, a platform for real-time multimodal acquisition and recognition of social signals. OpenSense enables precisely synchronized and coordinated acquisition and processing of human behavioral signals. Powered by the Microsoft's Platform for Situated Intelligence, OpenSense supports a range of sensor devices and machine learning tools and encourages developers to add new components to the system through straightforward mechanisms for component integration. This platform also offers an intuitive graphical user interface to build application pipelines from existing components. OpenSense is freely available for academic research. CCS CONCEPTS • Human-centered computing → Open source software; • Software and its engineering → Real-time systems software; • Computing methodologies → Machine learning.

show abstract

Section: Exportersmentioning

confidence: 99%

mentioning

confidence: 99%

OpenSense: A Platform for Multimodal Data Acquisition and Behavior Perception

Stefanov

Huang

et al. 2020

Proceedings of the 2020 International Conference on Multimodal Interaction

View full text Add to dashboard Cite

show abstract

“…TeamTalk (Marge and Rudnicky, 2019) for example, controls multiple ground robots by way of a predefined grammar, while DIARC (Scheutz et al, 2019) also supports dialogue with multiple robots, and has been implemented on ground, aerial, and social robots. Open-source architectures such as OpenDial (Lison and Kennington, 2016), IrisTK (Skantze and Al Moubayed, 2012), and Microsoft's PSI (Bohus et al, 2017) can be used to build many situated dialogue agents, including robots. Compared to similar architectures, MultiBot leverages wizardswappable components from ScoutBot and extends the mode of interaction to multi-participant dialogue.…”

Section: Related Workmentioning

confidence: 99%

A

Marge

Nogar

Hayes

et al. 2019

Proceedings of the 2019 Conference of the North

View full text Add to dashboard Cite

This paper presents a research platform that supports spoken dialogue interaction with multiple robots. The demonstration showcases our crafted MultiBot testing scenario in which users can verbally issue search, navigate, and follow instructions to two robotic teammates: a simulated ground robot and an aerial robot. This flexible language and robotic platform takes advantage of existing tools for speech recognition and dialogue management that are compatible with new domains, and implements an inter-agent communication protocol (tactical behavior specification), where verbal instructions are encoded for tasks assigned to the appropriate robot.

show abstract

“…One such system is the virtual receptionist, "which keeps track of users attention and engagement through visual cues (such as gaze tracking, head orientation etc.) to initiate the interaction at the most appropriate moment [45]. Further, it can also make use of hesitation (e.g., "hmmm .…”

Section: Asi a New Challengementioning

confidence: 99%

From Homo Sapiens to Robo Sapiens: The Evolution of Intelligence

Raveh

Tamir

2018

Information

View full text Add to dashboard Cite

In this paper, we present a review of recent developments in artificial intelligence (AI) towards the possibility of an artificial intelligence equal that of human intelligence. AI technology has always shown a stepwise increase in its capacity and complexity. The last step took place several years ago with the increased progress in deep neural network technology. Each such step goes hand in hand with our understanding of ourselves and our understanding of human cognition. Indeed, AI was always about the question of understanding human nature. AI percolates into our lives, changing our environment. We believe that the next few steps in AI technology, and in our understanding of human behavior, will bring about much more powerful machines that are flexible enough to resemble human behavior. In this context, there are two research fields: Artificial Social Intelligence (ASI) and General Artificial Intelligence (AGI). The authors also allude to one of the main challenges for AI, embodied cognition, and explain how it can be viewed as an opportunity for further progress in AI research.We end the discussion by demonstrating a way to overcome our fears of singularity, by the process of value alignment, which is expanded upon in Section 7. The Human-Machine Eco-SystemFrom ancient myths of inanimate objects coming alive to the creation of artificial intelligence, philosophers, scientists, writers, and artists have pondered the very nature and boundaries of humanity. Humans are fascinated by machines that can imitate us but also feel an existential discomfort around them-an uneasiness that stems from their ability to obscure the line between the living and the inanimate.Claude Levi-Strauss [1] has examined how the individual process of constructing reality is related to how an entire society develops and maintains its worldview. He argued that the most common way in which both an individual and a community put together a structure of reality is through the use of binary categories. An individual makes sense of the world by organizing things in a series of dual oppositions such as dark/light, living/dead, feminine/masculine, emotion/logic, and so on, which lead to the community's development of more abstract concepts like, chaos/order, natural/unnatural, normal/abnormal, subjectivity/objectivity, and moral/immoral. Such a predetermined schema of reality provides the confidence people need to face the world and explore its boundaries.As far as our relationship to thinking machines is concerned, it seems that the worldview we have developed for ourselves over time has become pessimistic as the pace of technology development increased. While in the past automata have entertained us mainly because they mimicked human behavior in an inaccurate and ridiculous way that revealed the fact that it was a trick, artificially intelligent machines today can successfully mimic an increasing number of the human's traits, such as natural human language and thought patterns. These traits have always separated us from all the other living cr...

show abstract

Rapid development of multimodal interactive systems: a demonstration of platform for situated intelligence

Cited by 38 publications

References 2 publications

OpenSense: A Platform for Multimodal Data Acquisition and Behavior Perception

OpenSense: A Platform for Multimodal Data Acquisition and Behavior Perception

A

From Homo Sapiens to Robo Sapiens: The Evolution of Intelligence

Contact Info

Product

Resources

About