AI that can process different types of information simultaneously – text, images, audio, video. Like having an assistant who can watch a film, read the book, and tell you which one was better, all while analyzing the soundtrack.
Synonyms:
Cross-modal AI