Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Imagine communicating with someone thousands of miles away and seeing their facial expressions in real-time as if they were seated next to you. This groundbreaking level of interaction is no longer a fantasy, thanks to new technologies being developed at the Mohamed bin Zayed University of Artificial Intelligence (MBZUAI) in Abu Dhabi.
During a demonstration at MBZUAI’s Data Observatory, researchers unveiled two remarkable innovations, Voodoo XP and XMem++, which are set to transform virtual communication and digital interaction. Here’s how they work and what makes them extraordinary.
Voodoo XP, developed at MBZUAI, enables the creation of realistic digital avatars in real time using just a single webcam. Unlike earlier systems, which required expensive setups or specialized hardware, Voodoo XP simplifies the process dramatically.
Professor Hao Li, Director of MBZUAI’s Metaverse Centre, demonstrated how the technology mirrors every movement and facial detail instantly. “What you’re seeing is me controlling the avatar in real time, with no special hardware—just an ordinary camera,” explained Professor Li.
This simplicity opens the door to countless applications, from enhancing virtual meetings to enabling remote collaboration in ways that feel more personal and engaging.
MBZUAI’s innovation stands in stark contrast to technologies like Meta’s Codec Avatars, which require 171 cameras and hours of training to create similar results. PhD student Ariana Bermudez highlighted, “With Voodoo XP, you just need one webcam, and it creates an avatar in seconds.”
The system captures even the smallest movements, such as blinking and smiling, making the avatars incredibly lifelike. What’s more, the technology doesn’t require complicated equipment, making it accessible to a wider audience.
The second technology showcased was XMem++, a cutting-edge video object segmentation tool. This innovation enhances memory efficiency and segmentation accuracy, making it highly effective for long video sequences.
According to Bermudez, XMem++ is a powerful tool designed for video editing, augmented reality (AR), and autonomous systems. She demonstrated how it can simplify tedious manual tasks for visual effects (VFX) artists. “Usually, VFX artists have to refine a lot of details. With XMem++, they can stop, make fixes, and the system will propagate the adjustments seamlessly.”
Bermudez shared an example where XMem++ was adopted by the VFX community shortly after its launch in 2023. A prime example is its integration with Nuke, an industry-standard compositing software. VFX artists use it to perform advanced tasks, such as making objects disappear in a scene, with far greater ease and efficiency.
Both Voodoo XP and XMem++ underscore MBZUAI’s ambition to make advanced technology accessible and practical. These innovations could have far-reaching implications across numerous industries, including:
With their ease of use and state-of-the-art capabilities, these technologies open up new possibilities for how humans communicate and interact in digital spaces.
Voodoo XP and XMem++ result from collaboration among MBZUAI researchers and global experts. For Voodoo XP, Phong Tran, Egor Zakharov, Long-Nhat Ho, Anh Tuan Tran, and Liwen Hu contributed to its success alongside Professor Li. Meanwhile, XMem++ was developed with a team of experts, including Maksym Bekuzarov (MBZUAI alumni) and Joon-Young Lee (Adobe).
Such collaborations highlight MBZUAI’s role as a global leader in artificial intelligence education and research, attracting top talent from around the world.
Innovations like Voodoo XP and XMem++ showcase how artificial intelligence is revolutionizing industries by bridging the gap between physical and virtual interactions. By eliminating the need for costly equipment and simplifying complex processes, these technologies make advanced digital experiences more accessible to individuals and organizations alike.
MBZUAI’s breakthroughs represent not only technological progress but also an opportunity to redefine how we connect, collaborate, and create in a rapidly evolving digital world.
MBZUAI isn’t just developing cutting-edge technologies; it’s also educating the next generation of AI leaders. Whether you’re a curious individual or an aspiring AI professional, explore their programs and resources to stay ahead in this exciting field.