Apple researchers unveils new AI system for enhanced voice assistant interactions

Last updated: April 2, 2024 3:44 pm

Abdul Raouf Al Sbeei - Apple Reporter April 2, 2024

3 Min Read

Apple researchers recently unveiled an advancement in artificial intelligence designed to improve voice assistant interactions with the introduction of ReALM (Reference Resolution As Language Modeling) which tackles a key challenge: understanding user references to what’s on their screen (via. VentureBeat).

Voice assistants usually struggles with interpreting ambiguous user commands, particularly those referencing visual elements on a device’s display. ReALM tackles this hurdle by leveraging the power of large language models. These models analyze the on-screen content and contextualize user queries, enabling them to pinpoint the specific information being referenced.

Being able to understand context, including references, is essential for a conversational assistant. Enabling the user to issue queries about what they see on their screen is a crucial step in ensuring a true hands-free experience in voice assistants.
Apple research team

This innovation hinges on ReALM’s ability to reconstruct the user’s screen. By parsing on-screen elements and their locations, it generates a textual representation that captures the visual layout. This allows ReALM to translate visual information into a language model’s familiar territory. This approach, combined with fine-tuned language models, surpasses existing systems like GPT-4 in understanding screen-based references.

The benefits extend beyond convenience. ReALM paves the way for a truly hands-free experience. Users can interact with their devices seamlessly, issuing voice commands directly related to what they see on the screen. This is particularly valuable for visually impaired users or situations where touching the device is impractical.

Apple researchers acknowledge the limitations of this technology. ReALM relies on automated parsing, which can struggle with complex visual references, like distinguishing between multiple images. Future iterations might incorporate computer vision and multi-modal techniques to address these challenges.

Apple’s upcoming Worldwide Developers Conference (WWDC) on June 10 is expected to serve as a platform for showcasing its AI advancements alongside iOS 18, a major update for iPhones. Speculation also suggests the unveiling of a new large language model framework, an “Apple GPT” chatbot, and a broader integration of AI features within their ecosystem.

TOPICS: VentureBeat

Share this Article

You’re reading the Apple Newsroom.

Loading stock data...

Apple researchers unveils new AI system for enhanced voice assistant interactions

Last updated: April 2, 2024 3:44 pm

Abdul Raouf Al Sbeei - Apple Reporter April 2, 2024

3 Min Read

Being able to understand context, including references, is essential for a conversational assistant. Enabling the user to issue queries about what they see on their screen is a crucial step in ensuring a true hands-free experience in voice assistants.
Apple research team

TOPICS: VentureBeat

Share this Article

Newsroom Apple In Other News Services

Stock Ticker

Apple researchers unveils new AI system for enhanced voice assistant interactions

Editor's Pick

Foldable iPhone in clamshell design moves into advanced prototyping stage ahead of 2026 launch

Kuo: iPhone 17 Slim to focus on design innovations rather than high-end specs

iPhone 18 Pro rumored to feature 48MP ultra-wide sensor from Samsung

iPhone 17 rumored to feature mechanical camera aperture system for improved bokeh effect

Read More

Apple Intelligence rumored to create custom playlist artwork on Apple Music and iOS 18

iPhone 17 rumored to feature mechanical camera aperture system for improved bokeh effect

Foldable iPhone in clamshell design moves into advanced prototyping stage ahead of 2026 launch

iPhone 17 lineup details revealed in new leak, including new screen sizes and A19 chip

Apple researchers unveils new AI system for enhanced voice assistant interactions

Editor's Pick

Kuo: iPhone 17 Slim to focus on design innovations rather than high-end specs

Apple Intelligence rumored to create custom playlist artwork on Apple Music and iOS 18

Apple to cut back on TV+ original content spending as viewership numbers underwhelm

iPhone 18 Pro rumored to feature 48MP ultra-wide sensor from Samsung

Read More

Apple Intelligence rumored to create custom playlist artwork on Apple Music and iOS 18

iPhone 17 lineup details revealed in new leak, including new screen sizes and A19 chip

Foldable iPhone in clamshell design moves into advanced prototyping stage ahead of 2026 launch

iPhone 17 rumored to feature mechanical camera aperture system for improved bokeh effect

Latest From Our Apple Newsroom

Apple-made 5G modem to debut on iPhone SE 4 and iPhone 17 Slim next year

Kuo: iPhone 17 Slim to focus on design innovations rather than high-end specs

Apple Intelligence rumored to create custom playlist artwork on Apple Music and iOS 18

iPhone 18 Pro rumored to feature 48MP ultra-wide sensor from Samsung

iPhone 17 rumored to feature mechanical camera aperture system for improved bokeh effect

Foldable iPhone in clamshell design moves into advanced prototyping stage ahead of 2026 launch

Quick Links

Find Us on Socials

Stock Ticker

Apple researchers unveils new AI system for enhanced voice assistant interactions

Editor's Pick

Read More

Apple researchers unveils new AI system for enhanced voice assistant interactions

Editor's Pick

Read More

Latest From Our Apple Newsroom

Quick Links