Agora Inc.

11/13/2024 | News release | Distributed by Public on 11/13/2024 15:36

Reinvent IoT with Real-Time Multimodal Agents Powered by Conversational AI and RTC

The Internet of Things (IoT) turns everyday objects into intelligent, connected tools that generate actionable insights. But IoT has been held back by limited "smart" assistants that struggle to understand requests and can't speak naturally or make decisions based on real-time conditions. Conversational AI is poised to change this by enabling multimodal communication with advanced large language models (LLMs) that can understand complex requests and hold natural conversations, and take actions based on live video feeds. Additionally, integrating low-latency communication and real-time synchronization enables a new wave of IoT innovation from smart homes to autonomous vehicles.

Agora's AI-driven IoT solutions pave the way for groundbreaking advancements in automation, communication, and real-time decision-making. Agora's Conversational AI SDK seamlessly integrates with OpenAI's Realtime API, offering users effortless access to voice communication and IoT operation powered by GPT models. Agora's IoT SDK enables seamless synchronization and real-time communication on IoT devices, whether speaking to an AI agent or video calling a friend.

In this post, we'll dive into some of the most exciting use cases across various industries and explore how these advancements are reshaping the future of IoT.

Redefining baby monitoring with AI-driven features

Ellie makes AI-powered baby monitors that integrate real-time video streaming with advanced AI features. Ellie uses Agora's Conversational AI to further enhance these features, enabling personalized and dynamic interaction.

Focused on helping parents effortlessly keep an eye on their baby's activities, Ellie aims to provide peace of mind to new parents with reliable and trustworthy AI baby care products.

Before moving to Agora, Ellie AI baby monitor relied on traditional P2P connections for streaming which was highly unreliable. The solution faced issues with poor video and audio quality, and constant delays in streaming, especially in regions with lower quality internet infrastructure. Ellie also did not have an effective way to measure quality of experience (QoE) in real-time to address issues quickly.

Key Features:

  • AI monitoring: Continuously streams real-time video and audio, detects crying and face coverings, and analyzes sleep patterns and movements to ensure optimal safety.
  • Key word filter: provides daily reports, summarizing key events such as sleep quality, movement, and any critical alerts, giving parents a comprehensive overview of their baby's well-being.
  • Parent voice cloning: Uses conversational AI to clone parents' voices, allowing them to soothe their baby in real time or narrate stories when they are unavailable.

Why Ellie chose Agora

  • Highest-quality video and audio streaming experience
  • Compatibility with child safety and AI features
  • Extensive global network coverage
  • Real-time quality monitoring to immediately address any issues impacting users
"Agora enabled us to seamlessly expand our advanced real-time AI features, delivering an unparalleled baby monitoring experience for parents."

Joe Tham, Co-founder, Ellie

Revolutionizing educational play with immersive interaction

AI smart toys provide engaging and personalized play experiences by combining interactive storytelling with conversational AI for immersive, creative, and educational play. Toys like Curio use kid-safe conversational AI to chat with children and tell them stories. The smart robot toy, Miko, entertains, educates, monitors, and guides activities for children while making learning fun and playful. It also helps parents connect and engage with their little ones anywhere through video calling and remote navigation while tracking and reporting on their learning progress.

Miko uses Agora to incorporate high-quality video calls with minimal delays. This feature enables parents to stay connected with and monitor their children, whether away from home or in another room. Agora's Signaling product enables real-time data synchronization, allowing parents to move Miko around to follow the child during a video call.

Key Features:

  • AI storytelling: Using Conversational AI for voice cloning, tone and accent adaptation, the toy tells ever-evolving stories in a parent's voice, adapting to the child's preferences, reactions, and even regional dialect.
  • AI companionship: Conversational AI enables the toy to evolve its responses and behaviors based on the child's emotions and interactions, providing a dynamic, learning companion that grows with the child.

Why Miko chose Agora:

  • High-quality video calling
  • Remote control with zero latency
  • Safe and secure IoT support
  • Advanced AI features
"Adding Agora's secure, high-quality video communication to Miko led to a massive boost in parent engagement with the app."

Sneh Vaswani, Co-Founder & CEO, Miko

Turning video feeds into smart security agents

Video feeds no longer need to record passively-they can actively protect.

Agora brings advanced intelligence to existing surveillance systems by utilizing computer vision to analyze real-time footage and generate predictive insights. This innovation transforms cameras into more than monitoring devices; they become proactive tools that learn, detect, and respond.

Key features:

  • Security agent: Agora's technology can instantly identify suspicious activities, preventing theft through real-time object recognition and immediate alerts. Additionally, computer vision capabilities extend beyond security, allowing for improved traffic flow analysis and the prediction of congestion patterns, optimizing public transportation schedules and logistics operations.
  • Voice driven AI video search: Integrating conversational AI into functionality like search enables users to ask questions like "how many people have passed by my door in the past 2 days" and get a quick answer without having to go back through video footage manually.

Redefining autonomous mobility with AI and real-time monitoring

Autonomous transportation is shaping the future of smart cities. Powered by Conversational AI, Agora's technology transforms autonomous vehicles like robo taxis into personalized, ride-sharing experiences designed to elevate urban mobility.

Agora's technology enables passengers to interact seamlessly in multiple languages for navigation help, music selection, or ride-related questions. Real-time safety monitoring ensures that operators can track vehicle performance and passenger safety. This gives parents peace of mind by allowing them to remotely monitor children or elderly family members traveling alone. In more complex situations, such as heavy traffic or emergencies, operators can even take control of the vehicle remotely to ensure a smooth and safe journey.

Key Features:

  • Robotaxi reception agent: Agora's technology enables passengers to interact seamlessly in multiple languages for navigation help, music selection, or ride-related questions.
  • Real-time safety monitoring: Ensures that operators can track vehicle performance and passenger safety. This gives parents peace of mind by allowing them to remotely monitor children or elderly family members traveling alone.
  • Teleoperation: In more complex situations, such as heavy traffic or emergencies, operators can even take control of the vehicle remotely to ensure a smooth and safe journey.

AI and the future of lawn care

Lawnmowers equipped with cameras and audio can be enhanced with real-time video feeds, computer vision models, and conversational AI for adaptive and predictive landscaping maintenance.

For example, the robotic lawn mower can continuously learn and adapt to its outdoor surroundings, creating optimized mowing patterns and predicting obstacles before they arise. This intelligent technology ensures smooth operation and efficiency, even in complex landscapes. These robotic mowers can recognize different types of grass and their unique growth patterns. They can also alert users when certain lawn areas need extra attention, such as watering or reseeding, allowing for real-time adjustments through a user-friendly app.

Key Features:

  • Grass recognition & adaptive lawn care: The mower can identify different grass types and their growth patterns. It alerts users when areas need special attention, such as watering or reseeding, and allows for real-time adjustments via a user-friendly app.
  • Safety & obstacle detection: AI ensures safety by pausing the mower if pets, children, or other obstacles are detected nearby. Users receive voice alerts like, "There is an obstacle in the mowing path. Resuming once clear."
  • Voice command control: Users can control the mower with simple voice commands, such as "Start mowing the backyard" or "Avoid the flower bed," allowing hands-free operation.
  • Gardener agent: The mower provides live updates like a real gardener, answering questions like "Does the lawn need to be watered?" and giving progress reports like "Backyard area complete, moving to the front yard."

Unlocking new possibilities in IoT

As we've seen, the combination of IoT and AI unlocks new possibilities across industries, transforming everything from security systems to autonomous transportation. Agora's AI-driven IoT solutions, powered by seamless integration with OpenAI's Realtime API, are at the forefront of this transformation.

By turning IoT devices into more intelligent tools powered by real-time AI, Agora is enhancing automation and communication and shaping the future of how we interact with technology. Whether it's improving safety, personalizing experiences, or optimizing operations, the potential for IoT in our increasingly connected world is limitless-and Agora is leading the charge. The future is smarter, more connected, and driven by innovation.