Voice Enhancement
Direction of Arrival

Technologies  / Voice Enhancement  / Direction of Arrival

 

 
 
 
 
 
 
 

 

 

 

About

 

Alango’s Direction of Arrival (DOA) technology reliably determines the direction of human speech in an acoustic signal in real time. The technology utilizes a microphone array of 3 or more microphones that are separated by known distances. Since sound travels at a known speed, the time of the arriving sound at each microphone is used to calculate the DOA.

Time difference refers to the slight delay when a sound wave reaches one microphone compared to another, due to their different distances from the sound source. By precisely measuring these subtle time differences across the array, the system can geometrically pinpoint the direction from which the human voice originated.

 

 

Demonstration Video

 
 
 
 
Direction of Arrival (DOA) Technology

 

 

Key Capabilities

 
  • Real-Time Direction Estimation:
    Pinpoints the direction of a speaker using microphone array input.
  • Speaker Tracking:
    Continuously monitors a speaker’s position—even while moving—enabling consistent voice pickup in dynamic environments.
  • Multi-Speaker Awareness:
    Differentiates between speakers in different directions, allowing selective attention.
  • Low Latency & Low Power:
    Designed for efficient embedded implementation.
 

 

Applications

 

Direction of Arrival (DOA) voice pickup technology opens a wide range of sophisticated applications, especially in environments where multiple people speaking are present. The ability to locate a person speaking in space enables highly intelligent and automated responses.

Here are some key applications:

 
 
         
   

Camera Steering and Tracking
in Conference Rooms & Education

 
         
 
 

Conference Room/Huddle Room Camera Auto-Framing: When a person speaks, DOA identifies their location, and the PTZ (Pan-Tilt-Zoom) camera automatically frames them, which ensures participants can clearly see who is talking.

Lecture Hall/Classroom Auto-Tracking: Similar to conference rooms, DOA can track a professor as they move around a lecture hall, keeping them in frame for remote students or recording. It can also switch to a student asking a question from the audience.

Courtroom Recording & Analysis: Automatically focuses cameras on the speaker (judge, witness, lawyer) for clear recording.

 
 
 
         
   

Smart Home and Voice Assistant Integration

 
         
 
 

Enhanced Voice Assistant Accuracy: Instead of a voice assistant just hearing "hey Google" or "Alexa," DOA allows it to know *where* the command came from in the room. This can enable:

  • Directional Commands: When issuing a command such as "turn on the light” the light turned on will be the closest to the origin of the voice.
  • Targeted Control: If multiple people give commands, the system can prioritize or confirm with the person closest to the identified voice. Knowing who spoke can link the command to that individual's user profile, ensuring personalized responses.
  • Location-Aware Reminders/Notifications: A smart home system could deliver a reminder only when you are in a specific part of the house, identified by your voice's DOA.
 
 
 
         
   

Automotive Applications

 
         
 
 

Targeted Voice Assistant Interaction: When multiple people are in a car, the voice assistant can pinpoint who issued a command, reducing confusion ("Did *you* say 'call mom' or *he* did?").

Personalized Infotainment Zones: Directing audio content (e.g., a phone call) only to the driver's headrest speakers, while other passengers listen to different content.

 
 
 
         
   

Robotics and Human-Robot Interaction

 
         
 
 

Directional Listening for Robots: Robots can understand where a sound or command came from. This is crucial for:

  • Following Instructions: "Robot, come here" from a specific direction.
  • Identifying Danger: Pinpointing the source of person in distress.
  • Social Robotics: A robot can turn its "head" or orient itself towards the person speaking, making interactions feel more natural and engaging.
 
 
 
         
   

Security and Surveillance

 
         
 
 

Identification: In security settings, DOA can identify the precise location of a person speaking, automatically directing surveillance cameras to that person.

 
 

 

 

Integration

 

DOA works in concert with other Alango products such as Voice Communication Package (VCP).

 
 
 

 

Top ▲