Sound Modem
Short-Range Sound Modem

Products  / Sound Modem  / Short-Range Sound Modem








Alango Short-Range Sound Modem (SRSM) technology allows for transferring arbitrary digital data, over-the-air, to a device via a narrow-band voice acoustic channel. The frequency of the data is within the human-audible spectrum utilizing a standard speaker from the transmit device to a standard microphone on the receive device; it’s simply data-over-sound.

SRSM can be compared to a traditional telephone modem (without a physical communication line) where data is encoded and transmitted on one side and then received, decoded and passed to the data consumer on the other side. This depicts a one-way communication scenario, but two-way communication is also possible with SRSM.

The proliferation of devices (e.g., smart home, toy/gaming, robot, industrial control, etc.) led to the development of SRSM, a nondestructive method of control and the ability to reconfigure the sealed out-of-the-box device by transferring data when RF means are not available; non-RF or environment is RF-restricted. Even where the device has connectivity through standard RF protocols (e.g., Bluetooth, Wi-Fi) they are limited in auxiliary data transfer and fine-tuning. An example of a non-RF device is a voice-activated toy. If it has a standard microphone, it has the possibility to update its settings via data-over-sound. Hence, there are many cases where SRSM can add extra functionality to devices in which no other means of non-destructive communication are possible.

PSK modulation is used to satisfy the requirement for communication that is short-range, low sampling rate, and high data rate. SRSM filters out signal distortions and follows clock skew to ensure reliable communication since PSK is inherently not robust to reverberation, multi-path propagation, and clock skew. Low bandwidth is utilized (1.2kHz ~ 2.6kHz) since almost all speakers and microphones provide good frequency response in this range. Sampling rates of 8kHz, 16kHz or higher are possible.

SRSM utilizes 8-channel OFDM with PSK (QAM-4) for each channel modulation scheme. It encodes 16 bits of data per symbol with symbol length of 10ms. This provides a net data rate of about 1.5kBits per second. The preamble, a safeguard against spoofing with similar sounds, takes 250ms. Processor load is under 2MHz (on Arm Cortex M4), requiring less than 10kB data memory.



Block Diagram







Alango uses SRSM in our technology packages such as Voice Communication Package (VCP), which is used for human-human speech pre-processing. For instance, SRSM allows for VCP acoustic parameters to be updated during the tuning process without the need for a hard-wired interface. This is especially useful for earbuds and other small form-factor devices, since the process of accessing wired interfaces may undermine the acoustic integrity and tuning of VCP parameters. Tuning is a repetitive process to advance the goal of achieving optimal acoustic performance. SRSM allows us and our customers to upload updated VCP acoustic profiles “on the fly” ensuring efficient tuning and an acoustically optimized end-product.

SRSM provides a reliable software-only solution for data transfer to sound-equipped devices that does not necessitate the OEM to modify or add existing hardware. Here are some potential applications:

IoT: Configuring “smart” devices engaged in a variety of tasks: Communicating with other devices, monitoring environmental sounds, checking sensors, and sending data to the database. With SRSM the user can change some parameters without intervention to the device itself and without accessing the database. For example, changing a thermostat temperature threshold.

Proximity detection: SRSM is short range and the signal cannot be recorded from long distances for the purpose of spoofing. As such, proximity detection, door opener, and presence indicator are potential applications. The data payload can include the user’s ID and other information.

Two-way acoustic communication: There are a variety of two-way communication scenarios where two devices talk to each other. But with SRSM the dialogue cannot be reliably recorded, like with RF communication. This brings, for example, key exchange to the next level of security.

Communication in an RF-restricted environment: In some cases, no RF communication is possible (in some hospitals rooms, for example). SRSM overcomes this limitation bringing reliable data transfer to these places.

Toy and robot control: As a method, entertaining or otherwise, for novel devices to communicate with each other.



Demonstration Video

Alango Short-Range Sound Modem Demo



White Paper



Top ▲