This document defines a set of JavaScript APIs that let a Web application manage how audio is rendered on the user audio output devices.

This document is not complete. It is subject to major changes and, while early experimentations are encouraged, it is therefore not intended for implementation.

Introduction

This proposal allows JavaScript to direct the audio output of a media element to authorized devices other than the system or user agent default. This can be helpful in a variety of real-time communication scenarios as well as general media applications. For example, an application can use this API to programmatically direct output to a device such as a Bluetooth headset or speakerphone.

HTMLMediaElement Extensions

This section specifies additions to the HTMLMediaElement [[!HTML5]] when the Audio Output Devices API is supported.

readonly attribute DOMString sinkId
The unique ID of the audio device through which output is being delivered, or empty string if it is using the user agent default. This ID should be one of the MediaDeviceInfo.deviceId values returned from MediaDevices.enumerateDevices() [[!GETUSERMEDIA]], or one of the predefined IDs listed below.
Promise<void> setSinkId ()
Sets the ID of the audio device through which audio output should be rendered if the application is authorized to play out of a given device.
DOMString sinkId
The id corresponding to the audio output device.

When this method is invoked, the user agent must run the following steps:

  1. If sinkId and the sinkId attribute are the same object, return a resolved promise.

  2. Let promise be a new promise.

  3. Run the following steps asynchronously:

    1. If sinkId does not match any audio output device identified by enumerateDevices(), reject promise with a new DOMException whose name is NotFoundError.

    2. If application is not authorized to play audio through the device identified by sinkId, reject promise with a new DOMException whose name is SecurityError.

    3. Associate the audio output device represented by sinkId with this object for playout.

    4. If the preceding step failed, reject promise with a new DOMException whose name is AbortError.

    5. If the media element's paused attribute is false, stop playing this object's audio out of the device represented by the sinkId attribute.

    6. Set the sinkId attribute to sinkId.

    7. If the media element's paused attribute is false, start playing this object's audio out of the device represented by sinkId.

    8. Resolve promise.

  4. Return promise.

Algorithms

Sink no longer available

The audio device associated with HTMLMediaElement.sinkId may become unavailable, for example if it is unplugged.

When the audio device identified by the sinkId attribute is no longer available, the user agent must take no action. For example, if the media element's paused attribute is false when the device identified by the sinkId is no longer available, then playback will continue as normal. Of course, audio will not be rendered because the device to which the media element is attached is unavailable.

The following paragraph is non-normative.

If the application wishes to react to the device change, the application can listen to the devicechange event and query enumerateDevices() for the list of updated devices. If the HTMLMediaElement.sinkId is no longer present in the list of MediaDeviceInfo.deviceIds returned, the device is no longer available and the application can choose to react accordingly.

New sink available

New audio devices may become available to the user agent, or an audio device (identified by HTMLMediaElement.sinkId) that had previously become unavailable may become available again, for example, if it is unplugged and later plugged back in.

In this scenario, the user agent must run the following steps:

  1. Let sinkId be the identifier for the newly available device.

  2. For each sinkId whose value is equal to sinkId:

    1. If the media element's paused attribute is false, start rendering this object's audio out of the device represented by the sinkId attribute.

The following paragraph is non-normative.

If the application wishes to react to the device change, the application can listen to the devicechange event and query enumerateDevices() for the list of updated devices.

Web Audio API Extensions

This is a work in progress. This section discusses modifications of the Web Audio API [[!WEBAUDIO]] when the Audio Output Devices API is supported.

AudioContext constructor argument

The sink ID is passed as an argument to the AudioContext constructor, e.g.,

new AudioContext({ sinkId: requestedSinkId });
Requiring the sink ID to be set at construction time simplifies the implementation, since the output sample rate is fixed.

Predefined Device IDs

User agents must also support the following predefined pseudo-identifiers which reference an abstract device. For example, if the application wants to route audio to a headset, but doesn't care exactly which headset, it can use the predefined id for the system communications device.

MediaDeviceId Enum

id-multimedia
Indicates the system multimedia playout device. This is the default, i.e., equivalent to not setting the sink id, or setting it to "".
id-communications
Indicates the system audio communications device, if one exists, typically a headset of some sort. [NOTE: what should happen if the system does not support this concept, but does know about headsets?]

Privacy Considerations

This specification defines conformance criteria that apply to a single product: the user agent that implements the interfaces that it contains.

Conformance requirements phrased as algorithms or specific steps may be implemented in any manner, so long as the end result is equivalent. (In particular, the algorithms defined in this specification are intended to be easy to follow, and not intended to be performant.)

Implementations that use ECMAScript to implement the APIs defined in this specification must implement them in a manner consistent with the ECMAScript Bindings defined in the Web IDL specification [[!WEBIDL]], as this specification uses that specification and terminology.

Acknowledgements

The following people have contributed directly to the developmment of this specification: Harald Alvestrand, Rick Byers, Dominique Hazael-Massieux (via the HTML5Apps project), Philip Jägenstedt, Victoria Kirst, Shijun Sun, Martin Thomson, Chris Wilson.