Home Cyber Attack Voice Assistants Can Store And Leak Texts Typed On Smartphones In Proximity

Voice Assistants Can Store And Leak Texts Typed On Smartphones In Proximity

by Abeerah Hashim

A team of researchers from the University of Cambridge, UK, has discovered how smart voice assistants can cause data leak. They noticed that voice assistants can record the texts typed on smartphones within close proximity of the smart speakers. An adversary can process these audio recordings to retrieve the texts and steal data.

Briefly, the attack becomes possible due to how the voice assistants behave.

As the researchers explained, almost every voice assistant, like Alexa, Echo, or Google Assistant, includes multiple microphones. These microphones are very sensitive, and often directional too. Hence, they can record sounds that we otherwise don’t or can’t hear easily. For instance, the sound of typing on virtual keyboards on smartphones.

Whereas, these microphones are also dangerous since they remain in an ‘always-on’ state. Although, the assistants activate only with their respect trigger keywords, like “Alexa”, or “Hey Google”, they remain active and record audios all the time. This functionality helps them to activate immediately when called.

However, their continuous recording of sounds is also a privacy breach for the users. Especially, considering that many studies have demonstrated how perpetrators can process sounds to steal sensitive data.

Stealing Texts Typed On Smartphones Via Voice Assistants

Although, the microphones on the smartphone devices are also vulnerable. But exploiting them requires access to a particular target phone. However, in the case of microphones present in smart speakers, an adversary can spy on multiple devices.

As the researchers stated in their paper,

In this work, we show that attacks on virtual keyboards do not necessarily need to assume access to the device, and can actually be performed with external microphones. For example, we show how keytaps performed on a smartphone can be reconstructed by nearby smart speakers.

In their study, the researchers demonstrated that by exploiting a smart speaker, an attacker can steal the recordings obtained through the speakers’ microphone. Then processing and reconstructing these recordings allows deciphering what a user typed on a smartphone by detecting the tap locations.

While this detection worked somewhat differently in the case of numbers and letters, it still gave pretty accurate results. Whereas, the attack could work at a distance of up to half a meter from the smart speaker.

As stated in the paper,

Given just 10 guesses, 5-digit PINs can be found up to 15 % of the time, and text can be reconstructed with 50 % accuracy.

staling texts typed on smartphones via voice assistants

Source: Zarandy et al.

For their experiment, the researchers used a Raspberry Pi with a ReSpeaker 6-mic circular array. Whereas, the victim devices included HTC Nexus 9 tablet, a Nokia 5.1 smartphone, and a Huawei Mate20 Pro smartphone. The Pi also had a WiFi setup on it to allow other devices to connect to it.

The researchers have shared the details about their findings in a research paper.

Limitations and Countermeasures

While the attack method sounds easy to execute, it has some limitations as well, the major one being tap detection.

This limitation is relatively less effective in the case of internal microphones. However, for external microphones, it’s a major hurdle as the environment can increase false positives.

Likewise, for detecting texts, the struggle of correct tap detection together with large dictionaries, the attack becomes quite impractical for a real-world exploit.

As for countermeasures, the use of screen protectors and phone cases alter tap acoustics. Thus, they provide partial protection against this attack.

Whereas, the researchers also advise smartphone makers to introduce “silent tap-like sounds” that trigger upon opening keyboards. Such sounds would also increase false positives, thereby mitigating this attack.

You may also like

Latest Hacking News

Privacy Preference Center


The __cfduid cookie is used to identify individual clients behind a shared IP address and apply security settings on a per-client basis.

cookie_notice_accepted and gdpr[allowed_cookies] are used to identify the choices made from the user regarding cookie consent.

For example, if a visitor is in a coffee shop where there may be several infected machines, but the specific visitor's machine is trusted (for example, because they completed a challenge within your Challenge Passage period), the cookie allows Cloudflare to identify that client and not challenge them again. It does not correspond to any user ID in your web application, and does not store any personally identifiable information.

__cfduid, cookie_notice_accepted, gdpr[allowed_cookies]


DoubleClick by Google refers to the DoubleClick Digital Marketing platform which is a separate division within Google. This is Google’s most advanced advertising tools set, which includes five interconnected platform components.

DoubleClick Campaign Manager: the ad-serving platform, called an Ad Server, that delivers ads to your customers and measures all online advertising, even across screens and channels.

DoubleClick Bid Manager – the programmatic bidding platform for bidding on high-quality ad inventory from more than 47 ad marketplaces including Google Display Network.

DoubleClick Ad Exchange: the world’s largest ad marketplace for purchasing display, video, mobile, Search and even Facebook inventory.

DoubleClick Search: is more powerful than AdWords and used for purchasing search ads across Google, Yahoo, and Bing.

DoubleClick Creative Solutions: for designing, delivering and measuring rich media (video) ads, interactive and expandable ads.



The _ga is asssociated with Google Universal Analytics - which is a significant update to Google's more commonly used analytics service. This cookie is used to distinguish unique users by assigning a randomly generated number as a client identifier. It is included in each page request in a site and used to calculate visitor, session and campaign data for the sites analytics reports. By default it is set to expire after 2 years, although this is customisable by website owners.

The _gat global object is used to create and retrieve tracker objects, from which all other methods are invoked. Therefore the methods in this list should be run only off a tracker object created using the _gat global variable. All other methods should be called using the _gaq global object for asynchronous tracking.

_gid works as a user navigates between web pages, they can use the gtag.js tagging library to record information about the page the user has seen (for example, the page's URL) in Google Analytics. The gtag.js tagging library uses HTTP Cookies to "remember" the user's previous interactions with the web pages.

_ga, _gat, _gid