Home Cyber Security News 2.2 Billion Emails Discovered in New Data Dump

2.2 Billion Emails Discovered in New Data Dump

by Abeerah Hashim
Collection #2 to #5

A couple of weeks ago, a huge database of breached credentials startled almost everyone. The huge collection of dumped data, referred as “Collection #1” flaunted 21 million passwords. However, we now hear about an even larger data dump. The Collection #2 to #5 data dumps supposedly include 2.2 billion email addresses.

2.2 Billion Email Addresses Exposed In Collection #2 To #5

After the news of the huge database named Collection #1 surfaced online, the researchers began working out to reveal further details. At that time, KrebsOnSecurity reported that the seller offering the Collection #1 also bears other huge subsets of databases. He found the database folders labelled as Collection #2, Collection #3, Collection #4 and Collection #5.However, there wasn’t much information available about these subsets then.

Collection 1 and other subsets

Source: KrebsOnSecurity

Nonetheless, a German security firm Heise Online has now disclosed the details of Collection #2 to #5. Reportedly, the combined data of these subsets make up to 2.2 billion email addresses along with corresponding passwords, according to the Hasso Plattner Institute.

According to Heise, the total data volume seems over 600GB. If we take into account the details revealed by KrebsOnSecurity, then the cumulative volume of Collection #2 to Collection #5 may be 784GB.

Exposed Data May Be Massive But Not Alarming

The researchers reveal something that may lessen the impact. According to them, the database includes combined data obtained from previous breaches. Supposedly, it does not include any new data.

“The data are apparently not completely new, but come to a large extent from older leaks.”

Nonetheless, one may not rule out the risks for abuse of these credentials by the malefactors.

“Nevertheless, the compilation and re-release is likely to increase the likelihood that cyber crooks will try out the credentials.”

The researchers re-emphasize on the importance of using unique passwords for every online account. They also advise using password managers to avoid the hassle of remembering so many passwords without compromising security.

Perhaps, recalling the recent wave of credential stuffing attacks at DailyMotion and Basecamp, one can realize the importance of password security measures.

You may also like

Latest Hacking News

Privacy Preference Center


The __cfduid cookie is used to identify individual clients behind a shared IP address and apply security settings on a per-client basis.

cookie_notice_accepted and gdpr[allowed_cookies] are used to identify the choices made from the user regarding cookie consent.

For example, if a visitor is in a coffee shop where there may be several infected machines, but the specific visitor's machine is trusted (for example, because they completed a challenge within your Challenge Passage period), the cookie allows Cloudflare to identify that client and not challenge them again. It does not correspond to any user ID in your web application, and does not store any personally identifiable information.

__cfduid, cookie_notice_accepted, gdpr[allowed_cookies]


DoubleClick by Google refers to the DoubleClick Digital Marketing platform which is a separate division within Google. This is Google’s most advanced advertising tools set, which includes five interconnected platform components.

DoubleClick Campaign Manager: the ad-serving platform, called an Ad Server, that delivers ads to your customers and measures all online advertising, even across screens and channels.

DoubleClick Bid Manager – the programmatic bidding platform for bidding on high-quality ad inventory from more than 47 ad marketplaces including Google Display Network.

DoubleClick Ad Exchange: the world’s largest ad marketplace for purchasing display, video, mobile, Search and even Facebook inventory.

DoubleClick Search: is more powerful than AdWords and used for purchasing search ads across Google, Yahoo, and Bing.

DoubleClick Creative Solutions: for designing, delivering and measuring rich media (video) ads, interactive and expandable ads.



The _ga is asssociated with Google Universal Analytics - which is a significant update to Google's more commonly used analytics service. This cookie is used to distinguish unique users by assigning a randomly generated number as a client identifier. It is included in each page request in a site and used to calculate visitor, session and campaign data for the sites analytics reports. By default it is set to expire after 2 years, although this is customisable by website owners.

The _gat global object is used to create and retrieve tracker objects, from which all other methods are invoked. Therefore the methods in this list should be run only off a tracker object created using the _gat global variable. All other methods should be called using the _gaq global object for asynchronous tracking.

_gid works as a user navigates between web pages, they can use the gtag.js tagging library to record information about the page the user has seen (for example, the page's URL) in Google Analytics. The gtag.js tagging library uses HTTP Cookies to "remember" the user's previous interactions with the web pages.

_ga, _gat, _gid