A staggering data leak involving 24 billion records has exposed one of the largest credential datasets ever discovered, raising urgent concerns across the cybersecurity community.

Security researchers identified a massive publicly accessible database containing usernames, email addresses, plaintext passwords, and login URLs—data that could provide direct access to billions of online accounts.

Unlike traditional breaches targeting a single company, this dataset appears to aggregate information collected from multiple sources, including infostealer malware infections, leaked databases, and cybercrime distribution channels.

Key Details

The exposed dataset was hosted on an Elasticsearch cluster and contained more than 8.3 terabytes of data.

Researchers believe the vast majority of the 24 billion entries are infostealer logs, which are records harvested from devices infected with credential-stealing malware.

Each record typically includes:

Username or email address
Plaintext password
Associated login URL
Metadata indicating the source of the data

The dataset was compiled from 36 distinct sources, including:

Telegram-based cybercrime channels
Breach compilations from previous incidents
Large aggregated “collections” of stolen credentials
Database exports from live systems

Although the database has since been taken offline, the exposure window may have been long enough for threat actors to copy or distribute the information.

Technical Analysis

The structure of the exposed data reveals a highly organized and continuously updated credential aggregation system.

Infostealer Data Collection

Infostealer malware is designed to harvest sensitive information directly from infected devices. This includes:

Saved browser credentials
Autofill data
Session cookies
Application login details

Unlike traditional breaches, infostealers capture data at the endpoint level, meaning credentials are often valid and current at the time of collection.

Aggregation and Data Sources

The dataset combined multiple categories of stolen information:

1. Telegram Channels

A significant portion of the data originated from hacking-focused Telegram channels, where stolen credentials are frequently shared or sold.

These channels contained:

Hundreds of millions to billions of records
Data from various regions and services
Credentials often categorized by type or platform

Some datasets were even linked to historically significant cybercriminal activity sources.

2. “Collections”

The largest portion—over 22 billion records—was grouped under generic “collections.”

These likely represent:

Previously leaked infostealer datasets
Combined credential lists targeting major platforms
Aggregated breach data organized for exploitation

3. Breach Compilations and Dumps

Additional data came from:

Historical breach compilations
Local database exports (“dumps”)
Known credential leak collections such as AntiPublic

These sources indicate that attackers are continuously merging older breaches with newly stolen data to maintain relevance.

Continuous Threat Intelligence Gathering

Interestingly, a subset of the dataset contained:

CVE vulnerability references
Links to exploit repositories
News articles about recent breaches
Social media discussions on cyber incidents

This suggests the dataset owner was actively monitoring the cybersecurity landscape to update and enrich their collection.

Impact and Risks

The sheer scale of this data leak significantly amplifies risk across multiple dimensions.

Account Takeover Risk

With billions of exposed credentials in plaintext, attackers can:

Perform large-scale credential stuffing attacks
Access email, banking, cloud, and social media accounts
Exploit reused passwords across services

Identity and Financial Risk

Compromised accounts may lead to:

Identity theft
Financial fraud
Unauthorized transactions
Data loss or ransomware exposure

Organizational Risk

For businesses, the impact can extend to:

Internal system access via compromised employee accounts
Exposure of confidential or proprietary data
Supply chain or partner compromise

The Unknown Factor

Researchers were unable to determine:

How many records are duplicates
How many unique individuals are affected
The exact age of all records

However, even with duplication, the scale of 24 billion records suggests widespread exposure.

Why This Matters

This incident highlights a major shift in cyber threats.

Attackers are no longer relying on single large breaches. Instead, they are building massive aggregated databases that combine:

Fresh infostealer logs
Historical breach data
Real-time intelligence updates

This approach increases the effectiveness of attacks by ensuring credential datasets remain relevant and actionable.

Expert Recommendations

Users and organizations must take immediate action to reduce risk.

Critical Steps

Change passwords immediately, especially for important accounts
Avoid reusing passwords across multiple services
Enable multi-factor authentication (MFA) wherever possible
Use a password manager to generate strong, unique credentials

Device Security

Keep operating systems and applications updated
Avoid downloading software from untrusted sources
Be cautious with email links and attachments
Use secure networks and VPNs when necessary

Organizational Controls

Monitor for unusual login activity
Implement credential breach detection systems
Enforce strict password and access policies
Train users on phishing and malware risks

Industry Context

Massive credential leaks are becoming increasingly common.

Recent years have seen multiple incidents involving billions of records, reflecting a shift toward large-scale data aggregation and reuse.

Infostealer malware-as-a-service has lowered the barrier to entry for cybercriminals, allowing even low-skilled attackers to collect and distribute stolen credentials.

As a result, the cybersecurity landscape is evolving from isolated breaches to continuous data harvesting ecosystems.

Conclusion

The 24 billion record exposure is a stark reminder of the scale at which personal data is being collected, traded, and exploited.

While the database has been taken offline, the risk persists. Once credentials are leaked, they can circulate indefinitely across cybercriminal networks.

For users, vigilance and proactive security practices are essential.

For organizations, the challenge is clear: traditional defenses must evolve to address a world where breach data is not just leaked—but continuously aggregated and weaponized.

FAQ SECTION

What is an infostealer?

An infostealer is malware designed to collect sensitive data such as usernames, passwords, and cookies directly from infected devices.

How dangerous is this data leak?

Highly dangerous due to its size and the presence of plaintext credentials, increasing the risk of account takeovers.

Are all 24 billion records unique?

No. Many records may be duplicates, but the overall impact remains significant.

What should I do if my data is exposed?

Change your passwords, enable multi-factor authentication, and monitor your accounts for suspicious activity.

Can this data still be used by attackers?

Yes. Even though the database is no longer public, the data may have already been copied and distributed.

24 Billion Credentials Exposed in Massive Data Leak Shaking Global Security