Get Your Free Report
Start for Free
SOCRadar® Cyber Intelligence Inc. | Public Elasticsearch Servers Expose 9.8 Billion Credential Records Across Enterprise, Cloud, and AI Platforms 
Apr 20, 2026
7 Mins Read
Apr 21, 2026
Moon

Public Elasticsearch Servers Expose 9.8 Billion Credential Records Across Enterprise, Cloud, and AI Platforms

Misconfigured Elasticsearch servers continue to expose massive volumes of sensitive data. This time, SOCRadar’s AI-powered Sensitive Data Exposure Monitoring service identified three publicly accessible Elasticsearch instances containing more than 9.8 billion credential records spread across separate datasets.

The exposed data included both simple email-password combinations and more detailed ULP records linking credentials to specific services and login URLs. In aggregate, these datasets demonstrate a critical shift in the threat landscape; credential exposure is no longer confined to isolated breach dumps. Instead, it reflects a sophisticated data ecosystem that facilitates large-scale credential stuffing, account takeover (ATO), enterprise compromise, and unauthorized infrastructure access.

What Was Found in Publicly Exposed Elasticsearch Servers?

Across the three servers, the exposed data totaled more than 9,879,060,029 records. Our team analyzed these exposed servers, reviewed the structure of the data, and notified the relevant parties. Following responsible disclosure, all identified servers were secured and taken offline.

The datasets were distributed as follows:

Server #1 – ULP Dataset

  • Records: 3,926,010,491
  • Size: Approximately 818 GB
  • Data type: URL + Email + Password

Server #2 – Credential Dataset

  • Records: 4,606,063,150
  • Size: Approximately 496 GB
  • Data type: Email + Password pairs

Server #3 – ULP Dataset

  • Records: 1,346,986,388
  • Size: Approximately 229 GB
  • Data type: URL + Email + Password

The two ULP datasets are especially valuable from a threat intelligence perspective because they do not just expose credentials. They also connect those credentials to specific target services through URLs, which makes the records more useful for service-level analysis and more actionable for attackers.

Example ULP records showing credentials tied to specific login URLs

Example ULP records showing credentials tied to specific login URLs

The sections below outline the main findings from our investigation, including enterprise credential exposure, identity provider risks, platform targeting patterns, and the broader implications for organizations.

1. More Than Half of the Exposed Credentials Were Corporate Accounts

One of the most concerning findings came from the second dataset. Analysis showed that out of 4.6 billion total email records, roughly 2.39 billion were corporate email addresses, representing about 52% of the exposed credentials.

Corporate accounts often serve as entry points into much larger environments. When exposed credentials belong to enterprise users, the risk extends beyond personal account compromise. Attackers can test reused passwords against VPNs, cloud portals, collaboration tools, and internal platforms.

This makes credential reuse a direct enterprise risk, especially in environments where employees use the same or similar passwords across business and personal services.

Screenshot of exposed email and password records from the credential dataset

Screenshot of exposed email and password records from the credential dataset

2. Identity Provider Exposure Raises the Risk to the Authentication Layer

The first ULP dataset revealed a significant number of records tied directly to enterprise identity services, including:

  • microsoftonline.com: 2.6 million+
  • auth0.com: around 200,000
  • okta.com: around 29,000

These domains are central to identity and access management across modern organizations. They are commonly used for Microsoft Entra ID, Okta, and Auth0 authentication workflows.

This is a more serious scenario than a standard credential leak affecting a single website. Credentials tied to identity providers can expose the authentication layer itself. If attackers gain access there, they may reach multiple connected services, not just one application. In practice, that can mean broader lateral movement, persistent access, and faster enterprise compromise.

3. Internal Platforms and Supply Chain Systems Were Also Exposed

The datasets also included credentials associated with widely used business platforms such as:

  • zendesk.com: 143,000
  • atlassian.com: 75,000
  • salesforce.com: 74,000

These platforms often contain customer records, internal support workflows, development projects, and collaboration data. Exposure at this level can affect daily operations and business continuity, not just user accounts.

Attackers with access to these systems may gain visibility into customer issues, internal documentation, project environments, and service processes. That turns credential exposure into a supply chain and operational risk.

4. AI Platforms Are Becoming Part of the Credential Exposure Problem

Another notable finding was the presence of credentials linked to modern AI services:

  • openai.com: 620,000+
  • huggingface.co: 24,000+
  • leonardo.ai: 18,000+

This shows that AI platforms are already part of large-scale credential reuse patterns. As organizations increasingly integrate AI services into internal workflows, these accounts may provide access to prompts, proprietary data, automation pipelines, and business logic.

This is an emerging attack surface. Many organizations still focus credential monitoring on email, cloud, and remote access systems, while AI services may not yet receive the same level of security attention.

5. Cloud and Infrastructure Credentials Increase the Impact

The exposed records also referenced cloud and infrastructure-related platforms, including:

  • amazon.com: around 178,000
  • windowsazure.com: around 26,000
  • fortinet.com: around 21,000

These services are commonly linked to cloud resource management, hosting, VPN connectivity, and remote access. That expands the impact of the exposure from account abuse to potential infrastructure compromise.

At the same time, the most frequently targeted services in the first ULP dataset included google.com, facebook.com, live.com, discord.com, netflix.com, amazon.com, and paypal.com, reflecting the same platforms often abused in credential stuffing and account takeover campaigns.

ULP records showing credentials mapped to different online services

ULP records showing credentials mapped to different online services

6. Weak Passwords Still Dominate at Billion-Record Scale

Despite years of awareness efforts, weak passwords remain widespread. The most common passwords observed included:

  • 123456: 28 million+
  • 123456789: 16 million+
  • password: 13 million+
  • qwerty: 10 million+
  • qwerty123: 7 million+

These patterns explain why credential stuffing remains effective. Even at this scale, poor password hygiene continues to fuel automated abuse.

What Do These Exposed Credential Datasets Mean for Organizations?

The combined datasets strongly suggest aggregation from multiple sources, including infostealer logs, historical breach collections, and credential stuffing lists. The repeated patterns, massive record counts, and mixed data structures all support that assessment.

For organizations, the risks are broad and immediate:

  • Widespread account compromise across consumer and enterprise services
  • Unauthorized access to corporate environments
  • Exposure of internal systems and business operations
  • Greater risk from credential reuse and automated attacks
  • Expansion of attack surface into AI, cloud, and identity platforms

This is also where external visibility becomes critical. Many of these exposures happen outside the traditional security perimeter, which means internal controls alone may not detect them early enough. Publicly exposed databases, credential reuse patterns, and third-party platform access often remain invisible until attackers begin exploiting them.

How Can SOCRadar Help?

SOCRadar Threat Intelligence helps close the gap by combining Attack Surface Management, Digital Risk Protection, and Cyber Threat Intelligence to identify exposed services and sensitive data before they are weaponized. When credential datasets surface in exposed databases or other public-facing sources, SOCRadar helps security teams understand what was exposed, which platforms are affected, and how the data could be used in real-world attacks.

This gives organizations a more actionable view of risk. Security teams can detect whether exposed credentials are tied to identity providers, internal business platforms, cloud services, or emerging AI tools, then prioritize response based on business impact. That visibility supports faster remediation, stronger credential monitoring, better executive and enterprise protection, and a more proactive response to account takeover and fraud risks.

SOCRadar’s Attack Surface Management, Digital Footprint

SOCRadar’s Attack Surface Management, Digital Footprint

Organizations can use external threat visibility to identify the warning signs earlier and reduce the chance that exposed data turns into a full-scale incident.