ClassAction.org Data Breach Lawsuit Scraper
BUSINESSLEAD GENERATION
ClassAction.org Data Breach Lawsuit Scraper
Scrapes the ClassAction.org data breach lawsuits index and returns structured records for each active data breach lawsuit, investigation, and open settlement. Each run delivers the full current feed of data breach cases with eligibility info, law firm attribution, impacted data types, and lead summaries.
What you get
| Field | Description |
|---|---|
name |
Full lawsuit / investigation title (e.g. "Gastro Health Data Breach") |
company |
Defendant company or breached organization |
lawsuit_type |
Always data-breach for this category |
status |
investigating, filed, or open settlement (inferred from article text) |
filed_date |
Month + year the case was filed or investigation opened (e.g. "May 2026") |
summary |
Lead paragraph describing the breach allegation |
impacted_data |
Comma-separated list of exposed data types (e.g. "Social Security Number, Date of Birth") |
eligibility |
Who qualifies to join or file a claim |
law_firm |
Law firm sponsoring the investigation |
source_url |
Full classaction.org detail page URL |
category |
data-breach-lawsuits |
Additional fields (incident_date, affected_count, jurisdiction, claim_deadline, potential_award, proof_required) are returned as null for investigation-stage pages where the data is not yet available.
How it works
The scraper performs a two-hop crawl:
- Index page — fetches the full data breach listing at
/data-breach-lawsuits, which renders all 100+ active cards in a single server-rendered HTML response. Each card provides the title, date, summary, and links to the detail page. - Detail pages — visits each
/data-breach-lawsuits/<slug>page to enrich the record with full article content, law firm attribution, eligibility information, and inferred status.
Input
| Parameter | Type | Description |
|---|---|---|
maxItems |
integer | Maximum number of records to return. Set to 0 to scrape all current cases. Default: 10 |
Usage notes
- The index page is refreshed continuously as new data breach lawsuits are filed. Running with
maxItems: 0captures the full current feed (~100 records per run). - Investigation-stage cases (status:
investigating) do not yet have settlement deadlines, award amounts, or proof requirements — those fields arenulluntil the case settles. - The scraper uses residential proxy routing for reliable access to classaction.org.