OrbTop

ClassAction.org Data Breach Lawsuit Scraper

BUSINESSLEAD GENERATION

ClassAction.org Data Breach Lawsuit Scraper

Scrapes the ClassAction.org data breach lawsuits index and returns structured records for each active data breach lawsuit, investigation, and open settlement. Each run delivers the full current feed of data breach cases with eligibility info, law firm attribution, impacted data types, and lead summaries.

What you get

Field Description
name Full lawsuit / investigation title (e.g. "Gastro Health Data Breach")
company Defendant company or breached organization
lawsuit_type Always data-breach for this category
status investigating, filed, or open settlement (inferred from article text)
filed_date Month + year the case was filed or investigation opened (e.g. "May 2026")
summary Lead paragraph describing the breach allegation
impacted_data Comma-separated list of exposed data types (e.g. "Social Security Number, Date of Birth")
eligibility Who qualifies to join or file a claim
law_firm Law firm sponsoring the investigation
source_url Full classaction.org detail page URL
category data-breach-lawsuits

Additional fields (incident_date, affected_count, jurisdiction, claim_deadline, potential_award, proof_required) are returned as null for investigation-stage pages where the data is not yet available.

How it works

The scraper performs a two-hop crawl:

  1. Index page — fetches the full data breach listing at /data-breach-lawsuits, which renders all 100+ active cards in a single server-rendered HTML response. Each card provides the title, date, summary, and links to the detail page.
  2. Detail pages — visits each /data-breach-lawsuits/<slug> page to enrich the record with full article content, law firm attribution, eligibility information, and inferred status.

Input

Parameter Type Description
maxItems integer Maximum number of records to return. Set to 0 to scrape all current cases. Default: 10

Usage notes

  • The index page is refreshed continuously as new data breach lawsuits are filed. Running with maxItems: 0 captures the full current feed (~100 records per run).
  • Investigation-stage cases (status: investigating) do not yet have settlement deadlines, award amounts, or proof requirements — those fields are null until the case settles.
  • The scraper uses residential proxy routing for reliable access to classaction.org.