OrbTop

State PE / RA License Scraper - Engineers and Architects

LEAD GENERATIONBUSINESSOTHER

State PE and RA License Scraper

Scrape Professional Engineer (PE) and Registered Architect (RA) license records from US state boards. Phase 1 covers California, Texas, New York, Florida, and Illinois — five high-volume states with structured online registers. Returns license number, status, expiration, PE discipline, firm name, and business address.


State License Scraper Features

  • Pulls from five state boards in one run — CA, TX, NY, FL, IL
  • Returns license number, status (Active, Inactive, Expired, Suspended, Revoked), and renewal date
  • PE discipline broken out (civil, mechanical, electrical, structural, etc.) where the board reports it
  • Optional last-name and license-status filters at the input stage
  • Profession switch — PE for engineers, RA for architects (states that issue both register both)
  • No proxy required — every Phase 1 source is a public state register

Who Uses State PE/RA License Data?

  • AEC staffing firms — Verify PE/RA credentials on candidate slates against the authoritative state board, not a self-reported LinkedIn field
  • A&E insurance underwriters — Confirm license status and discipline before binding professional-liability coverage
  • Construction-tech CRMs — Enrich engineer/architect contacts with license expiration so renewals can drive a workflow
  • Procurement and prequalification teams — Validate a contractor's professional headcount has the licenses they claim
  • Compliance and audit teams — Cross-check expired or revoked licenses against project rosters

How the State License Scraper Works

  1. Pick profession and statesPE or RA, plus any subset of the five Phase 1 states. Default is all five.
  2. Per-state handlers — Each state has its own data source — Socrata API (IL), bulk CSV (TX), public-form HTML (CA, FL), and the rosa V2 API (NY). The scraper wires up the right handler per state.
  3. FilteringlastName and licenseStatus apply at the state-board level where supported. Where the board doesn't support filtering, the scraper post-filters the result set.
  4. Export — One record per license in your Apify dataset with a canonical schema regardless of source.

Input

All Phase 1 states, all active PEs

{
  "profession": "PE",
  "states": ["CA", "TX", "NY", "FL", "IL"],
  "licenseStatus": "active",
  "maxItems": 0
}

Architects in New York and Illinois

{
  "profession": "RA",
  "states": ["NY", "IL"],
  "maxItems": 500
}

Name search across Texas and California

{
  "profession": "PE",
  "states": ["CA", "TX"],
  "lastName": "Smith",
  "maxItems": 200
}
Field Type Default Description
profession string PE PE (Professional Engineer) or RA (Registered Architect).
states array All five Phase 1 states Subset of CA, TX, NY, FL, IL.
lastName string Last-name substring filter where supported by the board.
licenseStatus string active, inactive, suspended, revoked, expired, or empty for all.
maxItems integer 10 Cap on total records across all selected states.
proxyConfiguration object none Proxy settings. Off by default.

State License Scraper Output Fields

{
  "profession": "PE",
  "first_name": "Jane",
  "middle_name": "M",
  "last_name": "Smith",
  "license_number": "PE-87421",
  "license_state": "CA",
  "discipline": "Civil",
  "license_status": "Active",
  "license_issue_date": "2014-06-22",
  "license_expiration_date": "2026-06-30",
  "firm_name": "Bechtel Engineering Inc.",
  "business_city": "San Francisco",
  "business_state": "CA",
  "business_zip": "94104",
  "source_state_board": "California Department of Consumer Affairs",
  "source_url": "https://search.dca.ca.gov/details/...",
  "disciplinary_actions": ""
}
Field Type Description
profession string PE or RA
first_name string First name
middle_name string Middle name or initial
last_name string Last name
license_number string License number, unique within the issuing state
license_state string Issuing state (two-letter code)
discipline string PE discipline (civil, mechanical, electrical, structural, etc.)
license_status string Current status (Active, Inactive, Expired, Suspended, Revoked)
license_issue_date string Date the license was originally issued
license_expiration_date string License expiration or renewal date
firm_name string Employer or firm name
business_city string Business address city
business_state string Business address state
business_zip string Business address ZIP code
source_state_board string State board that issued the license
source_url string URL of the source record on the state board
disciplinary_actions string Pipe-delimited disciplinary actions (`type

FAQ

How do I verify a PE or RA license against the state board?

State License Scraper hits the five Phase 1 state boards directly. Set profession, the states list, and any name or status filter, then run. Output is one record per license with the source URL kept intact for human verification.

How much does this actor cost to run?

State License Scraper uses pay-per-event pricing on the default_2603_basic profile at a 1.0x coefficient. No proxy fees. A nationwide active-license pull across the five states costs a few dollars in platform fees.

Which states are covered?

State License Scraper Phase 1 covers California (DCA), Texas (TBPE roster CSV), New York (rosa V2 API for both PE and RA), Florida (DBPR), and Illinois (IDFPR Socrata). Additional states follow the same multi-handler pattern and can be added.

Can I filter by discipline or firm?

State License Scraper exposes name and status filters at input. Discipline (PE specialty) and firm name come back as fields on every record, so downstream filtering by discipline = 'Structural' or firm_name LIKE '%Bechtel%' is straightforward.

Does this actor need proxies?

State License Scraper runs proxy-free. All Phase 1 sources are public state registers that accept direct traffic at the volumes this actor produces.


Need More Features?

Need additional states (PA, OH, WA, GA, etc.), NCEES Record / NCARB Record cross-matching, or scheduled renewal-deadline alerts? Open an issue or get in touch.

Why Use State License Scraper?

  • Authoritative source per state — Each handler pulls from the state board's own register or its bulk export. No middleman, no stale aggregator data.
  • Canonical schema across boards — Every state's record maps to the same columns. Downstream you don't care whether it came from a Socrata API or a public form.
  • License plus business address — Returns firm name and business city/state/zip when the board publishes them, which turns a license check into a usable B2B record.