OrbTop

Ritchie Bros Auctioneers Scraper

ECOMMERCEBUSINESS

Ritchie Bros Auctioneers Scraper

Scrape heavy equipment auction listings from Ritchie Bros Auctioneers (rbauction.com). Extract lot details across all equipment categories including excavators, wheel loaders, cranes, tractors, and more — without an account or proxy.

What it does

  • Discovers up to 1,801 equipment categories via the rbauction.com sitemap
  • Extracts all lots from each category with full pagination (?from=N)
  • Returns 36 structured fields per lot: manufacturer, model, year, serial number, usage hours, auction dates, location, prices, and condition flags
  • No login required — all listing data is publicly accessible

Input

Field Type Description
maxItems integer Maximum number of lots to scrape. Leave blank to scrape all.
categories array List of category slugs to scrape (e.g. ["excavators", "wheel-loaders"]). Leave blank to scrape all 1,801 categories.
startUrls array Optional list of specific rbauction.com category or lot URLs. Overrides categories when provided.

Example input

{
  "maxItems": 100,
  "categories": ["excavators", "wheel-loaders", "crawler-dozers"]
}

Output

Each item in the dataset represents one auction lot:

Field Description
lot_id Lot item number
asset_id Asset GUID
url Direct URL to the lot page
title Full equipment description
manufacturer Equipment manufacturer
model Equipment model
year Year of manufacture
serial_number Serial number
usage_hours Usage hours
category Equipment category
asset_type Asset type (localized)
industry Primary industry
lot_number Auction lot number
buying_format Buying format (auction, buy now, etc.)
buying_platform Buying platform
auction_name Auction / sale event name
auction_date Auction start date (ISO 8601)
auction_end_date Auction end date (ISO 8601)
location_city Equipment location city
location_state Equipment location state/province
location_country Equipment location country
location_region Regional location label
location_coordinates Lat/lon as "lat,lon" string
start_price Starting bid price
price_currency Price currency code
image_url Primary equipment image URL
is_unused Whether the equipment is new/unused
is_in_yard Whether the equipment is in a yard
is_ironclad_assured Whether covered by IronClad Assurance
inoperable Whether the equipment is inoperable
inspection_status Inspection status
quick_response_id Quick response / lot reference ID
catalogue_notes Catalogue notes from seller
listing_status Current listing status (New, Open, Preview)
is_online_bidding Whether online bidding is enabled
scraped_at ISO 8601 timestamp when the record was scraped

Technical notes

  • Uses Chrome TLS fingerprinting (impit) to clear Cloudflare's fingerprint gate — no proxy needed
  • Discovery: /sitemaps/search yields 1,801 category slugs
  • Data is fully embedded in Next.js __NEXT_DATA__ on server-rendered pages
  • Pagination: ?from=N parameter, 60 records per page
  • Concurrency: 5 parallel requests with automatic rate-limit handling