OrbTop

Decathlon Brasil Corrida Catalog Scraper

ECOMMERCE

Decathlon Brasil Corrida Catalog Scraper

Scrapes the complete Decathlon Brasil running category catalog — shoes, apparel, and accessories — and returns structured product data including prices (BRL), sizes, colors, stock status, and category metadata.

What it does

Crawls https://www.decathlon.com.br/esportes/corrida (the running category), paginating through all ~714 products across 30 pages. Product data is extracted directly from the server-rendered JSON embedded in each page (VTEX + Gatsby stack) — no JavaScript rendering required.

Covers Decathlon's house brands (Kalenji, Kiprun) as well as third-party brands stocked in the running category (Asics, Nike, etc.).

Output fields

Field Type Description
product_id string Decathlon product ID
product_sku string SKU / item ID for the specific variant
product_name string Full product name
product_url string Canonical product page URL
product_brand string Brand name (e.g. Kalenji, Kiprun, Asics)
product_category string Top-level category (e.g. Calcados, Roupas)
product_subcategory string Subcategory (e.g. Tenis, Jaquetas)
product_price_brl number Current selling price in BRL
product_original_price_brl number List price before discount in BRL
product_discount_pct number Discount percentage (0 if no discount)
product_rating number Product rating (1–5 scale)
product_review_count integer Review count (null — not exposed in catalog JSON)
product_color string Primary color of the first available variant
product_sizes_available array Available sizes across all variants
product_in_stock boolean True if any variant is available
product_image_url string Primary product image URL
product_description string Product description text
product_specs string JSON-encoded technical specs (sport, terrain, origin, etc.)

Input

Parameter Type Default Description
maxItems integer 10 Maximum number of products to return (set higher or remove cap for full catalog)

Usage

Set maxItems to the number of products you need. The running category has approximately 714 products across 30 pages (24 products per page). A full catalog run completes in under 2 minutes.

Notes

  • The actor fetches server-rendered HTML — no browser, no proxy required.
  • All product data (prices, sizes, availability) reflects the first variant of each product. Multiple color/size variants per product share the same product_id.
  • product_review_count is always null — the catalog API does not expose review counts.
  • The Decathlon Brasil store runs on VTEX with a Gatsby SSR frontend. Product data is embedded in window.pageData on every page.