Decathlon Brasil Corrida Catalog Scraper
Decathlon Brasil Corrida Catalog Scraper
Scrapes the complete Decathlon Brasil running category catalog — shoes, apparel, and accessories — and returns structured product data including prices (BRL), sizes, colors, stock status, and category metadata.
What it does
Crawls https://www.decathlon.com.br/esportes/corrida (the running category), paginating through all ~714 products across 30 pages. Product data is extracted directly from the server-rendered JSON embedded in each page (VTEX + Gatsby stack) — no JavaScript rendering required.
Covers Decathlon's house brands (Kalenji, Kiprun) as well as third-party brands stocked in the running category (Asics, Nike, etc.).
Output fields
| Field | Type | Description |
|---|---|---|
product_id |
string | Decathlon product ID |
product_sku |
string | SKU / item ID for the specific variant |
product_name |
string | Full product name |
product_url |
string | Canonical product page URL |
product_brand |
string | Brand name (e.g. Kalenji, Kiprun, Asics) |
product_category |
string | Top-level category (e.g. Calcados, Roupas) |
product_subcategory |
string | Subcategory (e.g. Tenis, Jaquetas) |
product_price_brl |
number | Current selling price in BRL |
product_original_price_brl |
number | List price before discount in BRL |
product_discount_pct |
number | Discount percentage (0 if no discount) |
product_rating |
number | Product rating (1–5 scale) |
product_review_count |
integer | Review count (null — not exposed in catalog JSON) |
product_color |
string | Primary color of the first available variant |
product_sizes_available |
array | Available sizes across all variants |
product_in_stock |
boolean | True if any variant is available |
product_image_url |
string | Primary product image URL |
product_description |
string | Product description text |
product_specs |
string | JSON-encoded technical specs (sport, terrain, origin, etc.) |
Input
| Parameter | Type | Default | Description |
|---|---|---|---|
maxItems |
integer | 10 | Maximum number of products to return (set higher or remove cap for full catalog) |
Usage
Set maxItems to the number of products you need. The running category has approximately 714 products across 30 pages (24 products per page). A full catalog run completes in under 2 minutes.
Notes
- The actor fetches server-rendered HTML — no browser, no proxy required.
- All product data (prices, sizes, availability) reflects the first variant of each product. Multiple color/size variants per product share the same
product_id. product_review_countis alwaysnull— the catalog API does not expose review counts.- The Decathlon Brasil store runs on VTEX with a Gatsby SSR frontend. Product data is embedded in
window.pageDataon every page.