OrbTop

Cobasi Pet Product Catalog Scraper (Brazil)

ECOMMERCE

Cobasi Pet Product Catalog Scraper (Brazil)

Scrape the full product catalog from Cobasi, Brazil's largest pet retail chain. Extract prices, brands, ratings, and Clube Cobasi subscription discounts across ~150,000 products covering dogs, cats, birds, fish, and other animals.

What You Get

Each record includes:

Field Description
sku Numeric product ID (from JSON-LD)
product_name Full product name
product_url Canonical product page URL
category_path Breadcrumb path (pipe-separated, e.g. Cachorro|Petiscos|Bifinhos)
brand Brand name (e.g. Royal Canin, Pedigree, Premier)
price_brl Regular price in Brazilian Reais
price_subscription_brl Clube Cobasi subscription price (usually ~10% off)
subscription_discount_pct Subscription discount percentage
in_stock Whether the product is currently available
rating_avg Average customer rating (0–5)
rating_count Total number of reviews
weight Product weight extracted from URL slug (e.g. 15kg, 500g)
target_species Animal species inferred from breadcrumbs (cachorro, gato, etc.)
target_life_stage Life stage inferred from product name (filhote, adulto, senior)
target_size Breed size inferred from product name (pequenas, medias, grandes)
flavor Flavor extracted from product name when present
images Product image URLs (pipe-separated)
scraped_at ISO timestamp of the crawl

How It Works

  1. Reads Cobasi's sitemap index at /sitemap.xml to discover all 8 product sitemaps (~150k URLs total)
  2. Crawls each product detail page (/<slug>-<sku>/p)
  3. Extracts structured data from JSON-LD blocks on the page:
    • @type: "Product" for single-variant products
    • @type: "ProductGroup" for multi-variant products (selects the first in-stock variant)
  4. Extracts Clube Cobasi subscription pricing from the DOM

Input

Parameter Type Default Description
maxItems integer 10 Maximum number of products to scrape. Use 0 for all (~150k).

Example Output

{
  "sku": "792616",
  "product_name": "Petisco Bifinho Joy Beef Ossobuco",
  "product_url": "https://www.cobasi.com.br/petisco-bifinho-joy-beef-ossobuco-65g-3792616/p",
  "category_path": "Cachorro|Petiscos|Bifinhos",
  "brand": "Joy",
  "price_brl": 4.49,
  "price_subscription_brl": 4.04,
  "subscription_discount_pct": 10,
  "in_stock": true,
  "rating_avg": 4.7,
  "rating_count": 219,
  "weight": "65g",
  "target_species": "cachorro",
  "target_life_stage": null,
  "target_size": null,
  "flavor": null,
  "images": "https://cobasi.vteximg.com.br/arquivos/ids/1092503-194-194/JOY_COMBO_OSSOBUCO_65g.jpg",
  "scraped_at": "2026-05-30T15:00:00.000Z"
}

Performance

  • Crawls ~10 pages per second with default concurrency settings
  • Memory usage: ~1GB
  • A full catalog run (~150k products) completes in approximately 4 hours

Use Cases

  • Brand managers: Track SKU coverage, pricing, and ratings across Cobasi's catalog
  • Price monitoring: Compare regular vs. Clube Cobasi subscription pricing for competitive analysis
  • Market research: Analyze product mix and brand presence in Brazil's R$36B pet retail market
  • Distributor tracking: Monitor stock availability and price changes across categories

Notes

  • Cobasi is the dominant Brazilian pet retail chain after the 2024 Petz/Cobasi merger
  • The Clube Cobasi subscription discount (typically 10%) is exclusive to this actor — not available in any other Apify store scraper
  • Product weights, flavors, life stages, and breed sizes are best-effort inferences from product names and may be null for some products