OrbTop

Proven Winners Plant Catalog Scraper

ECOMMERCE

Proven Winners Plant Catalog Scraper

Scrape the complete Proven Winners branded plant catalog — the dominant US branded-ornamental grower program — and capture structured data for ~2,600 trademarked varieties.

What This Actor Does

Proven Winners is the largest branded ornamental plant program in the US, sold at every major garden center. This actor walks the Proven Winners sitemap, fetches each plant detail page, and extracts the full structured trait dataset for every variety in the catalog.

Output includes:

  • Cultivar and common names, genus/species, botanical name
  • Plant type (annual / perennial / shrub / tropical), growth habit
  • Height and spacing ranges (in inches)
  • USDA hardiness zones, light and water requirements
  • Bloom times, flower and foliage colors
  • Container design roles: Thriller, Filler, or Spiller (proprietary PW classification)
  • Landscape uses, maintenance notes, care notes
  • US/Canadian plant patent numbers (USPP / PPAF)
  • Photo URLs, plant page URL

Who Needs This Data

  • Garden-design apps — Proven Winners is the source of the Thriller/Filler/Spiller container design framework; no generic plant database carries this proprietary classification
  • Nursery / garden center competitive intelligence — track the branded-cultivar catalog, new introductions, and series
  • Horticulture market analysts — monitor the largest branded ornamental program for trends, patent activity, and regional availability
  • Plant taxonomy enrichment — add patent numbers and branded-variety metadata to any generic plant dataset

Input Parameters

Parameter Type Default Description
maxItems integer 10 Maximum number of plant records to scrape. Set to 0 for the full catalog (~2,600 varieties).

Output Schema

Each record contains:

Field Type Description
plant_id string Drupal node ID — primary key
common_name string Common name (e.g. Panicle Hydrangea)
cultivar_name string Trademarked variety name (e.g. Bobo, Limelight)
botanical_name string Full botanical name (genus + species)
genus string Genus
species string Species epithet
series string PW marketing series (e.g. Supertunia, Endless Summer)
brand string Always "Proven Winners"
plant_type string Annual / Perennial / Shrub / Tropical
description string Marketing description and plant features
height_min_in number Minimum mature height in inches
height_max_in number Maximum mature height in inches
spacing_min_in number Minimum recommended spacing in inches
spacing_max_in number Maximum recommended spacing in inches
habit string Growth habit — Mounded, Upright, Trailing, etc.
usda_hardiness_zones string USDA zone range (e.g. "3a, 3b ... 9b")
light_requirement string Pipe-delimited light needs (e.g. "Full Sun | Part Sun")
water_requirement string Watering needs — Average, Low, High
bloom_time string Pipe-delimited bloom seasons (e.g. "Early Summer | Fall")
flower_colors string Pipe-delimited flower colors (e.g. "Pink | White")
foliage_colors string Pipe-delimited foliage colors
container_role string Thriller, Filler, or Spiller
landscape_uses string Pipe-delimited uses (e.g. "Container | Landscape | Edging Plant")
maintenance_notes string Cultural care instructions
pruning string Pruning / uses notes
features string Pipe-delimited traits (deer resistant, drought tolerant, etc.)
patent_info string USPP number or PPAF designation
image_urls string Pipe-delimited photo URLs
plant_url string Full URL to the plant detail page

Sample Output

{
  "plant_id": "272528",
  "common_name": "Panicle Hydrangea",
  "cultivar_name": "Bobo",
  "botanical_name": "Hydrangea paniculata",
  "genus": "Hydrangea",
  "species": "paniculata",
  "series": null,
  "brand": "Proven Winners",
  "plant_type": "Shrub",
  "height_min_in": 30,
  "height_max_in": 36,
  "spacing_min_in": 48,
  "spacing_max_in": 60,
  "habit": "Mounded",
  "usda_hardiness_zones": "3a, 3b, 4a, 4b, 5a, 5b, 6a, 6b, 7a, 7b, 8a, 8b, 9a, 9b",
  "light_requirement": "Part Sun to Sun | Sun",
  "water_requirement": "Average",
  "bloom_time": "Early Summer | Mid Summer | Late Summer | Fall",
  "flower_colors": "Pink | White",
  "foliage_colors": "Green",
  "container_role": "Thriller",
  "landscape_uses": "Container | Landscape",
  "patent_info": "USPP 22,782",
  "plant_url": "https://www.provenwinners.com/plants/hydrangea/bobo-panicle-hydrangea-hydrangea-paniculata"
}

Notes

  • The full catalog (~2,600 varieties) runs to completion in roughly 2-4 hours at the default concurrency of 5, respecting the site's robots.txt crawl-delay: 10 guidance.
  • The where_to_buy_count field is always null — retailer inventory is loaded dynamically by a JavaScript web component and is not present in the static HTML.
  • The fertilize field is always null — fertilizing instructions appear only on /culture subpages, which robots.txt disallows.