OrbTop

Houseplant411 Care Guide Scraper

DEVELOPER TOOLSEDUCATION

Houseplant411 Care Guide Scraper

Extract structured indoor houseplant care data from Houseplant411.com — 140+ care guides covering light requirements, watering, humidity, temperature, soil, fertilizer, toxicity, pet safety, FAQ, and images.

What this scraper collects

For each indoor plant care guide:

Field Description
id_plant URL slug (primary key)
common_name Common plant name
botanical_name Scientific name (where available)
description Introductory description
light_requirement Normalized: low / medium / bright indirect / direct
light_detail Full light care text
water_requirement Watering instructions
humidity low / average / high
temperature_range Preferred temperature range
soil Soil type requirements
fertilizer Feeding schedule and type
propagation Propagation method
repotting Pot size and repotting guidance
pruning Pruning instructions
flowering Flowering information
common_problems Disease and problem list
common_pests Pest list and treatment
toxicity toxic / non-toxic
toxicity_detail Full toxicity text
is_pet_safe Boolean — safe for pets
is_air_purifying Boolean — air purifying plant
faq FAQ entries (Q/A formatted strings)
image_urls Image URLs from the guide
guide_url Source URL

Usage

Input

Parameter Type Default Description
maxItems integer 10 Maximum guides to scrape. Set to 0 for all ~140 guides.

Example input

{
  "maxItems": 50
}

Example output record

{
  "id_plant": "dracaena-marginata-how-to-grow-care-guide",
  "common_name": "Dracaena Marginata",
  "light_requirement": "medium",
  "light_detail": "Dracaena marginata plants prefer medium light, but survive in low light situations...",
  "water_requirement": "Over-watering causes root-rot and is the main reason a Dracaena marginata dies...",
  "humidity": "average",
  "temperature_range": "70-80 F",
  "soil": "Dracaena marginata plants grow well in a loose, quick-draining soil...",
  "toxicity": "toxic",
  "is_pet_safe": false,
  "faq": ["Q: Why are the leaves turning brown?\nA: Usually fluoride in tap water..."],
  "image_urls": ["https://houseplant411.com/wp-content/uploads/dracaena-m.jpg"],
  "guide_url": "https://houseplant411.com/houseplant/dracaena-marginata-how-to-grow-care-guide/"
}

Use cases

  • Plant-care apps (Planta, Greg, Vera, Blossom) — enrich plant databases with per-plant care parameters
  • Pet-safety tools — filter plants by is_pet_safe flag and toxicity level
  • RAG / LLM plant assistants — ground plant Q&A in structured care data and FAQ corpus
  • Plant marketplace enrichment — add care info to product listings
  • Indoor gardening content sites — import structured care data for content

Coverage

  • ~140 indoor houseplant care guides
  • Popular plants: Pothos, Dracaena, Philodendron, Monstera, Ficus, Jade, Orchid, Snake Plant, Peace Lily, and more
  • Taxonomy: lighting level, ease of care, toxicity, plant type (floor, hanging, table, flowering, bonsai, clean-air)

Technical notes

  • Crawls post-sitemap.xml, filtered to /houseplant/ URLs
  • No JavaScript rendering required — fully server-rendered WordPress
  • No proxy needed — no bot wall observed
  • Respects rate limits via built-in concurrency control