Houseplant411 Care Guide Scraper
DEVELOPER TOOLSEDUCATION
Houseplant411 Care Guide Scraper
Extract structured indoor houseplant care data from Houseplant411.com — 140+ care guides covering light requirements, watering, humidity, temperature, soil, fertilizer, toxicity, pet safety, FAQ, and images.
What this scraper collects
For each indoor plant care guide:
| Field | Description |
|---|---|
id_plant |
URL slug (primary key) |
common_name |
Common plant name |
botanical_name |
Scientific name (where available) |
description |
Introductory description |
light_requirement |
Normalized: low / medium / bright indirect / direct |
light_detail |
Full light care text |
water_requirement |
Watering instructions |
humidity |
low / average / high |
temperature_range |
Preferred temperature range |
soil |
Soil type requirements |
fertilizer |
Feeding schedule and type |
propagation |
Propagation method |
repotting |
Pot size and repotting guidance |
pruning |
Pruning instructions |
flowering |
Flowering information |
common_problems |
Disease and problem list |
common_pests |
Pest list and treatment |
toxicity |
toxic / non-toxic |
toxicity_detail |
Full toxicity text |
is_pet_safe |
Boolean — safe for pets |
is_air_purifying |
Boolean — air purifying plant |
faq |
FAQ entries (Q/A formatted strings) |
image_urls |
Image URLs from the guide |
guide_url |
Source URL |
Usage
Input
| Parameter | Type | Default | Description |
|---|---|---|---|
maxItems |
integer | 10 | Maximum guides to scrape. Set to 0 for all ~140 guides. |
Example input
{
"maxItems": 50
}
Example output record
{
"id_plant": "dracaena-marginata-how-to-grow-care-guide",
"common_name": "Dracaena Marginata",
"light_requirement": "medium",
"light_detail": "Dracaena marginata plants prefer medium light, but survive in low light situations...",
"water_requirement": "Over-watering causes root-rot and is the main reason a Dracaena marginata dies...",
"humidity": "average",
"temperature_range": "70-80 F",
"soil": "Dracaena marginata plants grow well in a loose, quick-draining soil...",
"toxicity": "toxic",
"is_pet_safe": false,
"faq": ["Q: Why are the leaves turning brown?\nA: Usually fluoride in tap water..."],
"image_urls": ["https://houseplant411.com/wp-content/uploads/dracaena-m.jpg"],
"guide_url": "https://houseplant411.com/houseplant/dracaena-marginata-how-to-grow-care-guide/"
}
Use cases
- Plant-care apps (Planta, Greg, Vera, Blossom) — enrich plant databases with per-plant care parameters
- Pet-safety tools — filter plants by
is_pet_safeflag and toxicity level - RAG / LLM plant assistants — ground plant Q&A in structured care data and FAQ corpus
- Plant marketplace enrichment — add care info to product listings
- Indoor gardening content sites — import structured care data for content
Coverage
- ~140 indoor houseplant care guides
- Popular plants: Pothos, Dracaena, Philodendron, Monstera, Ficus, Jade, Orchid, Snake Plant, Peace Lily, and more
- Taxonomy: lighting level, ease of care, toxicity, plant type (floor, hanging, table, flowering, bonsai, clean-air)
Technical notes
- Crawls
post-sitemap.xml, filtered to/houseplant/URLs - No JavaScript rendering required — fully server-rendered WordPress
- No proxy needed — no bot wall observed
- Respects rate limits via built-in concurrency control