DogTime Dog Breed Profile Scraper
OTHER
DogTime Dog Breed Profile Scraper
Scrape complete breed profiles for all 550+ dog breeds listed on DogTime.com. Each record includes up to 25 numeric trait scores (1–5 scale), vital statistics, and detailed text sections — covering both AKC-recognized and rare/mixed breeds that other breed directories miss.
What you get
| Field group | Fields |
|---|---|
| Identity | breed_name, breed_slug, breed_group, akc_recognized |
| Vital stats | vital_stats (JSON: height, weight, lifespan, breed group) |
| All 25 trait scores | apartment_living, novice_owners, sensitivity_level, tolerates_being_alone, cold_tolerance, heat_tolerance, dog_friendly, cat_friendly, kid_friendly, stranger_friendly, amount_of_shedding, drooling_potential, easy_to_groom, general_health, potential_for_weight_gain, size, easy_to_train, intelligence, mouthiness_potential, prey_drive, tendency_to_bark_or_howl, wanderlust_potential, energy_level, exercise_needs, potential_for_playfulness |
| Full text | personality, history, care, feeding, coat_color_and_grooming, children_and_other_pets, description_long |
| Lists | highlights, health_concerns, rescue_groups_recommended |
| Meta | hero_image_url, profile_url, scraped_at |
Use cases
- Breed-matching apps — use the 25-axis numeric scorecard to recommend dogs by lifestyle (apartment size, activity level, family composition)
- Pet insurance underwriting — health scores, shedding potential, weight gain tendency
- Content enrichment — supplement AKC data with DogTime's richer trait grid covering 250+ breeds the AKC doesn't recognize
- Research / ML — structured numerical features across 550+ breeds for recommendation models
How it works
- Reads the dedicated
dog-breed-sitemap.xml(556 URLs, updated monthly) - Crawls each breed profile page with Cheerio (server-rendered HTML, no JS required)
- Extracts star ratings from the
xe-breed-cardaccordion, breed stats from the sidebar, and text content by section heading
Note on format coverage: DogTime is migrating its most-popular breeds (e.g. Labrador, Golden Retriever) to a new prose-only layout. These pages will have all text fields populated but numeric trait scores will be null. The majority of the 556 breeds still use the original scored format.
Input
| Parameter | Type | Default | Description |
|---|---|---|---|
maxItems |
integer | 10 | Maximum number of breed profiles to scrape. Set to 0 for all 556. |
Example output
{
"breed_name": "Basset Bleu de Gascogne",
"breed_slug": "basset-bleu-de-gascogne",
"breed_group": "Hound Dogs",
"akc_recognized": false,
"vital_stats": null,
"apartment_living": 1,
"kid_friendly": 5,
"dog_friendly": 5,
"easy_to_train": 2,
"intelligence": 4,
"energy_level": null,
"tendency_to_bark_or_howl": 5,
"personality": "The Basset Bleu de Gascogne is a playful breed who makes a great addition to a high energy family...",
"history": "The Basset Bleu de Gascogne originated in the Gascogny region of France...",
"hero_image_url": "https://dogtime.com/wp-content/uploads/sites/12/2022/07/GettyImages-1226718668-e1681236423656.jpg",
"profile_url": "https://dogtime.com/dog-breeds/basset-bleu-de-gascogne",
"scraped_at": "2026-06-12T19:40:43.047Z"
}
Pricing
Pay-per-event. A full 556-breed run typically costs under $1.