OrbTop

Keep Fitness Course Catalog Scraper

LEAD GENERATION

Keep Fitness Course Catalog Scraper

Scrapes Keep (gotokeep.com) — China's largest fitness platform, 200M+ users. Returns course listings across yoga, meditation, qigong, HIIT, dance, and dozens of other categories with title, difficulty, participant count, equipment, and cover image in clean JSON.

Keep is the WeChat of fitness for mainland China. Their web course catalog doesn't have a public API, so you scrape it.

What You Get

Each record is a fitness course from Keep's curated category groups.

Field Type Description
course_id string Unique plan ID extracted from the URL
title string Course title in Chinese
category string Category group name (from Keep's workout hashtag collection)
category_url string URL of the category this course belongs to
difficulty string Difficulty level — K1 (beginner) through K5 (advanced)
equipment_required string[] Equipment needed (e.g. 瑜伽垫/yoga mat, 哑铃/dumbbells)
total_participants integer Number of people who have trained with this course
cover_image_url string Course cover image URL (CDN, direct)
source_url string Full URL of the course plan page
scraped_at string ISO timestamp of when the record was scraped

Input Parameters

Parameter Type Default Description
maxItems integer 0 (no limit) Maximum number of courses to scrape. Use 0 for the full catalog.

How It Works

Keep's fitness catalog is organized as curated workout hashtag groups. The scraper seeds from /training (224 category groups), visits each group page, and extracts all course cards. No browser required — the catalog is server-rendered.

Full run coverage: ~224 category groups × avg 5 courses = roughly 1,000+ unique course records after deduplication.

Note: fields like instructor details, calorie burn, rating, and price are available only through Keep's authenticated mobile API and are not accessible from the public website.

Use Cases

  • Wellness brand market research — Understand which course categories dominate Keep's CN audience
  • Fitness app competitive analysis — Track participant counts and content gaps vs. Western platforms (Peloton, Apple Fitness+)
  • Influencer and trainer discovery — Identify course creators by scraping plan titles and cross-referencing with social profiles
  • Athleisure activation intelligence — Build CN fitness market snapshots for brands entering the market (Lululemon, Nike Yoga, Decathlon)

Notes

  • Keep CDN throttles burst requests. The scraper runs at 3 concurrent requests with automatic rate-limit backoff.
  • Some courses appear in multiple category groups. CoreCrawler deduplicates URLs automatically — you won't get duplicate records.
  • Difficulty ratings follow Keep's K-scale: K1 (easiest) → K5 (most demanding).
  • Participant counts reflect totals across all time — popular courses like Pamela Reif's stretch series exceed 4 million.

Built by OrbTop — Apify scrapers for professional data teams.