Bicycling Magazine Gear Reviews Product Scraper
Bicycling Magazine Gear Reviews Scraper
Scrapes gear review articles from Bicycling Magazine — the dominant US cycling editorial brand. Returns article metadata, product details, author bylines, publish dates, body text, and affiliate/retailer links for every review in the /bikes-gear/ section.
Pairs with BikeRadar Gear Review Scraper for full English-language geo coverage — US + UK.
What You Get
Each record is one article from the bikes-gear section. The schema covers the fields affiliate networks, comparison engines, and brand monitoring teams actually need.
| Field | Type | Notes |
|---|---|---|
article_id |
string | Hearst article ID from URL (/a{id}/) |
title |
string | Article headline |
product_name |
string | Product/article name from OG metadata |
brand |
string | Detected brand name from title |
category |
string | Section tag (Bikes & Gear, Mountain Bikes, etc.) |
rating |
number | Structured rating if present |
author |
string | Reviewer byline |
publish_date |
string | ISO 8601 publication date |
msrp_usd |
number | MSRP extracted from body text, when stated |
body_text |
string | Full article body (up to 50K chars) |
pros |
string | Pipe-separated pros list, when structured |
cons |
string | Pipe-separated cons list, when structured |
hero_image |
string | Article hero image URL (Hearst CDN) |
retailer_links |
string | Affiliate/retailer links, pipe-delimited (text|url) |
source_url |
string | Canonical article URL |
How It Works
Discovery uses two sources: the Google News sitemap (sitemap_google_news.xml) for recent articles, and the section listing pages (/bikes-gear/, /mountain-bikes/, /road-bikes/) for broader archive coverage. Each article page is fetched once. No headless browser required — Bicycling.com renders server-side via Next.js.
Structured metadata comes from the JSON-LD NewsArticle block embedded in every article. Retailer and affiliate links are extracted from anchor tags matching Amazon, REI, Walmart, and a handful of cycling-specific retailers. Pros and cons come from structured <h2>/<ul> patterns when the review uses them.
No proxy needed. Bicycling.com has no Cloudflare protection and no paywall for gear content.
Input
| Parameter | Type | Default | Description |
|---|---|---|---|
maxItems |
integer | 10 | Maximum articles to return |
Set maxItems to 0 for no cap (returns all discoverable articles from the listing pages and news sitemap).
Example Output
{
"article_id": "71536453",
"title": "Which Road Tire Size Is Fastest?",
"product_name": "I Tested 25, 28, 30, and 32mm Road Tires. One Size Was Nearly 6 Seconds Per Mile Faster.",
"brand": "Schwalbe",
"category": "Bikes & Gear",
"rating": null,
"author": "Dan Chabanov",
"publish_date": "2026-06-09T15:35:35.842216Z",
"msrp_usd": null,
"body_text": "It's safe to say that bike tires have been trending wider...",
"pros": null,
"cons": null,
"hero_image": "https://hips.hearstapps.com/hmg-prod/images/209fd3e8-bb9c-40cb-a29b-29c930bacc1e.jpg",
"retailer_links": "Schwalbe Pro One|https://www.amazon.com/dp/B0D5LRMRSB?tag=bicycling-auto-20",
"source_url": "https://www.bicycling.com/bikes-gear/a71536453/road-tire-size-test/"
}
Use Cases
Affiliate network monitoring — Track which products Bicycling links to and through which retailers. The US-side complement to BikeRadar for complete English-language coverage.
Comparison engine data — Pull MSRP, ratings, and product names from editorial reviews to populate product comparison pages.
Brand monitoring — Monitor how and when your brand appears in Bicycling's coverage, alongside the competitive set.
Content research — Access the full body text corpus for NLP, trend analysis, or training data.
Coverage Notes
Discovery covers:
- Recent articles via the Google News sitemap (last few days)
- All articles visible on
/bikes-gear/,/mountain-bikes/, and/road-bikes/listing pages (~50–60 articles each, server-rendered)
The site does not expose a full historical article sitemap. The listing pages show the most recent articles per section. For deep archive access (3000+ articles), run repeatedly with date filtering on publish_date.
Legal
This scraper accesses only publicly available content from bicycling.com. No authentication, paywalled content, or personal data is involved. Affiliate links are extracted as-is — do not redistribute them in ways that modify or misrepresent the affiliate relationship.
Built by OrbTop. Questions or feature requests → actor-support@orbtop.com