CALT Long March Rocket Fleet News Scraper
NEWSBUSINESS
CALT Long March Rocket Fleet News Scraper
Scrapes the official CALT (China Academy of Launch Vehicle Technology / 中国运载火箭技术研究院) news portal for Long March rocket mission updates, launch success/failure reports, and fleet news.
CALT builds the Long March (长征) family — CZ-2, CZ-3, CZ-5, CZ-7, CZ-8, CZ-10, CZ-11, and the Jielong (捷龙) commercial series — which collectively account for roughly 80% of all Chinese orbital launches. CALT mission bulletins are the primary source for post-flight outcome data and appear minutes after liftoff.
What this Actor does
- Crawls all ~570 news articles from
http://www.calt.com/n689/ - Extracts full article text, publish date, and embedded images
- Derives structured mission metadata: rocket family (CZ-5, CZ-7A, CZ-8A, CZ-10, JL-3, etc.), launch site (文昌/酒泉/太原/西昌), and mission outcome (成功/失利/部分成功)
- Respects
maxItemscap for quick data samples
Use cases
- Launch-insurance underwriting: real-time success/failure classification for actuarial models
- Trade press: syndication feed for SpaceNews, NASASpaceFlight, Andrew Jones type coverage
- Complement to SpaceDevs Launch Library: SpaceDevs gives pre-launch schedule; CALT gives post-flight outcome and payload detail
- Defense/space-situational-awareness analysts: mission tempo, payload IDs, anomaly bulletins
Output fields
| Field | Description |
|---|---|
article_id |
CALT CMS numeric article ID |
title_zh |
Article title in Chinese |
rocket_family |
Rocket designation (CZ-5, CZ-7A, CZ-8A, JL-3, etc.) |
publish_date |
Publication date (YYYY-MM-DD) |
launch_site |
Launch site (文昌, 酒泉, 太原, 西昌) |
mission_outcome |
成功 / 失利 / 部分成功 / null |
body_text |
Full article body text |
body_html |
Full article body HTML |
images |
Array of image URLs embedded in the article |
source_url |
Canonical article URL |
subchannel |
Source subchannel (最新) |
scrapedAt |
ISO-8601 scrape timestamp |
Input
| Parameter | Type | Default | Description |
|---|---|---|---|
maxItems |
integer | 10 | Maximum number of articles to return. Leave blank for full archive. |
Notes
- Site is plain HTTP, no anti-bot protection
- Chinese-language content — titles and body text are in Simplified Chinese
- ~570 articles available as of June 2026 across 38 listing pages
- Actor uses direct HTTP connection (no proxy needed)