OrbTop

CALT Long March Rocket Fleet News Scraper

NEWSBUSINESS

CALT Long March Rocket Fleet News Scraper

Scrapes the official CALT (China Academy of Launch Vehicle Technology / 中国运载火箭技术研究院) news portal for Long March rocket mission updates, launch success/failure reports, and fleet news.

CALT builds the Long March (长征) family — CZ-2, CZ-3, CZ-5, CZ-7, CZ-8, CZ-10, CZ-11, and the Jielong (捷龙) commercial series — which collectively account for roughly 80% of all Chinese orbital launches. CALT mission bulletins are the primary source for post-flight outcome data and appear minutes after liftoff.

What this Actor does

  • Crawls all ~570 news articles from http://www.calt.com/n689/
  • Extracts full article text, publish date, and embedded images
  • Derives structured mission metadata: rocket family (CZ-5, CZ-7A, CZ-8A, CZ-10, JL-3, etc.), launch site (文昌/酒泉/太原/西昌), and mission outcome (成功/失利/部分成功)
  • Respects maxItems cap for quick data samples

Use cases

  • Launch-insurance underwriting: real-time success/failure classification for actuarial models
  • Trade press: syndication feed for SpaceNews, NASASpaceFlight, Andrew Jones type coverage
  • Complement to SpaceDevs Launch Library: SpaceDevs gives pre-launch schedule; CALT gives post-flight outcome and payload detail
  • Defense/space-situational-awareness analysts: mission tempo, payload IDs, anomaly bulletins

Output fields

Field Description
article_id CALT CMS numeric article ID
title_zh Article title in Chinese
rocket_family Rocket designation (CZ-5, CZ-7A, CZ-8A, JL-3, etc.)
publish_date Publication date (YYYY-MM-DD)
launch_site Launch site (文昌, 酒泉, 太原, 西昌)
mission_outcome 成功 / 失利 / 部分成功 / null
body_text Full article body text
body_html Full article body HTML
images Array of image URLs embedded in the article
source_url Canonical article URL
subchannel Source subchannel (最新)
scrapedAt ISO-8601 scrape timestamp

Input

Parameter Type Default Description
maxItems integer 10 Maximum number of articles to return. Leave blank for full archive.

Notes

  • Site is plain HTTP, no anti-bot protection
  • Chinese-language content — titles and body text are in Simplified Chinese
  • ~570 articles available as of June 2026 across 38 listing pages
  • Actor uses direct HTTP connection (no proxy needed)