OrbTop

CAHEC China Animal Disease Surveillance Bulletin Scraper

BUSINESSOTHER

CAHEC China Animal Disease Surveillance Bulletin Scraper

Extract disease-surveillance bulletins from China's national animal health authority — CAHEC (中国动物卫生与流行病学中心).

Crawls the China Animal Health and Epidemiology Centre website across seven content sections and returns structured bulletin records including full text, NLP-tagged disease terms, species mentions, and province-level geographic references.


What does this scraper do?

CAHEC (China Animal Health & Epidemiology Centre) is the national veterinary epidemiology body under China's Ministry of Agriculture and Rural Affairs. It publishes bulletins on disease outbreaks, epidemiology surveys, regulatory notices, and biosecurity research.

This scraper crawls all seven public sections:

Section Description
中心动态 (Center News) Center announcements and activities
通知公告 (Notices) Regulatory and administrative notices
业务进展 (Business Progress) Operational and program updates
流行病学调查 (Epidemiology Surveys) Field survey reports and research
党的建设 (Party Building) Organization governance
政策法规 (Policies & Regulations) Regulatory documents
行业信息 (Industry Information) Sector news and updates

Output data

Each record includes:

Field Type Description
bulletin_id String Numeric article ID from URL
bulletin_url String Full URL to article
section String Content section (e.g., 流行病学调查)
title String Article title (Chinese)
published_at String Publication date/time (YYYY-MM-DD HH:MM)
issuer String Source department or sub-laboratory
body_text String Full article body text
body_html String Full article body HTML
disease_terms String Comma-separated disease terms matched (ASF, H5N1, FMD, etc.)
species_terms String Comma-separated species terms matched (pig, poultry, cattle, etc.)
region_terms String Comma-separated Chinese province/region mentions
attachments String Pipe-separated attachment links (text
first_seen String ISO 8601 scrape timestamp
scraped_at String ISO 8601 scrape timestamp

Input

Parameter Type Default Description
maxItems Integer 5 Maximum number of bulletins to extract

Example use cases

  • Veterinary pharma signal tracking — Monitor ASF, H5N1, FMD, and brucellosis outbreak bulletins before they appear in English-language media
  • Agricultural commodity risk — Track disease spread across Chinese provinces that affect livestock production regions
  • Regulatory intelligence — Extract CAHEC policy notices and technical guidance documents
  • Biosecurity research — Build historical datasets of Chinese animal disease surveillance reporting

Notes

  • Site is Chinese government static HTML with no anti-bot measures and no JavaScript rendering required
  • Content is in Simplified Chinese; disease, species, and region term extraction covers common English and Chinese terminology
  • The site is accessible from Apify cloud infrastructure; local connectivity from non-China IPs may be intermittent
  • Historical archive spans approximately 1,500+ bulletins across all sections