China Tier-1 VC Portfolio Aggregator
LEAD GENERATIONBUSINESSAUTOMATION
China Tier-1 VC Portfolio Aggregator
Aggregates portfolio company data from 9 top-tier Chinese venture capital firms in a single run. Returns a unified dataset with company names (Chinese and English), sector, stage, website, and investment metadata.
Supported Sources
| VC Firm | Method |
|---|---|
| HongShan 红杉中国 (Sequoia China) | Sitemap-based: 131 companies via company-sitemap.xml |
| Qiming Venture Partners 启明创投 | SSR listing: 427 companies in one page load |
| IDG Capital China | SSR listing with h3 company names and taglines |
| ZhenFund 真格基金 | Featured company descriptions + external website links |
| Sinovation Ventures 创新工场 | External company link anchors |
| 5Y Capital 五源资本 | H2/H3 company names (geo-dependent) |
| Matrix Partners China 经纬创投 | H1/H2 company references |
| Legend Capital 君联资本 | H2/H3 company references |
| Hillhouse Investment 高瓴 | Company-spotlight blog post slugs |
What You Get
| Field | Description |
|---|---|
vcFirm |
VC firm key: hongshan, qiming, idg_china, zhenfund, sinovation, 5y_capital, matrix_china, legend, hillhouse |
vcUrl |
Portfolio listing URL used for extraction |
nameZh |
Company name in Chinese |
nameEn |
Company name in English |
taglineZh |
Chinese description or tagline |
taglineEn |
English description or tagline |
website |
Company website URL |
sector |
Industry sector (AI, biotech, consumer, enterprise, fintech, etc.) |
stage |
Investment stage (seed, A, B, growth, IPO) |
status |
Company status: active, acquired, ipo, shutdown |
hqCity |
Headquarters city when available |
country |
Country code: CN, US, SG, HK |
exitType |
Exit venue: HKEX, STAR/SSE, ChiNext/SZSE, NASDAQ, NYSE |
vcDetailUrl |
VC firm's company profile page |
logoUrl |
Company logo URL when available |
extractedAt |
ISO 8601 scrape timestamp |
Input
{
"maxItems": 100,
"vcFirms": ["hongshan", "qiming"]
}
| Field | Type | Default | Description |
|---|---|---|---|
maxItems |
integer | 10 | Max total companies to return across all firms |
vcFirms |
array | all 9 firms | Which firms to scrape. Leave empty for all. Valid values: hongshan, idg_china, zhenfund, sinovation, qiming, hillhouse, 5y_capital, matrix_china, legend |
Usage Examples
All 9 firms, up to 500 companies:
{ "maxItems": 500, "vcFirms": [] }
Qiming only (largest fully-SSR dataset, 400+ companies):
{ "maxItems": 500, "vcFirms": ["qiming"] }
HongShan only (sitemap-driven, 130+ companies with Chinese names):
{ "maxItems": 200, "vcFirms": ["hongshan"] }
Cross-firm scan for M&A / sanctions screening:
{ "maxItems": 1000, "vcFirms": ["hongshan", "qiming", "idg_china", "zhenfund", "sinovation"] }
Notes
- All 9 sites return 200 with a standard browser User-Agent — no proxy required.
- HongShan is sitemap-based and fetches one detail page per company — runs slower but returns richer metadata (Chinese name from og:title, English name from slug).
- Qiming returns 400+ companies from a single page load — fastest per-company throughput.
- Sites that are JS-rendered or geo-restricted (5y_capital, matrix_china, legend) return fewer results; the actor fails gracefully and continues with remaining sources.
- The actor uses a ~800ms polite delay between source site requests.
- Chinese sites may have connectivity variance from non-CN egress IPs. Cloud validation (Apify cloud) passes normally.