BetaList Startup Scraper
BetaList Startup Scraper
Scrape BetaList's full catalog of early-stage startup launches. Extract startup profiles including name, tagline, description, website, launch date, category tags, founders, logo, screenshots, and boosted status.
What it does
This scraper crawls BetaList — a directory of pre-launch and recently-launched startups — and returns structured data on each listing. You can scrape the full catalog in discovery mode or provide specific startup URLs to target.
Two operating modes:
Catalog crawl (default): Starts from the BetaList homepage and paginates through all listing pages, extracting one record per startup detail page. Use
maxItemsto cap the run.Direct URL mode: Provide specific BetaList startup URLs via
startUrls. The actor scrapes those detail pages directly, bypassing the listing crawler.
Output fields
| Field | Type | Description |
|---|---|---|
slug |
string | URL slug (e.g. fliptip-ai) |
name |
string | Startup name |
tagline |
string | Short tagline from the listing |
description |
string | Full description from the detail page |
website |
string | Visit-redirect URL (resolves to the startup's actual site) |
launch_date |
string | Featured launch date on BetaList |
category_tags |
string | Topic tags (comma-separated) |
founders |
string | Maker display names (comma-separated) |
logo_url |
string | Logo image URL |
screenshots |
string | Screenshot image URLs (comma-separated) |
boosted |
boolean | Whether the startup is a paid-boosted listing |
betalist_url |
string | Full BetaList detail page URL |
upvotes |
integer | Always null — not available in public HTML |
country |
string | Always null — not shown in public profiles |
city |
string | Always null — not shown in public profiles |
twitter_handle |
string | Always null — requires login to view |
linkedin_url |
string | Always null — requires login to view |
founder_emails |
string | Always null — requires login to view |
Note: Fields marked "requires login" or "not available in public HTML" are
nullbecause BetaList gates this data behind authentication. The scraper operates without credentials.
Example output
{
"slug": "transvoix",
"name": "TransVoix",
"tagline": "Make calls in different languages with your voice, tone, and accent",
"description": "TransVoix lets you call someone who speaks a different language...",
"website": "https://betalist.com/startups/transvoix/visit",
"launch_date": "April 18, 2026",
"upvotes": null,
"category_tags": "Translation, AI Tools, SaaS, Telecommunications, Productivity",
"founders": "Tobex",
"founder_emails": null,
"twitter_handle": null,
"linkedin_url": null,
"logo_url": "https://cdn.betalist.com/t8bgyqo4eptsfu90fhl9hxwqqzju",
"screenshots": "https://resize.imagekit.co/...",
"country": null,
"city": null,
"boosted": true,
"betalist_url": "https://betalist.com/startups/transvoix"
}
Input parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
startUrls |
array | [] |
Optional BetaList startup URLs to scrape directly. Leave empty to crawl the full catalog. |
maxItems |
integer | 10 | Maximum number of records to collect. Set to 0 for unlimited (full catalog run). |
Use cases
- B2B lead generation: Find recently-launched startups that need tools and services
- Newsletter sponsorship: Build a pipeline of founders to pitch for newsletter sponsorships
- Market research: Track what categories of startups are launching each month
- Competitive intelligence: Monitor new entrants in a product category
- Investor prospecting: Identify early-stage companies before they appear on ProductHunt
Notes
- BetaList serves all content server-side — no JavaScript rendering required
- The scraper respects BetaList's rate limits automatically with exponential backoff
- Boosted listings (paid promotions) are flagged with
boosted: true - The
websitefield is a BetaList redirect URL — follow the redirect to reach the startup's actual homepage