BetaList Startup Scraper

Scrape BetaList's full catalog of early-stage startup launches. Extract startup profiles including name, tagline, description, website, launch date, category tags, founders, logo, screenshots, and boosted status.

What it does

This scraper crawls BetaList — a directory of pre-launch and recently-launched startups — and returns structured data on each listing. You can scrape the full catalog in discovery mode or provide specific startup URLs to target.

Two operating modes:

Catalog crawl (default): Starts from the BetaList homepage and paginates through all listing pages, extracting one record per startup detail page. Use maxItems to cap the run.
Direct URL mode: Provide specific BetaList startup URLs via startUrls. The actor scrapes those detail pages directly, bypassing the listing crawler.

Output fields

Field	Type	Description
`slug`	string	URL slug (e.g. `fliptip-ai`)
`name`	string	Startup name
`tagline`	string	Short tagline from the listing
`description`	string	Full description from the detail page
`website`	string	Visit-redirect URL (resolves to the startup's actual site)
`launch_date`	string	Featured launch date on BetaList
`category_tags`	string	Topic tags (comma-separated)
`founders`	string	Maker display names (comma-separated)
`logo_url`	string	Logo image URL
`screenshots`	string	Screenshot image URLs (comma-separated)
`boosted`	boolean	Whether the startup is a paid-boosted listing
`betalist_url`	string	Full BetaList detail page URL
`upvotes`	integer	Always `null` — not available in public HTML
`country`	string	Always `null` — not shown in public profiles
`city`	string	Always `null` — not shown in public profiles
`twitter_handle`	string	Always `null` — requires login to view
`linkedin_url`	string	Always `null` — requires login to view
`founder_emails`	string	Always `null` — requires login to view

Note: Fields marked "requires login" or "not available in public HTML" are null because BetaList gates this data behind authentication. The scraper operates without credentials.

Example output

{
  "slug": "transvoix",
  "name": "TransVoix",
  "tagline": "Make calls in different languages with your voice, tone, and accent",
  "description": "TransVoix lets you call someone who speaks a different language...",
  "website": "https://betalist.com/startups/transvoix/visit",
  "launch_date": "April 18, 2026",
  "upvotes": null,
  "category_tags": "Translation, AI Tools, SaaS, Telecommunications, Productivity",
  "founders": "Tobex",
  "founder_emails": null,
  "twitter_handle": null,
  "linkedin_url": null,
  "logo_url": "https://cdn.betalist.com/t8bgyqo4eptsfu90fhl9hxwqqzju",
  "screenshots": "https://resize.imagekit.co/...",
  "country": null,
  "city": null,
  "boosted": true,
  "betalist_url": "https://betalist.com/startups/transvoix"
}

Input parameters

Parameter	Type	Default	Description
`startUrls`	array	`[]`	Optional BetaList startup URLs to scrape directly. Leave empty to crawl the full catalog.
`maxItems`	integer	10	Maximum number of records to collect. Set to `0` for unlimited (full catalog run).

Use cases

B2B lead generation: Find recently-launched startups that need tools and services
Newsletter sponsorship: Build a pipeline of founders to pitch for newsletter sponsorships
Market research: Track what categories of startups are launching each month
Competitive intelligence: Monitor new entrants in a product category
Investor prospecting: Identify early-stage companies before they appear on ProductHunt

Notes

BetaList serves all content server-side — no JavaScript rendering required
The scraper respects BetaList's rate limits automatically with exponential backoff
Boosted listings (paid promotions) are flagged with boosted: true
The website field is a BetaList redirect URL — follow the redirect to reach the startup's actual homepage

BetaList Startup Scraper

BetaList Startup Scraper

What it does

Output fields

Example output

Input parameters

Use cases

Notes

Related Lead Generation scrapers