OrbTop

BetaList Startup Scraper

LEAD GENERATION

BetaList Startup Scraper

Scrape BetaList's full catalog of early-stage startup launches. Extract startup profiles including name, tagline, description, website, launch date, category tags, founders, logo, screenshots, and boosted status.

What it does

This scraper crawls BetaList — a directory of pre-launch and recently-launched startups — and returns structured data on each listing. You can scrape the full catalog in discovery mode or provide specific startup URLs to target.

Two operating modes:

  1. Catalog crawl (default): Starts from the BetaList homepage and paginates through all listing pages, extracting one record per startup detail page. Use maxItems to cap the run.

  2. Direct URL mode: Provide specific BetaList startup URLs via startUrls. The actor scrapes those detail pages directly, bypassing the listing crawler.

Output fields

Field Type Description
slug string URL slug (e.g. fliptip-ai)
name string Startup name
tagline string Short tagline from the listing
description string Full description from the detail page
website string Visit-redirect URL (resolves to the startup's actual site)
launch_date string Featured launch date on BetaList
category_tags string Topic tags (comma-separated)
founders string Maker display names (comma-separated)
logo_url string Logo image URL
screenshots string Screenshot image URLs (comma-separated)
boosted boolean Whether the startup is a paid-boosted listing
betalist_url string Full BetaList detail page URL
upvotes integer Always null — not available in public HTML
country string Always null — not shown in public profiles
city string Always null — not shown in public profiles
twitter_handle string Always null — requires login to view
linkedin_url string Always null — requires login to view
founder_emails string Always null — requires login to view

Note: Fields marked "requires login" or "not available in public HTML" are null because BetaList gates this data behind authentication. The scraper operates without credentials.

Example output

{
  "slug": "transvoix",
  "name": "TransVoix",
  "tagline": "Make calls in different languages with your voice, tone, and accent",
  "description": "TransVoix lets you call someone who speaks a different language...",
  "website": "https://betalist.com/startups/transvoix/visit",
  "launch_date": "April 18, 2026",
  "upvotes": null,
  "category_tags": "Translation, AI Tools, SaaS, Telecommunications, Productivity",
  "founders": "Tobex",
  "founder_emails": null,
  "twitter_handle": null,
  "linkedin_url": null,
  "logo_url": "https://cdn.betalist.com/t8bgyqo4eptsfu90fhl9hxwqqzju",
  "screenshots": "https://resize.imagekit.co/...",
  "country": null,
  "city": null,
  "boosted": true,
  "betalist_url": "https://betalist.com/startups/transvoix"
}

Input parameters

Parameter Type Default Description
startUrls array [] Optional BetaList startup URLs to scrape directly. Leave empty to crawl the full catalog.
maxItems integer 10 Maximum number of records to collect. Set to 0 for unlimited (full catalog run).

Use cases

  • B2B lead generation: Find recently-launched startups that need tools and services
  • Newsletter sponsorship: Build a pipeline of founders to pitch for newsletter sponsorships
  • Market research: Track what categories of startups are launching each month
  • Competitive intelligence: Monitor new entrants in a product category
  • Investor prospecting: Identify early-stage companies before they appear on ProductHunt

Notes

  • BetaList serves all content server-side — no JavaScript rendering required
  • The scraper respects BetaList's rate limits automatically with exponential backoff
  • Boosted listings (paid promotions) are flagged with boosted: true
  • The website field is a BetaList redirect URL — follow the redirect to reach the startup's actual homepage