OrbTop

Speaker Bureau Directory Scraper - Keynote Speakers & Fees

LEAD GENERATIONBUSINESSOTHER

Speaker Bureau Directory Scraper

Scrape keynote speaker profiles from the All American Speakers Bureau directory. Returns name, tagline, live and virtual fee ranges, travel region, topics, categories, full biography, books authored, and the bureau booking URL for ~16,500 speakers.


Speaker Bureau Directory Scraper Features

  • Extracts 16+ structured fields per speaker profile, including separate live and virtual fee ranges
  • Pulls a normalized topics array and a separate categories taxonomy — most directories give you one or the other
  • Returns the full biography as plain text. No HTML to clean up
  • Lists every book on the speaker's profile page, plus YouTube and Vimeo video URLs
  • Sources from a public sitemap — no proxies, no browser, no CAPTCHA dance
  • Configurable scope: scrape the whole roster (16k speakers) or pin specific profile URLs

Who Uses Speaker Bureau Data?

  • Corporate event planners — Build keynote shortlists with budget bands already attached. The fee fields alone save you ten contact-form submissions
  • PR and media bookers — Source guests for podcasts, panels, and trade press. Eighty percent of speaker outreach goes unanswered, so pre-qualified contact data has value
  • Competing speakers bureaus — Track talent rosters and benchmark fee positioning. Most A-list speakers are multi-bureau represented anyway
  • Sales teams targeting authors and executives — Speakers are a clean buyer-persona slice. Their booking pages double as a lead-source

How the Speaker Bureau Directory Scraper Works

  1. Fetch sitemap — Reads the bureau's public sitemap.xml to discover every speaker profile URL
  2. Filter and seed — Filters to the /speakers/{id}/... URL pattern, slices to maxItems, queues each profile
  3. Extract per profile — Loads each profile page and pulls speaker name, fees, topics, categories, biography, books, and videos into a flat record
  4. Save — Emits one JSON record per speaker to the dataset, tagged with bureau: "aae" so future versions can layer additional bureaus into the same output

Pass a list of direct profile URLs to skip sitemap discovery and crawl only what you specify.


Input

{
  "bureau": "aae",
  "maxItems": 10
}
Field Type Default Description
bureau string "aae" Source bureau. v1 supports aae (All American Speakers Bureau, ~16k speakers). Additional bureaus arrive in subsequent releases.
maxItems integer 10 Maximum number of speaker records to return. Default is intentionally low so a single run finishes within Apify's 5-minute tester window. Increase for larger crawls.
profileUrls array [] Optional list of direct speaker profile URLs. When provided, the scraper skips sitemap discovery and crawls only these URLs.

Targeted scrape — specific speakers only

{
  "bureau": "aae",
  "maxItems": 3,
  "profileUrls": [
    { "url": "https://www.allamericanspeakers.com/speakers/389198/Science-Bob-Pflugfelder" },
    { "url": "https://www.allamericanspeakers.com/speakers/385009/3OH%213" }
  ]
}

Speaker Bureau Directory Scraper Output Fields

{
  "speaker_name": "\"Science Bob\" Pflugfelder",
  "tagline": "Known as \"Science Bob\"; Science Teacher & TV Personality; Co-Author of the \"Nick & Tesla\" Series for Young Readers",
  "bureau": "aae",
  "bureau_label": "All American Speakers Bureau",
  "bureau_speaker_id": "389198",
  "profile_url": "https://www.allamericanspeakers.com/speakers/389198/%22Science-Bob%22-Pflugfelder",
  "profile_image_url": "https://thumbnails.aaehq.com/t_face_aas_md/.../2018_bobpflugfelder_headshot.png",
  "fee_range_live": "$10,000 - $20,000",
  "fee_range_virtual": "$5,000 - $10,000",
  "travel_region": "San Francisco, CA, USA",
  "topics": ["STEM (STEAM) Education", "Science Demonstrations And Innovations"],
  "categories": ["Education", "Science", "STEM", "STEM Education", "Technology"],
  "bio": "Bob Pflugfelder, known as \"Science Bob,\" is a science teacher, author...",
  "books_authored": [
    "Nick and Tesla's High-Voltage Danger Lab",
    "Nick and Tesla's Robot Army Rampage"
  ],
  "featured_videos": ["https://www.youtube.com/watch?v=sAGm50Cvw9g"],
  "bureau_booking_url": "https://www.allamericanspeakers.com/contact-us",
  "scraped_at": "2026-04-30T10:33:46.945Z"
}
Field Type Description
speaker_name string Speaker's full name as shown on the bureau profile
tagline string Short positioning line (e.g. "CEO of...", "Author of...", "The King of Negotiators")
bureau string Source bureau slug (aae for All American Speakers Bureau)
bureau_label string Human-readable bureau name
bureau_speaker_id string Numeric speaker ID assigned by the bureau (stable, from URL)
profile_url string Canonical URL of the speaker's profile page
profile_image_url string Speaker headshot URL
fee_range_live string Speaking fee range for in-person events (e.g. $10,000 - $20,000, Please Contact)
fee_range_virtual string Speaking fee range for virtual / online events
travel_region string Where the speaker travels from, as published
topics array of strings Speaking topics offered by the speaker
categories array of strings Bureau taxonomy tags the speaker is filed under
bio string Full biography, plain text (HTML stripped)
books_authored array of strings Book titles authored or co-authored by the speaker
featured_videos array of strings YouTube / Vimeo URLs of featured speaker videos
bureau_booking_url string URL where event planners can request booking through the bureau
scraped_at string ISO 8601 timestamp when the record was extracted

FAQ

How do I scrape allamericanspeakers.com?

The Speaker Bureau Directory Scraper pulls profiles directly from the public AAE sitemap. Set bureau: "aae", pick a maxItems cap, and run. No login, no proxy, no CAPTCHA solver required.

How much does the Speaker Bureau Directory Scraper cost to run?

Pricing is pay-per-event: $0.10 per actor start plus $0.001 per speaker record. A 100-speaker run costs about $0.20. The full ~16,500-speaker AAE roster is around $16.60.

Can I scrape only specific speakers?

Yes. Pass a profileUrls array of direct speaker URLs and the scraper skips sitemap discovery, fetching only what you list. Useful for refreshing a known watchlist or backfilling a CRM.

Does the Speaker Bureau Directory Scraper need proxies?

No. The target site is public, server-rendered, and behind a Cloudflare CDN — but it doesn't gate the profile pages behind a managed challenge. Plain HTTP requests work, which is more than half of "scraper-ready" sites can claim.

Why doesn't the scraper return agent name, agent email, or speaker direct email?

AAE doesn't publish per-speaker agent contact cards or speaker direct emails on the public profile page — they're gated behind the bureau's contact form. The scraper returns the bureau-wide bureau_booking_url instead, which is the actual path to a booking on this site.

Will more bureaus be supported?

Yes. The dataset schema already carries a bureau field on every record so additional bureaus (BigSpeak, Premier, WSB, Harry Walker) can be layered in without breaking existing output. v1 ships AAE because it's the largest and the cleanest to extract.


Need More Features?

Need additional bureaus, custom fields, or a different filter? File an issue or get in touch.

Why Use the Speaker Bureau Directory Scraper?

  • Affordable — $0.001 per speaker record, $0.10 per run start
  • Multi-bureau ready — Output schema includes a bureau tag on every record, so adding sources doesn't break consumers. Most aggregator scrapers can't say that
  • Structured fees — Returns separate fee_range_live and fee_range_virtual fields instead of one free-text string. The virtual line item didn't exist before 2020; pretending it doesn't isn't an option anymore