Speaker Bureau Directory Scraper - Keynote Speakers & Fees
Speaker Bureau Directory Scraper
Scrape keynote speaker profiles from the All American Speakers Bureau directory. Returns name, tagline, live and virtual fee ranges, travel region, topics, categories, full biography, books authored, and the bureau booking URL for ~16,500 speakers.
Speaker Bureau Directory Scraper Features
- Extracts 16+ structured fields per speaker profile, including separate live and virtual fee ranges
- Pulls a normalized
topicsarray and a separatecategoriestaxonomy — most directories give you one or the other - Returns the full biography as plain text. No HTML to clean up
- Lists every book on the speaker's profile page, plus YouTube and Vimeo video URLs
- Sources from a public sitemap — no proxies, no browser, no CAPTCHA dance
- Configurable scope: scrape the whole roster (16k speakers) or pin specific profile URLs
Who Uses Speaker Bureau Data?
- Corporate event planners — Build keynote shortlists with budget bands already attached. The fee fields alone save you ten contact-form submissions
- PR and media bookers — Source guests for podcasts, panels, and trade press. Eighty percent of speaker outreach goes unanswered, so pre-qualified contact data has value
- Competing speakers bureaus — Track talent rosters and benchmark fee positioning. Most A-list speakers are multi-bureau represented anyway
- Sales teams targeting authors and executives — Speakers are a clean buyer-persona slice. Their booking pages double as a lead-source
How the Speaker Bureau Directory Scraper Works
- Fetch sitemap — Reads the bureau's public sitemap.xml to discover every speaker profile URL
- Filter and seed — Filters to the
/speakers/{id}/...URL pattern, slices tomaxItems, queues each profile - Extract per profile — Loads each profile page and pulls speaker name, fees, topics, categories, biography, books, and videos into a flat record
- Save — Emits one JSON record per speaker to the dataset, tagged with
bureau: "aae"so future versions can layer additional bureaus into the same output
Pass a list of direct profile URLs to skip sitemap discovery and crawl only what you specify.
Input
{
"bureau": "aae",
"maxItems": 10
}
| Field | Type | Default | Description |
|---|---|---|---|
| bureau | string | "aae" | Source bureau. v1 supports aae (All American Speakers Bureau, ~16k speakers). Additional bureaus arrive in subsequent releases. |
| maxItems | integer | 10 | Maximum number of speaker records to return. Default is intentionally low so a single run finishes within Apify's 5-minute tester window. Increase for larger crawls. |
| profileUrls | array | [] |
Optional list of direct speaker profile URLs. When provided, the scraper skips sitemap discovery and crawls only these URLs. |
Targeted scrape — specific speakers only
{
"bureau": "aae",
"maxItems": 3,
"profileUrls": [
{ "url": "https://www.allamericanspeakers.com/speakers/389198/Science-Bob-Pflugfelder" },
{ "url": "https://www.allamericanspeakers.com/speakers/385009/3OH%213" }
]
}
Speaker Bureau Directory Scraper Output Fields
{
"speaker_name": "\"Science Bob\" Pflugfelder",
"tagline": "Known as \"Science Bob\"; Science Teacher & TV Personality; Co-Author of the \"Nick & Tesla\" Series for Young Readers",
"bureau": "aae",
"bureau_label": "All American Speakers Bureau",
"bureau_speaker_id": "389198",
"profile_url": "https://www.allamericanspeakers.com/speakers/389198/%22Science-Bob%22-Pflugfelder",
"profile_image_url": "https://thumbnails.aaehq.com/t_face_aas_md/.../2018_bobpflugfelder_headshot.png",
"fee_range_live": "$10,000 - $20,000",
"fee_range_virtual": "$5,000 - $10,000",
"travel_region": "San Francisco, CA, USA",
"topics": ["STEM (STEAM) Education", "Science Demonstrations And Innovations"],
"categories": ["Education", "Science", "STEM", "STEM Education", "Technology"],
"bio": "Bob Pflugfelder, known as \"Science Bob,\" is a science teacher, author...",
"books_authored": [
"Nick and Tesla's High-Voltage Danger Lab",
"Nick and Tesla's Robot Army Rampage"
],
"featured_videos": ["https://www.youtube.com/watch?v=sAGm50Cvw9g"],
"bureau_booking_url": "https://www.allamericanspeakers.com/contact-us",
"scraped_at": "2026-04-30T10:33:46.945Z"
}
| Field | Type | Description |
|---|---|---|
| speaker_name | string | Speaker's full name as shown on the bureau profile |
| tagline | string | Short positioning line (e.g. "CEO of...", "Author of...", "The King of Negotiators") |
| bureau | string | Source bureau slug (aae for All American Speakers Bureau) |
| bureau_label | string | Human-readable bureau name |
| bureau_speaker_id | string | Numeric speaker ID assigned by the bureau (stable, from URL) |
| profile_url | string | Canonical URL of the speaker's profile page |
| profile_image_url | string | Speaker headshot URL |
| fee_range_live | string | Speaking fee range for in-person events (e.g. $10,000 - $20,000, Please Contact) |
| fee_range_virtual | string | Speaking fee range for virtual / online events |
| travel_region | string | Where the speaker travels from, as published |
| topics | array of strings | Speaking topics offered by the speaker |
| categories | array of strings | Bureau taxonomy tags the speaker is filed under |
| bio | string | Full biography, plain text (HTML stripped) |
| books_authored | array of strings | Book titles authored or co-authored by the speaker |
| featured_videos | array of strings | YouTube / Vimeo URLs of featured speaker videos |
| bureau_booking_url | string | URL where event planners can request booking through the bureau |
| scraped_at | string | ISO 8601 timestamp when the record was extracted |
FAQ
How do I scrape allamericanspeakers.com?
The Speaker Bureau Directory Scraper pulls profiles directly from the public AAE sitemap. Set bureau: "aae", pick a maxItems cap, and run. No login, no proxy, no CAPTCHA solver required.
How much does the Speaker Bureau Directory Scraper cost to run?
Pricing is pay-per-event: $0.10 per actor start plus $0.001 per speaker record. A 100-speaker run costs about $0.20. The full ~16,500-speaker AAE roster is around $16.60.
Can I scrape only specific speakers?
Yes. Pass a profileUrls array of direct speaker URLs and the scraper skips sitemap discovery, fetching only what you list. Useful for refreshing a known watchlist or backfilling a CRM.
Does the Speaker Bureau Directory Scraper need proxies?
No. The target site is public, server-rendered, and behind a Cloudflare CDN — but it doesn't gate the profile pages behind a managed challenge. Plain HTTP requests work, which is more than half of "scraper-ready" sites can claim.
Why doesn't the scraper return agent name, agent email, or speaker direct email?
AAE doesn't publish per-speaker agent contact cards or speaker direct emails on the public profile page — they're gated behind the bureau's contact form. The scraper returns the bureau-wide bureau_booking_url instead, which is the actual path to a booking on this site.
Will more bureaus be supported?
Yes. The dataset schema already carries a bureau field on every record so additional bureaus (BigSpeak, Premier, WSB, Harry Walker) can be layered in without breaking existing output. v1 ships AAE because it's the largest and the cleanest to extract.
Need More Features?
Need additional bureaus, custom fields, or a different filter? File an issue or get in touch.
Why Use the Speaker Bureau Directory Scraper?
- Affordable — $0.001 per speaker record, $0.10 per run start
- Multi-bureau ready — Output schema includes a
bureautag on every record, so adding sources doesn't break consumers. Most aggregator scrapers can't say that - Structured fees — Returns separate
fee_range_liveandfee_range_virtualfields instead of one free-text string. The virtual line item didn't exist before 2020; pretending it doesn't isn't an option anymore