VOOZH about

URL: https://apify.com/parseforge/websummit-speakers-scraper

⇱ Web Summit Speakers Scraper Β· Apify


Pricing

from $7.50 / 1,000 results

Go to Apify Store

Web Summit Speakers Scraper

Capture the Web Summit speakers lineup including speaker name, title, company, bio, photo, and profile URL. Point at any speakers page and pull a clean list for sponsor research, partnership outreach, competitive intel, or building event attendee databases for the next event.

Pricing

from $7.50 / 1,000 results

Rating

0.0

(0)

Developer

πŸ‘ ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

3

Total users

1

Monthly active users

16 days ago

Last modified

Share

πŸ‘ ParseForge Banner

🎀 Web Summit Speakers Scraper

πŸš€ Export Web Summit speaker rosters in seconds. Names, companies, countries, talk topics and biographies, straight from websummit.com to a clean dataset.

πŸ•’ Last updated: 2026-05-29 Β· πŸ“Š 10 fields per record Β· Public directory Β· No login required

The Web Summit Speakers Scraper turns the public Web Summit directory into a clean, structured dataset. Hand it a search query or a category, and it returns one row per profile with every public field flattened and normalized.

You get 10 fields per record - the same ones a human visitor sees on the public site, just structured so you can drop them straight into Excel, BigQuery, or your CRM.

🎯 Target AudienceπŸ’‘ Primary Use Cases
🏒 B2B sales teamsBuild prospect lists from public directories
πŸ“Š Market researchersMap the competitive landscape
πŸ€– ML engineersBuild training sets of real-world profiles
πŸ“° JournalistsSource Web Summit profiles for stories
πŸ‘©β€πŸ’» DevelopersMirror Web Summit listings into your own DB
πŸ§‘β€πŸ’Ό RecruitersFind talent, agencies, or vendors at scale

πŸ“‹ What the Web Summit Speakers Scraper does

  • Calls the public Web Summit pages and parses the listing HTML or JSON payload.
  • Walks pagination and follows each profile link.
  • Extracts 10 fields per record - identifiers, contact info, ratings, and metadata.
  • Surfaces upstream errors as a clean error record instead of crashing.
  • Exports as CSV, Excel, JSON, JSONL, XML, RSS, or HTML.

πŸ’‘ Why it matters: Web Summit is a goldmine of public directory data, but the site is built for browsing, not bulk export. This actor turns it into a structured dataset in seconds.

🎬 Full Demo

🚧 Coming soon.

βš™οΈ Input

FieldTypeRequiredDescription
startUrlstringNoWeb Summit speakers page URL. Leave default to scrape websummit.com/speakers.
maxItemsintegerNoFree users: Limited to 10 items (preview). Paid users: Optional, max 1,000,000

Example 1 - default run:

{
"startUrl":"https://websummit.com/speakers",
"maxItems":10
}

Example 2 - larger pull:

{
"startUrl":"https://websummit.com/speakers",
"maxItems":50
}

⚠️ Good to Know: Free users are auto-limited to 10 items per run. Paid users can pull up to 1,000,000 records. Heavy runs may take longer due to Web Summit's rate limits.

πŸ“Š Output

Each record is a flat object. Image URL is first, error is always last.

FieldTypeDescription
πŸ–ΌοΈ photoUrlstringProfile photo URL.
πŸ‘€ namestringDisplay name.
πŸ’Ό titlestringRole or job title.
🏒 companystringAffiliated company.
🌍 countrystringCountry.
🏷️ topicsarraySpeaking topics or tags.
πŸ“ biostringShort biography.
🎫 sessionIdstringSession identifier.
πŸ”— profileUrlstringSource profile URL.
πŸ•’ scrapedAtstringWhen this row was fetched (ISO 8601).
❌ errorstringSet if the upstream response was an error.

Sample record:

{
"photoUrl":"https://cdn.example.com/img.jpg",
"name":"Sample value",
"title":"Sample value",
"company":"Sample value",
"country":"Sample value",
"topics":[
"item-1",
"item-2"
],
"bio":"Sample value",
"sessionId":"Sample value",
"profileUrl":"https://example.com",
"scrapedAt":"2026-05-29T12:00:00.000Z",
"error":null
}

✨ Why choose this Actor

| πŸ†“ | Free users get 10 items per run to evaluate before upgrading. | | 🧹 | Cleaned and normalized fields - no scraping artifacts. | | πŸ”’ | Numeric fields cast to numbers, dates to ISO, arrays preserved. | | πŸ›Ÿ | Upstream errors surfaced as clean error records, never crashes. | | πŸ”Œ | One input form, one click, dataset ready in seconds. | | πŸ’Ύ | Push to dataset β†’ instant CSV / Excel / JSON / XML / RSS / HTML export. |

πŸ“ˆ How it compares to alternatives

ApproachSetup timeClean fields?Pagination?Rate-limit handling?
Manual copy/pastehours per pagepartialmanualnone
Roll your own scraper4+ hours❌❌❌
This Actor5 sec, no installβœ…βœ…βœ…

πŸš€ How to use

  1. Click Try for free.
  2. Fill in the input form (or use prefilled defaults).
  3. Click Start. Within seconds your dataset is ready - download as CSV, Excel, JSON, or XML, or pipe to your warehouse.
  4. (Optional) Schedule it to re-run daily, weekly, or on a custom cron.

πŸ’Ό Business use cases

πŸ“Š Lead generation. Build a fresh, structured prospect list from Web Summit every week. No more manual copy-paste from a browser.

πŸ” Competitive intelligence. Track who's listed on Web Summit, with what services, and at what price point.

πŸ€– ML feature engineering. Build clean training sets of real-world profiles for matching, ranking, or recommendation models.

πŸ“° Editorial research. Reporters can pull a directory snapshot in 30 seconds, then verify quotes and facts against the structured data.

πŸ”Œ Automating Web Summit Speakers Scraper

  • Make / Zapier: trigger this actor on a schedule, push results to Airtable, Google Sheets, HubSpot, or Slack.
  • Cron schedule: native Apify scheduler - run nightly, weekly, or on any cron expression.
  • Webhooks: get a POST to your endpoint the moment a run finishes.
  • Pipe to BigQuery / Snowflake / Postgres: native Apify integrations move datasets straight into your warehouse.

🌟 Beyond business use cases

πŸŽ“ Education. Teach a data class? Have students pull their own Web Summit dataset in 5 seconds and analyze it in pandas.

πŸ§ͺ Personal research. Track your favourite freelancers, agencies, or speakers over time.

🀝 Non-profit & open data. Build public dashboards of who's working where and on what.

🧰 Tinkering & prototyping. Spin up a real dataset in seconds to test a new visualization library or BI tool.

πŸ€– Ask an AI assistant about this scraper

Pop this README into ChatGPT, Claude, or any AI assistant and ask it to map your specific workflow to the actor's inputs. The schema, examples, and field list above contain everything an LLM needs to design a working pipeline.

❓ Frequently Asked Questions

❓ Do I need an account on Web Summit? No. This actor only reads public pages.

❓ Is this allowed? This actor scrapes only publicly available data. Users are responsible for complying with Web Summit's terms of service and applicable law.

❓ How many records can I pull? Free plan: 10 per run (preview). Paid plans: up to 1,000,000.

❓ Is there a rate limit? Web Summit may throttle aggressive requests. The actor uses respectful pacing and surfaces upstream errors as clean records.

❓ Are values cleaned? Yes. Numeric fields are cast to numbers, dates to ISO strings, arrays preserved as arrays.

❓ How are errors handled? If a profile fails to parse, we push a single record with error populated. The run never crashes mid-batch.

❓ Can I schedule runs? Yes - Apify's native scheduler, or hook this up to Make / Zapier / cron.

❓ Will the schema change? Core identifiers and contact fields are stable. New optional fields may be added; existing fields will not be renamed.

❓ What format can I download? CSV, Excel, JSON, JSONL, XML, RSS, or HTML - straight from the Apify dataset UI.

❓ Can I filter by location, category, or rating? Yes - see the Input section for the full list of supported filters.

πŸ”Œ Integrate with any app

Apify ships native integrations with Make, Zapier, Slack, Discord, Google Drive, Google Sheets, Gmail, Airbyte, Keboola, Telegram, GitHub, and any REST API or webhook endpoint. Trigger runs from a calendar event, a form submission, a cron job, or pipe results straight into BigQuery, Snowflake, or a Postgres warehouse.

πŸ”— Recommended Actors

ActorWhat it does
ParseForge Alpha Vantage ScraperPublic stock, FX, and crypto market data.
ParseForge OurAirports ScraperGlobal airport database.
ParseForge FINRA BrokerCheck ScraperUS broker and adviser public records.
ParseForge FAA Aircraft Registry ScraperUS civil aircraft registry.

πŸ’‘ Pro Tip: browse the complete ParseForge collection for 900+ production-grade scrapers across business intelligence, real estate, e-commerce, sports, finance, and public records.


Disclaimer: This actor scrapes only publicly available data. ParseForge is not affiliated with, endorsed by, or sponsored by Web Summit or any of the third-party services referenced. Users are responsible for complying with the target site's terms of service and applicable law. Create a free account w/ $5 credit.

You might also like

Web Summit Schedule Scraper - Cheap πŸŽ€πŸ“ŠπŸš€πŸŒ

scrapestorm/web-summit-schedule-scraper---cheap

Looking to collect real-time event sessions & speaker insights from Web Summit? πŸŽ€πŸ”Ž With this scraper, you can extract data including session titles, tracks, stages, speakers, schedules, and direct URLs Perfect for event intelligence, networking insights & conference analytics πŸ“ŠπŸš€

2

5.0

Web Summit Events Scraper

piotrv1001/web-summit-events-scraper

The Web Summit Events Scraper extracts paginated event data from Web Summit, capturing titles, dates, times, locations, and participant detailsβ€”ideal for event tracking, networking, and industry research

30

Swapcard Event Scraper β€” Exhibitors, Speakers & Contacts

scrapesage/swapcard-exhibitor-scraper

Scrape any public Swapcard event into structured data: every exhibitor with real company description, email, website, full address, product categories and booth, plus all speakers and sessions. Optional speaker enrichment adds bios, socials, job titles and custom fields.

Sessionize Public Events Scraper

parseforge/sessionize-public-events-scraper

Pull sessions, speakers, or session and speaker pairs from any public Sessionize event using its event ID. Returns talk title, abstract, track, room, time slot, speaker name, bio, company, and social links. Useful for conference research, speaker outreach, and event analytics.

Event Profiles Β· Export Attendees & Speakers

corent1robert/brella-event-profiles

Export the full attendee or speaker list from any Brella event. Get names, companies, titles, emails, and interests in one clean dataset β€” ready for your CRM or outreach tool. Turn any Brella event into a lead list. One run, one spreadsheet, every participant.

πŸ‘ User avatar

Corentin Robert

5

Speaker Bureau Directory Scraper - Keynote Speakers & Fees

jungle_synthesizer/speaker-bureau-directory-scraper

Scrape keynote speaker profiles from major US speakers bureaus. Extract speaker name, tagline, live and virtual fee ranges, travel region, topics, categories, bio, books, profile photo, and bureau booking URL. Built for event planners, PR firms, and competing bureaus.

πŸ‘ User avatar

BowTiedRaccoon

4

Summits Profile Scraper

scrapedrift/summits-profile-scraper

Summits Profile Scraper extracts attendee and speaker profile data, including names, job titles, companies, bios, social links, contact details, and event information from online summit platforms. Ideal for networking, lead generation, event research, and audience analysis.

Linkedin Events Scraper

silentflow/linkedin-events-scraper

Scrape LinkedIn events with full authenticated access. Search by keywords or fetch specific event URLs. Extract titles, descriptions, dates, locations, organizers, attendee counts, speakers, and more. Ideal for sales prospecting, recruiting, market research, and competitive event analysis.

LinkedIn Event Attendees - Speakers & Registrations

alizarin_refrigerator-owner/linkedin-event-attendees-scraper

Export attendees from LinkedIn Events. Get speakers, registered attendees, and interested users.

Related articles

Web crawling vs. web scraping
Read more