OrbTop

Soopage Scraper - Global Business Directory (148 Countries)

LEAD GENERATIONBUSINESS

Soopage Global Business Directory Scraper

Scrape business listings from Soopage.com, a global business directory covering 148 countries. Returns company names, phone numbers, fax, addresses, cities, and country data from over 10 million listings across country subdomains like jpn.soopage.com, cn.soopage.com, and are.soopage.com.


Soopage Scraper Features

  • Extracts structured company data via schema.org microdata — consistent fields across all 148 countries
  • Filters by country subdomain: choose from Japan, China, UAE, India, and 144 more
  • Filters by city: scrape Tokyo's 120K listings or Osaka's 55K without touching the rest
  • Handles Cloudflare managed challenge transparently using residential proxy bypass
  • Respects maxItems so test runs are fast and won't run up your bill

What Can You Do With Soopage Data?

  • B2B sales teams — Build contact lists of local businesses in specific countries or cities for targeted outreach
  • Market researchers — Map business density and distribution across regions by pulling city-level listing counts
  • Data aggregators — Supplement thinner national business datasets with Soopage's global coverage
  • Lead generation agencies — Batch-collect phone and fax numbers for international clients who need country-specific leads

How Soopage Scraper Works

  1. Reads the cities list from {subdomain}.soopage.com/cities.html to discover all available city pages (or uses your cityFilter directly).
  2. Paginates through city listing pages — each page has 20 company links in the format /city/{City}.html, /city/{City}-page2.html, and so on.
  3. Fetches each company detail page and extracts schema.org microdata: name, phone, fax, address, city, country, and description.
  4. Stops when maxItems is reached.

Input

{
  "countrySubdomain": "jpn",
  "cityFilter": "Tokyo",
  "maxItems": 100,
  "proxyConfiguration": {
    "useApifyProxy": true,
    "apifyProxyGroups": ["RESIDENTIAL"]
  }
}
Field Type Default Description
countrySubdomain string "jpn" Country subdomain to scrape (e.g. jpn for Japan, cn for China, are for UAE)
cityFilter string "" Optional city name filter (e.g. Tokyo, Osaka). Leave empty to scrape all cities
maxItems integer 10 Maximum number of listings to return
proxyConfiguration object residential Proxy settings. Residential proxy required for Cloudflare bypass on paginated pages

Soopage Scraper Output Fields

{
  "name": "4 LIFE CITY CO., LTD.",
  "phone": "03-3448-9555 (+81-03-3448-9555)",
  "fax": "075-495-6612 (+81-075-495-6612)",
  "address": "2-14-5 Kami-osaki, Shinagawa-ku, Tokyo",
  "city": "Tokyo",
  "country": "Japan",
  "description": "4 LIFE CITY CO., LTD. is a company at Japan, Address is 2-14-5 Kami-osaki, Shinagawa-ku, Tokyo",
  "listing_url": "https://jpn.soopage.com/company/4-LIFE-CITY-CO-LTD_5.html"
}
Field Type Description
name string Company name
phone string Telephone number, may include country code
fax string Fax number if available
address string Street address
city string City or area
country string Country name
description string Auto-generated company description
listing_url string URL to the Soopage company profile page

FAQ

Which countries are supported?

All 148 country subdomains on Soopage.com. Common ones: jpn (Japan, ~394K listings), cn / chn (China), are (UAE), in (India), id (Indonesia), sgp (Singapore), kor (South Korea). Check soopage.com/countries.html for the full list.

Why do I need a residential proxy?

Soopage.com sits behind Cloudflare CDN. The first page of each city listing loads without challenge, but page 2 and beyond trigger a managed challenge. Residential proxies bypass this automatically — datacenter IPs get blocked on paginated pages.

How long does a full Japan crawl take?

Japan has ~394K listings across 49 cities. At 20 listings per page that is roughly 19,700 city-listing page requests plus 394K detail page requests. For targeted runs, use cityFilter to focus on a single city and set maxItems to a reasonable cap.

Does the data include email addresses or websites?

No. Soopage listings are thin by design. The available fields are name, phone, fax, address, city, country, and description. No email, no website, no categories. For richer profiles, consider the japanyello-scraper for Japan or the yello-network-scraper for other countries.


Need More Features?

Open an issue on the actor page or contact support with your use case.

Why Use Soopage Scraper?

  • Zero competition — No other Apify actor covers Soopage.com's 148-country global dataset
  • Scale — 10M+ listings across 148 countries from a single actor, configurable by country and city
  • Clean extraction — schema.org microdata means structured fields, not brittle CSS selectors that break on site redesigns