OrbTop

Japan Company Data Scraper (gBizINFO / gBiz)

BUSINESSLEAD GENERATIONOTHER

gBizINFO Japan Company Scraper

Scrape enriched Japanese company profiles from gBizINFO, the METI-operated open data portal covering 4.5M+ Japanese corporations. Returns corporate number (法人番号), capital, employees, representative, address, JSIC industry, government contracts, certifications, subsidies, and patent counts in clean JSON.


gBizINFO Scraper Features

  • Extracts corporate records across 4.5M+ Japanese companies. That's the full METI registry, enriched with data most foreign tools skip.
  • Two search modes — lookup by 法人番号 list, or filter by name/prefecture/industry/capital/employee count.
  • Returns structured JSON with 27 fields covering corporate identity, financials, and workforce breakdown.
  • Optional enrichment: government contracts (procurement), certifications, subsidies, commendations, and patent counts — one API call each, configurable per run.
  • Pure API scraping — no browser, no proxies, no CAPTCHAs.
  • Ships with a built-in METI demo token. Works out of the box. Bring your own token for larger runs.

Who Uses gBizINFO Data?

  • B2B sales teams — Enrich Japanese lead lists with capital, headcount, and industry code for account scoring.
  • M&A advisors — Pull target-company profiles (revenue proxy via capital, JSIC code, government-contract history) before the first call.
  • KYC analysts — Verify corporate registry status, representative, and registered address against 法人番号 without buying a TSR subscription.
  • Supply chain researchers — Map suppliers by JSIC industry code and prefecture, then pull government contracts to gauge stability.
  • Japan market entrants — Identify potential distributors or partners by filtering companies on capital ≥ ¥100M in Tokyo.
  • Procurement analysts — Track which companies are winning government contracts, and in which ministries, over time.

How gBizINFO Scraper Works

  1. Pick a search mode. Supply either a list of 法人番号 (byNumber) or a set of filters like company name, prefecture, capital range (bySearch).
  2. The scraper paginates the gBizINFO search API and collects matching corporate numbers.
  3. Each hit is then fetched from the detail endpoint, which returns the rich record — capital, employees, representative, JSIC industry, and the rest.
  4. If you toggled on enrichments, it fires one extra API call per company for certifications, procurement, subsidies, commendations, or patent counts, then flattens the arrays into formatted strings.

Input

{
    "searchMode": "bySearch",
    "nameQuery": "ソニー",
    "maxItems": 50,
    "includeCertifications": false,
    "includeProcurement": false
}
Field Type Default Description
searchMode string bySearch bySearch (filter queries) or byNumber (explicit 法人番号 list).
corporateNumbers array [] 13-digit 法人番号 list. Used only when searchMode=byNumber.
nameQuery string "" Company name partial match. Accepts Japanese or English.
corporateType string "" Corporate type code. 301 株式会社, 302 有限会社, 305 合同会社, etc. Comma-separate for multiple.
prefectureCode string "" JIS X 0401 2-digit prefecture code. 13 Tokyo, 27 Osaka, 14 Kanagawa.
capitalMin integer 0 Minimum capital (JPY).
capitalMax integer 0 Maximum capital (JPY).
employeeMin integer 0 Minimum employee count.
employeeMax integer 0 Maximum employee count.
existingOnly boolean true Exclude companies with closed/merged registry status.
includeCertifications boolean false Fetch certifications (ISO, DX認定, METI). Adds one API call per record.
includeProcurement boolean false Fetch government contracts received. Adds one API call per record.
includeSubsidies boolean false Fetch subsidies awarded. Adds one API call per record.
includePatents boolean false Fetch total patent count. Adds one API call per record.
includeCommendations boolean false Fetch commendations/awards. Adds one API call per record.
apiToken string "" Optional personal gBizINFO token. Blank uses METI's public demo token.
maxItems integer 50 Maximum records to return.

Enrich a specific 法人番号 list

{
    "searchMode": "byNumber",
    "corporateNumbers": ["7010401022916", "1010001126313", "7000012010029"],
    "includeProcurement": true,
    "includeCertifications": true,
    "maxItems": 3
}

Find Tokyo software companies with ¥100M+ capital

{
    "searchMode": "bySearch",
    "prefectureCode": "13",
    "capitalMin": 100000000,
    "corporateType": "301",
    "maxItems": 100
}

gBizINFO Scraper Output Fields

{
    "corporate_number": "7010401022916",
    "name": "日本電気株式会社",
    "name_en": "NEC Corporation",
    "kana": "にっぽんでんき",
    "representative_name": "取締役代表執行役社長兼CEO 森 田 隆 之",
    "postal_code": "1080014",
    "location": "東京都港区芝5丁目7番1号",
    "status": "-",
    "capital_stock": 427831000000,
    "employee_number": 21004,
    "company_size_male": 16869,
    "company_size_female": 5635,
    "business_items": "107,108,110,111,112,114,115,116,117,118",
    "business_summary": "社会公共事業、社会基盤事業、エンタープライズ事業、ネットワークサービス事業、グローバル事業",
    "company_url": "https://jpn.nec.com/inclusion-diversity/",
    "qualification_grade": "A、A、A、A",
    "date_of_establishment": "1899-07-17",
    "update_date": "2018-11-21",
    "certifications": ["2025-02-01 | DX認定制度 | 経済産業省"],
    "procurement": ["2021-07-20 | 海洋生分解性プラスチックの社会実装に向けた技術開発事業 | 国立研究開発法人新エネルギー・産業技術総合開発機構"],
    "subsidies": [],
    "commendations": [],
    "patent_count": -1,
    "source_url": "https://info.gbiz.go.jp/hojin/ichiran/7010401022916",
    "scraped_at": "2026-04-23T10:48:12.723Z"
}
Field Type Description
corporate_number string 13-digit 法人番号.
name string Corporation name in Japanese.
name_en string English name (if registered).
kana string Name reading in kana.
representative_name string Representative name and title.
postal_code string 7-digit postal code (no hyphen).
location string Registered address in Japanese.
status string Registry status (- active, closed/merged).
capital_stock number Capital stock (JPY).
employee_number number Total employees.
company_size_male number Male employees.
company_size_female number Female employees.
business_items string JSIC-like business-item codes, comma-joined.
business_summary string Business description (Japanese).
company_url string Corporate website URL.
qualification_grade string Government procurement grade (格付け等級).
date_of_establishment string YYYY-MM-DD founding date.
update_date string Last registry update date.
certifications array date | title | grantor strings (if enabled).
procurement array date | title | amount | awarder strings (if enabled).
subsidies array date | title | amount | grantor strings (if enabled).
commendations array date | title | grantor strings (if enabled).
patent_count integer Total patent count. -1 unless includePatents=true.
source_url string gBizINFO web page for this company.
scraped_at string ISO timestamp when the record was collected.

FAQ

How do I scrape Japanese company data from gBizINFO?

gBizINFO Scraper accepts either a list of 法人番号 or a filter query (name, prefecture, capital, industry). Drop your input, run it, collect the JSON. The API returns results fast — a 50-company pull with base fields finishes in under a minute.

Do I need a gBizINFO API token?

No. The actor ships with METI's publicly-embedded demo token and works out of the box. That said, METI publishes the demo token for evaluation, not production volume — bring your own free token from https://info.gbiz.go.jp/hojin/various_registration/form if you plan to run larger jobs.

What data can I get from gBizINFO?

gBizINFO Scraper returns 法人番号, company name (JP/EN/kana), representative, registered address, capital stock, employee count (plus male/female split), JSIC industry codes, company URL, and registry status. Turn on the enrichment flags to also pull government contracts, certifications (ISO, DX認定, METI), subsidies received, commendations, and patent counts.

Can I filter companies by prefecture or capital?

gBizINFO Scraper supports prefecture (JIS 2-digit code), capital range, employee range, corporate type, and name-match filters. Combine them to build specific prospect lists — for example, Tokyo 株式会社 with capital between ¥100M and ¥1B and 50+ employees.

Does gBizINFO Scraper need proxies?

No. The gBizINFO API has no WAF, no CAPTCHA, and generous rate limits. The actor runs direct against info.gbiz.go.jp with a polite ~6 req/s pace. Proxy configuration is exposed in the input, but leave it off by default.

How much does gBizINFO Scraper cost to run?

$0.10 per run start plus $0.001 per company record. A 1,000-company pull with no enrichments costs about $1.10. Enrichment flags add API calls but not charges — cost scales with records, not requests.


Need More Features?

Need different enrichment fields, historical snapshots, or a bulk download format? File an issue or get in touch.

Why Use gBizINFO Scraper?

  • Affordable — $0.001 per company record, so a 10,000-company enrichment run costs ~$10.10.
  • Unique coverage — The only Apify actor wrapping gBizINFO, which itself is richer than the bare 法人番号 registry and an order of magnitude cheaper than commercial Japanese corporate databases.
  • Clean output — Returns flat JSON with consistent field names. No nested arrays to untangle, no katakana encoding surprises, no HTML leftover from a page scrape.