OrbTop

Hatarako Japan Haken & Temp Staffing Job Scraper

JOBSLEAD GENERATION

Hatarako Japan Haken & Temp Staffing Job Scraper

Scrapes haken (派遣) and temporary staffing job listings from hatarako.net — Japan's leading dispatch job aggregator. Returns structured records with hourly wage, occupation, location, staffing agency, work hours, contract type, and start date.

The haken (dispatch) labor model covers roughly 20% of Japan's workforce. It has no meaningful equivalent in Western markets, which is why the data doesn't exist anywhere else. This scraper gives you the structured, field-level data — hourly wage by occupation, by prefecture — that Glassdoor and Indeed don't provide for Japan.

What It Extracts

Covers every prefecture and occupation category listed on hatarako.net. Each record comes from the site's JobPosting JSON-LD and the detail page's structured field table.

Field Description
job_id Numeric job ID from the URL (e.g., 9000547477)
title Job title as listed (Japanese text)
staffing_agency Name of the dispatch (haken) staffing company
hourly_wage_min_jpy Minimum hourly wage in JPY
hourly_wage_max_jpy Maximum hourly wage in JPY, when listed
occupation Job occupation/category (e.g., 一般事務, データ入力, 軽作業)
location Prefecture and ward/city (e.g., 東京都 / 新宿区)
work_hours Daily work hours (e.g., 09:00 ~ 17:00 *休憩60分)
contract_type Contract form (e.g., 派遣(一般派遣), 紹介予定派遣)
start_date Expected start date or period (free text)
required_skills Required qualifications from 応募資格 field
source_url Direct link to the job posting

How It Works

The scraper walks a sitemap that covers 408 area × occupation combinations across Japan. For each listing page, it collects unique job URLs and follows pagination. Then it fetches each job detail page and parses the embedded JobPosting JSON-LD block, supplemented by the dt/dd HTML table for fields like work hours and required skills.

No browser rendering needed. No proxy required. The site returns clean server-side HTML.

For large runs, all 408 area/occupation seeds are used. For capped runs (maxItems ≤ 20), a single high-density listing page (Tokyo / office work) is used as the seed to keep runs fast.

Input

Parameter Type Default Description
maxItems Integer 10 Maximum number of job records to return. Set to 0 for uncapped.

Set maxItems to a few thousand for a representative sample across major prefectures and occupations. Set to 0 only when you want the full corpus — hatarako.net carries tens of thousands of active listings.

Example Output

{
  "job_id": "9000547477",
  "title": "一般事務・OA事務/シフト勤務・日払い・週払い・未経験OK・40代活躍",
  "staffing_agency": "キャリアリンク株式会社(東証プライム市場)",
  "hourly_wage_min_jpy": 1730,
  "hourly_wage_max_jpy": null,
  "occupation": "一般事務・OA事務",
  "location": "東京都 / 新宿区",
  "work_hours": "09:00 ~ 17:00 *休憩60分 [残業予定] ほとんどなし",
  "contract_type": "派遣(一般派遣)",
  "start_date": "6月中旬~長期",
  "required_skills": "・PC基本操作が可能な方(文字入力/Excel使用)",
  "source_url": "https://www.hatarako.net/job/9000547477/"
}

Use Cases

  • Labor market research — hourly wage benchmarking by occupation and prefecture across the haken sector
  • Staffing industry analysis — which dispatch agencies dominate which occupations and regions
  • HR tech data pipelines — structured JP dispatch job data for salary comparison tools
  • Academic research — longitudinal dataset for studying haken wage trends and geographic distribution

Notes

  • Output is in Japanese. Field values are the site's native Japanese text — no machine translation applied.
  • hourly_wage_max_jpy is null for jobs that list only a minimum wage (common for dispatch).
  • required_skills is free text from the 応募資格 section, capped at 500 characters per record.
  • The site's robots.txt disallows / globally but serves 200 responses to standard user-agent requests. The scraper uses a browser-compatible user-agent and polite concurrency.

Data sourced from hatarako.net. This actor is provided by OrbTop.