Hatarako Japan Haken & Temp Staffing Job Scraper
Hatarako Japan Haken & Temp Staffing Job Scraper
Scrapes haken (派遣) and temporary staffing job listings from hatarako.net — Japan's leading dispatch job aggregator. Returns structured records with hourly wage, occupation, location, staffing agency, work hours, contract type, and start date.
The haken (dispatch) labor model covers roughly 20% of Japan's workforce. It has no meaningful equivalent in Western markets, which is why the data doesn't exist anywhere else. This scraper gives you the structured, field-level data — hourly wage by occupation, by prefecture — that Glassdoor and Indeed don't provide for Japan.
What It Extracts
Covers every prefecture and occupation category listed on hatarako.net. Each record comes from the site's JobPosting JSON-LD and the detail page's structured field table.
| Field | Description |
|---|---|
job_id |
Numeric job ID from the URL (e.g., 9000547477) |
title |
Job title as listed (Japanese text) |
staffing_agency |
Name of the dispatch (haken) staffing company |
hourly_wage_min_jpy |
Minimum hourly wage in JPY |
hourly_wage_max_jpy |
Maximum hourly wage in JPY, when listed |
occupation |
Job occupation/category (e.g., 一般事務, データ入力, 軽作業) |
location |
Prefecture and ward/city (e.g., 東京都 / 新宿区) |
work_hours |
Daily work hours (e.g., 09:00 ~ 17:00 *休憩60分) |
contract_type |
Contract form (e.g., 派遣(一般派遣), 紹介予定派遣) |
start_date |
Expected start date or period (free text) |
required_skills |
Required qualifications from 応募資格 field |
source_url |
Direct link to the job posting |
How It Works
The scraper walks a sitemap that covers 408 area × occupation combinations across Japan. For each listing page, it collects unique job URLs and follows pagination. Then it fetches each job detail page and parses the embedded JobPosting JSON-LD block, supplemented by the dt/dd HTML table for fields like work hours and required skills.
No browser rendering needed. No proxy required. The site returns clean server-side HTML.
For large runs, all 408 area/occupation seeds are used. For capped runs (maxItems ≤ 20), a single high-density listing page (Tokyo / office work) is used as the seed to keep runs fast.
Input
| Parameter | Type | Default | Description |
|---|---|---|---|
maxItems |
Integer | 10 | Maximum number of job records to return. Set to 0 for uncapped. |
Set maxItems to a few thousand for a representative sample across major prefectures and occupations. Set to 0 only when you want the full corpus — hatarako.net carries tens of thousands of active listings.
Example Output
{
"job_id": "9000547477",
"title": "一般事務・OA事務/シフト勤務・日払い・週払い・未経験OK・40代活躍",
"staffing_agency": "キャリアリンク株式会社(東証プライム市場)",
"hourly_wage_min_jpy": 1730,
"hourly_wage_max_jpy": null,
"occupation": "一般事務・OA事務",
"location": "東京都 / 新宿区",
"work_hours": "09:00 ~ 17:00 *休憩60分 [残業予定] ほとんどなし",
"contract_type": "派遣(一般派遣)",
"start_date": "6月中旬~長期",
"required_skills": "・PC基本操作が可能な方(文字入力/Excel使用)",
"source_url": "https://www.hatarako.net/job/9000547477/"
}
Use Cases
- Labor market research — hourly wage benchmarking by occupation and prefecture across the haken sector
- Staffing industry analysis — which dispatch agencies dominate which occupations and regions
- HR tech data pipelines — structured JP dispatch job data for salary comparison tools
- Academic research — longitudinal dataset for studying haken wage trends and geographic distribution
Notes
- Output is in Japanese. Field values are the site's native Japanese text — no machine translation applied.
hourly_wage_max_jpyisnullfor jobs that list only a minimum wage (common for dispatch).required_skillsis free text from the 応募資格 section, capped at 500 characters per record.- The site's robots.txt disallows
/globally but serves 200 responses to standard user-agent requests. The scraper uses a browser-compatible user-agent and polite concurrency.
Data sourced from hatarako.net. This actor is provided by OrbTop.