Karastan Carpet & Rug Catalog Scraper
Karastan Carpet & Rug Catalog Scraper
Scrapes the full Karastan residential carpet and area-rug catalog from karastan.com. Returns product specs — fiber type, face weight, pile height, density, colorway, collection name, and 15+ additional technical fields — for every SKU in the catalog.
What It Collects
Karastan doesn't publish a public data feed. This scraper finds all ~385 product URLs via the site's internal content API, then renders each product detail page in a real browser to extract the specs Angular loads after hydration.
Features:
- Discovers the full product catalog automatically — no URL list required
- Accepts a custom list of specific product page URLs if you only need a subset
- Filters by product type (carpet, area-rug, luxury-vinyl, engineered-wood)
- Extracts identity fields from the URL itself, so collection name, colorway code, and product type are always present regardless of page-render timing
- Flags SmartStrand fiber, wool content, pet-friendly certification, and CRI Green Label status
- Returns primary product image URL and dealer locator link
Use Cases
- Interior designers cross-referencing Karastan specifications against competitor fiber grades and face weights
- Flooring dealers building internal spec sheets and comparison tools without manual data entry
- Procurement teams benchmarking residential carpet specs across collections for specification documents
- Market researchers tracking Karastan's collection catalog for SKU expansion and product lifecycle analysis
How It Works
- Fetches the Karastan CMS product-pixel API — a single call that returns all product URLs without requiring a browser
- Applies any product type filters from your input
- Renders each product detail page using a headless Chromium browser (the spec data is loaded by Angular after the initial page shell)
- Waits for Angular to hydrate the spec component, then extracts all technical specification fields
- Saves records to the dataset, one per product colorway
Input
| Parameter | Type | Default | Description |
|---|---|---|---|
maxItems |
integer | 10 |
Maximum number of product records to scrape. Set to 0 for no limit (full catalog ~385 SKUs). |
startUrls |
array | [] |
Optional list of specific Karastan product page URLs to scrape. If empty, the full catalog is discovered automatically. |
productTypes |
array | [] |
Filter to specific product types: carpet, area-rug, luxury-vinyl, engineered-wood. Leave empty to scrape all types. |
Default input example:
{
"maxItems": 10
}
BYO URL input example (scrape specific products):
{
"maxItems": 5,
"startUrls": [
{ "url": "https://www.karastan.com/shop/Carpet/detail/K8918/Yorkshire_Garden/9701/Magnolia" },
{ "url": "https://www.karastan.com/shop/Carpet/detail/43771/Gossamer_Hues/9782/Ivory" }
]
}
Product type filter example:
{
"maxItems": 50,
"productTypes": ["carpet", "area-rug"]
}
Output
One record per product colorway. Each record contains:
| Field | Type | Description |
|---|---|---|
product_id |
string | Karastan style number (e.g. K8918) |
sku |
string | Same as product_id for Karastan |
product_type |
string | carpet, area-rug, luxury-vinyl, or engineered-wood |
collection_name |
string | Collection name (e.g. Yorkshire Garden) |
colorway_code |
string | Colorway numeric code (e.g. 9701) |
colorway_name |
string | Colorway name (e.g. Magnolia) |
style_category |
string | Casual, Classic, or Modern |
design_pattern |
string | solid, textured, floral, geometric |
fiber_material |
string | Fiber content (e.g. SmartStrand Silk Reserve) |
face_weight_oz_per_sqyd |
number | Face weight in oz/sq yd |
pile_height_inches |
number | Pile height in inches |
density_oz_per_cubic_yd |
number | Density in oz/cubic yd |
twist_level |
string | Twist level or twists per inch |
backing_type |
string | Backing material |
widths_available |
string | Available widths (e.g. 12 ft, 15 ft) |
is_smartstrand |
boolean | SmartStrand fiber technology |
is_wool |
boolean | Contains wool fiber |
is_pet_friendly |
boolean | Pet-friendly certification |
warranty_years |
number | Warranty years (999 = lifetime) |
cri_green_label |
boolean | CRI Green Label Plus certified |
sample_orderable |
boolean | Sample can be ordered from the page |
retail_price_estimate_usd |
number | Retail price if displayed (typically null) |
dealer_locator_url |
string | Karastan dealer locator URL |
product_url |
string | Full product detail page URL |
primary_image_url |
string | Primary product image URL |
last_modified |
string | Last modified date from sitemap if available |
Sample record:
{
"product_id": "K8918",
"sku": "K8918",
"product_type": "carpet",
"collection_name": "Yorkshire Garden",
"colorway_code": "9701",
"colorway_name": "Magnolia",
"style_category": "Classic",
"design_pattern": "floral",
"fiber_material": "SmartStrand Silk Reserve",
"face_weight_oz_per_sqyd": 60.0,
"pile_height_inches": 0.5,
"density_oz_per_cubic_yd": 3600,
"twist_level": "6",
"backing_type": "Action Bac",
"widths_available": "12 ft, 15 ft",
"is_smartstrand": true,
"is_wool": false,
"is_pet_friendly": true,
"warranty_years": 25,
"cri_green_label": true,
"sample_orderable": true,
"retail_price_estimate_usd": null,
"dealer_locator_url": "https://www.karastan.com/find-a-retailer",
"product_url": "https://www.karastan.com/shop/Carpet/detail/K8918/Yorkshire_Garden/9701/Magnolia",
"primary_image_url": "https://www.karastan.com/product-images/K8918-9701.jpg",
"last_modified": null
}
🔍 FAQ
How do I scrape the Karastan carpet catalog?
Karastan Carpet & Rug Catalog Scraper handles discovery automatically. Set maxItems to 0 and leave startUrls empty — it finds all ~385 SKUs without any additional configuration.
How much does the Karastan scraper cost to run?
Karastan Carpet & Rug Catalog Scraper uses browser rendering for each product page, which costs more than plain HTTP scraping. A full catalog run (~385 products) at current rates runs around $1–2. For spot-checks on specific collections, keep maxItems under 50 and it stays under $0.25.
What data can I get from karastan.com?
Karastan Carpet & Rug Catalog Scraper returns the full technical spec sheet Karastan publishes on each product page — fiber content, face weight, pile height, density, twist level, backing type, available widths, warranty, certifications, and colorway details. Dealer pricing is not publicly displayed and returns null.
Can I filter by product type?
Karastan Carpet & Rug Catalog Scraper accepts a productTypes array. Pass ["carpet"] to scrape only carpet SKUs, ["area-rug"] for rugs only, or any combination of carpet, area-rug, luxury-vinyl, and engineered-wood.
Does the Karastan scraper need proxies? Karastan Carpet & Rug Catalog Scraper uses residential proxies. Karastan's Angular SPA requires a real browser to load product specs, and residential routing keeps the runs clean. This is handled automatically — you don't configure proxies manually.
Why Use Karastan Carpet & Rug Catalog Scraper?
- Full catalog coverage — discovers all ~385 product SKUs via the internal CMS API before a single browser opens, so nothing gets missed by pagination gaps
- Structured spec output — extracts 20+ technical fields per colorway in clean JSON, including fiber flags and certifications that aren't in any public feed
- Handles the SPA — Karastan's product pages are Angular-rendered; the scraper waits for hydration before extracting, so spec fields are actually populated
Need More Features? Need custom fields, additional product types, or a different Mohawk brand? File an issue or get in touch.