OrbTop

Karastan Carpet & Rug Catalog Scraper

ECOMMERCE

Karastan Carpet & Rug Catalog Scraper

Scrapes the full Karastan residential carpet and area-rug catalog from karastan.com. Returns product specs — fiber type, face weight, pile height, density, colorway, collection name, and 15+ additional technical fields — for every SKU in the catalog.

What It Collects

Karastan doesn't publish a public data feed. This scraper finds all ~385 product URLs via the site's internal content API, then renders each product detail page in a real browser to extract the specs Angular loads after hydration.

Features:

  • Discovers the full product catalog automatically — no URL list required
  • Accepts a custom list of specific product page URLs if you only need a subset
  • Filters by product type (carpet, area-rug, luxury-vinyl, engineered-wood)
  • Extracts identity fields from the URL itself, so collection name, colorway code, and product type are always present regardless of page-render timing
  • Flags SmartStrand fiber, wool content, pet-friendly certification, and CRI Green Label status
  • Returns primary product image URL and dealer locator link

Use Cases

  • Interior designers cross-referencing Karastan specifications against competitor fiber grades and face weights
  • Flooring dealers building internal spec sheets and comparison tools without manual data entry
  • Procurement teams benchmarking residential carpet specs across collections for specification documents
  • Market researchers tracking Karastan's collection catalog for SKU expansion and product lifecycle analysis

How It Works

  1. Fetches the Karastan CMS product-pixel API — a single call that returns all product URLs without requiring a browser
  2. Applies any product type filters from your input
  3. Renders each product detail page using a headless Chromium browser (the spec data is loaded by Angular after the initial page shell)
  4. Waits for Angular to hydrate the spec component, then extracts all technical specification fields
  5. Saves records to the dataset, one per product colorway

Input

Parameter Type Default Description
maxItems integer 10 Maximum number of product records to scrape. Set to 0 for no limit (full catalog ~385 SKUs).
startUrls array [] Optional list of specific Karastan product page URLs to scrape. If empty, the full catalog is discovered automatically.
productTypes array [] Filter to specific product types: carpet, area-rug, luxury-vinyl, engineered-wood. Leave empty to scrape all types.

Default input example:

{
  "maxItems": 10
}

BYO URL input example (scrape specific products):

{
  "maxItems": 5,
  "startUrls": [
    { "url": "https://www.karastan.com/shop/Carpet/detail/K8918/Yorkshire_Garden/9701/Magnolia" },
    { "url": "https://www.karastan.com/shop/Carpet/detail/43771/Gossamer_Hues/9782/Ivory" }
  ]
}

Product type filter example:

{
  "maxItems": 50,
  "productTypes": ["carpet", "area-rug"]
}

Output

One record per product colorway. Each record contains:

Field Type Description
product_id string Karastan style number (e.g. K8918)
sku string Same as product_id for Karastan
product_type string carpet, area-rug, luxury-vinyl, or engineered-wood
collection_name string Collection name (e.g. Yorkshire Garden)
colorway_code string Colorway numeric code (e.g. 9701)
colorway_name string Colorway name (e.g. Magnolia)
style_category string Casual, Classic, or Modern
design_pattern string solid, textured, floral, geometric
fiber_material string Fiber content (e.g. SmartStrand Silk Reserve)
face_weight_oz_per_sqyd number Face weight in oz/sq yd
pile_height_inches number Pile height in inches
density_oz_per_cubic_yd number Density in oz/cubic yd
twist_level string Twist level or twists per inch
backing_type string Backing material
widths_available string Available widths (e.g. 12 ft, 15 ft)
is_smartstrand boolean SmartStrand fiber technology
is_wool boolean Contains wool fiber
is_pet_friendly boolean Pet-friendly certification
warranty_years number Warranty years (999 = lifetime)
cri_green_label boolean CRI Green Label Plus certified
sample_orderable boolean Sample can be ordered from the page
retail_price_estimate_usd number Retail price if displayed (typically null)
dealer_locator_url string Karastan dealer locator URL
product_url string Full product detail page URL
primary_image_url string Primary product image URL
last_modified string Last modified date from sitemap if available

Sample record:

{
  "product_id": "K8918",
  "sku": "K8918",
  "product_type": "carpet",
  "collection_name": "Yorkshire Garden",
  "colorway_code": "9701",
  "colorway_name": "Magnolia",
  "style_category": "Classic",
  "design_pattern": "floral",
  "fiber_material": "SmartStrand Silk Reserve",
  "face_weight_oz_per_sqyd": 60.0,
  "pile_height_inches": 0.5,
  "density_oz_per_cubic_yd": 3600,
  "twist_level": "6",
  "backing_type": "Action Bac",
  "widths_available": "12 ft, 15 ft",
  "is_smartstrand": true,
  "is_wool": false,
  "is_pet_friendly": true,
  "warranty_years": 25,
  "cri_green_label": true,
  "sample_orderable": true,
  "retail_price_estimate_usd": null,
  "dealer_locator_url": "https://www.karastan.com/find-a-retailer",
  "product_url": "https://www.karastan.com/shop/Carpet/detail/K8918/Yorkshire_Garden/9701/Magnolia",
  "primary_image_url": "https://www.karastan.com/product-images/K8918-9701.jpg",
  "last_modified": null
}

🔍 FAQ

How do I scrape the Karastan carpet catalog? Karastan Carpet & Rug Catalog Scraper handles discovery automatically. Set maxItems to 0 and leave startUrls empty — it finds all ~385 SKUs without any additional configuration.

How much does the Karastan scraper cost to run? Karastan Carpet & Rug Catalog Scraper uses browser rendering for each product page, which costs more than plain HTTP scraping. A full catalog run (~385 products) at current rates runs around $1–2. For spot-checks on specific collections, keep maxItems under 50 and it stays under $0.25.

What data can I get from karastan.com? Karastan Carpet & Rug Catalog Scraper returns the full technical spec sheet Karastan publishes on each product page — fiber content, face weight, pile height, density, twist level, backing type, available widths, warranty, certifications, and colorway details. Dealer pricing is not publicly displayed and returns null.

Can I filter by product type? Karastan Carpet & Rug Catalog Scraper accepts a productTypes array. Pass ["carpet"] to scrape only carpet SKUs, ["area-rug"] for rugs only, or any combination of carpet, area-rug, luxury-vinyl, and engineered-wood.

Does the Karastan scraper need proxies? Karastan Carpet & Rug Catalog Scraper uses residential proxies. Karastan's Angular SPA requires a real browser to load product specs, and residential routing keeps the runs clean. This is handled automatically — you don't configure proxies manually.

Why Use Karastan Carpet & Rug Catalog Scraper?

  • Full catalog coverage — discovers all ~385 product SKUs via the internal CMS API before a single browser opens, so nothing gets missed by pagination gaps
  • Structured spec output — extracts 20+ technical fields per colorway in clean JSON, including fiber flags and certifications that aren't in any public feed
  • Handles the SPA — Karastan's product pages are Angular-rendered; the scraper waits for hydration before extracting, so spec fields are actually populated

Need More Features? Need custom fields, additional product types, or a different Mohawk brand? File an issue or get in touch.