OrbTop

Carnaval Rio – Blocos & Agenda Scraper

TRAVELNEWS

Carnaval Rio – Blocos & Agenda Scraper

Scrape the official Riotur carnival agenda at carnaval.rio — blocos de rua, camarotes, bailes, ensaios técnicos, feijoadas, and sambodromo events. Returns structured event records with title, published date, event type classification, bloco name (where identifiable), free-entry detection, full description in Portuguese, and canonical event URL.

What data you get

Each record corresponds to one agenda post published by Riotur (Rio's official tourism authority):

Field Description
event_id WordPress post ID
event_title Title of the carnival event
event_url Canonical URL on carnaval.rio
event_type Classified event type: bloco-rua, camarote, baile, ensaio-tecnico, sambodromo, feijoada, roda-de-samba, or geral
bloco_name Canonical bloco or samba school name (extracted from title when identifiable)
published_at ISO-8601 timestamp when Riotur published the post
description_pt Full event description in Portuguese (HTML stripped)
excerpt_pt Short excerpt/summary in Portuguese
is_free "true" if content mentions gratuito / entrada franca / grátis
categories WordPress categories (pipe-separated)
tags WordPress tags (pipe-separated, when present)
og_image Feature image URL
source_url Source URL (same as event_url)
scrapedAt ISO-8601 timestamp when the record was scraped

Input

Parameter Type Default Description
maxItems integer 10 Maximum number of records to return. Set to a large value to fetch all (~150 posts in the agenda category).

Example input

{
  "maxItems": 50
}

Data source

carnaval.rio is the official carnival agenda published by Riotur (Empresa de Turismo do Município do Rio de Janeiro). It covers all official Grupo Especial and Série Ouro desfiles, blocos de rua, ensaios técnicos, camarotes, bailes, and community events recognized by the city government.

Data is fetched via the WordPress REST API (/wp-json/wp/v2/posts) — structured, clean JSON, no HTML parsing required.

Use cases

  • Tourism platforms building Rio Carnival experience packages
  • Travel media and editorial desks needing structured event feeds in Portuguese
  • Brazilian travel-tech startups layering carnival event data over hotel and booking inventory
  • Research: Rio Carnival economic impact, bloco popularity trends, free vs. ticketed event ratios

Notes

  • The agenda is seasonal — most content is published October–February (pre-carnival season plus event coverage during carnival week)
  • Event types are classified from title and tag keywords; geral means the classification was inconclusive
  • bloco_name extraction uses title pattern matching; not all event posts name a single bloco
  • Robots.txt has Disallow: / for HTML crawlers; the REST API is separately accessible per WordPress convention and confirmed reachable without authentication