Todotango Tango Artist Composer Discography Scraper
SOCIAL MEDIA
Todotango Tango Artist & Composer Discography Scraper
Scrape structured creator profiles and tango work records from todotango.com — the definitive Argentine tango encyclopedia. Extract biographies, birth/death dates, roles, photos, full lyrics, composer/lyricist attribution, and sheet-music references for 10 000+ tango creators across five roles.
What This Actor Does
The actor crawls Todotango.com's complete creator index and extracts two types of records into a single dataset:
Creator profile records — one record per tango artist:
- Creator ID, slug, name, and role (músico, letrista, cantor, cancionista, compositor)
- Birth/death dates and birth place
- Photo URL
- Link to the creator's biography page (Spanish)
- Count of linked lyrics and sheet-music items
- Notable works (JSON array of tango titles)
- Discography entries
Tango work records — one record per linked tango piece:
- Title, genre (tango/vals/milonga/candombe), year
- Composer(s) and lyricist(s)
- Full lyrics text in Spanish
- Sheet-music image URL
Both record types share the same dataset schema. The source_url field identifies the page origin; creator fields are null on tango records and tango fields are null on creator records.
Use Cases
- Copyright & public-domain research — YouTube creators and music-licensing firms can verify tango composer attribution and estimate public-domain status based on death dates
- SADAIC royalty research — The Argentine copyright society (SADAIC) does not publish their tango database; this scraper provides a structured alternative
- Ethnomusicology — UBA, Conservatorio Alberto Ginastera, and similar programs use Todotango for composer biography data
- Tango event curation — Buenos Aires milonga organizers building composer-themed event programs
- Digital reissue metadata — Tango-revival labels wanting structured metadata for catalog enrichment
Input Parameters
| Field | Type | Description |
|---|---|---|
maxItems |
integer | Maximum number of records to return (default: 5) |
Output Schema
| Field | Type | Description |
|---|---|---|
creator_id |
integer | Numeric creator ID from Todotango |
creator_slug |
string | URL slug for the creator |
creator_name |
string | Full display name |
creator_role |
string | Role description (e.g. "Pianista, compositor y letrista") |
birth_date |
string | Birth date (Spanish locale format) |
death_date |
string | Death date (Spanish locale format) |
birth_place |
string | City and country of birth |
biography_es |
string | URL to the Spanish biography page |
photo_url |
string | Creator photo URL |
notable_works |
string | JSON array of linked tango titles |
discography |
string | JSON array of discography entries |
lyrics_count |
integer | Number of linked lyrics entries |
sheet_music_count |
integer | Number of linked sheet-music entries |
tango_title |
string | Tango piece title (tango records only) |
tango_year |
integer | Year of composition |
tango_genre |
string | Genre: tango, vals, milonga, candombe |
tango_composer |
string | Composer name(s) |
tango_lyricist |
string | Lyricist name(s) |
tango_lyrics_es |
string | Full lyrics in Spanish |
tango_sheet_music_pdf_url |
string | Sheet-music image URL |
source_url |
string | Source page URL |
Performance Notes
- Default run scrapes
maxItems=5records for quick sampling - Full run across all 10 000+ creators: set
maxItemsto a large value or leave empty for unlimited - Concurrency is capped at 5 requests to respect the site's rate limit
- No proxy required — datacenter IPs get clean responses