OrbTop

France Pappers + BODACC + INPI Scraper

BUSINESSLEAD GENERATIONOTHER

France Pappers + BODACC + INPI Scraper

Scrape unified French company data from four official sources in a single run: Recherche Entreprises, BODACC, INPI RNE, and Pappers. Returns SIREN/SIRET identifiers, legal officers, revenue figures, and legal announcement history for any of the 26 million French companies on record.


France Pappers + BODACC + INPI Scraper Features

  • Searches by company name, SIREN/SIRET, NAF/APE industry code, or director name
  • Returns canonical Sirene data: legal name, legal form, NAF code, incorporation date, registered address, employee band
  • Extracts up to 15 officers (dirigeants) per company with role and birth year
  • Pulls revenue and net income from the most recent fiscal year — no Pappers key required for this
  • Fetches BODACC legal announcements for a configurable lookback window (1–10 years): insolvencies, acquisitions, capital changes, accounts filed
  • Flags has_insolvency and has_acquisition as booleans for instant screening
  • Optional Pappers integration: supply an API key to add EBITDA, total assets, equity, and beneficial owners
  • Pure API scraping — no browser, no proxy, no Cloudflare to fight
  • Outputs one flat record per company, ready for KYC/KYB pipelines and CRM ingestion

Who Uses French Registry Data?

  • KYC/KYB analysts — screen counterparties against BODACC for insolvency proceedings before onboarding
  • B2B sales teams — filter active companies by NAF code and revenue, feed into outreach sequences
  • M&A and due-diligence teams — retrieve officer lists, capital history, and BODACC acquisition filings
  • Compliance and legal teams — monitor corporate clients for modifications, liquidations, and new insolvency filings
  • Researchers and journalists — trace company structures, beneficial ownership, and historical legal events
  • Data engineers — replace a Societe.com Pro subscription or a chain of five separate Apify actors with one output schema

How France Pappers + BODACC + INPI Scraper Works

  1. Input a query — company name, SIREN, NAF code, or director name — and choose how many records to fetch.
  2. The actor searches Recherche Entreprises, which aggregates Sirene, RNE, and financial data across all 26M French companies.
  3. For each matching company, it optionally fetches BODACC announcements filtered to your lookback window and SIREN.
  4. If you supply a Pappers API key, it enriches each record with EBITDA, total assets, and beneficial owner data.

Input

{
  "mode": "by_name",
  "query": "TotalEnergies SE",
  "includeBodacc": true,
  "bodaccLookbackYears": 3,
  "includePappersFinancials": false,
  "pappersApiKey": "",
  "maxItems": 10
}
Field Type Default Description
mode string by_name Query mode: by_name, by_siren, by_naf_code, by_director_name
query string TotalEnergies SE Company name, 9-digit SIREN, 14-digit SIRET, NAF code (e.g. 6201Z), or director name
includeBodacc boolean true Fetch BODACC legal announcements for each company
bodaccLookbackYears integer 3 Years of BODACC history to include (1-10)
includePappersFinancials boolean false Enrich with Pappers premium financials and beneficial owners
pappersApiKey string Pappers API token. Required when includePappersFinancials is true. Free tier at pappers.fr
maxItems integer 10 Maximum companies to return

By SIREN lookup:

{
  "mode": "by_siren",
  "query": "542051180",
  "includeBodacc": true,
  "maxItems": 1
}

By NAF code (industry):

{
  "mode": "by_naf_code",
  "query": "6201Z",
  "includeBodacc": false,
  "maxItems": 25
}

France Pappers + BODACC + INPI Output Fields

{
  "siren": "542051180",
  "siret": "54205118000066",
  "legal_name": "TOTALENERGIES SE",
  "trade_name": "",
  "legal_form": "5800",
  "naf_ape_code": "7010Z",
  "naf_description": "Professional, scientific and technical activities",
  "status": "A",
  "incorporation_date": "1954-01-01",
  "cessation_date": null,
  "registered_address": "2 PLACE JEAN MILLIER LA DEFENSE 6 92400 COURBEVOIE",
  "employees_range": "51",
  "employees_year": 2023,
  "category": "GE",
  "dirigeants": [
    "POUYANNE PATRICK — (President du conseil d'administration et directeur general) — born 1963",
    "ASCHENBROICH JACQUES ANDRE — (Administrateur) — born 1954"
  ],
  "revenue_eur": 214550000000,
  "net_income_eur": 16031000000,
  "fiscal_year": "2024",
  "has_filed_accounts": true,
  "bodacc_count": 17,
  "bodacc_announcements": [
    "2026-04-02 | Avis initial | Greffe du TAE de Nanterre | Modification survenue sur l'administration.",
    "2025-11-18 | Avis initial | Greffe du TAE de Nanterre | Modification survenue sur le capital."
  ],
  "has_insolvency": false,
  "has_acquisition": false,
  "pappers_ebitda_eur": null,
  "pappers_total_assets_eur": null,
  "pappers_equity_eur": null,
  "pappers_beneficial_owners": [],
  "source_url": "https://recherche-entreprises.api.gouv.fr/search?q=542051180"
}
Field Type Description
siren string 9-digit SIREN — canonical French company identifier
siret string 14-digit SIRET of the head office
legal_name string Legal company name (raison sociale)
trade_name string Trade name or acronym (sigle)
legal_form string Legal form code
naf_ape_code string NAF/APE industry code
naf_description string Industry section label
status string A (active) or C (ceased)
incorporation_date string YYYY-MM-DD
cessation_date string or null YYYY-MM-DD if closed
registered_address string Full registered address
employees_range string INSEE employee band code
employees_year number Reference year for employee band
category string Company size: PME, ETI, GE, or empty
dirigeants string[] Officers: NAME (role) — born YYYY
revenue_eur number Most recent annual revenue (EUR)
net_income_eur number Most recent annual net income (EUR)
fiscal_year string Year of the most recent financial data
has_filed_accounts boolean True if BODACC depot de comptes found
bodacc_count number Total BODACC announcements in lookback window
bodacc_announcements string[] DATE | TYPE | COURT | SUMMARY per row
has_insolvency boolean True if collective proceedings found
has_acquisition boolean True if business sale/acquisition found
pappers_ebitda_eur number EBITDA from Pappers (requires key)
pappers_total_assets_eur number Total assets from Pappers (requires key)
pappers_equity_eur number Equity from Pappers (requires key)
pappers_beneficial_owners string[] Beneficial owners from Pappers (requires key)
source_url string Source API URL

FAQ

What data sources does this scraper use?

It combines four sources. Recherche Entreprises is the primary source — it aggregates INSEE Sirene and INPI RNE into a single free API. BODACC is the official Journal of Legal Announcements, published by the Prime Minister's office. Pappers is an optional private enrichment layer requiring a paid API key.

Do I need any API keys?

No. The primary data (Sirene, RNE, BODACC) comes from free government APIs that require no registration. The Pappers integration is optional and only activates when you provide a key.

What is the employees_range field?

It is the INSEE tranche d'effectif code. Code 21 means 50-99 employees, code 51 means 10,000 or more. The full table is available from INSEE.

How do I find companies by industry sector?

Use mode: by_naf_code with a NAF/APE code. The actor accepts both dot format (62.01Z) and compact format (6201Z).

Can I use this for KYC screening?

The BODACC data covers collective proceedings (insolvency, liquidation, safeguard procedures) and is the official record for French legal announcements. The has_insolvency field is a fast screening flag. For full KYC compliance, supplement with pappers_beneficial_owners via a Pappers key.

How current is the data?

Recherche Entreprises syncs daily from Sirene and RNE. BODACC publishes new announcements every business day. There is no cached or stale layer.


Need More Features?

Open a request at actor-support@orbtop.com to request NAF-code bulk export, INPI actes PDF download, or historical Sirene snapshots.

Why Use France Pappers + BODACC + INPI Scraper?

  • Four sources, one schema — no stitching together recherche-entreprises-scraper, bodacc-announcements-scraper, and pappers-sirene-scraper on your own
  • Government data, not a scrape — primary sources are official public APIs; no HTML parsing, no rate-limit battles
  • BODACC insolvency flagshas_insolvency and has_acquisition as top-level booleans make screening trivial
  • Revenue without a Pappers key — Recherche Entreprises includes annual revenue from RNE filings, covering most large and mid-cap companies
  • KYC-ready output schema — flat records, consistent field names, array fields as formatted strings