NAiOS IconNAiOS Logo
Volver al catálogo

Bright Data

brightdata

Bright Data provides the world's #1 web data platform with Web Unlocker for bypassing anti-bot systems, SERP API for search engine data, and pre-made scrapers for popular websites. Collect any web data at scale.

Acciones
10
Triggers
0
Autenticación
OAuth gestionado
No
Información técnica: el detalle de parámetros, schemas y triggers de esta página está pensado para equipos de integración. Si solo necesitas saber si tu herramienta favorita está disponible, basta con ver la lista de acciones.

Acciones disponibles (10)

Cada acción es una operación que el agente puede ejecutar contra este conector. Haz clic en una acción para ver sus parámetros.

Trigger Site CrawlBRIGHTDATA_CRAWL_APIAcción

Tool to trigger a site crawl job to extract content across multiple pages or entire domains. use when you need to start a crawl for a given dataset and list of urls.

Parámetros de entrada

  • itemsobject[]Obligatorio

    List of objects containing URLs or other parameters required by the crawler.

  • dataset_idstringObligatorio

    Your dataset ID.

  • include_errorsboolean

    Whether to include an errors report with the results.

  • custom_output_fieldsstring

    Pipe-separated list of output fields to include (e.g., 'url|about.updated_on').

Parámetros de salida

  • dataobjectObligatorio

    Data from the action execution

  • errorstring

    Error if any occurred during the execution of the action

  • successfulbooleanObligatorio

    Whether or not the action execution was successful or not

Browse Available ScrapersBRIGHTDATA_DATASET_LISTAcción

Tool to list all available pre-made scrapers (datasets) from bright data's marketplace. use when you need to browse available data sources for structured scraping.

Parámetros de entrada

Sin parámetros.

Parámetros de salida

  • dataobjectObligatorio

    Data from the action execution

  • errorstring

    Error if any occurred during the execution of the action

  • successfulbooleanObligatorio

    Whether or not the action execution was successful or not

Filter DatasetBRIGHTDATA_FILTER_DATASETAcción

Tool to apply custom filter criteria to a marketplace dataset (beta). use after selecting a dataset to generate a filtered snapshot.

Parámetros de entrada

  • filesstring[]

    Optional list of file paths to CSV or JSON files containing filter values.

  • filterobjectObligatorio

    Filter criteria object. Supports operators: '=', '!=', '<', '<=', '>', '>=', 'in', 'not_in', 'includes', 'not_includes', 'array_includes', 'not_array_includes', 'is_null', 'is_not_null'. Combine using 'and'/'or' up to 3 levels of nesting.

  • dataset_idstringObligatorio

    ID of the dataset to filter.

  • records_limitinteger

    Optional maximum number of records to include in the snapshot, must be >= 1.

Parámetros de salida

  • dataobjectObligatorio

    Data from the action execution

  • errorstring

    Error if any occurred during the execution of the action

  • successfulbooleanObligatorio

    Whether or not the action execution was successful or not

Get Available CitiesBRIGHTDATA_GET_LIST_OF_AVAILABLE_CITIESAcción

Tool to get available static network cities for a given country. use when you need to configure static proxy endpoints after selecting a country.

Parámetros de entrada

  • countrystringObligatorio

    ISO 3166-1 alpha-2 country code to list available cities for.

  • pool_ip_typestringenum

    Type of static proxy pool. 'dc' for datacenter, 'static_res' for residential.

    dcstatic_res

Parámetros de salida

  • dataobjectObligatorio

    Data from the action execution

  • errorstring

    Error if any occurred during the execution of the action

  • successfulbooleanObligatorio

    Whether or not the action execution was successful or not

Get Available CountriesBRIGHTDATA_GET_LIST_OF_AVAILABLE_COUNTRIESAcción

Tool to list available countries and their iso 3166-1 alpha-2 codes. use when you need to configure zones with valid country codes before provisioning proxies.

Parámetros de entrada

Sin parámetros.

Parámetros de salida

  • dataobjectObligatorio

    Data from the action execution

  • errorstring

    Error if any occurred during the execution of the action

  • successfulbooleanObligatorio

    Whether or not the action execution was successful or not

Download Scraped DataBRIGHTDATA_GET_SNAPSHOT_RESULTSAcción

Tool to retrieve the scraped data from a completed crawl job by snapshot id. use after triggering a crawl or filtering a dataset to download the collected data.

Parámetros de entrada

  • limitinteger

    Maximum number of records to return.

  • formatstringenum

    Desired output format.

    jsoncsvndjson
  • offsetinteger

    Number of records to skip before starting to collect the result set.

  • snapshot_idstringObligatorio

    Identifier of the snapshot to fetch results for.

Parámetros de salida

  • dataany

    Parsed JSON payload when format is 'json'.

  • errorstring

    Error if any occurred during the execution of the action

  • contentstring

    Raw content for non-JSON formats (csv/ndjson).

  • successfulbooleanObligatorio

    Whether or not the action execution was successful or not

Check Crawl StatusBRIGHTDATA_GET_SNAPSHOT_STATUSAcción

Tool to check the processing status of a crawl job using snapshot id. call before attempting to download results to ensure data collection is complete.

Parámetros de entrada

  • snapshot_idstringObligatorio

    Identifier of the snapshot to check.

Parámetros de salida

  • dataobjectObligatorio

    Data from the action execution

  • errorstring

    Error if any occurred during the execution of the action

  • successfulbooleanObligatorio

    Whether or not the action execution was successful or not

List Unlocker ZonesBRIGHTDATA_LIST_WEB_UNLOCKER_ZONESAcción

Tool to list your configured web unlocker zones and proxy endpoints. use to view available zones for web scraping and bot protection bypass.

Parámetros de entrada

Sin parámetros.

Parámetros de salida

  • dataobjectObligatorio

    Data from the action execution

  • errorstring

    Error if any occurred during the execution of the action

  • successfulbooleanObligatorio

    Whether or not the action execution was successful or not

SERP SearchBRIGHTDATA_SERP_SEARCHAcción

Tool to perform serp (search engine results page) searches across different search engines. use when you need to retrieve search results, trending topics, or competitive analysis data.

Parámetros de entrada

  • pageinteger

    Page number of results to fetch.

  • querystringObligatorio

    The search query to execute.

  • countrystring

    ISO 3166-1 alpha-2 country code for localized search results.

  • languagestring

    Language code for search results (e.g., 'en', 'es', 'fr').

  • num_resultsinteger

    Number of search results to return (1-100).

  • safe_searchstringenum

    Safe search filter level.

    offmoderatestrict
  • include_newsboolean

    Whether to include news search results.

  • search_enginestringenum

    Search engine to use for the query.

    googlebingyahooduckduckgo
  • include_imagesboolean

    Whether to include image search results.

Parámetros de salida

  • dataobjectObligatorio

    Data from the action execution

  • errorstring

    Error if any occurred during the execution of the action

  • successfulbooleanObligatorio

    Whether or not the action execution was successful or not

Web UnlockerBRIGHTDATA_WEB_UNLOCKERAcción

Tool to bypass bot detection, captcha, and other anti-scraping measures to extract content from websites. use when you need to scrape websites that block automated access or require javascript rendering.

Parámetros de entrada

  • urlstringObligatorio

    The URL of the website to unlock and scrape.

  • devicestringenum

    Device type to emulate during scraping.

    desktopmobile
  • formatstringenum

    Response format - HTML content, PNG screenshot, or JPG screenshot.

    htmlpngjpg
  • countrystring

    ISO 3166-1 alpha-2 country code for the proxy location.

  • timeoutinteger

    Request timeout in seconds (1-180).

  • wait_forstring

    CSS selector to wait for before considering page loaded.

  • render_jsboolean

    Whether to render JavaScript on the page.

Parámetros de salida

  • dataobjectObligatorio

    Data from the action execution

  • errorstring

    Error if any occurred during the execution of the action

  • successfulbooleanObligatorio

    Whether or not the action execution was successful or not