Bright Data
brightdataBright Data provides the world's #1 web data platform with Web Unlocker for bypassing anti-bot systems, SERP API for search engine data, and pre-made scrapers for popular websites. Collect any web data at scale.
Acciones disponibles (10)
Cada acción es una operación que el agente puede ejecutar contra este conector. Haz clic en una acción para ver sus parámetros.
Trigger Site CrawlBRIGHTDATA_CRAWL_APIAcciónTool to trigger a site crawl job to extract content across multiple pages or entire domains. use when you need to start a crawl for a given dataset and list of urls.
BRIGHTDATA_CRAWL_APIAcciónTool to trigger a site crawl job to extract content across multiple pages or entire domains. use when you need to start a crawl for a given dataset and list of urls.
Parámetros de entrada
itemsobject[]ObligatorioList of objects containing URLs or other parameters required by the crawler.
dataset_idstringObligatorioYour dataset ID.
include_errorsbooleanWhether to include an errors report with the results.
custom_output_fieldsstringPipe-separated list of output fields to include (e.g., 'url|about.updated_on').
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not
Browse Available ScrapersBRIGHTDATA_DATASET_LISTAcciónTool to list all available pre-made scrapers (datasets) from bright data's marketplace. use when you need to browse available data sources for structured scraping.
BRIGHTDATA_DATASET_LISTAcciónTool to list all available pre-made scrapers (datasets) from bright data's marketplace. use when you need to browse available data sources for structured scraping.
Parámetros de entrada
Sin parámetros.
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not
Filter DatasetBRIGHTDATA_FILTER_DATASETAcciónTool to apply custom filter criteria to a marketplace dataset (beta). use after selecting a dataset to generate a filtered snapshot.
BRIGHTDATA_FILTER_DATASETAcciónTool to apply custom filter criteria to a marketplace dataset (beta). use after selecting a dataset to generate a filtered snapshot.
Parámetros de entrada
filesstring[]Optional list of file paths to CSV or JSON files containing filter values.
filterobjectObligatorioFilter criteria object. Supports operators: '=', '!=', '<', '<=', '>', '>=', 'in', 'not_in', 'includes', 'not_includes', 'array_includes', 'not_array_includes', 'is_null', 'is_not_null'. Combine using 'and'/'or' up to 3 levels of nesting.
dataset_idstringObligatorioID of the dataset to filter.
records_limitintegerOptional maximum number of records to include in the snapshot, must be >= 1.
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not
Get Available CitiesBRIGHTDATA_GET_LIST_OF_AVAILABLE_CITIESAcciónTool to get available static network cities for a given country. use when you need to configure static proxy endpoints after selecting a country.
BRIGHTDATA_GET_LIST_OF_AVAILABLE_CITIESAcciónTool to get available static network cities for a given country. use when you need to configure static proxy endpoints after selecting a country.
Parámetros de entrada
countrystringObligatorioISO 3166-1 alpha-2 country code to list available cities for.
pool_ip_typestringenumType of static proxy pool. 'dc' for datacenter, 'static_res' for residential.
dcstatic_res
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not
Get Available CountriesBRIGHTDATA_GET_LIST_OF_AVAILABLE_COUNTRIESAcciónTool to list available countries and their iso 3166-1 alpha-2 codes. use when you need to configure zones with valid country codes before provisioning proxies.
BRIGHTDATA_GET_LIST_OF_AVAILABLE_COUNTRIESAcciónTool to list available countries and their iso 3166-1 alpha-2 codes. use when you need to configure zones with valid country codes before provisioning proxies.
Parámetros de entrada
Sin parámetros.
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not
Download Scraped DataBRIGHTDATA_GET_SNAPSHOT_RESULTSAcciónTool to retrieve the scraped data from a completed crawl job by snapshot id. use after triggering a crawl or filtering a dataset to download the collected data.
BRIGHTDATA_GET_SNAPSHOT_RESULTSAcciónTool to retrieve the scraped data from a completed crawl job by snapshot id. use after triggering a crawl or filtering a dataset to download the collected data.
Parámetros de entrada
limitintegerMaximum number of records to return.
formatstringenumDesired output format.
jsoncsvndjsonoffsetintegerNumber of records to skip before starting to collect the result set.
snapshot_idstringObligatorioIdentifier of the snapshot to fetch results for.
Parámetros de salida
dataanyParsed JSON payload when format is 'json'.
errorstringError if any occurred during the execution of the action
contentstringRaw content for non-JSON formats (csv/ndjson).
successfulbooleanObligatorioWhether or not the action execution was successful or not
Check Crawl StatusBRIGHTDATA_GET_SNAPSHOT_STATUSAcciónTool to check the processing status of a crawl job using snapshot id. call before attempting to download results to ensure data collection is complete.
BRIGHTDATA_GET_SNAPSHOT_STATUSAcciónTool to check the processing status of a crawl job using snapshot id. call before attempting to download results to ensure data collection is complete.
Parámetros de entrada
snapshot_idstringObligatorioIdentifier of the snapshot to check.
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not
List Unlocker ZonesBRIGHTDATA_LIST_WEB_UNLOCKER_ZONESAcciónTool to list your configured web unlocker zones and proxy endpoints. use to view available zones for web scraping and bot protection bypass.
BRIGHTDATA_LIST_WEB_UNLOCKER_ZONESAcciónTool to list your configured web unlocker zones and proxy endpoints. use to view available zones for web scraping and bot protection bypass.
Parámetros de entrada
Sin parámetros.
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not
SERP SearchBRIGHTDATA_SERP_SEARCHAcciónTool to perform serp (search engine results page) searches across different search engines. use when you need to retrieve search results, trending topics, or competitive analysis data.
BRIGHTDATA_SERP_SEARCHAcciónTool to perform serp (search engine results page) searches across different search engines. use when you need to retrieve search results, trending topics, or competitive analysis data.
Parámetros de entrada
pageintegerPage number of results to fetch.
querystringObligatorioThe search query to execute.
countrystringISO 3166-1 alpha-2 country code for localized search results.
languagestringLanguage code for search results (e.g., 'en', 'es', 'fr').
num_resultsintegerNumber of search results to return (1-100).
safe_searchstringenumSafe search filter level.
offmoderatestrictinclude_newsbooleanWhether to include news search results.
search_enginestringenumSearch engine to use for the query.
googlebingyahooduckduckgoinclude_imagesbooleanWhether to include image search results.
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not
Web UnlockerBRIGHTDATA_WEB_UNLOCKERAcciónTool to bypass bot detection, captcha, and other anti-scraping measures to extract content from websites. use when you need to scrape websites that block automated access or require javascript rendering.
BRIGHTDATA_WEB_UNLOCKERAcciónTool to bypass bot detection, captcha, and other anti-scraping measures to extract content from websites. use when you need to scrape websites that block automated access or require javascript rendering.
Parámetros de entrada
urlstringObligatorioThe URL of the website to unlock and scrape.
devicestringenumDevice type to emulate during scraping.
desktopmobileformatstringenumResponse format - HTML content, PNG screenshot, or JPG screenshot.
htmlpngjpgcountrystringISO 3166-1 alpha-2 country code for the proxy location.
timeoutintegerRequest timeout in seconds (1-180).
wait_forstringCSS selector to wait for before considering page loaded.
render_jsbooleanWhether to render JavaScript on the page.
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not