Scrapingbee
scrapingbeeScrapingBee is a web scraping API that handles headless browsers and proxy rotation, allowing developers to extract HTML from any website in a single API call.
Acciones disponibles (5)
Cada acción es una operación que el agente puede ejecutar contra este conector. Haz clic en una acción para ver sus parámetros.
ScrapingBee Data ExtractionSCRAPINGBEE_DATA_EXTRACTIONAcciónTool to extract structured data from a webpage using css or xpath selectors. use scrapingbee's extract rules feature.
SCRAPINGBEE_DATA_EXTRACTIONAcciónTool to extract structured data from a webpage using css or xpath selectors. use scrapingbee's extract rules feature.
Parámetros de entrada
urlstringObligatoriouriThe webpage URL to extract data from.
waitintegerSeconds to wait before extraction (for dynamic content).
devicestringenumEmulate device type (desktop or mobile).
desktopmobileapi_keystringObligatorioYour ScrapingBee API key.
extractorobjectObligatorioJSON object defining fields to extract and their CSS/XPath selectors. For nested selectors, use object with 'selector' and optional 'type' keys.
javascriptbooleanWhether to render JavaScript before extraction.
country_codestringTwo-letter country code for proxy geolocation (e.g., 'us', 'de').
premium_proxybooleanUse premium proxy for higher reliability.
custom_headersobjectCustom HTTP headers as a JSON object string.
block_resourcesbooleanBlock images, CSS, and resources to speed up extraction.
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not
ScrapingBee HTML FetchSCRAPINGBEE_HTML_FETCHAcciónTool to fetch html or screenshot via scrapingbee html api. use when you need page markup or image after optional js rendering and resource controls.
SCRAPINGBEE_HTML_FETCHAcciónTool to fetch html or screenshot via scrapingbee html api. use when you need page markup or image after optional js rendering and resource controls.
Parámetros de entrada
urlstringObligatorioThe URL to scrape.
waitintegerMilliseconds to wait before returning content.
retryintegerNumber of retries on request failure.
devicestringenumDevice type to emulate ('desktop' or 'mobile').
desktopmobilecookiesstringCookies to send in requests (HTTP header string).
wait_forstringCSS selector to wait for before returning content.
block_adsbooleanBlock ads and tracking scripts.
render_jsbooleanRender JavaScript before returning HTML.
js_snippetstringJavaScript snippet to execute before returning content.
screenshotbooleanReturn screenshot as base64-encoded PNG.
js_scenariostringJSON scenario for custom headless browser actions.
country_codestringTwo-letter country code for geolocation (e.g., 'us').
extract_rulesstringExtraction rules (CSS selector or JSONPath).
premium_proxybooleanUse premium proxy for scraping.
stealth_proxybooleanUse stealth (undetectable) proxy mode.
block_resourcesstringComma-separated resource types to block (e.g., 'image,stylesheet').
screenshot_selectorstringCSS selector of element to screenshot.
screenshot_full_pagebooleanCapture full-page screenshot instead of only viewport.
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not
ScrapingBee Proxy ModeSCRAPINGBEE_SCRAPING_BEE_PROXY_MODEAcciónTool to fetch web content via scrapingbee's proxy mode. use when you need to route requests through scrapingbee proxies with optional js rendering and resource blocking.
SCRAPINGBEE_SCRAPING_BEE_PROXY_MODEAcciónTool to fetch web content via scrapingbee's proxy mode. use when you need to route requests through scrapingbee proxies with optional js rendering and resource blocking.
Parámetros de entrada
urlstringObligatoriouriThe target URL to scrape through ScrapingBee Proxy Mode.
cookiesobjectCookies to send with the request as a key-value mapping.
headersobjectAdditional HTTP headers to send as a key-value mapping.
timeoutintegerRequest timeout in milliseconds.
block_adsbooleanBlock ads and tracking scripts to speed up scraping.
render_jsbooleanEnable JavaScript rendering before returning content.
session_idstringSession identifier to keep the same IP for multiple requests.
js_scenariostringCustom JavaScript scenario name for advanced interactions.
country_codestringenumTwo-letter country code for geolocated proxy (e.g., 'us', 'fr').
usdefrukcaitespremium_proxybooleanUse premium proxies for higher reliability.
stealth_proxybooleanUse stealth proxy mode for extra undetectability.
block_resourcesstring[]List of resource types to block (e.g., ['image','stylesheet']).
forward_headersbooleanForward original request headers to the target site.
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not
ScrapingBee Stealth ProxySCRAPINGBEE_STEALTH_PROXYAcciónTool to perform stealth scraping via scrapingbee's stealth proxy mode. use when you encounter anti-bot measures requiring undetectable requests.
SCRAPINGBEE_STEALTH_PROXYAcciónTool to perform stealth scraping via scrapingbee's stealth proxy mode. use when you encounter anti-bot measures requiring undetectable requests.
Parámetros de entrada
urlstringObligatoriouriThe URL of the webpage to retrieve using stealth proxy.
waitintegerWait time in milliseconds before returning the response.
devicestringenumDevice type to emulate during rendering.
desktopmobiletabletcookiesstringCustom cookies as JSON-encoded string.
headersstringAdditional HTTP headers as JSON-encoded string.
js_renderbooleanRender JavaScript on the page before returning the response.
country_codestringTwo-letter country code for proxy geolocation (e.g., 'us', 'de').
extract_rulesstringExtraction rules in JSON string for structured data.
premium_proxybooleanUse premium proxies for higher reliability.
stealth_proxybooleanEnable stealth proxy mode. Use when the target site blocks bots.
block_resourcesbooleanBlock images, styles, and fonts for faster loads.
forward_headersbooleanForward original request headers from the browser.
return_page_sourcebooleanReturn the raw page source instead of text.
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not
ScrapingBee Usage StatsSCRAPINGBEE_USAGE_STATSAcciónTool to retrieve usage statistics for your scrapingbee account. use when you need to monitor remaining credits and request count.
SCRAPINGBEE_USAGE_STATSAcciónTool to retrieve usage statistics for your scrapingbee account. use when you need to monitor remaining credits and request count.
Parámetros de entrada
Sin parámetros.
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not