NAiOS IconNAiOS Logo
Volver al catálogo

WebScraping.AI

webscraping_ai

WebScraping.AI provides an API for web scraping with features like Chrome JS rendering, rotating proxies, and HTML parsing.

Acciones
4
Triggers
0
Autenticación
OAuth gestionado
No
Información técnica: el detalle de parámetros, schemas y triggers de esta página está pensado para equipos de integración. Si solo necesitas saber si tu herramienta favorita está disponible, basta con ver la lista de acciones.

Acciones disponibles (4)

Cada acción es una operación que el agente puede ejecutar contra este conector. Haz clic en una acción para ver sus parámetros.

Get account usage and quotaWEBSCRAPING_AI_ACCOUNT_INFOAcción

Tool to retrieve account api call quota and usage. use when checking remaining requests and subscription details.

Parámetros de entrada

Sin parámetros.

Parámetros de salida

  • dataobjectObligatorio

    Data from the action execution

  • errorstring

    Error if any occurred during the execution of the action

  • successfulbooleanObligatorio

    Whether or not the action execution was successful or not

Retrieve HTML ContentWEBSCRAPING_AI_GET_HTMLAcción

Tool to retrieve html content of a web page. use when you need raw page html, optionally rendered with javascript.

Parámetros de entrada

  • jsboolean

    Whether to render JavaScript before fetching.

  • urlstringObligatoriouri

    The target URL to scrape.

  • proxystring

    Proxy location/country code, e.g., 'us', 'de'.

  • devicestringenum

    Device type to spoof the user-agent as.

    mobiledesktop
  • cookiesobject

    Custom cookies to include in the browser session.

  • headersobject

    Custom HTTP headers to include in the request.

Parámetros de salida

  • dataobjectObligatorio

    Data from the action execution

  • errorstring

    Error if any occurred during the execution of the action

  • successfulbooleanObligatorio

    Whether or not the action execution was successful or not

Get Rendered HTMLWEBSCRAPING_AI_GET_RENDERED_HTMLAcción

Tool to retrieve fully rendered html of a webpage. use when js-generated content must be included.

Parámetros de entrada

  • jsstring

    Base64-encoded JavaScript to execute after rendering.

  • urlstringObligatoriouri

    The target URL to render and fetch HTML.

  • waitinteger

    Wait time before capture, in milliseconds.

  • devicestringenum

    Browser device mode to simulate.

    desktopmobile
  • localestring

    Browser locale (RFC5646 code).

  • cookiesstring

    Cookies in 'key1=value1; key2=value2;' format.

  • headersobject

    Extra HTTP headers as JSON object.

  • refererstringuri

    Referer header value.

  • timeoutinteger

    Request timeout, in milliseconds.

  • proxy_typestringenum

    Proxy type to use for the request.

    datacenterresidential
  • user_agentstring

    Custom User-Agent string.

  • disable_imagesboolean

    Whether to disable image loading.

Parámetros de salida

  • dataobjectObligatorio

    Data from the action execution

  • errorstring

    Error if any occurred during the execution of the action

  • successfulbooleanObligatorio

    Whether or not the action execution was successful or not

Get TextWEBSCRAPING_AI_GET_TEXTAcción

Tool to retrieve raw text content from a specified web page. use when you need plain text extraction from a url.

Parámetros de entrada

  • urlstringObligatorio

    The target URL to scrape text from.

  • proxystringenum

    Proxy region to use for the request (e.g., 'us' or 'eu').

    useu
  • localestring

    Browser locale/language (e.g., 'en-US').

  • sessionstring

    Session ID for preserving cookies across multiple calls.

  • timeoutinteger

    Request timeout in seconds (must be >= 1).

  • render_jsboolean

    Whether to render JavaScript on the page before extracting text.

Parámetros de salida

  • dataobjectObligatorio

    Data from the action execution

  • errorstring

    Error if any occurred during the execution of the action

  • successfulbooleanObligatorio

    Whether or not the action execution was successful or not