Scrapegraph Ai
scrapegraph_aiScrapeGraphAI is an AI-powered web scraping API that enables developers to extract structured data from any website using natural language prompts. Website https://scrapegraphai.com
Acciones disponibles (10)
Cada acción es una operación que el agente puede ejecutar contra este conector. Haz clic en una acción para ver sus parámetros.
Get CreditsSCRAPEGRAPH_AI_GET_CREDITSAcciónRetrieve remaining and used credits for your scrapegraphai account.
SCRAPEGRAPH_AI_GET_CREDITSAcciónRetrieve remaining and used credits for your scrapegraphai account.
Parámetros de entrada
Sin parámetros.
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not
Convert Webpage to MarkdownSCRAPEGRAPH_AI_MARKDOWNIFYAcciónTool to convert a specified webpage into clean, formatted markdown.
SCRAPEGRAPH_AI_MARKDOWNIFYAcciónTool to convert a specified webpage into clean, formatted markdown.
Parámetros de entrada
waitbooleanIf true, wait for job completion and return full results instead of request_id.
website_urlstringObligatorioThe URL of the webpage to convert to Markdown.
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not
Markdownify StatusSCRAPEGRAPH_AI_MARKDOWNIFY_STATUSAcciónCheck status and results of webpage-to-markdown conversion jobs.
SCRAPEGRAPH_AI_MARKDOWNIFY_STATUSAcciónCheck status and results of webpage-to-markdown conversion jobs.
Parámetros de entrada
request_idstringObligatorioThe unique identifier of the Markdownify request
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not
Search ScraperSCRAPEGRAPH_AI_SEARCH_SCRAPERAcciónPerform ai-powered web searches with structured, parsed results.
SCRAPEGRAPH_AI_SEARCH_SCRAPERAcciónPerform ai-powered web searches with structured, parsed results.
Parámetros de entrada
parsebooleanIf true, parses individual result pages for additional content.
querystringObligatorioThe search query to execute.
languagestringISO 639-1 language code for the search (e.g., 'en').
num_resultsintegerMaximum number of search results to return. Must be ≥ 1.
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not
Check SearchScraper StatusSCRAPEGRAPH_AI_SEARCH_SCRAPER_STATUSAcciónCheck the status and results of an asynchronous searchscraper job.
SCRAPEGRAPH_AI_SEARCH_SCRAPER_STATUSAcciónCheck the status and results of an asynchronous searchscraper job.
Parámetros de entrada
request_idstringObligatorioThe unique identifier of the SearchScraper request
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not
Start Smart CrawlerSCRAPEGRAPH_AI_SMART_CRAWLER_STARTAcciónStart ai-powered web crawling with structured data extraction across multiple pages.
SCRAPEGRAPH_AI_SMART_CRAWLER_STARTAcciónStart ai-powered web crawling with structured data extraction across multiple pages.
Parámetros de entrada
urlstringObligatorioStarting URL for the crawl.
waitbooleanIf true, wait for job completion and return full results instead of task_id.
depthintegerHow many link levels to follow from the start URL; default is 1.
rulesobjectCustom crawl rules, e.g., include/exclude URL patterns.
promptstringObligatorioInstructions for what data to extract during the crawl.
headersobjectCustom HTTP headers to include in crawl requests.
sitemapbooleanWhether to use sitemap.xml for discovery; default is false.
max_pagesintegerMaximum number of pages to crawl; default is 20.
website_htmlstringRaw HTML content to seed the crawl; overrides `url` if provided (max 2 MB).
output_schemaobjectOptional Pydantic or Zod schema for structured output.
number_of_scrollsintegerNumber of infinite scroll actions per page.
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not
SmartCrawler StatusSCRAPEGRAPH_AI_SMART_CRAWLER_STATUSAcciónCheck status and results of smartcrawler web crawling jobs.
SCRAPEGRAPH_AI_SMART_CRAWLER_STATUSAcciónCheck status and results of smartcrawler web crawling jobs.
Parámetros de entrada
task_idstringObligatorioThe unique identifier of the SmartCrawler task
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not
Start Smart ScraperSCRAPEGRAPH_AI_SMART_SCRAPER_STARTAcciónStart ai-powered web scraping with natural language extraction prompts.
SCRAPEGRAPH_AI_SMART_SCRAPER_STARTAcciónStart ai-powered web scraping with natural language extraction prompts.
Parámetros de entrada
waitbooleanIf true, wait for job completion and return full results instead of request_id.
headersobjectAdditional HTTP headers to include in the scraping request.
user_promptstringObligatorioNatural language description of the information to extract.
website_urlstringURL of the webpage to scrape. Required if `website_html` is not provided.
website_htmlstringRaw HTML content of the webpage. Required if `website_url` is not provided.
output_schemaobjectOptional JSON Schema to structure the output.
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not
SmartScraper StatusSCRAPEGRAPH_AI_SMART_SCRAPER_STATUSAcciónCheck status and results of smartscraper web scraping jobs.
SCRAPEGRAPH_AI_SMART_SCRAPER_STATUSAcciónCheck status and results of smartscraper web scraping jobs.
Parámetros de entrada
request_idstringObligatorioThe unique identifier of the SmartScraper request
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not
Submit FeedbackSCRAPEGRAPH_AI_SUBMIT_FEEDBACKAcciónSubmit feedback and ratings for completed scrapegraphai requests.
SCRAPEGRAPH_AI_SUBMIT_FEEDBACKAcciónSubmit feedback and ratings for completed scrapegraphai requests.
Parámetros de entrada
ratingintegerObligatorioStar rating from 0 (lowest) to 5 (highest)
request_idstringObligatorioUUID of the request/session this feedback is for
feedback_textstringOptional comments about the request
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not